Publications


Super Study Guide: Transformers & Large Language Models
Authors:
Afshine Amini & Shervine Amidi
TR Translation by Dr. Merve Ayyüce KIZRAK

This book is a concise and illustrated guide for anyone who wants to understand the inner workings of large language models in the context of interviews, projects or to satisfy their own curiosity.

It is divided into 5 parts:

  • Foundations: primer on neural networks and important deep learning concepts for training and evaluation
  • Embeddings: tokenization algorithms, word-embeddings (word2vec) and sentence embeddings (RNN, LSTM, GRU)
  • Transformers: motivation behind its self-attention mechanism, detailed overview on the encoder-decoder architecture and related variations such as BERT, GPT and T5, along with tips and tricks on how to speed up computations
  • Large language models: main techniques to tune Transformer-based models, such as prompt engineering, (parameter efficient) finetuning and preference tuning
  • Applications: most common problems including sentiment extraction, machine translation, retrieval-augmented generation and many more

Buy the book pdf or hardcopy version here!


Publication TitleYear
Super Study Guide: Transformers & Large Language Models (in Turkish)2025
Data-Centric AI Approach on Aerial Object Detection for Smart Transportation (Accepted by Machine Learning for Societial Improvement, Modernization and Progress)2022
Data Sharing and Privacy Issues Arising with COVID-19 Data and Applications 2022
Privacy-Preserving Mechanisms with Explainability in Assistive AI Technologies, SpringerLink, Advances in Assistive Technologies2021
Crowd Density Estimation by Using Attention Based Capsule Network and Multi-Column CNN, IEEE Access 2021
Limitations and Challenges on the Diagnosis of COVID-19 Using Radiology Images and Deep Learning2021
Cluster-Based Monitoring and Location Estimation for Crowd Counting 2020
Classification of Recyclable Materials Using Efficient Deep Learning Models and Benchmarking of GPU Performance 2020
Differential Privacy Practice on Diagnosis of COVID-19 Radiology Imaging Using EfficientNet2020
Voting-Based Multiple Classification Approach for Turkish News Texts2019
Deep and Wide Convolutional Neural Network Model for Highly Dense Crowd 2019
BOOK: New Generation Technologies in Health, Chapter: Artificial Intelligence in Health2019
Uçak Motoru Sağlığı için Uzun-Kısa Süreli Bellek Yöntemi ile Öngörücü Bakım - Predictive Maintenance of Aircraft Motor Health with Long-Short Term Memory Method2019
Predictive Maintenance of Aircraft Motor Health with Long-Short Term Memory Method2018
A Comprehensive Survey of Deep Learning in Crowd Analysis (in Turkish) 2018
RecycleNet: Intelligent Waste Sorting Using Deep Neural Networks2018
Recognition of Sign Language Using Capsule Networks (in Turkish) 2018
A Musical Information Retrieval System for Classical Turkish Music Makams2017
A Novel Approach for People Counting and Tracking from Crowd video2017
BOOK: Automatic Acute Lymphocytic Leukemia Diagnosis Based on Kernel Ridge Regression Method2017
Classification of Classic Turkish Music Makams by Using Deep Belief Networks (in Turkish)2017
Classification of Classic Turkish Music Makams by using deep belief Networks2016
Classification of Classic Turkish Music Makams2014
Classification of EEG Signals by Using Support Vector Machines2013
Automatic Acute Lymphocytic Leukemia Diagnosis Based on Kernel Ridge Regression Method2012
A New Median Filter Based Fingerprint Recognition Algorithm2011
Circularly Polarized Microstrip Patch Antenna with Slits2010
DVB-T’de OFDM Performans Analizi2010
A New Way of Looking of Fingerprint Recognition Median-Filtering Fingerprint Recognition Algorithm (HMFA)2009
A new median filter based fingerprint recognition algorithm (HMPA) (in Turkish)2009