Ahmed Taha – Medium

Pinned

Ahmed Taha

L2-CAF: A Neural Network Debugger

Every software engineer has used a debugger to debug his code. Yet, a neural network debugger… That’s news! This paper [1] proposes a…

5 min readMay 24, 2021

--

L2-CAF: A Neural Network Debugger

--

Ahmed Taha

Sigmoid Loss for Language Image Pre-Training

Contrastive Language Image Pre-training (CLIP) has gained significant momentum after OpenAI’s CLIP paper [2]. CLIP uses image-text pairs to…

9 min readMar 18, 2024

--

Sigmoid Loss for Language Image Pre-Training

--

Ahmed Taha

Big Transfer (BiT): General Visual Representation Learning

Pre-trained representations bring two benefits during fine-tuning: (1) improved sample efficiency, and (2) simplified hyperparameter…

5 min readJul 3, 2023

--

Big Transfer (BiT): General Visual Representation Learning

--

Ahmed Taha

FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness

Standard attention suffers quadratic complexity in terms of the sequence length (number of tokens). To reduce complexity, efficient…

8 min readJun 5, 2023

--

3

FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness

--

3

Ahmed Taha

High Resolution Images and Efficient Transformers

ResNet and ViT models achieve competitive performance, but they are not the best. For instance, DenseNets achieve superior performance to…

5 min readMay 9, 2023

--

High Resolution Images and Efficient Transformers

--

Ahmed Taha

Masked Autoencoders Are Scalable Vision Learners

Annotated data is a vital pillar of deep learning. Yet, annotated data is rare in certain applications (e.g., medical and robotics). To…

7 min readMar 27, 2023

--

Masked Autoencoders Are Scalable Vision Learners

--

Ahmed Taha

Rethinking Attention with Performers — Part II & Final

This article’s objective is to summarize the Performers [1] paper. The article highlights key details and documents some personal comments…

7 min readFeb 14, 2023

--

Rethinking Attention with Performers — Part II & Final

--

Ahmed Taha

Rethinking Attention with Performers — Part I

This article’s objective is to present a high-level hand-wavy understanding of how Performers [1] work.

7 min readOct 11, 2022

--

1

Rethinking Attention with Performers — Part I

--

1

Ahmed Taha

Understanding the Effective Receptive Field in Deep Convolutional Neural Networks

In deep networks, a receptive field — or field of view — is the region in the input space that affects a particular layer’s feature as…

5 min readMay 16, 2022

--

Understanding the Effective Receptive Field in Deep Convolutional Neural Networks

--

Ahmed Taha

Understanding Transfer Learning for Medical Imaging

Transfer learning (a.k.a. ImageNet pre-training) is a common practice in deep learning where a pre-trained network is fine-tuned on a new…

6 min readApr 4, 2022

--

1

Understanding Transfer Learning for Medical Imaging

--

1

Ahmed Taha

Ahmed Taha

I write reviews on computer vision papers.

Help
Status
About
Careers
Blog
Privacy
Terms
Text to speech
Teams