Ahmed Taha – Medium

Ahmed Taha

Pinned

Ahmed Taha

L2-CAF: A Neural Network Debugger

Every software engineer has used a debugger to debug his code. Yet, a neural network debugger… That’s news! This paper [1] proposes a…

May 24, 2021

L2-CAF: A Neural Network Debugger

May 24, 2021

Ahmed Taha

Sigmoid Loss for Language Image Pre-Training

Contrastive Language Image Pre-training (CLIP) has gained significant momentum after OpenAI’s CLIP paper [2]. CLIP uses image-text pairs to…

Mar 18

Sigmoid Loss for Language Image Pre-Training

Mar 18

Ahmed Taha

Big Transfer (BiT): General Visual Representation Learning

Pre-trained representations bring two benefits during fine-tuning: (1) improved sample efficiency, and (2) simplified hyperparameter…

Jul 3, 2023

Big Transfer (BiT): General Visual Representation Learning

Jul 3, 2023

Ahmed Taha

FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness

Standard attention suffers quadratic complexity in terms of the sequence length (number of tokens). To reduce complexity, efficient…

Jun 5, 2023

FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness

Jun 5, 2023

Ahmed Taha

High Resolution Images and Efficient Transformers

ResNet and ViT models achieve competitive performance, but they are not the best. For instance, DenseNets achieve superior performance to…

May 9, 2023

High Resolution Images and Efficient Transformers

May 9, 2023

Ahmed Taha

Masked Autoencoders Are Scalable Vision Learners

Annotated data is a vital pillar of deep learning. Yet, annotated data is rare in certain applications (e.g., medical and robotics). To…

Mar 27, 2023

Masked Autoencoders Are Scalable Vision Learners

Mar 27, 2023

Ahmed Taha

Rethinking Attention with Performers — Part II & Final

This article’s objective is to summarize the Performers [1] paper. The article highlights key details and documents some personal comments…

Feb 14, 2023

Rethinking Attention with Performers — Part II & Final

Feb 14, 2023

Ahmed Taha

Rethinking Attention with Performers — Part I

This article’s objective is to present a high-level hand-wavy understanding of how Performers [1] work.

Oct 11, 2022

Rethinking Attention with Performers — Part I

Oct 11, 2022

Ahmed Taha

Understanding the Effective Receptive Field in Deep Convolutional Neural Networks

In deep networks, a receptive field — or field of view — is the region in the input space that affects a particular layer’s feature as…

May 16, 2022

Understanding the Effective Receptive Field in Deep Convolutional Neural Networks

May 16, 2022

Ahmed Taha

Understanding Transfer Learning for Medical Imaging

Transfer learning (a.k.a. ImageNet pre-training) is a common practice in deep learning where a pre-trained network is fine-tuned on a new…

Apr 4, 2022

Understanding Transfer Learning for Medical Imaging

Apr 4, 2022

Ahmed Taha

Ahmed Taha

I write reviews on computer vision papers.

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams