List: Transformer | Curated by CP Lu, PhD | Medium

Nov 13, 2023

8 stories

Transformer

In

TDS Archive

by

Dmitrii Eliuseev

16, 8, and 4-bit Floating Point Formats — How Does it Work?

Let’s go into bits and bytes

Sep 30, 2023

16, 8, and 4-bit Floating Point Formats — How Does it Work?

Sep 30, 2023

In

AIGuys

by

Vishal Rajput

RetNet: Transformer killer is here

Can RetNet replace the Transformers? Early results looks very promising.

Sep 14, 2023

RetNet: Transformer killer is here

Sep 14, 2023

Joe El Khoury - GenAI Engineer

Introducing Deformable Attention Transformer

This post is based on findings made in this paper

Jun 21, 2023

Introducing Deformable Attention Transformer

Jun 21, 2023

Zain ul Abideen

Attention Is All You Need: The Core Idea of the Transformer

An overview of the Transformer model and its key components.

Jun 26, 2023

Attention Is All You Need: The Core Idea of the Transformer

Jun 26, 2023

In

TDS Archive

by

Chen Margalit

Simplifying Transformers: State of the Art NLP Using Words You Understand — part 2- Input

Deep dive into how transformers’ inputs are constructed

Jul 26, 2023

Simplifying Transformers: State of the Art NLP Using Words You Understand — part 2- Input

Jul 26, 2023

Fareed Khan

Understanding Transformers: A Step-by-Step Math Example — Part 1

I understand that the transformer architecture may seem scary, and you might have encountered various explanations on YouTube or in blogs…

Jun 5, 2023

Understanding Transformers: A Step-by-Step Math Example — Part 1

Jun 5, 2023

In

Towards AI

by

Quadric

(Vision) Transformers: Rise of the Chimera

It’s 2023, and transformers are having a moment. No, I’m not talking about the latest installment of the Transformers movie franchise…

Jun 21, 2023

(Vision) Transformers: Rise of the Chimera

Jun 21, 2023

Hunter Phillips

Overview: The Implemented Transformer

The transformer is a state-of-the-art model introduced in “Attention is All You Need” in 2017. There are great articles describing various…

May 8, 2023

Overview: The Implemented Transformer

May 8, 2023

CP Lu, PhD
505 Followers
Committed to advancing AI hardware, I relish exploring philosophy and history, bridging the past and future.
Following
Data Science Collective
The Medium Blog
Cameron R. Wolfe, Ph.D.
GPTalk
Yu-Cheng Tsai
See all (113)

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams