List: GPU | Curated by CP Lu, PhD | Medium

Jul 17, 2024

5 stories

GPU

Jesus Rodriguez

Understanding FlashAttention-3: One of the Most Important Algortihms to Make Transformers Fast

The new version takes full advatange of H100 capabilities to improve attention in transformer models.

Jul 15, 2024

Understanding FlashAttention-3: One of the Most Important Algortihms to Make Transformers Fast

Jul 15, 2024

In

Level Up Coding

by

Dr. Ashish Bamania

Superfast Matrix-Multiplication-Free LLMs Are Finally Here

A deep dive into Matrix-Multiplication-Free LLMs that might drastically decrease the use of GPUs in AI, unlike today

Jun 20, 2024

Superfast Matrix-Multiplication-Free LLMs Are Finally Here

Jun 20, 2024

This story is no longer available

In

AI Advances

by

Jacky

NVIDIA GPUs Now Support Copilot+! Is AI PC Set for a Revolution?

Exploring the Impact of Microsoft’s Latest AI Service and NVIDIA’s Support on the Future of Personal Computing

Jun 13, 2024

NVIDIA GPUs Now Support Copilot+! Is AI PC Set for a Revolution?

Jun 13, 2024

In

Intuitively and Exhaustively Explained

by

Daniel Warfield

CUDA for AI — Intuitively and Exhaustively Explained

Parallelized AI from scratch in CUDA

Jun 14, 2024

CUDA for AI — Intuitively and Exhaustively Explained

Jun 14, 2024

CP Lu, PhD
505 Followers
Committed to advancing AI hardware, I relish exploring philosophy and history, bridging the past and future.
Following
Data Science Collective
The Medium Blog
Cameron R. Wolfe, Ph.D.
GPTalk
Yu-Cheng Tsai
See all (113)

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams