List: LLM | Curated by CP Lu, PhD

Feb 17, 2025
288 stories
1 save
LLM
Shaw Talebi
How to Train LLMs to “Think” (o1 & DeepSeek-R1)Advanced reasoning models explained
Feb 12
3
Feb 12
3
Alberto Romero
Debunking 10 Popular Myths About DeepSeekI’ve heard too many wrong takes that need correction
Feb 4
14
Feb 4
14
In
The Generator
by
Thomas Smith
Why DeepSeek Doesn’t MatterThe new AI model is a hack, not an “extinction level event.” Here’s why.
Jan 29
57
Jan 29
57
Youngmi huang
DeepSeek-R1 論文解析：強化學習如何提升 AI 推理能力？從兩階段 RL 訓練到 GRPO，全面解讀論文中的技術細節與關鍵成果
Jan 28
4
Jan 28
4
In
Generative AI
by
Cezary Gesikowski
Deep Research in ChatGPT–AI Agent for In-Depth Knowledge WorkThe days of million active browsers tabs are numbered
Feb 3
1
Feb 3
1
Jan Kammerath
DeepSeek: Is It A Stolen ChatGPT?While I was drowning in emails, fiddling around with Xcode and the Neural Cores in my MacBook, DeepSeek popped up on X and Reddit. It…
Jan 27
52
Jan 27
52
In
Level Up Coding
by
Dr. Ashish Bamania
DeepSeek-R1 Beats OpenAI’s o1, Revealing All Its Training Secrets Out In The OpenA deep dive into how DeepSeek-R1 was trained from scratch and how this open-source research will accelerate AI progress like never before.
Jan 27
32
Jan 27
32
Alberto Romero
DeepSeek Is Chinese But Its AI Models Are From Another PlanetOpenAI and the US are in deep trouble
Jan 22
193
Jan 22
193
In
Towards AI
by
Yash Thube
DeepSeek-R1: The Open-Source AI That Thinks Like OpenAI’s BestWhat’s the hype about?👀
Jan 21
3
Jan 21
3
Alberto Romero
This Rumor About GPT-5 Changes EverythingLet’s start the year on an exciting note
Jan 16
30
Jan 16
30
In
Level Up Coding
by
Fareed Khan
Building a 2 Billion Parameter LLM from Scratch Using PythonIt starts making sense
Jan 15
20
Jan 15
20
In
AI Mind
by
Mr Tony Momoh
The $6 Million AI That’s Making OpenAI Nervous: How a Tiny Chinese Startup Is Disrupting Silicon…In the world of artificial intelligence, David just threw a stone at Goliath. And this time, David spent 99% less money.
Jan 2
28
Jan 2
28
Rendy Dalimunthe
Understanding Test-Time Compute: A New Mechanism Allowing AI to “Think Harder”Exploring How AI Adapts to Complex Tasks with Dynamic Reasoning Power
Nov 22, 2024
Nov 22, 2024
In
AI Advances
by
Tim Urista | Senior Cloud Engineer
Dramatically Reduce Inference Costs with DeepSeek-V3: A New Era in Open-Source LLMsIntroduction
Dec 27, 2024
1
Dec 27, 2024
1
In
AIGuys
by
Vishal Rajput
OpenAI Achieved AGI With Its New o3 model?Does adding extensive search during inference means better generalization.
Dec 23, 2024
2
Dec 23, 2024
2
In
AIGuys
by
Vishal Rajput
Are Tiny Transformers The Future Of Scaling?Can we reduce the O(N)² of Attnetion Mechanism?
Nov 12, 2024
1
Nov 12, 2024
1
Devansh
LLMs are NOT reaching their limits.A response to Gary Marcus and many other “AI skeptics”
Nov 10, 2024
9
Nov 10, 2024
9
In
Generative AI
by
Yuki Shizuya
Unlocking Mixture-of-Experts (MoE) LLM : Your MoE model can be embedding model for freeMixture-of-experts (MoE) LLM can be used as an embedding model for free.
Nov 3, 2024
5
Nov 3, 2024
5
Don Lim
Reasoning tokens and techniques used in System 2 LLMs such as OpenAI o1What is the System 2 model?
Sep 16, 2024
2
Sep 16, 2024
2
In
SyncedReview
by
Synced
Stanford’s Landmark Study: AI-Generated Ideas Rated More Novel Than Expert ConceptsRecent advancements in large language models (LLMs) have generated enthusiasm about their potential to accelerate scientific innovation…
Sep 18, 2024
1
Sep 18, 2024
1