Large Language Models
-
Overcome Failing Document Ingestion & RAG Strategies with Agentic Knowledge Distillation
Machine LearningIntroducing the pyramid search approach
17 min read -
And how to order a cheeseburger with an LLM
28 min read -
Exploring the sources of randomness in GPT-4o from the known and controllable to the opaque…
14 min read -
Understanding hallucinations as emergent cognitive effects of the training pipeline
11 min read -
For a long time, one of the common ways to start new Node.js projects was…
7 min read -
While building my own LLM-based application, I found many prompt engineering guides, but few equivalent…
8 min read -
Key architecture innovation behind DeepSeek-V2 and DeepSeek-V3 for faster inference
9 min read -
Exploring techniques to prompt VLMs
21 min read -
A deep dive into “Not All Tokens Are What You Need for Pretraining”
7 min read