Large Language Models | Towards Data Science

AI-generated image showing agents building a pyramid of knowledge

Overcome Failing Document Ingestion & RAG Strategies with Agentic Knowledge Distillation

Machine Learning

Introducing the pyramid search approach

Tula Masterman

March 5, 2025

17 min read

Generative AI Is Declarative

Artificial Intelligence

And how to order a cheeseburger with an LLM

Michael Herman

March 5, 2025

28 min read

Avoidable and Unavoidable Randomness in GPT-4o

Machine Learning

Exploring the sources of randomness in GPT-4o from the known and controllable to the opaque…

Vincent Vatter

March 3, 2025

14 min read

Unraveling Large Language Model Hallucinations

Machine Learning

Understanding hallucinations as emergent cognitive effects of the training pipeline

Prashal Ruchiranga

February 28, 2025

11 min read

How to Use an LLM-Powered Boilerplate for Building Your Own Node.js API

Large Language Models

For a long time, one of the common ways to start new Node.js projects was…

Uladzimir Yancharuk

February 20, 2025

7 min read

A Comprehensive Guide to LLM Temperature 🔥🌡️

Large Language Models

While building my own LLM-based application, I found many prompt engineering guides, but few equivalent…

Kelsey Wang

February 7, 2025

8 min read

DeepSeek-V3 Explained 1: Multi-head Latent Attention

Deep Learning

Key architecture innovation behind DeepSeek-V2 and DeepSeek-V3 for faster inference

Shirley Li

January 31, 2025

9 min read

High Level Overview of VLMs. The picture of the cute dog is from Josh Frenette on Unsplash. This image is inspired by the representation of VLMs provided in this blog from HuggingFace (https://huggingface.co/blog/vlms) (Overall Image By Author)

Prompting Vision Language Models

Large Language Models

Exploring techniques to prompt VLMs

Anand Subramanian

January 29, 2025

21 min read

Figure 1. Created by the author based on the figure presented in the original paper, with additional explanations and interpretations

Beyond Causal Language Modeling

Artificial Intelligence

A deep dive into “Not All Tokens Are What You Need for Pretraining”

Masatake Hirono

January 27, 2025

7 min read