Notes
Research, engineering write-ups, and the dead ends in between.
Distribution Matching Is Not Enough: Two Failure Modes in Latent Text Drifting
May 25, 2026
Probing Latent Directions in Video Diffusion Models
May 25, 2026
Hybrid Lexical–Semantic Retrieval for Tool Selection in Agent Systems
April 30, 2026
From Single-GPU to Distributed Training: A Framework for Making the Right Call
April 20, 2026
Distributed Data Parallel: How It Actually Works
April 20, 2026
Tensor Parallelism and Sequence Parallelism
April 20, 2026
Pipeline Parallelism: How It Actually Works
April 20, 2026
ZeRO and FSDP: Model Sharding
April 20, 2026
Kinetic-4B: A 4-Billion Parameter Model That Outperforms Claude Haiku at Tool Calling
April 1, 2026
LLM Inference at the Edge
March 30, 2026