<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"><channel><title>Saurabh Ghatnekar — AI Engineer — Systems, Data &amp; Alignment</title><description>Saurabh Ghatnekar is an AI engineer working across the LLM stack: training and inference systems (kernels, parallelism, quantization), data pipelines (curation, deduplication, mixing), and alignment (SFT, RL, preference and synthetic data). Deep-dive writing on how frontier models actually get built.</description><link>https://saurabh.works/</link><language>en-us</language><item><title>SFT, DPO, or RLHF? Choosing the Right Post-Training Recipe</title><link>https://saurabh.works/blog/sft-dpo-rlhf-post-training-guide/</link><guid isPermaLink="true">https://saurabh.works/blog/sft-dpo-rlhf-post-training-guide/</guid><description>When supervised fine-tuning is enough, when preference optimization pays off, and where verifiable rewards fit — a practical decision guide.</description><pubDate>Fri, 03 Jul 2026 00:00:00 GMT</pubDate><category>alignment</category><category>post-training</category><category>sft</category><category>dpo</category><category>rlhf</category><category>preference-data</category><category>verifiers</category><author>connect@saurabh.works (Saurabh Ghatnekar)</author></item><item><title>Data Curation for LLMs: Filtering, Deduplication, and Mixing in Practice</title><link>https://saurabh.works/blog/llm-data-curation-filtering-deduplication-mixing/</link><guid isPermaLink="true">https://saurabh.works/blog/llm-data-curation-filtering-deduplication-mixing/</guid><description>A practical walkthrough of the LLM data pipeline — quality filtering, exact and near deduplication with MinHash, decontamination, and mixture weights.</description><pubDate>Thu, 25 Jun 2026 00:00:00 GMT</pubDate><category>data</category><category>data-curation</category><category>deduplication</category><category>minhash</category><category>data-mixing</category><category>llm-pretraining</category><author>connect@saurabh.works (Saurabh Ghatnekar)</author></item><item><title>How to Fit Large Language Models on Small GPUs</title><link>https://saurabh.works/blog/fit-large-language-models-on-small-gpus/</link><guid isPermaLink="true">https://saurabh.works/blog/fit-large-language-models-on-small-gpus/</guid><description>Where GPU memory actually goes during LLM training, and how activation checkpointing, quantization, 8-bit optimizers, and CPU offloading win it back.</description><pubDate>Fri, 12 Jun 2026 00:00:00 GMT</pubDate><category>systems</category><category>gpu-memory</category><category>activation-checkpointing</category><category>quantization</category><category>cpu-offloading</category><category>llm-training</category><author>connect@saurabh.works (Saurabh Ghatnekar)</author></item></channel></rss>