Blog

#rag

Posts on AI engineering, LLM systems, and software development.

Sort:

Local LlmsMay 5, 2026#10

Local RAG and embeddings

Build a working local RAG pipeline in about 30 lines using nomic-embed-text, Chroma, and Llama 3.2. And why running it on your own machine beats the cloud for personal notes.

AI Chroma Embeddings LLM Local Llms

Read →

AI FoundationsMarch 13, 2026#8

Prompt, RAG, fine-tune: three ways to shape a model

Three levers for shaping what an LLM does: prompting (ask better), RAG (give it the right context), fine-tuning (change the weights). What each costs, what each fixes, and how to pick.

AI Fine Tuning Fundamentals LLM Prompting

Read →

AI FoundationsMarch 11, 2026#7

RAG: giving a model memory it doesn't have

RAG is the pattern of fetching relevant text from a search system and putting it in the LLM's context window before asking your question. Not magic, not fine-tuning, just better prompts.

AI Embeddings Fundamentals LLM RAG

Read →