Back to Portfolio
AI & ML Insights

AI & ML Insights by Krishil Agrawal.

Deep dives into real-world machine learning systems, AI architectures, and engineering challenges — written for engineers who build, not just read.

6+Technical deep dives
100%Production-focused
0Beginner tutorials
2026Current & up-to-date
FeaturedAgentic AISystem DesignGenAI / LLMs

MCP vs A2A — The 2 Protocols Every AI Developer Needs to Know

MCP connects your agent to tools. A2A connects your agent to other agents. Those are two very different problems — and confusing them will wreck your architecture before you write a single line of code.

12 min read·April 2026
All Articles5 articles
SecurityDeep DiveAgentic AI

Why Agentic AI Is Still Broken: 5 Security Failures Killing Real Deployments

Agentic AI promises autonomy, but prompt injection, tool misuse, and broken trust chains are silently killing deployments. Here's what's really broken and how to fix it.

10 min read·April 2026
GenAI / LLMsSystem Design

Scaling RAG Systems: From Naive Retrieval to Agentic Chunking

80% of RAG failures happen at the chunking layer, not the LLM. Here's how to move from fixed-size splitting to intelligent, context-aware chunking.

14 min read·April 2026
GenAI / LLMsSystem DesignAgentic AI

GraphRAG vs Vector RAG — When Relationships Beat Similarity

GraphRAG beats Vector RAG in 4 specific scenarios. Learn when entity relationships outperform semantic similarity — with diagrams, examples, and code.

12 min read·April 2026
GenAI / LLMsAgentic AIDeep Dive

LLMs Don't Have Memory. So How Do They Remember?

Every LLM starts completely fresh. No memory of you, your preferences, or your last conversation. So how do AI assistants seem to remember anything? Here's the complete engineering answer.

15 min read·April 2026
Machine LearningGenAI / LLMsDeep Dive

Vectorization vs Embeddings — The Difference Every ML Engineer Must Understand

Both convert text to numbers. Both produce vectors. And yet they solve fundamentally different problems — and using them interchangeably will break your models in ways that are very hard to debug.

14 min read·April 2026
Why Read

What Makes These Articles Different

System-Level Thinking

Every article covers the full engineering stack — not just the model API, but the retrieval layer, the memory architecture, and the deployment constraints.

Production Constraints First

Topics are chosen based on failure modes in real systems — the gaps between demos and deployments that most tutorials never address.

Grounded in Research

When numbers appear — token costs, latency figures, accuracy deltas — they come from cited sources and real benchmarks, not intuition.

Stay Sharp

New AI Engineering deep-dives — every month.

RAG systems. Agentic architectures. LLM deployment patterns. No filler.