Home
Field Guide
Tutorials
Topics
- Developer Productivity Engineering workflows, tooling, and execution systems
- Templates Reusable checklists, docs, and operating artifacts
- AI Playground Interactive learning labs and runnable examples
- Subagent Evals Evaluation harnesses for multi-agent behavior

Praveen Yellamaraju Production AI Systems Field Guide

Home
Field Guide
Tutorials
Topics
- Developer Productivity Engineering workflows, tooling, and execution systems
- Templates Reusable checklists, docs, and operating artifacts
- AI Playground Interactive learning labs and runnable examples
- Subagent Evals Evaluation harnesses for multi-agent behavior

LLM API

4 Posts

Exploring llm api and related topics

Filter by Topic

All AI AI Agents AI Architecture AI Engineering AI Evaluation AI Literacy AI/ML Agent Harness Agentic AI Agentic Workflows Agents Anthropic Architecture Automation Benchmarks Best Practices Blockchain Career Claude Claude Code Codex Data Quality Developer Productivity Development Engineering Eval Generation Feedback Loops Gemini Governance LLM LLM API Leadership MLOps Machine Learning OpenAI Production Production AI Prompt Engineering Python RAG SOTA Security Self-Improving AI Structured Prompting Supply Chain Systems Thinking Testing Versioning npm

What Happens When You Call an LLM API

March 31, 2026 · 12 min read

Your prompt travels through 7 infrastructure layers before a single token comes back. A plain-language walkthrough of API gateways, tokenization, prefill, decode, post-processing, billing, and the network physics underneath.

Read article →

AI/ML Architecture LLM API Production

Context Window vs Attention Window: What AI Architects Must Understand

February 12, 2026 · 5 min read

Context size is not the same as attention behavior. A practical guide for LLM architecture, RAG design, and long-context system trade-offs.

Read article →

AI/ML Architecture RAG LLM API Best Practices

Recursive Language Models: Why Smarter Navigation Beats Bigger Memory

January 21, 2026 · 8 min read

RLMs solve the context window problem by letting AI write code to explore information. The result? Tasks going from 0% to 91% success. Here's how it works and when to use it.

Read article →

AI/ML Architecture LLM API Production

The Anatomy of a Production LLM Call

January 9, 2026 · 12 min read

Beyond the Quickstart: Authentication, Error Handling, and Cost Management

Read article →

Python LLM API OpenAI Anthropic Gemini Production

All Topics

AI AI Agents AI Architecture AI Engineering AI Evaluation AI Literacy AI/ML Agent Harness Agentic AI Agentic Workflows Agents Anthropic Architecture Automation Benchmarks Best Practices Blockchain Career Claude Claude Code Codex Data Quality Developer Productivity Development Engineering Eval Generation Feedback Loops Gemini Governance LLM LLM API Leadership MLOps Machine Learning OpenAI Production Production AI Prompt Engineering Python RAG SOTA Security Self-Improving AI Structured Prompting Supply Chain Systems Thinking Testing Versioning npm

Explore

All Posts
About Me
Get in Touch

Acting AI Advisor performing architecture and design responsibilities for intelligent, enterprise-scale solutions. Writing about agentic systems, prompt engineering, and the future of AI.

Connect

LinkedIn
Email
Newsletter
RSS Feed

Site

Blog
About
Resume
Privacy