Praveen Yellamaraju
  • Home
  • Field Guide
  • Tutorials
  • Topics
    • Developer Productivity Engineering workflows, tooling, and execution systems
    • Templates Reusable checklists, docs, and operating artifacts
    • AI Playground Interactive learning labs and runnable examples
    • Subagent Evals Evaluation harnesses for multi-agent behavior
    • About Me Profile, focus areas, and projects
    • Resume Experience and background
    • Contact Start a conversation
Praveen Yellamaraju Production AI Systems Field Guide
  • Home
  • Field Guide
  • Tutorials
  • Topics
    • Developer Productivity Engineering workflows, tooling, and execution systems
    • Templates Reusable checklists, docs, and operating artifacts
    • AI Playground Interactive learning labs and runnable examples
    • Subagent Evals Evaluation harnesses for multi-agent behavior
    • About Me Profile, focus areas, and projects
    • Resume Experience and background
    • Contact Start a conversation
LLM API

4 Posts

Exploring llm api and related topics

Filter by Topic

All AI AI Agents AI Architecture AI Engineering AI Evaluation AI Literacy AI/ML Agent Harness Agentic Workflows Agents Anthropic Architecture Automation Benchmarks Best Practices Blockchain Career Claude Claude Code Codex Data Quality Developer Productivity Development Engineering Eval Generation Feedback Loops Gemini Governance LLM LLM API Leadership MLOps Machine Learning OpenAI Production Production AI Prompt Engineering Python RAG SOTA Security Self-Improving AI Structured Prompting Supply Chain Systems Thinking Testing Versioning npm

What Happens When You Call an LLM API

March 31, 2026 · 12 min read

Your prompt travels through 7 infrastructure layers before a single token comes back. A plain-language walkthrough of API gateways, tokenization, prefill, decode, post-processing, billing, and the network physics underneath.

Read article →
AI/MLArchitectureLLM APIProduction

Context Window vs Attention Window: What AI Architects Must Understand

February 12, 2026 · 5 min read

Context size is not the same as attention behavior. A practical guide for LLM architecture, RAG design, and long-context system trade-offs.

Read article →
AI/MLArchitectureRAGLLM APIBest Practices

Recursive Language Models: Why Smarter Navigation Beats Bigger Memory

January 21, 2026 · 8 min read

RLMs solve the context window problem by letting AI write code to explore information. The result? Tasks going from 0% to 91% success. Here's how it works and when to use it.

Read article →
AI/MLArchitectureLLM APIProduction

The Anatomy of a Production LLM Call

January 9, 2026 · 12 min read

Beyond the Quickstart: Authentication, Error Handling, and Cost Management

Read article →
PythonLLM APIOpenAIAnthropicGeminiProduction

All Topics

AIAI AgentsAI ArchitectureAI EngineeringAI EvaluationAI LiteracyAI/MLAgent HarnessAgentic WorkflowsAgentsAnthropicArchitectureAutomationBenchmarksBest PracticesBlockchainCareerClaudeClaude CodeCodexData QualityDeveloper ProductivityDevelopmentEngineeringEval GenerationFeedback LoopsGeminiGovernanceLLMLLM APILeadershipMLOpsMachine LearningOpenAIProductionProduction AIPrompt EngineeringPythonRAGSOTASecuritySelf-Improving AIStructured PromptingSupply ChainSystems ThinkingTestingVersioningnpm

Explore

  • All Posts
  • About Me
  • Get in Touch

Acting AI Advisor performing architecture and design responsibilities for intelligent, enterprise-scale solutions. Writing about agentic systems, prompt engineering, and the future of AI.

Connect

  • LinkedIn
  • Email
  • Newsletter
  • RSS Feed

Site

  • Blog
  • About
  • Resume
  • Privacy

© 2026 Praveen Srinag Yellamaraju. All rights reserved.