Filter/Tag

#llm_10

10 entries

2026-03-19 ai agents react

Agent Patterns: ReAct, Reflection & Planning — From One LLM Call to a Production Loop

ReAct, Reflection, and Planning for LLM agents — when to use each, guardrails against runaway loops, and links to tool use and orchestration.
2026-03-11 ai agents function-calling

Function Calling & Tool Use — JSON Schema, the Agent Loop, Parallel Calls & Security

How LLM function calling bridges models to the world: JSON Schema tools, the request→execute→result loop, parallel calls, validation, MCP, and security.
2026-03-03 ai llm models

Choosing an LLM for Agents: A Durable Framework Beyond Leaderboards

A senior engineer framework for model selection — capability tiers, context, modality, cost, privacy, tool use — plus routing, cascades, and why benchmarks lie.
2026-02-23 ai llm agents

Evaluating LLMs & Agents: Golden Sets, Metrics, LLM-as-Judge, and Regression in CI

Why eval is the hardest part of shipping agents — golden datasets, offline vs online metrics, LLM-as-judge rubrics, human agreement, and regression in CI.
2026-02-15 ai llm fine-tuning

Fine-tuning vs Prompting vs RAG: A Decision Framework for Adapting LLMs

When to prompt, retrieve, or fine-tune: knowledge vs behavior, data needs, cost, privacy, SFT/LoRA/DPO — and why most teams start with prompt + RAG.
2026-02-07 ai agents context

Context Engineering & Agent Memory — Packing the Window Without Losing the Thread

How senior engineers pack system prompts, tools, history, RAG, and output reserve into a fixed context window — and manage memory when the budget breaks.
2026-01-30 ai llm agents

Stopping Criteria & Output Control — When Generation Ends and What to Do About It

EOS tokens, max_tokens, stop sequences, and finish_reason handling for production LLM agents — streaming, truncation, and runaway cost guards.
2026-01-22 ai agents prompt-engineering

Prompt Engineering for Agents — Messages, Personas, Few-Shot & Structured Output

Agent prompt design: messages/roles, personas, few-shot trade-offs, CoT vs reasoning models, JSON schemas, templates, injection guards, iteration.
2026-01-14 ai llm sampling

Sampling for Agents: Temperature, Top-p, Top-k — When Randomness Helps or Hurts

How LLMs turn logits into tokens — temperature, top_p, top_k, penalties, seeds — and why agent builders tune sampling differently for tool calls vs brainstorming.
2026-01-06 ai agents tokens

Tokens & Context Windows — The Hard Budget Every AI Agent Must Respect

Tokens are the atomic unit of LLM memory and cost. Learn subword tokenization, context window math, and agent budgeting before you ship.