Posts by tag 'architecture' — Page 2 — AI Infrastructure Leader | Keynote Speaker

Apr 15, 2026 · AI Infrastructure

Rack-Scale AI Design: The End of Component Scaling

We have hit the physical limits of what a single chip can do. The new unit of compute for AI infrastructure isn't the GPU; it's the fully integrated rack.

Apr 14, 2026 · AI Infrastructure

TTFT (Time To First Token): Measuring Inference Correctly

TTFT reveals the real bottleneck in LLM inference. Learn why Time To First Token matters more than average latency, and how to separate prefill vs decode.

Apr 10, 2026 · Agentic AI

The Infinite Board Problem: Pruning State in Long-Running Reasoning Loops

How to manage the shared state size in complex reasoning loops to prevent context window overflow without losing critical history.

Mar 13, 2026 · AI Engineering

ADK vs. LangChain: The Protocol-First Shift

Class-based chains are a legacy pattern. Discover why Google ADK and its open Agent Protocol are the future of interoperable, production-grade multi-agent systems.

Feb 25, 2026 · AI Infrastructure

Stateful Agents on K8s: Redis is Your Bottleneck, Not the Vector DB

Agents are stateless. Their memory is not. Scaling the LLM reasoning loop is trivial compared to solving the transactional concurrency of agent memory on Kubernetes.

Feb 21, 2026 · Agentic AI

A2A Architectures: Tools are not just Functions (The Two-Phase Commit)

Why Agent-to-Agent (A2A) interactions and Side Effects require a 'Two-Phase Commit' for safety.

Search

Tag: architecture