Category 'AI Infrastructure' — Page 4 — AI Infrastructure Leader | Keynote Speaker

Mar 11, 2026 · AI Infrastructure

Beyond Vibe-Checks: Trajectory Evaluation & Synthetic Adversaries

Is your agent actually reasoning, or just lucky? Discover why trajectory analysis and synthetic red-teaming are the only ways to build production-grade autonomous systems.

Feb 25, 2026 · AI Infrastructure

Stateful Agents on K8s: Redis is Your Bottleneck, Not the Vector DB

Agents are stateless. Their memory is not. Scaling the LLM reasoning loop is trivial compared to solving the transactional concurrency of agent memory on Kubernetes.

Feb 24, 2026 · AI Infrastructure

Writing Pallas Kernels for JAX: Stepping Outside the XLA Safety Net

When XLA's heuristics fail for custom attention mechanisms, you can't just hope for a compiler update. Here is how you write Triton-like kernels directly in Python using JAX Pallas.

Feb 19, 2026 · AI Infrastructure

Single-Batch Inference: Speculative Decoding on an A100

See how speculative decoding performs for single-batch requests on an NVIDIA A100. We analyze acceptance rates, latency, and the mechanics of the draft model gamble.

Feb 6, 2026 · AI Infrastructure

My Profiling Nightmare: The Warp Stall

A war story of chasing a 5ms latency spike to a single loose thread. How to read Nsight Systems and spot Warp Divergence.

Feb 3, 2026 · AI Infrastructure

JAX XLA: Why Your GPU is Idle 40% of the Time

Recompilation is the silent killer of training throughput. If you see 'Jit' in your profiler, you are losing money. We dive into XLA internals.

Strictly Necessary

Analytics