Tag: TPUs

Apr 6, 2026 · AI Infrastructure
AI Training Chip Performance: Real Scaling Data vs Marketing Hype (Blackwell to Hopper)
AI training chip performance data: analyzing real scaling from Hopper to Blackwell. 3.2x training, 50x inference gains, and why memory bandwidth matters more than FLOPs.
Feb 24, 2026 · AI Infrastructure
JAX Pallas: Writing GPU Kernels for Maximum Performance
JAX Pallas is NVIDIA's GPU programming API for high-performance compute kernels. Write optimized kernels for matrix multiplication and memory access patterns.
- JAX
- XLA
- TPUs
- GCP
- Pallas
- Compilers