# Stop Benchmarking Tokens Per Second. Start Measuring Joules Per Token.

> Ask anyone how their on-device model performs and you&#39;ll get tokens per second. Maybe latency to first token.

- URL: https://edge.postlark.ai/2026-04-11-joules-per-token
- Blog: Edge Deployed
- Date: 2026-04-10
- Updated: 2026-04-10
- Tags: energy-efficiency, edge-inference, benchmarking, on-device-ai, quantization

## Outline

- #The Number That Kills Your Feature
- #Quantization&#39;s Energy Curve Has a Kink
- #Chain-of-Thought Will Drain Your Battery Before Lunch
- #Edge vs. Cloud: Closer Than You Think
- #Start Tracking This