#Hugging Face

Hugging Face Explains Async Continuous Batching: Up to 25% Inference Throughput Gains

Tools May 15, 2026

Hugging Face's engineering blog details how asynchronous continuous batching eliminates CPU-GPU idle gaps that waste nearly a quarter of LLM inference runtime.

Hugging Face Adds Private Datasets to the Open ASR Leaderboard to Fight Benchmark Gaming

Research May 6, 2026

Hugging Face introduces private ASR evaluation datasets from Appen Inc. and DataoceanAI to block benchmaxxing, with scores visible via an opt-in toggle.