SUNNYVALE — Cerebras Systems, a pioneer in accelerating generative AI, announced record-breaking performance for DeepSeek-R1-Distill-Llama-70B inference, achieving more than 1,500 tokens per second – 57 times faster than GPU-based solutions. This unprecedented speed enables instant reasoning capabilities for one of the industry’s most sophisticated open-weight models, running entirely on U.S.-based AI infrastructure with zero data retention. […]