DeepInfra Lands $107 Million Series B

<p><strong>PALO ALTO<&sol;strong> &&num;8212&semi; <a title&equals;"" href&equals;"https&colon;&sol;&sol;www&period;globenewswire&period;com&sol;Tracker&quest;data&equals;BqdISHG8f5qZNoyaG-NWCfkztP10VNGeP1EWBUucW&lowbar;osM3Cv16tefKBIdFdZ-Y0Kgl5mxLJWuDHd-bh3-L4WUA&equals;&equals;" target&equals;"&lowbar;blank" rel&equals;"nofollow noopener"><u>DeepInfra<&sol;u><&sol;a>&comma; a cloud platform for high-throughput AI inference&comma; has landed &dollar;107 million in Series B funding to scale its inference cloud and global capacity&period; Processing nearly five trillion tokens per week&comma; DeepInfra enables enterprises and scaleups to run open-source and agent-driven AI workloads with improved cost&comma; performance and security&period;<&sol;p>&NewLine;<p>Developed by the team behind the popular messenger app&comma; <a title&equals;"" href&equals;"https&colon;&sol;&sol;www&period;globenewswire&period;com&sol;Tracker&quest;data&equals;vfC3bYvPsxyyRqr64O167h7-KxQB-KS6D9gW9yqR6fefhH6CbbUgCOBY2FNsAO0q" target&equals;"&lowbar;blank" rel&equals;"nofollow noopener"><u>imo<&sol;u><&sol;a>&comma; which has scaled across more than 200 million users globally&comma; DeepInfra’s latest round is co-led by <a title&equals;"" href&equals;"https&colon;&sol;&sol;www&period;globenewswire&period;com&sol;Tracker&quest;data&equals;RWle43USoK58Q2Tit78Qtszk6DefsgRapX2paFWqSGTaE-xeR1b5-nJWSiOQ5IyC79kw6PWF5EhYfm9MtkEu9g&equals;&equals;" target&equals;"&lowbar;blank" rel&equals;"nofollow noopener"><u>500 Global<&sol;u><&sol;a> and <a title&equals;"" href&equals;"https&colon;&sol;&sol;www&period;globenewswire&period;com&sol;Tracker&quest;data&equals;HIm16AkFGfd9Lo8cJx0EafIa8XwGThHcZDs4k9xK1CP8JXqw6qwaiRZEzDWmP0jXzJHJ0tMdZYqp&lowbar;95vqxKVh6FGoKAnzk6898xm7OcdGqgN7XhA5yMUheZE3koC5KGZ" target&equals;"&lowbar;blank" rel&equals;"nofollow noopener"><u>Georges Harik<&sol;u><&sol;a>&comma; one of Google’s earliest engineers&comma; with participation from A&period;Capital Ventures&comma; Crescent Cove&comma; Felicis&comma; NVIDIA&comma; Peak6&comma; Samsung Next&comma; Supermicro and Upper90&period;<&sol;p>&NewLine;<p>&OpenCurlyDoubleQuote;When we launched nearly four years ago&comma; we believed inference would become the dominant driver of enterprise AI workloads – and we are now at this inflection point&comma;” said <a title&equals;"" href&equals;"https&colon;&sol;&sol;www&period;globenewswire&period;com&sol;Tracker&quest;data&equals;ZS5ioPFYNh9bmt-MYNnmK23Os8Ne2Y2S15rqmBP9lUDJKBAxJLRgj1MIyxhhNROZ8uRCKX-B766qQt6oIBIwjKjlsLXxmqCIqJMhEtTPWZr-BU0H5Z2hOSJQUr9Ol6qj" target&equals;"&lowbar;blank" rel&equals;"nofollow noopener"><u>Nikola Borisov<&sol;u><&sol;a>&comma; co-founder and CEO&comma; DeepInfra&period; &OpenCurlyDoubleQuote;What’s happening now is incredibly exciting – open-source models are rapidly reaching parity with proprietary systems&comma; unlocking a new wave of innovation at a fraction of the cost and enabling widespread adoption&period; At the same time&comma; agent-based systems are driving continuous&comma; high-volume demand&period; Inference is no longer a thin layer – it’s the system constraint that will define the majority of workloads&period; Most cloud platforms weren’t built for this always-on&comma; distributed model&comma; so we built DeepInfra from the ground up to deliver better economics&comma; performance&comma; and security&period;”<&sol;p>&NewLine;<p>The investment reflects 500 Global’s portfolio thesis across the AI stack&period; The firm’s conviction is that infrastructure will be as defining a category as the models themselves&period;<&sol;p>&NewLine;<p>&&num;8220&semi;Demand for AI is causing every layer of the AI stack to innovate&comma; and inference is no exception&period; In the agentic age&comma; new workflows are arising on a rapid basis&comma; as evidenced recently by OpenClaw and AutoResearch&period; Enterprises and developers building with open source and agent-driven AI need infrastructure that was designed to be flexible&comma; fast and reliable&period; We backed DeepInfra because&comma; in our assessment&comma; this team has already proven they can build and operate distributed systems at global scale&comma; and because we believe purpose-built inference infrastructure will be fundamental to the next phase of AI as compute was to the last&comma;&&num;8221&semi; said Tony Wang&comma; Managing Partner&comma; 500 Global&period;<&sol;p>&NewLine;<p>DeepInfra is an early infrastructure collaborator in NVIDIA’s open AI ecosystem&comma; supporting <a title&equals;"" href&equals;"https&colon;&sol;&sol;www&period;globenewswire&period;com&sol;Tracker&quest;data&equals;XAPLgWa59uWtDMfq7bJW4DIeamyQTp3gMoGQZ0JxeYuLUY7u96XTyfFosPGAAGnqzANvsPtEMOybzqBaTKraD0a1F-BXG1yhn22GEak1WKUaoc&lowbar;XTv&lowbar;hVfMmvJzO&lowbar;iuvMlwHjFSNIoNoPtyJAP12lg&equals;&equals;" target&equals;"&lowbar;blank" rel&equals;"nofollow noopener"><u>Nemotron models<&sol;u><&sol;a>&comma; <a title&equals;"" href&equals;"https&colon;&sol;&sol;www&period;globenewswire&period;com&sol;Tracker&quest;data&equals;M3r0S4THglxb&lowbar;ruvZGLjmvb2l79Mzr24iUtZ4ZA0IgibE&lowbar;7&lowbar;xbK&lowbar;EZ8nZ4oK4IUmA&lowbar;zUQ3U02r2BHyMdAujqdpc&lowbar;3skI5KE49YAYbIpQXAGqDv4xA49V7yZsbzS1R&lowbar;wJ" target&equals;"&lowbar;blank" rel&equals;"nofollow noopener"><u>NemoClaw agent framework<&sol;u><&sol;a>&comma; and the <a title&equals;"" href&equals;"https&colon;&sol;&sol;www&period;globenewswire&period;com&sol;Tracker&quest;data&equals;HVY3RiwWFHqvR6VwCvx4tp6xGiOe2cCSm2zo4rmxFrEvt6BIY0u3bWD9p97RRUKnTR9d-1p1F9GUhVpIWFvcDSpQEz4SlCbZT0sbPJQVXpC7qf16DiYacj1VvTv2CPIwIcb2OMN7fRHO7Q6sAty3mA&equals;&equals;" target&equals;"&lowbar;blank" rel&equals;"nofollow noopener"><u>NVIDIA Dynamo inference software<&sol;u><&sol;a>&period; As NVIDIA advances the Nemotron family of open-source models and agent-based systems like OpenClaw drive increased inference demand&comma; DeepInfra is one of the vendors providing the infrastructure layer for these systems in production&period; The platform supports more than 190 open-source models through OpenAI-compatible APIs and offers a fully managed&comma; enterprise-ready environment with built-in security&comma; including zero data retention and SOC 2 and ISO 27001 certification&period;<&sol;p>&NewLine;<p>&&num;8220&semi;DeepInfra gives us access to best-in-class models with the reliability and speed we need to ship&period; The performance speaks for itself&period; They help us keep up with the pace of innovation in this space&comma;” said Jesse Proudman&comma; president and CTO&comma; <a title&equals;"" href&equals;"https&colon;&sol;&sol;www&period;globenewswire&period;com&sol;Tracker&quest;data&equals;3adCIEEybI2-b6mF4&lowbar;PcMXck7wJxDYqLZhU0AvIqKHViBZTx27rlpzjfl854WxYis7F-MYrexCordPj4SRnEoA&equals;&equals;" target&equals;"&lowbar;blank" rel&equals;"nofollow noopener"><u>Venice AI<&sol;u><&sol;a>&period;<&sol;p>&NewLine;

Editor

Google Unveils Gemini Intelligence for Android

Google is introducing Gemini Intelligence on Android, which brings the best of Gemini to Android's…

53 minutes

Kleiner Perkins Leads $400 Million Round in Mind Robotics

PALO ALTO -- Mind Robotics has scored a $400 million financing led by Kleiner Perkins,…

1 day

Luxury Home Prices in Bay Area Rise Due to ChatGPT

Zip codes in the San Francisco Bay Area known for luxury homes saw a 13.4%…

1 day

Airbnb Activates Anti-Party System for Memorial Day

Ahead of Memorial Day this month, Airbnb says it is activating its anti-party technology for…

2 days

KLA to Split Shares 10-for-1 on June 4

MILPITAS -- KLA Corporation announced that its board of directors approved a Ten‑for‑One forward stock…

2 days

A16z Leads $60 Million Funding in Tessera Labs

SAN JOSE -- Tessera Labs has announced $60 million in oversubscribed funding led by Andreessen…

2 days