/digitalocean.com
Content/digitalocean.com
Open by Design: How NVIDIA and DigitalOcean Are Building the Stack for the Always-On Agentic Era
DigitalOcean and NVIDIA highlighted the importance of open-source AI models and infrastructure for building agentic systems in a Deploy 2026 session. They emphasized model flexibility, evaluation stan…
Launch/digitalocean.com
DigitalOcean Serverless Inference: A Deep Dive
DigitalOcean introduced Serverless Inference, a fully managed API-first platform supporting 30+ foundation models across text, code, vision, image, video, and speech. It offers single-endpoint access,…
Feature/digitalocean.com
The Inference Tax: How Prefix-Aware Routing Eliminates the Hidden Cost of LLMs at Scale
DigitalOcean unveiled a prefix-aware routing and caching mechanism to eliminate redundant LLM inference costs, targeting the 'prefill tax' where identical system prompts and shared contexts are recomp…