DigitalOcean and NVIDIA highlighted the importance of open-source AI models and infrastructure for building agentic systems in a Deploy 2026 session. They empha...
DigitalOcean unveiled a prefix-aware routing and caching mechanism to eliminate redundant LLM inference costs, targeting the 'prefill tax' where identical syste...
DigitalOcean launched its Inference Router in Public Preview, enabling dynamic model routing for AI coding agents like OpenCode to optimize cost, latency, and q...
Anthropic released Claude Opus 4.8, improving benchmarks and agentic task reliability while adding user-controlled effort levels, dynamic workflows for large-sc...
DigitalOcean introduced Batch Inference on its AI-Native Cloud, enabling high-volume asynchronous AI workloads at up to 50% lower cost than real-time inference....
DigitalOcean’s AI-Native Cloud, powered by NVIDIA HGX B300 GPUs, enabled Hippocratic AI’s Polaris system to scale to 10 million patient calls with a 99.9% clini...
DigitalOcean launched generally available request-based autoscaling on App Platform, enabling apps to scale automatically based on live HTTP traffic signals lik...
DigitalOcean introduced Inference Router, an infrastructure-level tool that automatically routes LLM requests to the best-fit model based on task requirements, ...
DigitalOcean introduced a unified AI inference platform featuring serverless inference with 50+ models, dedicated GPU options, and an Intelligent Router that dy...