Top mises à jour de 2026
Open by Design: How NVIDIA and DigitalOcean Are Building the Stack for the Always-On Agentic Era
DigitalOcean and NVIDIA highlighted the importance of open-source AI models and infrastructure for building agentic systems in a Deploy 2026 session. They emphasized model flexibility, evaluation stan…
DigitalOcean Serverless Inference: A Deep Dive
DigitalOcean introduced Serverless Inference, a fully managed API-first platform supporting 30+ foundation models across text, code, vision, image, video, and speech. It offers single-endpoint access,…
The Inference Tax: How Prefix-Aware Routing Eliminates the Hidden Cost of LLMs at Scale
DigitalOcean unveiled a prefix-aware routing and caching mechanism to eliminate redundant LLM inference costs, targeting the 'prefill tax' where identical system prompts and shared contexts are recomp…
OpenCode Now Supports DigitalOcean Inference Router for Intelligent Model Routing
DigitalOcean launched its Inference Router in Public Preview, enabling dynamic model routing for AI coding agents like OpenCode to optimize cost, latency, and quality. The integration allows OpenCode …
Introducing Claude Opus 4.8
Anthropic released Claude Opus 4.8, improving benchmarks and agentic task reliability while adding user-controlled effort levels, dynamic workflows for large-scale coding, and a 2.5× faster fast mode …
Scalable, Cost-Efficient AI: Introducing Unified Batch Inference on DigitalOcean
DigitalOcean introduced Batch Inference on its AI-Native Cloud, enabling high-volume asynchronous AI workloads at up to 50% lower cost than real-time inference. The feature supports OpenAI and Anthrop…
News Details
DigitalOcean’s AI-Native Cloud, powered by NVIDIA HGX B300 GPUs, enabled Hippocratic AI’s Polaris system to scale to 10 million patient calls with a 99.9% clinical safety score. The collaboration deli…
Request-Based Autoscaling Is Now Generally Available on App Platform
DigitalOcean launched generally available request-based autoscaling on App Platform, enabling apps to scale automatically based on live HTTP traffic signals like requests per second and P95 latency. P…
How We Built DigitalOcean Inference Router
DigitalOcean introduced Inference Router, an infrastructure-level tool that automatically routes LLM requests to the best-fit model based on task requirements, optimizing for cost, latency, or quality…
Your Model Doesn't Matter. Your Infrastructure Does.
DigitalOcean introduced a unified AI inference platform featuring serverless inference with 50+ models, dedicated GPU options, and an Intelligent Router that dynamically selects models based on cost, …
United Republic of Tanzania Tax Information
DigitalOcean now charges 18% VAT to Tanzanian customers and requires 15% withholding tax for certain digital service payments to non-residents, effective since July 2023. Tanzanian businesses must wit…
What's New on DigitalOcean's Inference Engine
DigitalOcean added several new AI models to its Inference Engine, including Kimi K2.6, DeepSeek-V4-Pro, GPT-5.5, GPT Image 2.0, and Claude Opus 4.7. These models enable autonomous workflows, long-cont…
20 premières affichées. Cliquez un mois ci-dessus pour l'archive complète de Digitalocean.
Suivez Digitalocean en pilote automatique
- · Brief IA hebdomadaire — résumé narratif de ce qui a été livré, chaque lundi 9 h
- · Alertes par e-mail ou Slack, ou discutez avec l'archive depuis votre tableau de bord
- · Ajoutez Digitalocean + jusqu'à 4 autres concurrents gratuitement, sans carte bancaire