Microsoft used Nvidia's GTC conference this week to roll out a series of enterprise AI announcements spanning agent infrastructure, real-time voice interactions and next-generation GPU deployments.
FriendliAI — founded by the researcher behind continuous batching, the technique at the core of vLLM — is launching ...
The Azure Kubernetes Service (AKS) team at Microsoft has shared guidance for running Anyscale's managed Ray service at scale.