Akamai and NVIDIA Make a Big Leap in Global Edge-Based AI Inference
Table of Contents
Mobile – Akamai Technologies officially introduced Akamai Cloud Inference, a platform designed to revolutionize the way artificial intelligence (AI) is implemented and operated by extending inference capabilities from the data center to the internet edge layer. This solution promises faster,more secure and ultra-low latency AI performance globally,addressing the needs of a new generation of applications that demand instant data processing close to the user.
Different from customary approaches, Akamai Cloud Inference delivers intelligent, adaptable agentic AI directly at the edge, right where the data is generated. The platform combines the power of Akamai’s global distributed architecture with NVIDIA Blackwell AI infrastructure, creating a highly efficient computing ecosystem to accelerate real-time decision making.
Menurut Dr. Tom Leighton, CEO of Akamai, this innovation is a continuation of the company’s vision which two decades ago pioneered the Content Delivery Network (CDN) to overcome the “world wide wait.” Now, the same steps are being applied to the world of AI, where the main challenge is increasing inference capacity and performance as the use of intelligent applications expands. “With the support of NVIDIA technology, we are ready to expand our AI inference network to thousands of locations around the world,” said Leighton.
Likewise, Jensen Huang, Founder and CEO of NVIDIA, added that the inference phase is now the most intensive part of the AI process because it demands large-scale computing and real-time reasoning capabilities. “The collaboration with Akamai will bring inference capabilities closer to users wherever they are, so that generative AI can run faster, more efficiently, and be more accessible,” explained Huang.
this latest platform also combines an NVIDIA RTX PRO Server with an RTX 6000 Blackwell Server Edition GPU, NVIDIA BlueField-3 DPU, and NVIDIA AI Enterprise software. All integrated with Akamai’s global edge network covering more than 4,200 locations worldwide. This combination allows AI to be run in a distributed manner with high scalability, and utilizes the latest technology such as the BlueField-4 DPU to speed up and secure data transfer between computing layers from the center to the edge.
Akamai and NVIDIA Partner to Bring AI Inference to the Edge
Akamai and NVIDIA have announced a collaboration to deliver global edge-based AI inference, bringing the power of artificial intelligence closer to users and enabling faster, more adaptive AI experiences.This partnership leverages NVIDIA’s inference software and Akamai’s extensive edge network to accelerate generative and physical AI applications worldwide.
expanding AI Capabilities with Edge Computing
Traditionally, AI workloads have been processed in centralized data centers. This can introduce latency, impacting the responsiveness of AI-powered applications. By moving AI inference to the edge – closer to the end-user – Akamai and NVIDIA aim to overcome these limitations. this distributed approach allows for real-time processing, reduced bandwidth consumption, and enhanced privacy.
The Akamai Cloud inference Platform
The core of this initiative is the Akamai Cloud Inference platform, now operational in 20 locations globally with plans for continued expansion.This platform combines Akamai’s globally distributed edge servers with NVIDIA’s AI Enterprise software suite.Specifically, it utilizes NVIDIA’s TensorRT and Triton Inference Server to optimize and deploy AI models at scale. https://www.akamai.com/blog/cloud-computing/akamai-and-nvidia-make-a-big-leap-in-global-edge-based-ai-inference
Key features of the Akamai Cloud Inference platform include:
* Distributed AI: Processing AI tasks across a vast network of edge servers, minimizing latency and maximizing responsiveness.
* Intelligent Orchestration: An automated system that dynamically routes AI workloads to the most efficient locations – from the edge to centralized data centers – based on factors like model requirements and network conditions.This simplifies infrastructure management for developers.
* Scalability: The platform is designed to handle a growing number of AI models and inference requests, ensuring consistent performance as demand increases.
* Global Reach: With deployments in 20 locations and ongoing expansion, Akamai Cloud Inference provides a truly global footprint for AI applications.
Benefits of Edge-Based AI Inference
The collaboration between Akamai and NVIDIA offers several key benefits:
* Reduced Latency: Bringing AI processing closer to the user considerably reduces response times, crucial for applications like real-time video analytics, autonomous systems, and interactive AI assistants.
* Enhanced User Experience: Faster AI inference translates to a more seamless and responsive user experience.
* Bandwidth Optimization: Processing data at the edge reduces the amount of data that needs to be transmitted to and from centralized data centers,lowering bandwidth costs and improving network efficiency.
* Improved Privacy: Edge computing can help protect sensitive data by processing it locally, reducing the risk of data breaches during transmission.
* Scalability and Flexibility: The distributed nature of the platform allows for easy scaling to meet changing demands and supports a wide range of AI models and applications.
Applications and Future Outlook
Akamai and NVIDIA envision a wide range of applications for their edge-based AI inference platform, including:
* Generative AI: Accelerating the performance of large language models (LLMs) and other generative AI applications.
* Computer vision: Enabling real-time object detection, image recognition, and video analytics for applications like security surveillance, retail analytics, and autonomous vehicles.
* natural Language Processing (NLP): Improving the speed and accuracy of NLP tasks such as sentiment analysis, machine translation, and chatbot interactions.
* AI-Powered Security: Enhancing threat detection and response capabilities by analyzing network traffic and user behavior in real-time.
Akamai’s goal is to make generative and physical AI faster, more adaptive, and globally accessible by shifting AI processing from centralized data centers to the edge.https://www.nvidia.com/en-us/news/akamai-nvidia-ai-inference/ This partnership represents a notable step towards realizing that vision, paving the way for a new generation of AI-powered applications that are more responsive, efficient, and user-amiable.
Tags: Akamai Cloud Inference,Akamai,NVIDIA,AI,Edge Computing,AI Inference.