Amazon Launches EC2 G7 Instances with NVIDIA RTX PRO 4500 GPUs

NewsAmazon Launches EC2 G7 Instances with NVIDIA RTX PRO 4500 GPUs

Amazon Launches G7 Instances for Enhanced GPU Performance

Amazon Web Services (AWS) has announced the general availability of its latest Amazon Elastic Compute Cloud (EC2) G7 instances, which are designed to provide high-performance GPU acceleration for artificial intelligence (AI) inference, graphics rendering, and data analytics workloads. This launch marks a significant advancement in cloud computing capabilities, as AWS becomes the first major cloud provider to support NVIDIA’s RTX PRO 4500 Blackwell Server Edition GPUs.

Key Features of G7 Instances

The introduction of G7 instances brings several enhancements over their predecessors, the G6 instances. With custom sixth-generation Intel Xeon Scalable processors and NVIDIA’s state-of-the-art GPUs, G7 instances deliver up to 4.6 times the AI inference performance and 2.1 times the graphics performance compared to G6 instances. This leap in performance is particularly beneficial for a variety of GPU-enabled applications including AI inference, video transcoding, and virtual desktop infrastructure (VDI).

  • Faster GPU Memory: The NVIDIA RTX PRO 4500 GPUs feature 32 GB of memory per GPU, offering 1.33 times the memory capacity and 2.45 times the memory bandwidth compared to G6 instances. This upgrade is crucial for enhancing both AI inference and graphics performance.
  • High-Performance Networking and Storage: G7 instances provide an impressive 700 Gbps of Elastic Fabric Adapter (EFA)-enabled networking throughput—seven times that of G6 instances. This capability ensures low-latency, high-bandwidth connectivity essential for demanding applications. Additionally, they support up to 7.6 TB of local NVMe SSD storage, allowing users to keep large datasets close to compute resources.
  • Advanced Video Encoding/Decoding: Equipped with ninth-generation NVENC and sixth-generation NVDEC engines, G7 instances can handle high-resolution video workflows with support for 4:2:2 encoding and decoding. They deliver 1.5 times the number of concurrent video streams compared to previous generations.

Specifications Overview

The specifications for the EC2 G7 instances are robust, catering to a wide range of computational needs. Each instance can be configured with up to eight NVIDIA RTX PRO 4500 GPUs and offers various sizes tailored for different workloads.

The following table summarizes key specifications across different instance types:

  • g7.2xlarge: 1 GPU, 32 GB GPU memory, 8 vCPUs, 32 GiB memory, up to 600 GB storage
  • g7.4xlarge: 1 GPU, 32 GB GPU memory, 16 vCPUs, 64 GiB memory, up to 600 GB storage
  • g7.8xlarge: 1 GPU, 32 GB GPU memory, 32 vCPUs, 128 GiB memory, up to 950 GB storage
  • g7.12xlarge: 2 GPUs, 64 GB GPU memory, 48 vCPUs, 192 GiB memory, up to 1900 GB storage
  • g7.24xlarge: 4 GPUs, 128 GB GPU memory, 96 vCPUs, 384 GiB memory, up to 3800 GB storage
  • g7.48xlarge: 8 GPUs, 256 GB GPU memory, 192 vCPUs, 768 GiB memory, up to two x3800 GB storage
  • g7.metal:* Similar specs as g7.48xlarge but coming soon

*Note: The g7.metal instance type will be available in the near future.

Use Cases and Getting Started

The versatility of G7 instances makes them suitable for a wide range of applications beyond traditional computing tasks. They support NVIDIA GPUDirect P2P (peer-to-peer) communication for multi-GPU configurations and are compatible with various operating systems including Amazon Linux and Windows Server.

AWS provides several options for users looking to leverage these new instances effectively. Users can utilize AWS Deep Learning AMIs or NVIDIA Workstation AMIs that come prepackaged with necessary drivers for AI workloads. For those using Amazon Elastic Kubernetes Service (EKS), building EKS AMIs with specific NVIDIA driver versions is recommended.

AWS Regional Availability and Pricing Options

The EC2 G7 instances are currently available in two AWS regions: US East (Ohio) and US West (Oregon). Future regional expansion plans can be tracked through AWS’s CloudFormation resources tab on their website.

AWS offers multiple purchasing options for these instances including On-Demand pricing models as well as Savings Plans and Spot Instances. Dedicated Instances are also supported for larger instance sizes like g7.12xlarge through g7.48xlarge.

What This Means

The launch of Amazon EC2 G7 instances signifies a substantial enhancement in cloud computing capabilities specifically tailored for high-performance tasks such as AI inference and data analytics. By leveraging cutting-edge NVIDIA technology alongside powerful Intel processors, AWS is positioning itself as a leader in providing robust solutions for businesses requiring advanced computational power.

This development not only benefits enterprises looking to optimize their workloads but also paves the way for innovations in fields such as machine learning and real-time data processing where speed and efficiency are paramount.

For more information, read the original report here.

Neil S
Neil S
Neil is a highly qualified Technical Writer with an M.Sc(IT) degree and an impressive range of IT and Support certifications including MCSE, CCNA, ACA(Adobe Certified Associates), and PG Dip (IT). With over 10 years of hands-on experience as an IT support engineer across Windows, Mac, iOS, and Linux Server platforms, Neil possesses the expertise to create comprehensive and user-friendly documentation that simplifies complex technical concepts for a wide audience.
Watch & Subscribe Our YouTube Channel
YouTube Subscribe Button

Latest From Hawkdive

You May like these Related Articles

LEAVE A REPLY

Please enter your comment!
Please enter your name here

This site uses Akismet to reduce spam. Learn how your comment data is processed.