Amazon Launches G7 Instances for Enhanced GPU Performance
Amazon Web Services (AWS) has announced the general availability of its latest Amazon Elastic Compute Cloud (EC2) G7 instances, which are designed to provide high-performance GPU acceleration for artificial intelligence (AI) inference, graphics rendering, and data analytics workloads. This launch marks a significant advancement in cloud computing capabilities, as AWS becomes the first major cloud provider to support NVIDIA’s RTX PRO 4500 Blackwell Server Edition GPUs.
Key Features of G7 Instances
The introduction of G7 instances brings several enhancements over their predecessors, the G6 instances. With custom sixth-generation Intel Xeon Scalable processors and NVIDIA’s state-of-the-art GPUs, G7 instances deliver up to 4.6 times the AI inference performance and 2.1 times the graphics performance compared to G6 instances. This leap in performance is particularly beneficial for a variety of GPU-enabled applications including AI inference, video transcoding, and virtual desktop infrastructure (VDI).
- Faster GPU Memory: The NVIDIA RTX PRO 4500 GPUs feature 32 GB of memory per GPU, offering 1.33 times the memory capacity and 2.45 times the memory bandwidth compared to G6 instances. This upgrade is crucial for enhancing both AI inference and graphics performance.
- High-Performance Networking and Storage: G7 instances provide an impressive 700 Gbps of Elastic Fabric Adapter (EFA)-enabled networking throughput—seven times that of G6 instances. This capability ensures low-latency, high-bandwidth connectivity essential for demanding applications. Additionally, they support up to 7.6 TB of local NVMe SSD storage, allowing users to keep large datasets close to compute resources.
- Advanced Video Encoding/Decoding: Equipped with ninth-generation NVENC and sixth-generation NVDEC engines, G7 instances can handle high-resolution video workflows with support for 4:2:2 encoding and decoding. They deliver 1.5 times the number of concurrent video streams compared to previous generations.
Specifications Overview
The specifications for the EC2 G7 instances are robust, catering to a wide range of computational needs. Each instance can be configured with up to eight NVIDIA RTX PRO 4500 GPUs and offers various sizes tailored for different workloads.
The following table summarizes key specifications across different instance types:
- g7.2xlarge: 1 GPU, 32 GB GPU memory, 8 vCPUs, 32 GiB memory, up to 600 GB storage
- g7.4xlarge: 1 GPU, 32 GB GPU memory, 16 vCPUs, 64 GiB memory, up to 600 GB storage
- g7.8xlarge: 1 GPU, 32 GB GPU memory, 32 vCPUs, 128 GiB memory, up to 950 GB storage
- g7.12xlarge: 2 GPUs, 64 GB GPU memory, 48 vCPUs, 192 GiB memory, up to 1900 GB storage
- g7.24xlarge: 4 GPUs, 128 GB GPU memory, 96 vCPUs, 384 GiB memory, up to 3800 GB storage
- g7.48xlarge: 8 GPUs, 256 GB GPU memory, 192 vCPUs, 768 GiB memory, up to two x3800 GB storage
- g7.metal:* Similar specs as g7.48xlarge but coming soon
*Note: The g7.metal instance type will be available in the near future.
Use Cases and Getting Started
The versatility of G7 instances makes them suitable for a wide range of applications beyond traditional computing tasks. They support NVIDIA GPUDirect P2P (peer-to-peer) communication for multi-GPU configurations and are compatible with various operating systems including Amazon Linux and Windows Server.
AWS provides several options for users looking to leverage these new instances effectively. Users can utilize AWS Deep Learning AMIs or NVIDIA Workstation AMIs that come prepackaged with necessary drivers for AI workloads. For those using Amazon Elastic Kubernetes Service (EKS), building EKS AMIs with specific NVIDIA driver versions is recommended.
AWS Regional Availability and Pricing Options
The EC2 G7 instances are currently available in two AWS regions: US East (Ohio) and US West (Oregon). Future regional expansion plans can be tracked through AWS’s CloudFormation resources tab on their website.
AWS offers multiple purchasing options for these instances including On-Demand pricing models as well as Savings Plans and Spot Instances. Dedicated Instances are also supported for larger instance sizes like g7.12xlarge through g7.48xlarge.
What This Means
The launch of Amazon EC2 G7 instances signifies a substantial enhancement in cloud computing capabilities specifically tailored for high-performance tasks such as AI inference and data analytics. By leveraging cutting-edge NVIDIA technology alongside powerful Intel processors, AWS is positioning itself as a leader in providing robust solutions for businesses requiring advanced computational power.
This development not only benefits enterprises looking to optimize their workloads but also paves the way for innovations in fields such as machine learning and real-time data processing where speed and efficiency are paramount.
For more information, read the original report here.


































