Elastic GPU Service

Powerful parallel computing capabilities based on GPU technology.

Buy NowContact Sales

Elastic GPU Service (EGS) is a GPU-based computing service ideal for scenarios such as deep learning, video processing, scientific computing, and visualization. EGS solutions use the following GPUs: AMD FirePro S7150, NVIDIA Tesla M40, NVIDIA Tesla P100, NVIDIA Tesla P4, and NVIDIA Tesla V100.


Choose How You Pay

Elastic GPU Service provides different purchasing methods. Based on your needs, you can select Pay-As-You-Go or Subscription. The following prices are indicative. You can find the exact prices on the product purchase order page.
  • Pay-As-You-Go

    Bills you for the exact amount of resources you use. Activate or release resources at any time with no hardware or maintenance costs.

  • Subscription

    Subscription fees are lower on average than that of Pay-As-You-Go, and suited for those requiring long-term resources at reduced cost.

Tab #3 content goes here!

Donec pulvinar neque sed semper lacinia. Curabitur lacinia ullamcorper nibh; quis imperdiet velit eleifend ac. Donec blandit mauris eget aliquet lacinia! Donec pulvinar massa interdum ri.

Benefits

Deep Learning
Online deep learning training and inference services, image recognition, content identification, and voice recognition
Video Processing
HD media coding, 4K/8K HD live, video conferencing, and source film repair
Scientific Computing
Video rendering, collision simulation, computational finance, genetic engineering, and climate prediction
Visualization
Engineering design, non-linear editing, and distance education applications

Features

Common Scenarios

Online rendering in the cloud

Online rendering using Cloud Desktop

You can quickly access a GA1 instance using Cloud Desktop to experience richer visual and manipulation renderings. You can also use the Remote Desktop Protocol (RDP) to achieve real-time online rendering and graph editing. By using RDP, you can access a GA1 instance from anywhere and perform rendering and graph editing work using multiple types of devices. Data is stored using Network Attached Storage (NAS) or Alibaba Cloud Object Storage Service (OSS). You can pull data from your internal network at any time, which ensures data security. In workplaces, Express Connect and NAT Gateway can be used to improve network experiences and reduce costs.

Currently, GA1 instances only support Windows Server 2008 R2 (64-bit), Windows 7 (64-bit), CentOS 7.3 (64-bit), and Ubuntu 16.04 (64-bit). Support for Windows Server 2016 and Windows 10 is coming soon.

Benefits

Visualized instances

With the powerful computing performance of GA1 instances, you can complete online editing from anywhere.

Service integrations

GA1 instances can be integrated with services such as Express Connect, NAT Gateway, OSS, and NAS.

Integrations and Configurations

Excellent acceleration capability suitable for general-purpose GPU computational

Acceleration engine provided for deep learning

A GN4 instance is based on NVIDIA's Maxwell M40 GPU and provides up to 14 TFLOPS of single-precision floating-point performance. This helps achieve large-scale parallel floating-point computation performance required in deep learning and other general-purpose GPU computation scenarios. GN4 instances can be seamlessly integrated into an elastic computing ecosystem to provide solutions that are ideal for either online or offline computation scenarios. Additionally, integrating Container Service into your workflow can help simplify deployment and O&M, and provide resource scheduling services.

Benefits

Elastic expansion

GN4 instances can interwork with Auto Scaling and Server Load Balancer to achieve elastic expansion.

Fast deployment

Using Container Service can speed up service deployment, O&M, and resource scheduling.

Integrations and Configurations

Outstanding floating-point computation acceleration capability

Outstanding computation acceleration performance

A GN5 instance is based on NVIDIA Tesla P100 GPU and provides up to 74.4 TFLOPS of single-precision floating-point performance. This helps achieve large-scale parallel floating-point computation performance required in deep learning and other general-purpose GPU computation scenarios. A GN5 instance also provides up to 37.6 TFLOPS of double-precision floating-point performance to deliver high computing performance required in scenarios such as scientific computing. GN5 instances support the GPUDirect P2P technology. In this way, GPUs can directly communicate with each other by using PCI buses, greatly reducing inter-GPU communication latency. GN5 instances can be seamlessly integrated into an elastic computing ecosystem to provide solutions that are ideal for either online or offline computation scenarios.

Additionally, making full use of Container Service can help simplify deployment and O&M, and provide resource scheduling services. The Image Market provides a GN5 instance image that is equipped with an NVIDIA GPU driver and a deep learning framework, which simplifies deployment.

Benefits

Elastic expansion

GN5 instances can interwork with Auto Scaling and Server Load Balancer to achieve elastic expansion.

Fast deployment

Using Container Service can speed up service deployment, O&M, and resource scheduling.

Integrations and Configurations

Extraordinary deep learning inference capabilities

Optimal deep learning inference capabilities

A GN5i instance is based on NVIDIA Tesla P4 GPU and provides up to 11 TFLOPS of single-precision floating-point performance and 44 TOPS INT8 of computing capability that are ideal for deep learning scenarios, especially for inference. Additionally, a single GPU only consumes 75 W of power while maintaining a high-performance output. GN5i instances can be seamlessly integrated into an elastic computing ecosystem to provide solutions that are ideal for either online or offline computation scenarios. Additionally, making full use of Container Service can help simplify deployment and O&M, and provide resource scheduling services. The Image Market provides a GN5i instance image that is equipped with an NVIDIA GPU driver and a deep learning framework, which simplifies deployment.

Benefits

Elastic expansion

GN5 instances can interwork with Auto Scaling and Server Load Balancer to achieve elastic expansion.

Fast deployment

Using Container Service can speed up service deployment, O&M, and resource scheduling.

Integrations and Configurations

Multi-region High-speed Interconnection

Widely spread services and high-speed data interconnection

On-cloud services can be built fully based on VPC with users spread across all regions. To speed up user access, networks of the service systems in different nodes must be interconnected with each other at high speed.

Advantages

Secure Isolation

Services are deployed on Alibaba Cloud VPC, which is secure and reliable.

High Reliability

Express Connect is used to connect different VPC instances, ensuring the quality of cross-region interconnection.

High Reliability

VPC with Express Connect provides the maximum interconnection bandwidth of 10 Gbit/s, easily meeting the needs of massive applications.

Upgraded Support For You

1 on 1 Presale Consultation, 24/7 Technical Support, Faster Response, and More Tickets.

1 on 1 Presale Consultation

Consulting by experienced cloud experts. Learn More

24/7 Technical Support

Extended service time from 10 hours 5 days a week to 24/7. Learn More

6 Free Tickets per Quarter

The number of free tickets doubled from 3 to 6 per quarter. Learn More

Faster Response

Shorten after-sale response time from 36 hours to 18 hours. Learn More