ml.inf2.8xlarge
Amazon SageMaker AI instance ml.inf2.8xlarge with 16 vCPUs, 128 GiB RAM. Available in 14 regions starting from $1722.80 per month. 
| Instance Type | ml.inf2.8xlarge | 
|---|---|
| Instance Family | Accelerated Computing Instances | 
| Details | Machine Learning (ML) inference ml – SageMaker ML optimized inf – AWS Inferentia 2 – Generation 8xlarge – Size | 
| vCPUs | 16 | 
| Memory | 128 GiB | 
| CPU Architecture | x86_64 | 
| GPU | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Has GPU | no | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Instances | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Sizes | ml.inf2.xlarge, ml.inf2.8xlarge, ml.inf2.24xlarge, ml.inf2.48xlarge | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| EBS (Elastic Block Store) | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| EBS Optimized | default | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| EBS Baseline IOPS | 40000 IOPS | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| EBS Max IOPS | 40000 IOPS | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| EBS Baseline Bandwidth | 10000 Mbps | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| EBS Max Bandwidth | 10000 Mbps | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Instance Storage | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Instance Storage | no | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Networking | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Max Network Interfaces | 8 | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| IPv6 Support | yes | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| ENA Support | required | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Regional Availability
US
GovCloud (US-East) / us-gov-east-1
GovCloud (US-West) / us-gov-west-1
US East (Ohio) / us-east-2
US East (Virginia) / us-east-1
US West (N. California) / us-west-1
US West (Oregon) / us-west-2
GovCloud (US-East) / us-gov-east-1
GovCloud (US-West) / us-gov-west-1
US East (Ohio) / us-east-2
US East (Virginia) / us-east-1
US West (N. California) / us-west-1
US West (Oregon) / us-west-2
Central America
Mexico (Central) / mx-central-1
Mexico (Central) / mx-central-1
Canada
Canada (Central) / ca-central-1
Canada West (Calgary) / ca-west-1
Canada (Central) / ca-central-1
Canada West (Calgary) / ca-west-1
Europe
Europe (Ireland) / eu-west-1
Europe (London) / eu-west-2
Europe (Frankfurt) / eu-central-1
Europe (Milan) / eu-south-1
Europe (Paris) / eu-west-3
Europe (Spain) / eu-south-2
Europe (Stockholm) / eu-north-1
Europe (Zurich) / eu-central-2
Europe (Ireland) / eu-west-1
Europe (London) / eu-west-2
Europe (Frankfurt) / eu-central-1
Europe (Milan) / eu-south-1
Europe (Paris) / eu-west-3
Europe (Spain) / eu-south-2
Europe (Stockholm) / eu-north-1
Europe (Zurich) / eu-central-2
Africa
Africa (Cape Town) / af-south-1
Africa (Cape Town) / af-south-1
South America
South America (São Paulo) / sa-east-1
South America (São Paulo) / sa-east-1
Asia Pacific
Asia Pacific (Hong Kong) / ap-east-1
Asia Pacific (Hyderabad) / ap-south-2
Asia Pacific (Jakarta) / ap-southeast-3
Asia Pacific (Malaysia) / ap-southeast-5
Asia Pacific (Melbourne) / ap-southeast-4
Asia Pacific (Mumbai) / ap-south-1
Asia Pacific (New Zealand) / ap-southeast-6
Asia Pacific (Osaka) / ap-northeast-3
Asia Pacific (Seoul) / ap-northeast-2
Asia Pacific (Singapore) / ap-southeast-1
Asia Pacific (Sydney) / ap-southeast-2
Asia Pacific (Taipei) / ap-east-2
Asia Pacific (Thailand) / ap-southeast-7
Asia Pacific (Tokyo) / ap-northeast-1
Asia Pacific (Hong Kong) / ap-east-1
Asia Pacific (Hyderabad) / ap-south-2
Asia Pacific (Jakarta) / ap-southeast-3
Asia Pacific (Malaysia) / ap-southeast-5
Asia Pacific (Melbourne) / ap-southeast-4
Asia Pacific (Mumbai) / ap-south-1
Asia Pacific (New Zealand) / ap-southeast-6
Asia Pacific (Osaka) / ap-northeast-3
Asia Pacific (Seoul) / ap-northeast-2
Asia Pacific (Singapore) / ap-southeast-1
Asia Pacific (Sydney) / ap-southeast-2
Asia Pacific (Taipei) / ap-east-2
Asia Pacific (Thailand) / ap-southeast-7
Asia Pacific (Tokyo) / ap-northeast-1
Middle East
Israel (Tel Aviv) / il-central-1
Middle East (Bahrain) / me-south-1
Middle East (UAE) / me-central-1
Israel (Tel Aviv) / il-central-1
Middle East (Bahrain) / me-south-1
Middle East (UAE) / me-central-1
Regional Prices
| Geography | Region | Region | Instance Price | 
|---|---|---|---|
| Hidden | Region 10 (hidden-region-10) | hidden-region-10 | 2.3600 | 
| Hidden | Region 11 (hidden-region-11) | hidden-region-11 | 2.3600 | 
| Hidden | Region 9 (hidden-region-9) | hidden-region-9 | 2.3600 | 
| Hidden | Region 4 (hidden-region-4) | hidden-region-4 | 2.3615 | 
| Hidden | Region 3 (hidden-region-3) | hidden-region-3 | 2.5920 | 
| Hidden | Region 5 (hidden-region-5) | hidden-region-5 | 2.8290 | 
| Asia Pacific | Asia Pacific (Mumbai) (ap-south-1) | ap-south-1 | 2.9420 | 
| Hidden | Region 1 (hidden-region-1) | hidden-region-1 | 3.0720 | 
| Asia Pacific | Asia Pacific (Singapore) (ap-southeast-1) | ap-southeast-1 | 3.1680 | 
| Hidden | Region 7 (hidden-region-7) | hidden-region-7 | 3.3120 | 
| Asia Pacific | Asia Pacific (Tokyo) (ap-northeast-1) | ap-northeast-1 | 3.3950 | 
| Hidden | Region 2 (hidden-region-2) | hidden-region-2 | 3.3950 | 
| Hidden | Region 6 (hidden-region-6) | hidden-region-6 | 3.5400 | 
| Hidden | Region 8 (hidden-region-8) | hidden-region-8 | 4.0200 | 
Similar Alternative Instances
| Instance Type | Instance Family | vCPUs | Memory (GiB) | GPU | Instance Price | 
|---|---|---|---|---|---|
| ml.r6g.4xlarge | Standard Instances | 16 | 128 | no | 0.9677 | 
| ml.r6gd.4xlarge | Memory Optimized Instances | 16 | 128 | no | 1.1059 | 
| ml.m7i.8xlarge | Standard Instances | 16 | 128 | no | 1.9350 | 
| ml.r8g.4xlarge | Memory Optimized Instances | 16 | 128 | no | 2.2600 | 
| ml.g4dn.8xlarge | Accelerated Computing Instances | 16 | 128 | yes | 2.7200 |