Multi-Instance GPU (MIG) expands the performance and value of NVIDIA Blackwell and Hopper? generation GPUs. MIG can partition the GPU into as many as seven instances, each fully isolated with its own high-bandwidth memory, cache, and compute cores. This gives administrators the ability to support every workload, from the smallest to the largest, with guaranteed quality of service (QoS) and extending the reach of accelerated computing resources to every user.
Without MIG, different jobs running on the same GPU, such as different AI inference requests, compete for the same resources. A job consuming larger memory bandwidth starves others, resulting in several jobs missing their latency targets. With MIG, jobs run simultaneously on different instances, each with dedicated resources for compute, memory, and memory bandwidth, resulting in predictable performance with QoS and maximum GPU utilization.
Blackwell and Hopper GPUs support MIG with multi-tenant, multi-user configurations in virtualized environments across up to seven GPU instances, securely isolating each instance with confidential computing at the hardware and hypervisor level. Dedicated video decoders for each MIG instance deliver secure, high-throughput intelligent video analytics (IVA) on shared infrastructure. With concurrent MIG profiling, administrators can monitor right-sized GPU acceleration and allocate resources for multiple users.
For researchers with smaller workloads, rather than renting a full cloud instance, they can use MIG to isolate a portion of a GPU securely while being assured that their data is secure at rest, in transit, and in use. This improves flexibility for cloud service providers to price and address smaller customer opportunities.
MIG enables fine-grained GPU provisioning by IT and DevOps teams. Each MIG instance behaves like a standalone GPU to applications, so there’s no change to the CUDA? platform. MIG can be used in all major enterprise computing environments?.
Blackwell Ultra GPU | Blackwell GPU* | H100 GPU | |
---|---|---|---|
Confidential computing | Yes | Yes | Yes |
Instance types | Up to 7x 34GB Up to 4x 70GB Up to 2x 140GB Up to 1x 288 GB |
Up to 7x 23GB Up to 4x 45GB Up to 2x 95GB Up to 1x 192GB |
7x 10GB 4x 20GB 2x 40GB 1x 80GB |
GPU profiling and monitoring | Concurrently on all instances | Concurrently on all instances | Concurrently on all instances |
Secure Tenants | 7x | 7x | 7x |
Media decoders | Dedicated NVJPEG and NVDEC per instance | Dedicated NVJPEG and NVDEC per instance | Dedicated NVJPEG and NVDEC per instance |
Preliminary specifications, may be subject to change. *Sizes shown for Blackwell GPUs in GB200 NVL72. MIG sizes for Blackwell GPUs in HGX B200 are lower, refer to technical documentation.
Learn More About MIG.
臀推是什么意思dayuxmw.com | 上午八点是什么时辰hcv8jop0ns7r.cn | 12378是什么电话travellingsim.com | 隆胸有什么危害和后遗症吗hcv9jop3ns5r.cn | 吃什么可以修复子宫内膜hcv8jop7ns7r.cn |
一个火一个同念什么hcv9jop0ns2r.cn | 下游是什么意思hcv8jop6ns5r.cn | 嗓子发炎吃什么水果hcv8jop9ns8r.cn | 眼屎多用什么眼药水好hcv8jop7ns9r.cn | 喜欢一个人会有什么表现helloaicloud.com |
24属什么生肖hcv8jop6ns6r.cn | 氟哌噻吨美利曲辛片治什么病hcv9jop7ns5r.cn | 轻歌曼舞是什么意思hcv9jop0ns2r.cn | 待定是什么意思hcv8jop2ns0r.cn | mbi是什么意思bysq.com |
择日是什么意思hcv7jop6ns1r.cn | 背痛挂什么科hcv8jop2ns4r.cn | 郭富城属什么生肖hcv7jop6ns9r.cn | cacao是什么意思hcv9jop1ns7r.cn | bmi指数是什么意思hcv9jop3ns0r.cn |