Can I run CLIP ViT-H/14 on NVIDIA Jetson AGX Orin 64GB?

check_circle
Perfect
Yes, you can run this model!
GPU VRAM
64.0GB
Required
2.0GB
Headroom
+62.0GB

VRAM Usage

0GB 3% used 64.0GB

Performance Estimate

Tokens/sec ~90.0
Batch size 32

info Technical Analysis

The NVIDIA Jetson AGX Orin 64GB is exceptionally well-suited for running the CLIP ViT-H/14 model. With 64GB of LPDDR5 VRAM, it offers a substantial headroom of 62GB beyond the model's 2GB FP16 VRAM requirement. This ample VRAM ensures that the model can be loaded and executed without any memory constraints, even when handling larger batches or more complex processing pipelines. The Ampere architecture, with its 2048 CUDA cores and 64 Tensor Cores, provides significant computational power for accelerating the matrix multiplications and other operations crucial for CLIP's performance.

While the memory bandwidth of 0.21 TB/s is adequate, optimizing data transfer between the GPU and memory is still beneficial. The 60W TDP of the Jetson AGX Orin is also a factor to consider, as it might limit sustained peak performance. However, for inference tasks like CLIP, this is generally not a major bottleneck. The estimated 90 tokens/sec throughput indicates a reasonable performance level, making it suitable for real-time or near real-time vision applications. Batch size of 32 is possible because of the large headroom of VRAM.

lightbulb Recommendation

Given the Jetson AGX Orin's capabilities, prioritize optimizing the inference pipeline for efficiency. Start by using NVIDIA's TensorRT to quantize the CLIP model to INT8, which can further reduce VRAM usage and improve inference speed. Experiment with different batch sizes to find the optimal balance between throughput and latency. Also, monitor the GPU temperature and power consumption during sustained usage to ensure thermal throttling doesn't become a limiting factor.

If you encounter performance bottlenecks, consider using techniques like mixed-precision training or model distillation to create a smaller, faster version of CLIP. Additionally, ensure you're using the latest NVIDIA drivers and CUDA toolkit for optimal performance. For deployment, consider using Triton Inference Server to manage and scale your CLIP inference workloads.

tune Recommended Settings

Batch_Size
32
Context_Length
77
Other_Settings
['Enable CUDA graph capture', 'Use asynchronous data loading', 'Optimize pre and post processing']
Inference_Framework
TensorRT
Quantization_Suggested
INT8

help Frequently Asked Questions

Is CLIP ViT-H/14 compatible with NVIDIA Jetson AGX Orin 64GB? expand_more
Yes, CLIP ViT-H/14 is perfectly compatible with the NVIDIA Jetson AGX Orin 64GB due to ample VRAM.
What VRAM is needed for CLIP ViT-H/14? expand_more
CLIP ViT-H/14 requires approximately 2.0GB of VRAM when using FP16 precision.
How fast will CLIP ViT-H/14 run on NVIDIA Jetson AGX Orin 64GB? expand_more
You can expect CLIP ViT-H/14 to run at an estimated 90 tokens/sec on the NVIDIA Jetson AGX Orin 64GB.