HPC-Inference¶

HPC-Inference

Batch inference solution for large-scale image datasets on HPC

About¶

Problem: Many batch inference workflows waste GPU resources due to I/O bottlenecks and sequential processing, leading to poor GPU utilization and longer processing times.

Key Bottlenecks¶

Slow sequential large file loading (Disk → RAM)
Single-threaded image preprocessing
Data transfer delays (CPU ↔ GPU)
GPU idle time waiting for data
Sequential output writing

HPC-Inference Solutions¶

Parallel data loading: Eliminates disk I/O bottlenecks with optimized dataset loaders
Asynchronous preprocessing: Keeps GPUs fed with continuous data queues
SLURM integration: Deploy seamlessly on HPC clusters
Multi-GPU distribution: Scales across HPC nodes for maximum throughput
Resource profiling: Logs timing metrics and CPU/GPU usage rates to help optimize your configuration

Core Features¶

The hpc_inference package's core functionality includes customized PyTorch datasets:

ParquetImageDataset for image data stored as compressed binary columns across multiple large Parquet files
ImageFolderDataset for image data stored in folders using open file formats such as JPG, PNG, TIFF, etc.

The package also comes with a suite of ready-to-use job scripts to perform efficient batch inference using pretrained models on HPCs.

Use Cases¶

Image Folder Dataset - Process images from directory structures
Parquet Dataset - Handle compressed image data in Parquet format
Large scale CLIP embedding - Generate embeddings for massive datasets
Large scale face detection - Detect faces across large image collections
Large scale animal detection - Use MegaDetector for wildlife analysis
Grid search profiling - Optimize processing parameters

Quick Links¶

Acknowledgement¶

This project is a joint effort between the Imageomics Institute and the ABC Global Center.