Batch Inference

Batch inference in Valohai runs as a standard execution, letting you process datasets or file collections at scale without managing infrastructure.

Batch inference uses the same execution system you use for training:

Key advantage: You already know this system. If you've run training jobs, you can run inference jobs.

Image classification at scale Process a directory of product images to tag inventory items.

Batch predictions on tabular data Run monthly churn predictions on your entire customer database.

Document processing Extract entities from legal documents or medical records in batches.

Choose batch inference when:

Need lower latency? Check out Real-Time Endpoints for sub-second predictions.

See practical examples:

Or jump straight to defining your inference step in valohai.yaml.

Last updated 1 month ago

Was this helpful?