Batch Inference

Batch inference complements the low-latency API when you need to score or enrich large portfolios.

When to choose batch

Use case	Recommended path
Nightly portfolio rescoring	Batch
Real-time onboarding	REST API
Marketing list enrichment	Batch
High-touch underwriting	REST API with on-demand enrichment

Workflow

Upload data

Upload a file with model input data via the Batch API.

Submit job

Request a batch prediction via the Batch API specifying the previously uploaded file and an available model. Avra responds with a batch_id so you can track the job.

Processing

Jobs run in prioritized queues. Expect minutes for thousands of records and hours for millions.

Waiting

Poll GET /v1/api/batches/{id} checking job status or subscribe to the batch-lifecycle webhook to be notified on status transitions.

Download results

Request a download link at GET /v1/api/batches/{id}/result to obtain a single-use download URL to your result file.

Refer to the API Reference for endpoint schemas, payload examples, and error handling best practices when submitting batches.Workflow steps may vary when using SFTP Batch Inference. Please refer to the documentation for more details.

Limits

Up to 10M entities per batch by default — reach out for higher quotas.
Five concurrent batches per workspace.
CSV uploads capped at 500 MB.

Integrations

Use SFTP Batch Inference for scheduled large files.
Configure webhooks via the dashboard or per-request webhook_url. See Webhooks.

Monitoring

Track batch throughput and failures in the dashboard under Data Operations → Batches.

⌘I

​When to choose batch

​Workflow

​Limits

​Integrations

​Monitoring

When to choose batch

Workflow

Limits

Integrations

Monitoring