Models

Valohai Model Hub is a centralized registry for managing machine learning models throughout their lifecycle, from training to production deployment.

Centralized model documentation with version control. Customize fields and use markdown to document model details in the main view. The sidebar highlights the current approved version with download options and displays version history with approval status indicators (approved/pending/rejected).

Why Use Model Hub

The Problem: Scattered Model Artifacts

Without a model registry:

  • Models saved as random files in cloud storage

  • No clear "which model is in production?"

  • Can't trace which data trained which model

  • Manual approval processes via Slack/email

  • No version comparison or rollback capability

  • Team members can't find the right model

Example chaos:


The Solution: Centralized Model Registry

Centralized model inventory across all projects. Browse models with their latest version status (approved/pending), last modified date, associated projects, and ownership. Provides visibility into all models in the organization with filtering and search capabilities for easy discovery.

Model Hub provides:

  • Single source of truth for all models

  • Automatic versioning from training pipelines

  • Built-in approval workflow (pending → approved → rejected)

  • Complete lineage tracking (data → code → model)

  • Version comparison with metrics and artifacts

  • Access control for governance

  • model:// URIs for consistent references

Example organization:


Model Hub vs. Saving Model Files

When to Use Model Hub

Scenario
Use Model Hub
Just Save Files

Production models

✅ Yes

❌ No

Need approval workflow

✅ Yes

❌ No

Multiple model versions

✅ Yes

❌ No

Team collaboration

✅ Yes

❌ No

Lineage tracking needed

✅ Yes

✅ Yes

Quick experiment checkpoint

⚠️ Optional

✅ Yes

Intermediate training artifacts

❌ No

✅ Yes


Workflow Comparison

Without Model Hub:

With Model Hub:


Key Features

Automatic Lineage Tracking

Every model version automatically tracks:

  • Training data — Which dataset versions were used

  • Code version — Exact commit that trained the model

  • Hyperparameters — All parameters from training

  • Environment — Docker image, dependencies

  • Training metrics — Loss, accuracy, custom metrics

  • Artifacts — Model files, checkpoints, configs

Benefit: "Which data trained the production model?" → One click to see complete history.

Model lineage view showing end-to-end traceability. The model (1) was created by execution #1805, which received preprocessed data from job #1799 that used training datasets and images.zip. Each execution is clickable to view full details. The right side shows downstream usage: this model has been consumed by 3 jobs, with one successfully generating predictions (3).

Approval Workflow

Built-in state management for model lifecycle:

States:

  • Pending — Newly created, awaiting review

  • Approved — Validated for production use

  • Rejected — Not suitable for production

Benefit: Clear approval trail for compliance and governance.

Complete audit trail of all model activities. Track when the model was used in executions (1), approved by team members like Sofia (2), and modified by users like Drazen (3). Every edit, usage, and approval of the model or its versions is automatically logged with timestamp, user, and location details.

Version Comparison

Compare model versions side-by-side:

  • Training metrics (accuracy, loss, F1)

  • Hyperparameters

  • Training data

  • File sizes and artifacts

  • Training duration

Benefit: "Is v2 really better than v1?" → Compare metrics directly.

Compare model versions visually by plotting performance metrics. Select metrics for the horizontal axis (2) and vertical axis (3) to visualize tradeoffs across different model versions (1).

Unified Access

All models in one place:

  • Organization view → All models across projects

  • Project view → Models relevant to this project

  • Search and filter by tags, state, metrics

  • Download artifacts or use in pipelines

Benefit: No hunting through S3 buckets or file shares.


Use Cases

Production Model Management

Scenario: Deploy and monitor production models with approval gates.

Benefit: Controlled releases with audit trail.


A/B Testing

Scenario: Compare model variants in production.

Benefit: Safe model rollout with easy rollback.


Model Lineage & Compliance

Scenario: Audit which data trained production models.

Benefit: Complete audit trail for compliance.


Team Collaboration

Scenario: Multiple data scientists training models, ML engineer deploying.

Benefit: Clear handoff between roles.


Experiment Tracking

Scenario: Track dozens of training runs, pick best.

Benefit: Organized experimentation with clear winners.


model:// URI Format

Models are referenced using model:// URIs, similar to datum:// and dataset:// links.

Format

Examples:

Using model:// URIs

In valohai.yaml:

In code:

Benefit: Consistent, versioned references across all workflows.


Model Hub vs. Other Registries

Feature
Valohai Model Hub
MLflow
W&B
SageMaker Model Registry

Built-in lineage

✅ Automatic

⚠️ Manual logging

⚠️ Manual logging

❌ Limited

Approval workflow

✅ Built-in

❌ No

❌ No

✅ Manual

Versioned inputs

model:// URIs

⚠️ Manual paths

⚠️ Manual paths

✅ ARNs

Access control

✅ Built-in

⚠️ Enterprise only

✅ Yes

✅ Yes

Training integration

✅ Automatic

⚠️ Manual tracking

⚠️ Manual tracking

⚠️ Manual

Reproducibility

✅ Full pipeline

⚠️ Model only

⚠️ Model only

⚠️ Model only

Valohai differentiator: Automatic lineage from full pipeline execution, not just model files.


Getting Started

Ready to use Model Hub? Follow these guides:

  1. Create and Manage Models — Create models, versions, approval workflow

  2. Model Artifacts & Versioning — Save models from training, use in deployment



Next Steps

  • Create your first model in Model Hub

  • Set up automatic model versioning from training

  • Configure approval workflow for your team

  • Deploy using versioned model:// URIs

Last updated

Was this helpful?