Package Datasets for Faster Downloads
The Problem: Small Files Are Slow
The Solution: Automatic Dataset Packaging
When to Use Dataset Packaging
Ideal Use Cases
When Not to Use
Packaging vs Manual Tar Files
Approach
When to Use
Enable Dataset Packaging
How to Package a Dataset Version
Step 1: Enable in Execution Configuration

Step 2: Create Dataset with Packaging Flag
Step 3: Use the Packaged Dataset
Complete Example: Time-Series Dataset
Complete Example: Image Classification
How It Works Behind the Scenes
During Dataset Creation
During Execution Using Packaged Dataset
Verify Packaging Worked
Check Execution Logs
Expert Mode UI
Method
Download Time
Setup Overhead
Method
Download Time
Setup Overhead
Method
Download Time
Setup Overhead
Current Limitations
Programmatic Creation Only
Requires Environment Variable
Best Practices
Start with Small Test Dataset
Name Dataset Versions Clearly
Monitor First Execution
Combine with Dataset Versioning
Next Steps
Last updated
Was this helpful?
