LavenderDataLoader - Fault Handling
Lavender Data is designed to handle failures gracefully, ensuring your training pipeline remains robust.
Handling Failed Samples
By default, Lavender Data will skip failed samples.
You can control this behavior with the skip_on_failure
and max_retry_count
parameters.
Parameter | Description | Default |
---|---|---|
skip_on_failure | Skip the sample if it fails (after max_retry_count attempts), instead of stopping the iteration | False |
max_retry_count | Retry each failed sample up to this number of times | 0 |
dataloader = LavenderDataLoader( dataset_id=dataset.id, shardsets=[shardset.id], skip_on_failure=True, # Skip the sample if it fails (after max_retry_count attempts) max_retry_count=3, # Retry each failed sample up to 3 times)