WebDataset is a PyTorch Dataset (IterableDataset) implementation providing efficient access to datasets stored in POSIX tar archives and uses only sequential/streaming data access. This brings ...
This repo provides the scripts to fine-tune openai/whisper. The repository includes tools for data preprocessing, converting data to WebDataset format, and fine-tuning whisper. Shrutilipi (AI4Bharat) ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results