You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Support Audio feature for TAR archives in sequential access (huggingface#3129)
* Add test fixture for TAR WAV file
* Add test iter_archive
* Test dataset with Audio feature for TAR archive
* Add Audio method to decode from bytes instead of path
* Add Audio support for bytes besides path
* Fix docstring
* Remove archived attribute from test audio with TAR archive
* Remove archived attribute from Audio feature
* Implement Audio.encode_example
* Call Audio.encode_example from encode_nested_example
* Fix docs
* Enhance Audio.decode_example to accept a string
* Fix docs
* Implement private Audio._storage_dtype to specify cached dtype
* Change Audio._storage_dtype dynamically when encoding a string
* Update test of Audio instantiation
* Set ArrowWriter.schema property dynamically calculated from features
* Update ArrowWriter.write_examples_on_file
* Update ArrowWriter._build_writer
* Fix code quality
* Replace _schema with schema and condition on schema in ArrowWriter
* Add test for MP3 TAR audio file
* Refactor Audio decode_example
* Pass raw bytes to torchaudio.load
* Revert "Pass raw bytes to torchaudio.load"
This reverts commit c973209.
* Pass format to load in _decode_example_with_torchaudio
* Fix filename extension in test
* Fix Audio tests CI
* Fix Audio tests CI
* Fix audio test CI by checking out PR HEAD commit instead of merge commit
* Change default Audio storage dtype to string
* Rename Audio decode functions
* Refactor Audio decode_example
* Force CI re-run
* Refactor and rename
* Fix docstring
* Fix docstrings
0 commit comments