Add Docs for AudioEncoder #717

NicolasHug · 2025-06-09T12:34:03Z

This PR adds docstrings and a tutorial for the AudioEncoder, plus some minor changes listed in the comments below

NicolasHug · 2025-06-09T12:35:13Z

docs/source/conf.py

Changed in this file, as well as the files renaming, are meant to separate our "tutorials" page into 2 separate sections: one for decoding, one for encoding.

NicolasHug · 2025-06-09T12:36:22Z

src/torchcodec/encoders/_audio_encoder.py

@@ -16,8 +26,11 @@ def __init__(self, samples: Tensor, *, sample_rate: int):
            raise ValueError(
                f"Expected samples to be a Tensor, got {type(samples) = }."
            )
+        if samples.ndim == 1:
+            # make it 2D and assume 1 channel
+            samples = samples[None, :]


Drive-by, I think this makes sense, i.e. if the input tensor is 1D we assume it's 1 channel instead of raising an error

Is that the most idomatic way to do that? There's also unsqueeze, view or reshape. Minor, but I'm surprised at how this works.

scotts · 2025-06-09T15:32:58Z

README.md

-  uses the version of FFmpeg you already have installed. FFmpeg is a mature
-  library with broad coverage available on most systems. It is, however, not
-  easy to use. TorchCodec abstracts FFmpeg's complexity to ensure it is used
+* Relying on [FFmpeg](https://www.ffmpeg.org/) to do the decoding / encoding.


Nit: I prefer "x and y" as opposed to "x / y" in prose.

scotts · 2025-06-09T15:39:12Z

docs/source/conf.py

+            order = [
+                "audio_encoding.py",
+            ]
+


Can we add a comment explaining that we have two top-level galleries, and for that reason, we need to figure out which gallery we're using (decoding versus encoding)? I was real confused until I concluded that must be what's going on.

scotts · 2025-06-09T15:41:47Z

Moving the examples into decoding and encoding subdirectories makes sense, but I'm curious: will that change the resulting tutorial URL? We already have some pointers to these tutorials floating around blogs and social media.

Dan-Flores · 2025-06-10T14:57:55Z

examples/encoding/audio_encoding.py

+
+# %%
+# We first instantiate an :class:`~torchcodec.encoders.AudioEncoder`. We pass it
+# the samples to be encoded. The samples must a 2D tensors of shape


Nit: "The samples must be a 2D tensors of shape"

NicolasHug added 3 commits June 9, 2025 11:41

Docstrings

110327c

Some reorg

d7ec60e

Add tuto

ee05058

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Jun 9, 2025

NicolasHug commented Jun 9, 2025

View reviewed changes

Add smoke test

a0d7af3

scotts reviewed Jun 9, 2025

View reviewed changes

scotts approved these changes Jun 9, 2025

View reviewed changes

Dan-Flores reviewed Jun 10, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Docs for AudioEncoder #717

Add Docs for AudioEncoder #717

Uh oh!

NicolasHug commented Jun 9, 2025

Uh oh!

NicolasHug Jun 9, 2025

Uh oh!

NicolasHug Jun 9, 2025

Uh oh!

scotts Jun 9, 2025

Uh oh!

scotts Jun 9, 2025

Uh oh!

scotts Jun 9, 2025

Uh oh!

scotts commented Jun 9, 2025 •

edited

Loading

Uh oh!

Dan-Flores Jun 10, 2025

Uh oh!

Uh oh!

Add Docs for AudioEncoder #717

Are you sure you want to change the base?

Add Docs for AudioEncoder #717

Uh oh!

Conversation

NicolasHug commented Jun 9, 2025

Uh oh!

NicolasHug Jun 9, 2025

Choose a reason for hiding this comment

Uh oh!

NicolasHug Jun 9, 2025

Choose a reason for hiding this comment

Uh oh!

scotts Jun 9, 2025

Choose a reason for hiding this comment

Uh oh!

scotts Jun 9, 2025

Choose a reason for hiding this comment

Uh oh!

scotts Jun 9, 2025

Choose a reason for hiding this comment

Uh oh!

scotts commented Jun 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Dan-Flores Jun 10, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

scotts commented Jun 9, 2025 •

edited

Loading