Add Buffer class #2044

mthrok · 2021-11-28T20:05:24Z

Part of #1986. Splitting the PR for easier review.

Add Buffer class that is responsible for converting AVFrame to Tensor.
Note: The API to retrieve the buffered Tensors is tentative.
For the overall architecture, see https://github.com/mthrok/audio/blob/ffmpeg/torchaudio/csrc/ffmpeg/README.md.

Note: Without a change to build process, the code added here won't be compiled. The build process will be updated later.
Needs to be imported after #2043.

facebook-github-bot · 2021-12-08T03:31:00Z

@mthrok has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

nateanl · 2021-12-13T13:30:05Z

torchaudio/csrc/ffmpeg/buffer.h

+  std::deque<torch::Tensor> chunks;
+  AVMediaType media_type;


Shall we add reference to the variables?

Suggested change

std::deque<torch::Tensor> chunks;

AVMediaType media_type;

std::deque<torch::Tensor>& chunks;

AVMediaType& media_type;

It's just an enum, so it's cheap and safer to copy.

nateanl · 2021-12-13T13:32:37Z

torchaudio/csrc/ffmpeg/buffer.cpp

+    case AVMEDIA_TYPE_AUDIO:
+      push_audio_frame(frame);
+      break;
+    case AVMEDIA_TYPE_VIDEO:


Where are AVMEDIA_TYPE_AUDIO and AVMEDIA_TYPE_VIDEO defined? Or it's taken care by ffmpeg?

https://ffmpeg.org/doxygen/4.1/group__lavu__misc.html#ga9a84bba4713dfced21a1a56163be1f48

nateanl · 2021-12-13T13:40:27Z

torchaudio/csrc/ffmpeg/buffer.cpp

+torch::Tensor convert_audio_tensor(AVFrame* pFrame) {
+  // ref: https://ffmpeg.org/doxygen/4.1/filter__audio_8c_source.html#l00215
+  AVSampleFormat format = static_cast<AVSampleFormat>(pFrame->format);
+  int num_channels = pFrame->channels;


I feel like num_channels can be defined when initializing the Buffer object? As each Buffer object will receive signal from one Streamer, which fixed the num_channels after initialization.

I agree with you, but unfortunately the shape information is not necessarily available at the time Buffer object is initialized, due to a filter graph operation inserted before the buffer, which can change number of audio channels, image size, image channels and rate. The structure passed around does not have the explicit information of these.

Let me figure this out later. along with the buffer size concern you brought up.

nateanl · 2021-12-13T13:48:48Z

torchaudio/csrc/ffmpeg/buffer.cpp

+} // namespace
+
+void Buffer::push_audio_frame(AVFrame* pFrame) {
+  chunks.push_back(convert_audio_tensor(pFrame));


Do we want to check the size of chunks so that if it exceeds the buffer size, we might want to pop out the last chunk?

Yeah, let me follow-up this one. I am still figuring out a safe way to process chunks of different rates at the same time. (sample rate / frame rate)

facebook-github-bot · 2021-12-23T19:32:30Z

@mthrok has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

Summary: Part of #1986. Splitting the PR for easier review. Add StreamProcessor class that bundles `Buffer`, `FilterGraph` and `Decoder`. Note: The API to retrieve the buffered Tensors is tentative. For the overall architecture, see https://github.com/mthrok/audio/blob/ffmpeg/torchaudio/csrc/ffmpeg/README.md. Note: Without a change to build process, the code added here won't be compiled. The build process will be updated later. Needs to be imported after #2044. Pull Request resolved: #2045 Reviewed By: carolineechen Differential Revision: D33299858 Pulled By: mthrok fbshipit-source-id: d85bececed475f45622743f137dd59cb1390ceed

Summary: Part of pytorch#1986. Splitting the PR for easier review. Add Buffer class that is responsible for converting `AVFrame` to `Tensor`. Note: The API to retrieve the buffered Tensors is tentative. For the overall architecture, see https://github.com/mthrok/audio/blob/ffmpeg/torchaudio/csrc/ffmpeg/README.md. Note: Without a change to build process, the code added here won't be compiled. The build process will be updated later. Needs to be imported after pytorch#2043. Pull Request resolved: pytorch#2044 Reviewed By: carolineechen Differential Revision: D32940553 Pulled By: mthrok fbshipit-source-id: 8b8b2222ad7b47edc17e9139420e8a71c00d726a

Summary: Part of pytorch#1986. Splitting the PR for easier review. Add StreamProcessor class that bundles `Buffer`, `FilterGraph` and `Decoder`. Note: The API to retrieve the buffered Tensors is tentative. For the overall architecture, see https://github.com/mthrok/audio/blob/ffmpeg/torchaudio/csrc/ffmpeg/README.md. Note: Without a change to build process, the code added here won't be compiled. The build process will be updated later. Needs to be imported after pytorch#2044. Pull Request resolved: pytorch#2045 Reviewed By: carolineechen Differential Revision: D33299858 Pulled By: mthrok fbshipit-source-id: d85bececed475f45622743f137dd59cb1390ceed

pytorch-probot bot added the ciflow/default label Nov 28, 2021

facebook-github-bot added the CLA Signed label Nov 28, 2021

mthrok mentioned this pull request Nov 28, 2021

Add StreamProcessor class #2045

Closed

mthrok force-pushed the ffmpeg-buffer branch 2 times, most recently from 314bbeb to fdeab57 Compare December 8, 2021 03:30

nateanl reviewed Dec 13, 2021

View reviewed changes

Add Buffer class

1b44c5e

mthrok force-pushed the ffmpeg-buffer branch from fdeab57 to 1b44c5e Compare December 23, 2021 19:21

carolineechen approved these changes Dec 23, 2021

View reviewed changes

facebook-github-bot closed this in c6de2a1 Dec 24, 2021

mthrok deleted the ffmpeg-buffer branch December 24, 2021 01:26

mthrok mentioned this pull request Jan 5, 2022

Streaming API progress overview #2138

Closed

30 tasks

mthrok added module: IO new feature prototype labels Feb 14, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Buffer class #2044

Add Buffer class #2044

mthrok commented Nov 28, 2021

facebook-github-bot commented Dec 8, 2021

nateanl Dec 13, 2021

mthrok Dec 17, 2021

nateanl Dec 13, 2021

mthrok Dec 17, 2021

nateanl Dec 13, 2021

mthrok Dec 17, 2021

nateanl Dec 13, 2021

mthrok Dec 17, 2021

facebook-github-bot commented Dec 23, 2021

Add Buffer class #2044

Add Buffer class #2044

Conversation

mthrok commented Nov 28, 2021

facebook-github-bot commented Dec 8, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

facebook-github-bot commented Dec 23, 2021