Avoid relying on PyAV-provided video frame count #6929

SpecLad · 2023-09-29T12:44:00Z

Motivation and context

They are not reliable.

In particular, MP4 has a feature called "edit lists" that allows you to set a custom playback order for the media data. With edit lists, you could only specify that a particular range of frames should be played, or that a range should be played multiple times, etc. See the following for technical details:

https://developer.apple.com/documentation/quicktime-file-format/edit_list_atom

FFmpeg follows edit lists when decoding videos. However, the frame count returned by PyAV's Stream.frames property is the number of frames in the raw media data and does not reflect the modifications applied by an edit list.

When we build a video manifest, we use Stream.frames if it's non-zero. Therefore, in the presence of an edit list we will obtain a frame count that does not match the actual number of frames that we can get out of the video.

FWIW, edit lists are probably not the only way that Stream.frames could be inaccurate, it's just the reason behind a specific problem I encountered.

Since we already have to handle the situation where Stream.frames is not available, just pretend it doesn't exist and always count frames by traversing the entire video. I don't think it even matters much, since we have to do it anyway to build the rest of the manifest.

We also have to stop validating the frame count in a user-provided manifest, which is unfortunate, but it doesn't seem worthwhile to decode the entire video just for that.

How has this been tested?

I checked that dataset_manifest/create.py now calculates the correct number of frames for a file with an edit list. I also tested the same file by uploading it to CVAT.

Checklist

I submit my changes into the develop branch
I have added a description of my changes into the CHANGELOG file
~~[ ] I have updated the documentation accordingly~~
~~[ ] I have added tests to cover my changes~~
~~[ ] I have linked related issues (see GitHub docs)~~
[ ] I have increased versions of npm packages if it is necessary
(cvat-canvas,
cvat-core,
cvat-data and
cvat-ui)

License

I submit my code changes under the same MIT License that covers the project.
Feel free to contact the maintainers if that's a concern.

They are not reliable. In particular, MP4 has a feature called "edit lists" that allows you to set a custom playback order for the media data. With edit lists, you could only specify that a particular range of frames should be played, or that a range should be played multiple times, etc. See the following for technical details: https://developer.apple.com/documentation/quicktime-file-format/edit_list_atom FFmpeg follows edit lists when decoding videos. However, the frame count returned by PyAV's `Stream.frames` property is the number of frames in the raw media data and does not reflect the modifications applied by an edit list. When we build a video manifest, we use `Stream.frames` if it's non-zero. Therefore, in the presence of an edit list we will obtain a frame count that does not match the actual number of frames that we can get out of the video. FWIW, edit lists are probably not the only way that `Stream.frames` could be inaccurate, it's just the reason behind a specific problem I encountered. Since we already have to handle the situation where `Stream.frames` is not available, just pretend it doesn't exist and always count frames by traversing the entire video. I don't think it even matters much, since we have to do it anyway to build the rest of the manifest. We also have to stop validating the frame count in a user-provided manifest, which is unfortunate, but it doesn't seem worthwhile to decode the entire video just for that.

codecov · 2023-09-29T14:13:58Z

Codecov Report

Merging #6929 (b98f3d3) into develop (d497bb6) will increase coverage by 0.00%.
The diff coverage is 100.00%.

@@           Coverage Diff            @@
##           develop    #6929   +/-   ##
========================================
  Coverage    82.52%   82.52%           
========================================
  Files          360      360           
  Lines        38908    38895   -13     
  Branches      3544     3544           
========================================
- Hits         32108    32100    -8     
+ Misses        6800     6795    -5

Components	Coverage Δ
cvat-ui	`77.62% <ø> (-0.01%)`	⬇️
cvat-server	`87.00% <100.00%> (+0.02%)`	⬆️

nmanovic

LGTM

They are not reliable. In particular, MP4 has a feature called "edit lists" that allows you to set a custom playback order for the media data. With edit lists, you could only specify that a particular range of frames should be played, or that a range should be played multiple times, etc. See the following for technical details: https://developer.apple.com/documentation/quicktime-file-format/edit_list_atom FFmpeg follows edit lists when decoding videos. However, the frame count returned by PyAV's `Stream.frames` property is the number of frames in the raw media data and does not reflect the modifications applied by an edit list. When we build a video manifest, we use `Stream.frames` if it's non-zero. Therefore, in the presence of an edit list we will obtain a frame count that does not match the actual number of frames that we can get out of the video. FWIW, edit lists are probably not the only way that `Stream.frames` could be inaccurate, it's just the reason behind a specific problem I encountered. Since we already have to handle the situation where `Stream.frames` is not available, just pretend it doesn't exist and always count frames by traversing the entire video. I don't think it even matters much, since we have to do it anyway to build the rest of the manifest. We also have to stop validating the frame count in a user-provided manifest, which is unfortunate, but it doesn't seem worthwhile to decode the entire video just for that.

SpecLad force-pushed the no-easy-frame-count branch from ebe2f31 to b98f3d3 Compare September 29, 2023 13:14

SpecLad marked this pull request as ready for review September 29, 2023 13:24

SpecLad requested review from azhavoro, mdacoca and Marishka17 as code owners September 29, 2023 13:24

nmanovic approved these changes Oct 2, 2023

View reviewed changes

nmanovic merged commit 4cdd68e into cvat-ai:develop Oct 2, 2023

SpecLad deleted the no-easy-frame-count branch October 9, 2023 15:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Avoid relying on PyAV-provided video frame count #6929

Avoid relying on PyAV-provided video frame count #6929

SpecLad commented Sep 29, 2023 •

edited

Loading

codecov bot commented Sep 29, 2023

nmanovic left a comment

Avoid relying on PyAV-provided video frame count #6929

Avoid relying on PyAV-provided video frame count #6929

Conversation

SpecLad commented Sep 29, 2023 • edited Loading

Motivation and context

How has this been tested?

Checklist

License

codecov bot commented Sep 29, 2023

Codecov Report

nmanovic left a comment

Choose a reason for hiding this comment

SpecLad commented Sep 29, 2023 •

edited

Loading