Analysis gets stuck with corrupt files #286

paulijar · 2021-02-17T20:36:35Z

I got a report from a user that scanning the files is insanely slow, taking an hour or more per file. It turned out that his library contained some corrupted mp3 files which were causing the slowness.

I got some sample files and could reproduce the issue: calling getID3::analyze on these files seemed to cause some kind of busy loop, as CPU load hit 100% and nothing was happening. I waited for 45 minutes but the analysis didn't finish during this time. But as said, my user reported that eventually the scanning moved on to next files.

Now, obviously these files are broken and no metadata can be extracted from them. But could getID3 maybe bail out a bit sooner on these? I'll send the sample files by email.

The text was updated successfully, but these errors were encountered:

JamesHeinrich · 2021-02-18T00:30:24Z

I have downloaded the sample files and confirmed there's something wrong (even corrupt MP3s shouldn't take more than a couple seconds to analyze or be rejected). I have not had time to look in detail as to where or how it's getting stuck, but I will look at the in the next day or two. Thanks for the samples.

#286 Prevent apparently-mp3 files with large number of 0xFF chars from stalling scanning

JamesHeinrich · 2021-02-20T02:54:11Z

Should be fixed in 1490b43

Your files are "special" in that they consist largely of nothing but FF bytes which means the code that looks for the next valid MPEG-audio sequence has to examine every single byte (at least within the first 128kB of the file) which is why/where the immense slowdown was taking place. I have added a failsafe escape route where the loop is broken after examining 1000 false syncs. On my test system those corrupt files now finish scanning (with appropriate error) in about 0.1s each.

paulijar · 2021-02-26T21:39:09Z

Now I finally had time to test the fix, and it seems to work fine. Thanks!

JamesHeinrich added a commit that referenced this issue Feb 20, 2021

#286 corrupt mp3 can cause slow scanning

1490b43

#286 Prevent apparently-mp3 files with large number of 0xFF chars from stalling scanning

JamesHeinrich closed this as completed Feb 20, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Analysis gets stuck with corrupt files #286

Analysis gets stuck with corrupt files #286

paulijar commented Feb 17, 2021

JamesHeinrich commented Feb 18, 2021

JamesHeinrich commented Feb 20, 2021

paulijar commented Feb 26, 2021

Analysis gets stuck with corrupt files #286

Analysis gets stuck with corrupt files #286

Comments

paulijar commented Feb 17, 2021

JamesHeinrich commented Feb 18, 2021

JamesHeinrich commented Feb 20, 2021

paulijar commented Feb 26, 2021