-
Notifications
You must be signed in to change notification settings - Fork 10.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
bbc cannot extract playlist #28115
Comments
This is yet another duplicate of TL;DR (or DS=didn'r search):
Workarounds:
Reformat
Inspect Page Source and search for
What's important is the pid values, e.g.
save it as a text file named |
Fixed in a400024. |
BBC reel playlist URLs are not handled properly(does not handle |
It would be easy to fix if there was a way of distinguishing when the
|
the extractor doesn't need to know whether the parameter has been set explicitly by the user or it's the default value(the default value has to treated in the same way as if the user did pass the option), you can look at other extractors that handle the |
The default is wrong: it should be Apparently the other extractors can't handle the case where basically the same page is fetched using different URLs that imply distinct default playlist handling. For instance:
The correct logic for a URL that has both a single video and a playlist is:
If Even if the different URL formats were handled by separate extractors, it wouldn't help to disambiguate the |
this URL would not be considered that it implies a video, instead, this will be determined after checking the |
Actually, like the other URL formats of this type, the vpid= type has a focused video; unlike the others it's not the first in the list under the video. So it does imply a video, especially as its vpid is mentioned. There is also the third case https://www.bbc.com/reel/video/p099tghy/is-phrenology-the-weirdest-pseudoscience-of-them-all- which is apparently identical to https://www.bbc.com/reel/playlist/mind-matters. I reviewed the results of Suppose that a user goes to a page with a video and wants to archive the video. The page (say, https://www.bbc.com/reel/video/p099tghy/is-phrenology-the-weirdest-pseudoscience-of-them-all-) happens to have a playlist that can be extracted, so the user ends up with 57 (12, in this case) other unexpected videos. The unhappy user can make --no-playlist the configuration default to avoid such a surprise. Then the same user goes to a playlist page (say, https://www.bbc.com/reel/playlist/mind-matters) that happens to have an active video and finds that only that video is fetched. The user is unhappy again. Whereas, if --no-playlist and --yes-playlist operate independently (equivalently, |
for https://www.bbc.com/reel/playlist/mind-matters, it's a playlist URL and it will be treated this way regardless of |
The same page can have both a video and a playlist and the interpretation of which is to be processed depends only on the URL. https://www.bbc.com/reel/playlist/mind-matters is a URL "referring to a video and a playlist", to quote the manual, so --no-playlist ought to be respected. But the other two URL styles that I quoted, which are plainly video and not playlist URLs, refer to an essentially identical page. They should yield the video by default but then it's impossible to override that with --yes-playlist because the option processing doesn't record that --yes-playlist was used. In summary, the change from False to None in
See https://github.com/dirkf/youtube-dl/tree/df-bbcreel-playlist-patch. |
it's a breaking change, it would change the default behaviour.
the description of the option states this:
so the option would apply only to |
Checklist
Verbose log
Description
WRITE DESCRIPTION HERE
Microsoft Windows XP [Version 5.1.2600]
(C) Copyright 1985-2001 Microsoft Corp.
N:\Movies>youtube-dl --version
2021.02.04.1
N:\Movies>youtube-dl --verbose https://www.bbc.com/reel/playlist/mind-matters?vp
id=p0962h5x
[debug] System config: []
[debug] User config: []
[debug] Custom config: []
[debug] Command-line args: ['--verbose', 'https://www.bbc.com/reel/playlist/mind
-matters?vpid=p0962h5x']
[debug] Encodings: locale cp1252, fs mbcs, out cp437, pref cp1252
[debug] youtube-dl version 2021.02.04.1
[debug] Python version 3.4.4 (CPython) - Windows-XP-5.1.2600-SP3
[debug] exe versions: ffmpeg N-77883-gd7c75a5, ffprobe N-77883-gd7c75a5, phantom
js 1.9.7
[debug] Proxy map: {}
[bbc] mind-matters: Downloading webpage
ERROR: Unable to extract playlist data; please report this issue on https://yt-d
l.org/bug . Make sure you are using the latest version; type youtube-dl -U to
update. Be sure to call youtube-dl with the --verbose flag and include its compl
ete output.
Traceback (most recent call last):
File "C:\Users\dst\AppData\Roaming\Build archive\youtube-dl\ytdl-org\tmpgi7ngq
0n\build\youtube_dl\YoutubeDL.py", line 806, in wrapper
File "C:\Users\dst\AppData\Roaming\Build archive\youtube-dl\ytdl-org\tmpgi7ngq
0n\build\youtube_dl\YoutubeDL.py", line 827, in __extract_info
File "C:\Users\dst\AppData\Roaming\Build archive\youtube-dl\ytdl-org\tmpgi7ngq
0n\build\youtube_dl\extractor\common.py", line 532, in extract
File "C:\Users\dst\AppData\Roaming\Build archive\youtube-dl\ytdl-org\tmpgi7ngq
0n\build\youtube_dl\extractor\bbc.py", line 1176, in _real_extract
File "C:\Users\dst\AppData\Roaming\Build archive\youtube-dl\ytdl-org\tmpgi7ngq
0n\build\youtube_dl\extractor\common.py", line 1010, in _search_regex
youtube_dl.utils.RegexNotFoundError: Unable to extract playlist data; please rep
ort this issue on https://yt-dl.org/bug . Make sure you are using the latest ver
sion; type youtube-dl -U to update. Be sure to call youtube-dl with the --verb
ose flag and include its complete output.
N:\Movies>
The text was updated successfully, but these errors were encountered: