[chaturbate] fix url extraction and parsing #23012

ghost · 2019-11-08T02:19:03Z

Before submitting a pull request make sure you have:

At least skimmed through adding new extractor tutorial and youtube-dl coding conventions sections
Searched the bugtracker for similar pull requests
Checked the code with flake8

In order to be accepted and merged into youtube-dl each piece of code must be in public domain or released under Unlicense. Check one of the following options:

I am the original author of this code and I am willing to release it under Unlicense
I am not the original author of this code but it is in public domain or released under Unlicense (provide reliable evidence)

What is the purpose of your pull request?

Bug fix
Improvement
New extractor
New feature

Description of your pull request and other information

Fixes issue #23010

The chaturbate extractor broke some time this evening. The regex looking for .m3u8 URLs no longer matched anything. Additionally, the URL now needs a bit of extra processing.

The page source contains a large JavaScript string containing an encoded JSON object. The nested quotes, as well as other special characters, are encoded as e.g. \u0022. So, we match the URL delimited by \u0022 or \u0027 (double or single quotes, respectively) and then decode any \uXXXX sequences in the match group.

ghost · 2019-11-09T14:46:56Z

Noob here, how do I implement these changes?

ghost · 2019-11-10T06:14:58Z

@Ubernoob7 Here's one way to do it:

Download my fork: https://github.com/throwaway396/youtube-dl/archive/master.zip
Extract it somewhere convenient
Open a terminal (command prompt) in the youtube-dl-master folder you just extracted
Run python youtube_dl/__main__.py "https://chaturbate.com/model/"

I'm assuming you already have Python installed and in your PATH.

casualreader · 2019-11-10T20:07:07Z

A simple interim fix: inserting a cookie "cb_legacy=1" reverts to the old behaviour.

*** youtube_dl/extractor/chaturbate.py.orig
--- youtube_dl/extractor/chaturbate.py

*** 31,37 ****
def _real_extract(self, url):
video_id = self._match_id(url)

! webpage = self._download_webpage(url, video_id)

      m3u8_urls = []

--- 31,39 ----
def _real_extract(self, url):
video_id = self._match_id(url)

! webpage = self._download_webpage(url, video_id, headers={
! 'Cookie': 'cb_legacy=1',
! })

      m3u8_urls = []

youtube-dl.patch.txt

pcjamesy · 2019-11-13T21:31:02Z

Question for you, I have a crontab and a run-one system to automagiclly record streams when they go live, or come back from one of the private modes. The script it as follows "run-one youtube-dl -o '/home/ubuntu/video/%(title)s.%(ext)s' https://chaturbate.com/URL/"

How would I take your branch that i've installed and make it run from anyfolder when "youtube-dl" is called?

purrsevere · 2019-11-13T22:14:26Z

@pcjamesy Install it globally.

git clone https://github.com/throwaway396/youtube-dl /tmp/youtube-dl
cd /tmp/youtube-dl 
python3 setup.py install

CashMoney6980 · 2019-11-21T04:32:03Z

@pcjamesy Install it globally.

git clone https://github.com/throwaway396/youtube-dl /tmp/youtube-dl
cd /tmp/youtube-dl 
python3 setup.py install

This worked for me! Many thanks!

Alex999Rus · 2020-12-11T11:06:42Z

guys who can record a video for me how to fix it on windows 7

[chaturbate] fix url extraction and parsing

9dd209b

ghost mentioned this pull request Nov 8, 2019

[chaturbate] fix url extraction and parsing #23011

Closed

9 tasks

pcjamesy approved these changes Nov 8, 2019

View reviewed changes

kenorb mentioned this pull request Nov 18, 2019

[Chaturbate] 403: Forbidden for socks5 proxy #23133

Closed

6 tasks

dstftw closed this in f0f6a7e Nov 21, 2019

meunierd referenced this pull request in meunierd/youtube-dl Feb 13, 2020

[chaturbate] Fix extraction (closes #23010, closes #23012)

82fe256

pareronia referenced this pull request in pareronia/youtube-dl Jun 22, 2020

[chaturbate] Fix extraction (closes #23010, closes #23012)

b8b99d1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[chaturbate] fix url extraction and parsing #23012

[chaturbate] fix url extraction and parsing #23012

ghost commented Nov 8, 2019

ghost commented Nov 9, 2019

ghost commented Nov 10, 2019

casualreader commented Nov 10, 2019

pcjamesy commented Nov 13, 2019

purrsevere commented Nov 13, 2019

CashMoney6980 commented Nov 21, 2019

Alex999Rus commented Dec 11, 2020

[chaturbate] fix url extraction and parsing #23012

[chaturbate] fix url extraction and parsing #23012

Conversation

ghost commented Nov 8, 2019

Before submitting a pull request make sure you have:

In order to be accepted and merged into youtube-dl each piece of code must be in public domain or released under Unlicense. Check one of the following options:

What is the purpose of your pull request?

Description of your pull request and other information

ghost commented Nov 9, 2019

ghost commented Nov 10, 2019

casualreader commented Nov 10, 2019

pcjamesy commented Nov 13, 2019

purrsevere commented Nov 13, 2019

CashMoney6980 commented Nov 21, 2019

Alex999Rus commented Dec 11, 2020