cbsnews.com broken for weeks #26165

Open
opened 2026-02-21 14:10:40 -05:00 by deekerman · 1 comment
Owner

Originally created by @Tetracerus on GitHub (Apr 8, 2023).

  • I'm reporting a broken site support
  • I've verified that I'm running youtube-dl version 2021.12.17
  • I've checked that all provided URLs are alive and playable in a browser
  • I've checked that all URLs and arguments with special characters are properly quoted or escaped
  • I've searched the bugtracker for similar issues including closed ones

Verbose log

$ youtube-dl -v -F https://www.cbsnews.com/video/sunday-morning-full-episode-4-2-2023/
[debug] System config: []
[debug] User config: []
[debug] Custom config: []
[debug] Command-line args: ['-v', '-F', 'https://www.cbsnews.com/video/sunday-morning-full-episode-4-2-2023/']
[debug] Encodings: locale UTF-8, fs utf-8, out UTF-8, pref UTF-8
[debug] youtube-dl version 2021.12.17
[debug] Python version 3.6.8 (CPython) - Linux-4.18.0-425.13.1.el8_7.x86_64-x86_64-with-centos-8.7-Green_Obsidian
[debug] exe versions: ffmpeg 4.2.8, ffprobe 4.2.8
[debug] Proxy map: {}
[cbsnews] sunday-morning-full-episode-4-2-2023: Downloading webpage
[cbsnews] A0u8tIz70VHrE_UwZAjPeMOAa54TH4T5: Downloading XML
ERROR: A0u8tIz70VHrE_UwZAjPeMOAa54TH4T5: Failed to parse XML  (caused by ParseError('no element found: line 1, column 0',)); please report this issue on https://yt-dl.org/bug . Make sure you are using the latest version; see  https://yt-dl.org/update  on how to update. Be sure to call youtube-dl with the --verbose flag and include its complete output.
Traceback (most recent call last):
  File "/usr/local/bin/youtube_dl/extractor/common.py", line 857, in _parse_xml
    return compat_etree_fromstring(xml_string.encode('utf-8'))
  File "/usr/local/bin/youtube_dl/compat.py", line 2611, in compat_etree_fromstring
    return etree.XML(text, parser=etree.XMLParser(target=_TreeBuilder()))
  File "/usr/lib64/python3.6/xml/etree/ElementTree.py", line 1315, in XML
    return parser.close()
  File "<string>", line None
xml.etree.ElementTree.ParseError: no element found: line 1, column 0
Traceback (most recent call last):
  File "/usr/local/bin/youtube_dl/extractor/common.py", line 857, in _parse_xml
    return compat_etree_fromstring(xml_string.encode('utf-8'))
  File "/usr/local/bin/youtube_dl/compat.py", line 2611, in compat_etree_fromstring
    return etree.XML(text, parser=etree.XMLParser(target=_TreeBuilder()))
  File "/usr/lib64/python3.6/xml/etree/ElementTree.py", line 1315, in XML
    return parser.close()
  File "<string>", line None
xml.etree.ElementTree.ParseError: no element found: line 1, column 0

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "/usr/local/bin/youtube_dl/YoutubeDL.py", line 818, in wrapper
    return func(self, *args, **kwargs)
  File "/usr/local/bin/youtube_dl/YoutubeDL.py", line 839, in __extract_info
    ie_result = ie.extract(url)
  File "/usr/local/bin/youtube_dl/extractor/common.py", line 535, in extract
    ie_result = self._real_extract(url)
  File "/usr/local/bin/youtube_dl/extractor/cbsnews.py", line 108, in _real_extract
    return self._extract_video_info(item['mpxRefId'], 'cbsnews')
  File "/usr/local/bin/youtube_dl/extractor/cbs.py", line 63, in _extract_video_info
    content_id, query={'partner': site, 'contentId': content_id})
  File "/usr/local/bin/youtube_dl/extractor/common.py", line 850, in _download_xml
    expected_status=expected_status)
  File "/usr/local/bin/youtube_dl/extractor/common.py", line 834, in _download_xml_handle
    fatal=fatal), urlh
  File "/usr/local/bin/youtube_dl/extractor/common.py", line 861, in _parse_xml
    raise ExtractorError(errmsg, cause=ve)
youtube_dl.utils.ExtractorError: A0u8tIz70VHrE_UwZAjPeMOAa54TH4T5: Failed to parse XML  (caused by ParseError('no element found: line 1, column 0',)); please report this issue on https://yt-dl.org/bug . Make sure you are using the latest version; see  https://yt-dl.org/update  on how to update. Be sure to call youtube-dl with the --verbose flag and include its complete output.

Description

cbsnews.com videos stopped working a few weeks ago.

I just grabbed the latest snapshot of master minutes ago and it's still not working.

For this video:
https://www.cbsnews.com/video/sunday-morning-full-episode-4-2-2023/

We just need to parse index.html to find the URL to master.m3u8 file. Snippet:

Sunday Morning Full Episode 4/2","timestamp":1680440400000,"duration":3760,"durationLabel":"01:02:40","label":null,"images":{"sd":"https://assets1.cbsnewsstatic.com/hub/i/r/2023/04/02/a457a56d-498d-420b-9ba1-d91fab2ee07b/thumbnail/640x360/6bd395cc9a6be06149ad04014c1d9296/smjanepauley040223-1848688-640x360.jpg","hd":"https://assets3.cbsnewsstatic.com/hub/i/r/2023/04/02/a457a56d-498d-420b-9ba1-d91fab2ee07b/thumbnail/1280x720/58ec9371cb82766448dfed42a78b2182/smjanepauley040223-1848688-640x360.jpg"},"previewUrl":"https://splice.amlg.io/api/v2/video/A0u8tIz70VHrE_UwZAjPeMOAa54TH4T5/preview/","video":"https://prod.vodvideo.cbsnews.com/cbsnews/vr/hls/2023/03/22/2185212995641/1848685_hls/master.m3u8","video2":"https://prod.vodvideo.cbsnews.com/cbsnews/vr/hls/2023/03/22/2185212995641/1848685_hls/master.m3u8","format":"application/x-mpegURL","url":"https://www.cbsnews.com/video/sunday-morning-full-episode-4-2-2023/

If we fetch https://prod.vodvideo.cbsnews.com/cbsnews/vr/hls/2023/03/22/2185212995641/1848685_hls/master.m3u8 - it contains metadata on all the video and audio formats. Snippet:

#EXT-X-STREAM-INF:BANDWIDTH=634827,AVERAGE-BANDWIDTH=516066,CODECS="avc1.640028,mp4a.40.2",RESOLUTION=640x360,FRAME-RATE=29.970,VIDEO-RANGE=SDR,AUDIO="audio_aac",CLOSED-CAPTIONS=NONE 0402_SUNMO_FULL_1_1848685_375/stream.m3u8

That SUNMO_FULL.../stream.m3u8 is the video playlist:

Video/audio links (VLC can play) are:
AAC Audio: https://prod.vodvideo.cbsnews.com/cbsnews/vr/hls/2023/03/22/2185212995641/1848685_hls/0402_SUNMO_FULL_1_en-US_1848685_aac_128/stream.m3u8
640x360: https://prod.vodvideo.cbsnews.com/cbsnews/vr/hls/2023/03/22/2185212995641/1848685_hls/0402_SUNMO_FULL_1_1848685_375/stream.m3u8
768x432: https://prod.vodvideo.cbsnews.com/cbsnews/vr/hls/2023/03/22/2185212995641/1848685_hls/0402_SUNMO_FULL_1_1848685_750/stream.m3u8
960x540: https://prod.vodvideo.cbsnews.com/cbsnews/vr/hls/2023/03/22/2185212995641/1848685_hls/0402_SUNMO_FULL_1_1848685_1500/stream.m3u8
1280x720: https://prod.vodvideo.cbsnews.com/cbsnews/vr/hls/2023/03/22/2185212995641/1848685_hls/0402_SUNMO_FULL_1_1848685_2100/stream.m3u8
HLS 1920x1080: https://prod.vodvideo.cbsnews.com/cbsnews/vr/hls/2023/03/22/2185212995641/1848685_hls/0402_SUNMO_FULL_1_1848685_3000/stream.m3u8

Originally created by @Tetracerus on GitHub (Apr 8, 2023). - [X] I'm reporting a broken site support - [X] I've verified that I'm running youtube-dl version **2021.12.17** - [X] I've checked that all provided URLs are alive and playable in a browser - [X] I've checked that all URLs and arguments with special characters are properly quoted or escaped - [X] I've searched the bugtracker for similar issues including closed ones ## Verbose log <!-- Provide the complete verbose output of youtube-dl that clearly demonstrates the problem. Add the `-v` flag to your command line you run youtube-dl with (`youtube-dl -v <your command line>`), copy the WHOLE output and insert it below. It should look similar to this: [debug] System config: [] [debug] User config: [] [debug] Command-line args: [u'-v', u'http://www.youtube.com/watch?v=BaW_jenozKcj'] [debug] Encodings: locale cp1251, fs mbcs, out cp866, pref cp1251 [debug] youtube-dl version 2021.12.17 [debug] Python version 2.7.11 - Windows-2003Server-5.2.3790-SP2 [debug] exe versions: ffmpeg N-75573-g1d0487f, ffprobe N-75573-g1d0487f, rtmpdump 2.4 [debug] Proxy map: {} <more lines> --> ``` $ youtube-dl -v -F https://www.cbsnews.com/video/sunday-morning-full-episode-4-2-2023/ [debug] System config: [] [debug] User config: [] [debug] Custom config: [] [debug] Command-line args: ['-v', '-F', 'https://www.cbsnews.com/video/sunday-morning-full-episode-4-2-2023/'] [debug] Encodings: locale UTF-8, fs utf-8, out UTF-8, pref UTF-8 [debug] youtube-dl version 2021.12.17 [debug] Python version 3.6.8 (CPython) - Linux-4.18.0-425.13.1.el8_7.x86_64-x86_64-with-centos-8.7-Green_Obsidian [debug] exe versions: ffmpeg 4.2.8, ffprobe 4.2.8 [debug] Proxy map: {} [cbsnews] sunday-morning-full-episode-4-2-2023: Downloading webpage [cbsnews] A0u8tIz70VHrE_UwZAjPeMOAa54TH4T5: Downloading XML ERROR: A0u8tIz70VHrE_UwZAjPeMOAa54TH4T5: Failed to parse XML (caused by ParseError('no element found: line 1, column 0',)); please report this issue on https://yt-dl.org/bug . Make sure you are using the latest version; see https://yt-dl.org/update on how to update. Be sure to call youtube-dl with the --verbose flag and include its complete output. Traceback (most recent call last): File "/usr/local/bin/youtube_dl/extractor/common.py", line 857, in _parse_xml return compat_etree_fromstring(xml_string.encode('utf-8')) File "/usr/local/bin/youtube_dl/compat.py", line 2611, in compat_etree_fromstring return etree.XML(text, parser=etree.XMLParser(target=_TreeBuilder())) File "/usr/lib64/python3.6/xml/etree/ElementTree.py", line 1315, in XML return parser.close() File "<string>", line None xml.etree.ElementTree.ParseError: no element found: line 1, column 0 Traceback (most recent call last): File "/usr/local/bin/youtube_dl/extractor/common.py", line 857, in _parse_xml return compat_etree_fromstring(xml_string.encode('utf-8')) File "/usr/local/bin/youtube_dl/compat.py", line 2611, in compat_etree_fromstring return etree.XML(text, parser=etree.XMLParser(target=_TreeBuilder())) File "/usr/lib64/python3.6/xml/etree/ElementTree.py", line 1315, in XML return parser.close() File "<string>", line None xml.etree.ElementTree.ParseError: no element found: line 1, column 0 During handling of the above exception, another exception occurred: Traceback (most recent call last): File "/usr/local/bin/youtube_dl/YoutubeDL.py", line 818, in wrapper return func(self, *args, **kwargs) File "/usr/local/bin/youtube_dl/YoutubeDL.py", line 839, in __extract_info ie_result = ie.extract(url) File "/usr/local/bin/youtube_dl/extractor/common.py", line 535, in extract ie_result = self._real_extract(url) File "/usr/local/bin/youtube_dl/extractor/cbsnews.py", line 108, in _real_extract return self._extract_video_info(item['mpxRefId'], 'cbsnews') File "/usr/local/bin/youtube_dl/extractor/cbs.py", line 63, in _extract_video_info content_id, query={'partner': site, 'contentId': content_id}) File "/usr/local/bin/youtube_dl/extractor/common.py", line 850, in _download_xml expected_status=expected_status) File "/usr/local/bin/youtube_dl/extractor/common.py", line 834, in _download_xml_handle fatal=fatal), urlh File "/usr/local/bin/youtube_dl/extractor/common.py", line 861, in _parse_xml raise ExtractorError(errmsg, cause=ve) youtube_dl.utils.ExtractorError: A0u8tIz70VHrE_UwZAjPeMOAa54TH4T5: Failed to parse XML (caused by ParseError('no element found: line 1, column 0',)); please report this issue on https://yt-dl.org/bug . Make sure you are using the latest version; see https://yt-dl.org/update on how to update. Be sure to call youtube-dl with the --verbose flag and include its complete output. ``` ## Description <!-- Provide an explanation of your issue in an arbitrary form. Provide any additional information, suggested solution and as much context and examples as possible. If work on your issue requires account credentials please provide them or explain how one can obtain them. --> cbsnews.com videos stopped working a few weeks ago. I just grabbed the latest snapshot of master minutes ago and it's still not working. For this video: https://www.cbsnews.com/video/sunday-morning-full-episode-4-2-2023/ We just need to parse index.html to find the URL to master.m3u8 file. Snippet: `Sunday Morning Full Episode 4/2","timestamp":1680440400000,"duration":3760,"durationLabel":"01:02:40","label":null,"images":{"sd":"https://assets1.cbsnewsstatic.com/hub/i/r/2023/04/02/a457a56d-498d-420b-9ba1-d91fab2ee07b/thumbnail/640x360/6bd395cc9a6be06149ad04014c1d9296/smjanepauley040223-1848688-640x360.jpg","hd":"https://assets3.cbsnewsstatic.com/hub/i/r/2023/04/02/a457a56d-498d-420b-9ba1-d91fab2ee07b/thumbnail/1280x720/58ec9371cb82766448dfed42a78b2182/smjanepauley040223-1848688-640x360.jpg"},"previewUrl":"https://splice.amlg.io/api/v2/video/A0u8tIz70VHrE_UwZAjPeMOAa54TH4T5/preview/","video":"https://prod.vodvideo.cbsnews.com/cbsnews/vr/hls/2023/03/22/2185212995641/1848685_hls/master.m3u8","video2":"https://prod.vodvideo.cbsnews.com/cbsnews/vr/hls/2023/03/22/2185212995641/1848685_hls/master.m3u8","format":"application/x-mpegURL","url":"https://www.cbsnews.com/video/sunday-morning-full-episode-4-2-2023/` If we fetch https://prod.vodvideo.cbsnews.com/cbsnews/vr/hls/2023/03/22/2185212995641/1848685_hls/master.m3u8 - it contains metadata on all the video and audio formats. Snippet: `#EXT-X-STREAM-INF:BANDWIDTH=634827,AVERAGE-BANDWIDTH=516066,CODECS="avc1.640028,mp4a.40.2",RESOLUTION=640x360,FRAME-RATE=29.970,VIDEO-RANGE=SDR,AUDIO="audio_aac",CLOSED-CAPTIONS=NONE 0402_SUNMO_FULL_1_1848685_375/stream.m3u8` That SUNMO_FULL.../stream.m3u8 is the video playlist: Video/audio links (VLC can play) are: AAC Audio: https://prod.vodvideo.cbsnews.com/cbsnews/vr/hls/2023/03/22/2185212995641/1848685_hls/0402_SUNMO_FULL_1_en-US_1848685_aac_128/stream.m3u8 640x360: https://prod.vodvideo.cbsnews.com/cbsnews/vr/hls/2023/03/22/2185212995641/1848685_hls/0402_SUNMO_FULL_1_1848685_375/stream.m3u8 768x432: https://prod.vodvideo.cbsnews.com/cbsnews/vr/hls/2023/03/22/2185212995641/1848685_hls/0402_SUNMO_FULL_1_1848685_750/stream.m3u8 960x540: https://prod.vodvideo.cbsnews.com/cbsnews/vr/hls/2023/03/22/2185212995641/1848685_hls/0402_SUNMO_FULL_1_1848685_1500/stream.m3u8 1280x720: https://prod.vodvideo.cbsnews.com/cbsnews/vr/hls/2023/03/22/2185212995641/1848685_hls/0402_SUNMO_FULL_1_1848685_2100/stream.m3u8 HLS 1920x1080: https://prod.vodvideo.cbsnews.com/cbsnews/vr/hls/2023/03/22/2185212995641/1848685_hls/0402_SUNMO_FULL_1_1848685_3000/stream.m3u8
Author
Owner

@dirkf commented on GitHub (Apr 8, 2023):

This bug is already addressed in https://github.com/yt-dlp/yt-dlp/issues/6565 and can be fixed by back-porting the extractor changes from https://github.com/yt-dlp/yt-dlp/pull/6681.

@dirkf commented on GitHub (Apr 8, 2023): This bug is already addressed in https://github.com/yt-dlp/yt-dlp/issues/6565 and can be fixed by back-porting the extractor changes from https://github.com/yt-dlp/yt-dlp/pull/6681.
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
starred/youtube-dl-ytdl-org#26165
No description provided.