aboutsummaryrefslogtreecommitdiffstats
path: root/youtube/yt_data_extract/everything_else.py
Commit message (Collapse)AuthorAgeFilesLines
* Update channel to new ctoken formatJesus E2023-05-281-5/+6
| | | | | | Huge thanks to @michaelweiser Different sortings still don't work for videos and playlists
* Fix videos added to playlist from channel missing author_idJames Taylor2021-08-311-1/+5
| | | | Signed-off-by: Jesús <heckyel@hyperbola.info>
* Fix comment reply url extraction due to youtube changes0.1.0James Taylor2021-08-231-3/+7
| | | | Signed-off-by: Jesús <heckyel@hyperbola.info>
* Switch to new comments api now that old one is being disabledJames Taylor2021-08-091-4/+2
| | | | | | | | | | | watch_comment api periodically gives the error "Top level comments mweb servlet is turned down." The continuation items for the new api are in a different arrangement in the json, so changes were necessary to the extract_items function. Signed-off-by: Jesús <heckyel@hyperbola.info>
* Use extract_approx_int for comment likesJames Taylor2021-06-101-2/+2
| | | | | | | | Full digits no longer available Closes #64 Signed-off-by: Jesús <heckyel@hyperbola.info>
* Fix comment like extraction due to Youtube changesJames Taylor2021-05-171-0/+2
| | | | | | Variable name changed from likeCount to voteCount Signed-off-by: Jesús <heckyel@hyperbola.info>
* Channel about: Add http:// to links without itJames Taylor2021-05-061-0/+2
| | | | | | So that the link is not interpretted as a relative link Signed-off-by: Jesús <heckyel@hyperbola.info>
* Channel: Allow going to next pages of playlists pageJames Taylor2021-03-151-1/+1
| | | | | | | Uses previous and next buttons. Now can view more than just first page of playlists page Signed-off-by: Jesús <heckyel@hyperbola.info>
* Fix comment repliesJames Taylor2021-02-261-0/+6
| | | | | | | | | | | | | Comment reply protobuf now requires the channel id of the uploader of the video. Otherwise the endpoint returns 500. Instead of making the protobuf ourselves and passing this data around through query parameters, just use the ctoken provided to us but modify the max_replies field from 10 to 250. Fixes #53 Signed-off-by: Jesús <heckyel@hyperbola.info>
* channel: replace page #s w/ next page button using provided ctokenJames Taylor2020-12-181-0/+2
| | | | | | Since yt doesn't accept page #'s when sorting by oldest Signed-off-by: Jesús <heckyel@hyperbola.info>
* remove trailing whitespaceszrose5842020-10-211-1/+1
|
* yt_data_extract: normalize thumbnail and author urlsJames Taylor2020-10-191-6/+6
| | | | | | | | | | for instance, urls that start with // become https:// adjustment required in comments.py because the url was left as a relative url in yt_data_extract by mistake and was using URL_ORIGIN prefix as fix. see #31
* Switch to mobile api endpoint to fix 'Unknown error' blockageJames Taylor2020-08-111-9/+18
| | | | See https://github.com/iv-org/invidious/issues/1319#issuecomment-671732646
* extract_items: Handle case where continuation has multipleJames Taylor2020-08-111-1/+2
| | | | | | | | | | | | [something]Continuation renderers, all of which are junk except one. Check the items in each one until the one which contains the items being sought is found. The usage in extract_comments_info needed to be changed to specify the items being sought. It was unspecified before which is strictly incorrect since extract_items by default looks for video/playlist/channel thumbnail items. It was relying on this special case for continuations. But now that wouldn't work anymore.
* extract_channel_info: Improve error extractionJames Taylor2020-08-111-3/+6
| | | | | | | | Use extract_str function since it's not always 'simpleText' Make sure we don't output an empty error message if we don't know what it is. channel.py: Don't check if error message is empty, check if it's None
* Regression: Fix channel extraction 'items' key not present when there's no ↵James Taylor2019-12-231-2/+3
| | | | | | items. Examples: Empty channels, no search results
* Channel: Change search results to use next and previous page buttonsJames Taylor2019-12-231-1/+3
| | | | Because youtube doesn't give the number of search results, so previous behavior would give an error if a page number out of range was selected.
* Rewrite channel extraction with proper error handling and new extraction ↵James Taylor2019-12-211-43/+35
| | | | | | names. Extract subscriber_count correctly. Don't just shove english strings into info['stats']. Actually give semantic names for the stats.
* Extraction: Move stuff around in files and put underscores in front of ↵James Taylor2019-12-191-2/+2
| | | | | | internal helper function names Move get_captions_url in watch_extraction to bottom next to other exported, public functions
* Extraction: Split yt_data_extract.py into multiple filesJames Taylor2019-12-191-0/+273