Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | Fix playlist id extraction for radio renderers | James Taylor | 2019-12-31 | 1 | -1/+1 |
| | |||||
* | Extraction: Correctly extract view_count for vids with 0 views. | James Taylor | 2019-12-30 | 1 | -1/+9 |
| | | | | Also change superfluous use of multi_get to item.get nearby | ||||
* | extract_items: allow extracting items that are normally dug into for more | James Taylor | 2019-12-26 | 1 | -5/+5 |
| | | | | | By checking first if it's in item_types rather than checking if it can be dug into first. For example: this allows extracting things like sectionListRenderer | ||||
* | yt_data_extract: Split up extract_items so renderer extraction works ↵ | James Taylor | 2019-12-26 | 1 | -47/+48 |
| | | | | | | independently extract_items_from_renderer will extract given just a renderer rather than a response | ||||
* | yt_data_extract.common: Simplify usage of get functions and remove dead code | James Taylor | 2019-12-26 | 1 | -18/+11 |
| | | | | | | | Change usage of multi_deep_get to multi_get where possible Remove checking of type from calls to get functions (because it's very unlikely Youtube suddenly changes the type without changing the name of the variable or anything, and it takes up unnecessary space) Remove all default=None arguments from get functions, since those are superflous. Remove list_types constant since it's no longer in use. | ||||
* | yt_data_extract: Simplify extract_items so it needs only 1 while loop | James Taylor | 2019-12-26 | 1 | -32/+31 |
| | |||||
* | extract_item_info: Don't extract author, author_id, etc. for channel items | James Taylor | 2019-12-24 | 1 | -7/+8 |
| | | | | Philosophically, a channel doesn't create itself. | ||||
* | Fix extract_approx_int not working for non-approx ints, make extract_int ↵ | James Taylor | 2019-12-24 | 1 | -2/+2 |
| | | | | | | | | more robust For example, "354 subscribers" wasn't being extracted correctly be extract_approx_int. Make extract_approx_int and extract_int only extract integers that are words. So e.g. 342 will not be extracted from internetuser342 | ||||
* | Regression: Fix channel extraction 'items' key not present when there's no ↵ | James Taylor | 2019-12-23 | 1 | -2/+3 |
| | | | | | | items. Examples: Empty channels, no search results | ||||
* | Channel: Change search results to use next and previous page buttons | James Taylor | 2019-12-23 | 1 | -1/+3 |
| | | | | Because youtube doesn't give the number of search results, so previous behavior would give an error if a page number out of range was selected. | ||||
* | Rewrite channel extraction with proper error handling and new extraction ↵ | James Taylor | 2019-12-21 | 2 | -45/+40 |
| | | | | | | names. Extract subscriber_count correctly. Don't just shove english strings into info['stats']. Actually give semantic names for the stats. | ||||
* | Fix extract_approx_int. Fixes incorrect subscriber count on channels. | James Taylor | 2019-12-21 | 1 | -2/+2 |
| | | | | It wasn't working because decimals such as 15.1M weren't considered, so it was extracting "1M" | ||||
* | Fix regression: date extraction broken. Move constants to correct file in ↵ | James Taylor | 2019-12-20 | 2 | -2/+2 |
| | | | | yt_data_extract | ||||
* | Extraction: Move non-stateful signature decryption functionality into ↵ | James Taylor | 2019-12-19 | 2 | -1/+98 |
| | | | | yt_data_extract | ||||
* | Extraction: Move stuff around in files and put underscores in front of ↵ | James Taylor | 2019-12-19 | 3 | -38/+37 |
| | | | | | | internal helper function names Move get_captions_url in watch_extraction to bottom next to other exported, public functions | ||||
* | Extraction: Move html post processing stuff from yt_data_extract to util | James Taylor | 2019-12-19 | 2 | -41/+1 |
| | |||||
* | Extraction: Split yt_data_extract.py into multiple files | James Taylor | 2019-12-19 | 4 | -0/+1188 |