aboutsummaryrefslogtreecommitdiffstats
path: root/youtube_dl/extractor/common.py
Commit message (Collapse)AuthorAgeFilesLines
...
* Add temporary _sort_formats helper functionPhilipp Hagemeister2013-12-241-0/+41
|
* Add a resolution field and improve general --list-formats outputPhilipp Hagemeister2013-12-241-3/+10
|
* [myvideo] Use RTMP instead of RTMPT (Fixes #2032)Philipp Hagemeister2013-12-231-0/+5
|
* [bliptv] Remove support for direct downloadsPhilipp Hagemeister2013-12-231-2/+0
| | | | This is now handled by the generic IE
* [aparat] Add support (Fixes #2012)Philipp Hagemeister2013-12-201-0/+2
|
* [generic] Detect ooyala videos (fixes #2013)Jaime Marquínez Ferrándiz2013-12-191-2/+4
|
* [youtube] Do not warn for videos with allow_rating=0Philipp Hagemeister2013-12-171-3/+4
| | | | | This fixes #1982 Test video: http://www.youtube.com/watch?v=gi2uH3YxohU
* _search_regex's "isatty" call fails with Py2exe'sItay Brandes2013-12-161-1/+1
| | | | | | | | | _search_regex calls the sys.stderr.isatty() function for unix systems. Py2exe uses a custom Stderr() stream which doesn't have an `isatty()` function, leading to it's crash. Fixes easily with checking that it's a unix system first.
* Reorder info_dict documentationPhilipp Hagemeister2013-12-161-25/+27
|
* Document duration fieldPhilipp Hagemeister2013-12-161-1/+2
|
* [mtv] Fixup incorrectly encoded XML documentsPhilipp Hagemeister2013-12-101-1/+4
|
* Add fatal=False parameter to _download_* functions.Philipp Hagemeister2013-12-091-7/+23
| | | | This allows us to simplify the calls in the youtube extractor even further.
* [9gag] Like/dislike count (#1895)Philipp Hagemeister2013-12-051-0/+3
|
* [smotri] SimplifyPhilipp Hagemeister2013-12-021-1/+2
|
* [zdf] Use _download_xmlPhilipp Hagemeister2013-11-281-1/+2
|
* Merge branch 'opener-to-ydl'Philipp Hagemeister2013-11-251-2/+2
|\
| * Merge branch 'master' into opener-to-ydlPhilipp Hagemeister2013-11-241-1/+9
| |\
| * | Move the opener to the YoutubeDL object.Philipp Hagemeister2013-11-221-2/+2
| | | | | | | | | | | | | | | This is the first step towards being able to just import youtube_dl and start using it. Apart from removing global state, this would fix problems like #1805.
* | | Remove quality_name field and improve zdf extractorPhilipp Hagemeister2013-11-251-1/+0
| | |
* | | [zdf/common] Use API in ZDF extractor.Philipp Hagemeister2013-11-251-0/+2
| |/ |/| | | | | | | This also comes with a lot of extra format fields Fixes #1518
* | [collegehumor] Encode the xml before calling ↵Jaime Marquínez Ferrándiz2013-11-241-0/+6
| | | | | | | | | | | | xml.etree.ElementTree.fromstring (fixes #1822) Uses a new helper method in InfoExtractor: _download_xml
* | Match --download-archive during playlist processing (Fixes #1745)Philipp Hagemeister2013-11-221-1/+3
|/
* Add support for tou.tv (Fixes #1792)Philipp Hagemeister2013-11-201-0/+28
|
* Add automatic generation of format note based on bitrate and codecsPhilipp Hagemeister2013-11-161-0/+4
|
* Don't accept '>' inside the content attribute in OpenGraph regexesJaime Marquínez Ferrándiz2013-11-151-3/+5
|
* Improve the OpenGraph regexJaime Marquínez Ferrándiz2013-11-151-5/+9
| | | | | * Do not accept '>' between the property and content attributes. * Recognize the properties if the content attribute is before the property attribute using two regexes (fixes the extraction of the description for SlideshareIE).
* [common] Simplify og_search_propertyPhilipp Hagemeister2013-11-121-3/+3
|
* Fix AssertionError when og property not foundMarcin Cieślak2013-11-051-1/+3
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | On tvp.pl some webpages contain OpenGraph metadata and some don't. If og property is not found, _og_search_description fails with WARNING: unable to extract OpenGraph description; please report this issue on http://yt-dl.org/bug Traceback (most recent call last): File "/usr/home/saper/bin/youtube-dl", line 18, in <module> youtube_dl.main() File "/usr/home/saper/sw/youtube-dl/youtube_dl/__init__.py", line 766, in main _real_main(argv) File "/usr/home/saper/sw/youtube-dl/youtube_dl/__init__.py", line 719, in _real_main retcode = ydl.download(all_urls) File "/usr/home/saper/sw/youtube-dl/youtube_dl/YoutubeDL.py", line 715, in download videos = self.extract_info(url) File "/usr/home/saper/sw/youtube-dl/youtube_dl/YoutubeDL.py", line 348, in extract_info ie_result = ie.extract(url) File "/usr/home/saper/sw/youtube-dl/youtube_dl/extractor/common.py", line 125, in extract return self._real_extract(url) File "/usr/home/saper/sw/youtube-dl/youtube_dl/extractor/tvp.py", line 56, in _real_extract info['description'] = self._og_search_description(webpage) File "/usr/home/saper/sw/youtube-dl/youtube_dl/extractor/common.py", line 331, in _og_search_description return self._og_search_property('description', html, fatal=False, **kargs) File "/usr/home/saper/sw/youtube-dl/youtube_dl/extractor/common.py", line 325, in _og_search_property return unescapeHTML(escaped) File "/usr/home/saper/sw/youtube-dl/youtube_dl/utils.py", line 494, in unescapeHTML assert type(s) == type(u'') AssertionError The patch allows me to use: try: info['description'] = self._og_search_description(webpage) info['thumbnail'] = self._og_search_thumbnail(webpage) except RegexNotFoundError: pass
* Add the 'webpage_url' field to info_dictJaime Marquínez Ferrándiz2013-11-031-0/+3
| | | | | The url for the video page, it must allow to reproduce the result. It's automatically set by YoutubeDL if it's missing.
* Remove superfluous spacePhilipp Hagemeister2013-10-301-1/+1
|
* Merge remote-tracking branch 'origin/master'Philipp Hagemeister2013-10-281-4/+4
|\
| * [Instagram] get the non-https link, as they are serving Akamai cert from a ↵Filippo Valsorda2013-10-281-4/+4
| | | | | | | | instagram.com domain
* | New debug option --write-pagesPhilipp Hagemeister2013-10-281-0/+12
|/
* [vimeo] Fix pro videos and player.vimeo.com urlsJaime Marquínez Ferrándiz2013-10-231-2/+3
| | | | | The old process can still be used for those videos. Added RegexNotFoundError, which is raised by _search_regex if it can't extract the info.
* The 'format' field now defaults to '{format_id} - {width}x{height}{format_note}'Jaime Marquínez Ferrándiz2013-10-211-1/+4
| | | | Following the YoutubeIE format. The 'format_note' gives additional info about the format, for example '3D' or 'DASH video'.
* fix typosPhilipp Hagemeister2013-10-181-1/+1
|
* Allow users to specify an age limit (fixes #1545)Philipp Hagemeister2013-10-061-0/+10
| | | | | With these changes, users can now restrict what videos are downloaded by the intented audience, by specifying their age with --age-limit YEARS . Add rudimentary support in youtube, pornotube, and youporn.
* Clarify that url and ext are optional when formats is given (#980)Philipp Hagemeister2013-10-041-0/+2
|
* Document formats (for #980)Philipp Hagemeister2013-10-041-2/+13
|
* Fix detection of the webpage charset if it's declared using ' instead of "Jaime Marquínez Ferrándiz2013-08-291-1/+1
| | | | Like in "<meta charset='utf-8'/>"
* [sohu] Handle encoding, and fix testsPhilipp Hagemeister2013-08-281-2/+7
|
* Merge remote-tracking branch 'origin/reuse_ies'Philipp Hagemeister2013-08-281-0/+5
|\
| * YoutubeIE: reuse instances of InfoExtractors (closes #998)Jaime Marquínez Ferrándiz2013-07-081-0/+5
| | | | | | | | | | | | When a IE is added to the list, it's also added to a dictionary. When a IE is requested it first looks in the dictionary and if there's no instance it will create a new one. That way _real_initialize is only called once for each IE, saving time if it needs to login for example.
* | [addanime] improvePhilipp Hagemeister2013-08-281-1/+1
|\ \
* \ \ Merge pull request #937 from jaimeMF/subtitles_reworkJaime Marquínez Ferrándiz2013-08-231-1/+2
|\ \ \ | | | | | | | | Subtitles rework
| * | | Use a dictionary for storing the subtitlesJaime Marquínez Ferrándiz2013-07-201-1/+2
| |/ / | | | | | | | | | The errors while getting the subtitles are reported as warnings, if no subtitles are found return and empty dict.
* / / Cache suitable regular expressionsPhilipp Hagemeister2013-08-211-1/+7
|/ / | | | | | | This speeds up TestAllURLsMatching.test_no_duplicates by about 8000% at the cost of minimal memory overhead.
* | Use unescapeHTML for OpenGraph propertiesPhilipp Hagemeister2013-07-171-1/+3
| | | | | | | | These are attribute values, so we don't need the more complex and whitespace-destroying cleanHTML - we just need to unescape quotes, that's it.
* | Strip hash info from URL when making requests (Fixes #1038)Philipp Hagemeister2013-07-131-0/+5
| |
* | Improve OpenGraph property matchingPhilipp Hagemeister2013-07-131-6/+6
| |