09023ec | Xavier Grangier | 29 March 2015, 14:04:10 UTC | #217 - check if content value is not None | 29 March 2015, 14:04:10 UTC |
a059fa1 | Xavier Grangier | 29 March 2015, 13:42:02 UTC | Merge branch 'StevenMaude-patch-1' into develop | 29 March 2015, 13:42:02 UTC |
51f0cac | Xavier Grangier | 29 March 2015, 13:42:00 UTC | Merge branch 'patch-1' of https://github.com/StevenMaude/python-goose into StevenMaude-patch-1 | 29 March 2015, 13:42:00 UTC |
4132135 | Xavier Grangier | 29 March 2015, 13:41:23 UTC | Merge branch 'amalfra-develop' into develop | 29 March 2015, 13:41:23 UTC |
5db0166 | Amal Francis | 04 March 2015, 07:11:07 UTC | Type fix: Issue #204 | 04 March 2015, 07:11:07 UTC |
cc9d892 | Steven Maude | 19 February 2015, 12:35:13 UTC | Tidy README.rst Minor typo fixes. | 19 February 2015, 12:35:13 UTC |
65c8a1c | Xavier Grangier | 24 January 2015, 20:20:45 UTC | Merge branch 'nathanathan-patch-1' into develop | 24 January 2015, 20:20:45 UTC |
aee045d | Xavier Grangier | 24 January 2015, 20:19:04 UTC | #199 - test for empty title | 24 January 2015, 20:19:04 UTC |
3bf8f5e | Xavier Grangier | 24 January 2015, 20:18:41 UTC | #199 - pep8 | 24 January 2015, 20:18:41 UTC |
7981697 | Nathan Breit | 20 January 2015, 13:47:57 UTC | Check for empty title | 20 January 2015, 13:47:57 UTC |
8739747 | Xavier Grangier | 03 January 2015, 09:31:01 UTC | Merge branch 'release/1.0.25' into develop | 03 January 2015, 09:31:01 UTC |
c583da2 | Xavier Grangier | 03 January 2015, 09:30:45 UTC | bump version | 03 January 2015, 09:30:45 UTC |
1daa55c | Xavier Grangier | 02 January 2015, 20:19:19 UTC | Merge branch 'randvis-bugfixing/191' into develop | 02 January 2015, 20:19:19 UTC |
f9f1f1d | randvis | 02 January 2015, 14:51:42 UTC | 191 - keep available parsers list unchanged during multiple extract() calls | 02 January 2015, 14:55:08 UTC |
595209e | Xavier Grangier | 31 December 2014, 06:37:53 UTC | Merge branch 'release/1.0.24' into develop | 31 December 2014, 06:37:53 UTC |
ca1d824 | Xavier Grangier | 31 December 2014, 06:37:42 UTC | bump version | 31 December 2014, 06:37:42 UTC |
7f2f5fb | Xavier Grangier | 31 December 2014, 06:32:56 UTC | Merge branch 'feature/extractor-refactor-188' into develop | 31 December 2014, 06:32:56 UTC |
6959185 | Xavier Grangier | 31 December 2014, 06:30:42 UTC | #188 - add empty meta test case | 31 December 2014, 06:30:42 UTC |
9be09b8 | Xavier Grangier | 31 December 2014, 06:22:19 UTC | #188 - move title tests | 31 December 2014, 06:22:19 UTC |
41e951c | Xavier Grangier | 31 December 2014, 06:18:05 UTC | #188 - move authors tests | 31 December 2014, 06:18:05 UTC |
ea693a9 | Xavier Grangier | 31 December 2014, 06:06:12 UTC | #188 - test refactor | 31 December 2014, 06:06:12 UTC |
b762ea8 | Xavier Grangier | 31 December 2014, 05:56:28 UTC | #188 - move tweets tests case | 31 December 2014, 05:56:28 UTC |
0e6a771 | Xavier Grangier | 31 December 2014, 05:53:07 UTC | #188 - test refactor video image tags publishdate | 31 December 2014, 05:53:07 UTC |
c381993 | Xavier Grangier | 31 December 2014, 05:40:38 UTC | #188 - news extractos tests files | 31 December 2014, 05:40:38 UTC |
ff4449c | Xavier Grangier | 31 December 2014, 05:36:14 UTC | #188 - remove useless file | 31 December 2014, 05:36:14 UTC |
26ba835 | Xavier Grangier | 31 December 2014, 05:32:34 UTC | #188 - move image test case | 31 December 2014, 05:32:34 UTC |
6009d44 | Xavier Grangier | 31 December 2014, 05:24:10 UTC | #188 - tests refactor | 31 December 2014, 05:24:10 UTC |
49f50b0 | Xavier Grangier | 31 December 2014, 05:17:39 UTC | #188 - move test files | 31 December 2014, 05:17:39 UTC |
530ab52 | Xavier Grangier | 31 December 2014, 04:12:41 UTC | #188 - move domain extraction to meta extractor | 31 December 2014, 04:12:41 UTC |
4584341 | Xavier Grangier | 31 December 2014, 04:08:08 UTC | #188 - rename meta extractor file | 31 December 2014, 04:08:08 UTC |
08fd6b9 | Xavier Grangier | 31 December 2014, 04:07:38 UTC | #188 - move meta extraction to MetasExtractor class | 31 December 2014, 04:07:38 UTC |
8320262 | Xavier Grangier | 31 December 2014, 02:32:12 UTC | #188 - move publishdate extraction to PublishDateExtractor class | 31 December 2014, 02:32:12 UTC |
1cb9ed4 | Xavier Grangier | 31 December 2014, 02:23:10 UTC | #188 - move opengraph extraction to OpenGraphExtractor class | 31 December 2014, 02:23:10 UTC |
8eb74d8 | Xavier Grangier | 31 December 2014, 02:19:37 UTC | #188 - move tags extraction to TagsExtractor class | 31 December 2014, 02:19:37 UTC |
cd4cc7e | Xavier Grangier | 31 December 2014, 02:14:16 UTC | #188 - renamve authors class file | 31 December 2014, 02:14:16 UTC |
4de0f4b | Xavier Grangier | 31 December 2014, 02:13:46 UTC | #188 - move authors extraction to AuthorsExtractor class | 31 December 2014, 02:13:46 UTC |
2608e43 | Xavier Grangier | 31 December 2014, 02:11:04 UTC | #188 - move tweet extraction to TweetExtractor class | 31 December 2014, 02:11:04 UTC |
12dfda5 | Xavier Grangier | 31 December 2014, 02:07:45 UTC | #188 - move links extraction to LinksExtractor class | 31 December 2014, 02:07:45 UTC |
0492fb8 | Xavier Grangier | 31 December 2014, 02:02:45 UTC | #188 - move title extractor from content to title extractor class | 31 December 2014, 02:02:45 UTC |
a5e96e7 | Xavier Grangier | 31 December 2014, 01:50:40 UTC | #188 - ImageExtractor extends from BaseExtractor | 31 December 2014, 01:50:40 UTC |
9597fe1 | Xavier Grangier | 31 December 2014, 01:45:59 UTC | #188 - rename UpgradedImageIExtractor to ImageExtractor | 31 December 2014, 01:45:59 UTC |
ab81954 | Xavier Grangier | 31 December 2014, 01:41:22 UTC | #188 - move images extractor to extractors dir and correct videos | 31 December 2014, 01:41:22 UTC |
8d6d49e | Xavier Grangier | 31 December 2014, 01:31:03 UTC | #188 - move video to extractor directory | 31 December 2014, 01:31:03 UTC |
a957931 | Xavier Grangier | 31 December 2014, 01:20:28 UTC | #188 - correct import | 31 December 2014, 01:20:28 UTC |
cbbfba3 | Xavier Grangier | 31 December 2014, 01:17:59 UTC | #188 - add tags and author extractors | 31 December 2014, 01:17:59 UTC |
bcf4654 | Xavier Grangier | 31 December 2014, 01:14:18 UTC | #188 - create specific extractors classes | 31 December 2014, 01:14:18 UTC |
6ef3f68 | Xavier Grangier | 31 December 2014, 01:12:02 UTC | #188 - contentextractor inherits form baseextractor | 31 December 2014, 01:12:02 UTC |
731f104 | Xavier Grangier | 31 December 2014, 01:10:47 UTC | #188 - create a base extractor class | 31 December 2014, 01:10:47 UTC |
8eccabf | Xavier Grangier | 31 December 2014, 01:07:05 UTC | #188 - mv article extractor to extractors directory | 31 December 2014, 01:07:05 UTC |
3ebc97c | Xavier Grangier | 30 December 2014, 08:04:51 UTC | #187 - empty list | 30 December 2014, 08:04:51 UTC |
dd33aab | Xavier Grangier | 30 December 2014, 07:42:06 UTC | ignore egg files | 30 December 2014, 07:42:06 UTC |
bcfb9f3 | Xavier Grangier | 30 December 2014, 07:34:00 UTC | Merge branch 'release/1.0.23' into develop | 30 December 2014, 07:34:00 UTC |
4eda345 | Xavier Grangier | 30 December 2014, 07:33:45 UTC | bump version | 30 December 2014, 07:33:45 UTC |
57b1534 | Xavier Grangier | 30 December 2014, 07:31:00 UTC | #185 - movies info | 30 December 2014, 07:31:00 UTC |
ce6d8a1 | Xavier Grangier | 30 December 2014, 07:06:04 UTC | Merge branch 'feature/extract-183' into develop | 30 December 2014, 07:06:04 UTC |
22ded4b | Xavier Grangier | 30 December 2014, 07:04:47 UTC | #183 - use article tag for a top node | 30 December 2014, 07:04:47 UTC |
fe5f5e9 | Xavier Grangier | 30 December 2014, 06:47:52 UTC | #183 - pep8 | 30 December 2014, 06:47:52 UTC |
4632df7 | Xavier Grangier | 30 December 2014, 06:36:15 UTC | #182 - rename soup parser | 30 December 2014, 06:36:15 UTC |
0e6201d | Xavier Grangier | 30 December 2014, 06:30:47 UTC | #81 - use correct language for stopwords file | 30 December 2014, 06:30:47 UTC |
413037f | Xavier Grangier | 30 December 2014, 06:19:41 UTC | Merge branch 'KillaW0lf04-develop' into develop | 30 December 2014, 06:19:41 UTC |
8d18a8e | Xavier Grangier | 30 December 2014, 06:16:03 UTC | Merge branch 'develop' of https://github.com/KillaW0lf04/python-goose into KillaW0lf04-develop Conflicts: goose/crawler.py | 30 December 2014, 06:16:03 UTC |
b70075a | Xavier Grangier | 30 December 2014, 06:09:52 UTC | Merge branch 'feature/extract-115' into develop | 30 December 2014, 06:09:52 UTC |
c7ec678 | Xavier Grangier | 30 December 2014, 06:09:02 UTC | #115 - use known content tags to be article main body | 30 December 2014, 06:09:02 UTC |
b5ddaf1 | Xavier Grangier | 30 December 2014, 06:02:42 UTC | #115 - add issue 115 test files | 30 December 2014, 06:02:42 UTC |
e404f1b | Xavier Grangier | 30 December 2014, 06:02:06 UTC | #115 - remove businessinsider tests case due to no valid html | 30 December 2014, 06:02:06 UTC |
4fd94de | Xavier Grangier | 30 December 2014, 03:41:13 UTC | Merge branch 'feature/title-extraction-137' into develop | 30 December 2014, 03:41:13 UTC |
148ce9b | Xavier Grangier | 30 December 2014, 03:40:36 UTC | #137 - more explicit error message | 30 December 2014, 03:40:36 UTC |
bd96c94 | Xavier Grangier | 30 December 2014, 03:39:57 UTC | #137 - refactor title extraction based on opengraph, meta headling and title element | 30 December 2014, 03:39:57 UTC |
66b63fc | Xavier Grangier | 30 December 2014, 03:39:16 UTC | #137 - fetch opengraph before title | 30 December 2014, 03:39:16 UTC |
0e370dc | Xavier Grangier | 30 December 2014, 03:38:05 UTC | #137 - corrected title | 30 December 2014, 03:38:05 UTC |
d31112b | Xavier Grangier | 30 December 2014, 03:37:43 UTC | #137 - corrected title | 30 December 2014, 03:37:43 UTC |
3ff269e | Xavier Grangier | 30 December 2014, 03:37:16 UTC | #137 - use og:title in test case | 30 December 2014, 03:37:16 UTC |
655aca6 | Xavier Grangier | 30 December 2014, 03:36:44 UTC | #137 - test separator | 30 December 2014, 03:36:44 UTC |
b04f1e9 | Xavier Grangier | 30 December 2014, 02:09:47 UTC | #137 - opengraph title test case | 30 December 2014, 02:09:47 UTC |
502053d | Xavier Grangier | 30 December 2014, 01:42:22 UTC | Merge branch 'feature/empty-content-129' into develop | 30 December 2014, 01:42:22 UTC |
a36b5a8 | Xavier Grangier | 30 December 2014, 01:41:44 UTC | #129 - force articleBody to be the document root if found | 30 December 2014, 01:41:44 UTC |
e452c23 | Xavier Grangier | 30 December 2014, 01:41:16 UTC | #129 - add issue test case | 30 December 2014, 01:41:16 UTC |
6f00464 | Xavier Grangier | 30 December 2014, 00:51:10 UTC | Merge branch 'feature/article-infos-177' into develop | 30 December 2014, 00:51:10 UTC |
37e24b2 | Xavier Grangier | 30 December 2014, 00:49:55 UTC | #177 - add top image to returned dict | 30 December 2014, 00:49:55 UTC |
206f6e2 | Xavier Grangier | 30 December 2014, 00:48:11 UTC | #177 - info method return article data as dict | 30 December 2014, 00:48:11 UTC |
6338f68 | Xavier Grangier | 30 December 2014, 00:47:41 UTC | #177 - title is empty string by default | 30 December 2014, 00:47:41 UTC |
96caa3c | Xavier Grangier | 30 December 2014, 00:47:11 UTC | #177 - tags are a list | 30 December 2014, 00:47:11 UTC |
989ab24 | Xavier Grangier | 30 December 2014, 00:11:52 UTC | #175 - links extraction tests | 30 December 2014, 00:11:52 UTC |
321fb86 | Xavier Grangier | 30 December 2014, 00:05:04 UTC | #173 - authors extraction test case | 30 December 2014, 00:05:04 UTC |
848acf8 | Xavier Grangier | 29 December 2014, 23:53:02 UTC | #172 - tweet extraction tests | 29 December 2014, 23:53:02 UTC |
21a67ca | Xavier Grangier | 29 December 2014, 23:33:17 UTC | Merge branch 'pistolero-develop' into develop | 29 December 2014, 23:33:17 UTC |
90b041d | Xavier Grangier | 29 December 2014, 23:32:47 UTC | #171 - do not increment version yet | 29 December 2014, 23:32:47 UTC |
90b3cac | Sergey Kirillov | 29 December 2014, 12:00:27 UTC | Replaced bare except with except Exception | 29 December 2014, 12:32:02 UTC |
27709a4 | Xavier Grangier | 29 December 2014, 04:03:49 UTC | Merge branch 'feature/extract-twitter-169' into develop | 29 December 2014, 04:03:49 UTC |
af49328 | Xavier Grangier | 29 December 2014, 04:02:28 UTC | #169 - extract tweets | 29 December 2014, 04:02:28 UTC |
675c077 | Xavier Grangier | 29 December 2014, 04:01:32 UTC | #169 - extract tweets | 29 December 2014, 04:01:32 UTC |
cda2ef6 | Xavier Grangier | 29 December 2014, 02:48:55 UTC | #142 - extract authors | 29 December 2014, 02:48:55 UTC |
8a4ecf2 | Xavier Grangier | 29 December 2014, 02:27:31 UTC | Merge branch 'feature/links-139' into develop | 29 December 2014, 02:27:31 UTC |
0a3303e | Xavier Grangier | 29 December 2014, 02:26:33 UTC | #139 - article links extract method | 29 December 2014, 02:26:33 UTC |
4adf4bc | Xavier Grangier | 29 December 2014, 02:26:06 UTC | #139 - extract article links | 29 December 2014, 02:26:06 UTC |
124371e | Xavier Grangier | 29 December 2014, 02:25:40 UTC | #139 - article links property | 29 December 2014, 02:25:40 UTC |
517eb67 | Xavier Grangier | 29 December 2014, 01:55:14 UTC | Merge branch 'feature/opengraph-165' into develop | 29 December 2014, 01:55:14 UTC |
6bbe2db | Xavier Grangier | 29 December 2014, 01:54:33 UTC | #165 - rename article body extraction test | 29 December 2014, 01:54:33 UTC |
c2eb34e | Xavier Grangier | 29 December 2014, 01:53:09 UTC | #165 - opengraph extraction test | 29 December 2014, 01:53:09 UTC |
eb1274b | Xavier Grangier | 29 December 2014, 01:52:37 UTC | #165 - rename dict | 29 December 2014, 01:52:37 UTC |