https://github.com/grangier/python-goose
Revision 383eb8e5588f1b913b32b83a6a903af8380494e9 authored by Xavier Grangier on 16 June 2013, 08:23:01 UTC, committed by Xavier Grangier on 16 June 2013, 08:23:01 UTC
Improved article tag extraction
Tip revision: 383eb8e5588f1b913b32b83a6a903af8380494e9 authored by Xavier Grangier on 16 June 2013, 08:23:01 UTC
Merge pull request #20 from litso/improved_article_tags
Merge pull request #20 from litso/improved_article_tags
Tip revision: 383eb8e
File | Mode | Size |
---|---|---|
images | ||
resources | ||
utils | ||
__init__.py | -rw-r--r-- | 2.8 KB |
article.py | -rw-r--r-- | 3.0 KB |
cleaners.py | -rw-r--r-- | 9.5 KB |
configuration.py | -rw-r--r-- | 3.8 KB |
crawler.py | -rw-r--r-- | 5.0 KB |
extractors.py | -rw-r--r-- | 18.3 KB |
network.py | -rw-r--r-- | 1.5 KB |
outputformatters.py | -rw-r--r-- | 4.1 KB |
parsers.py | -rw-r--r-- | 5.8 KB |
text.py | -rw-r--r-- | 4.9 KB |
version.py | -rw-r--r-- | 937 bytes |
video.py | -rw-r--r-- | 924 bytes |
![swh spinner](/static/img/swh-spinner.gif)
Computing file changes ...