19cefd9 | Richard Smith-Unna | 12 October 2015, 10:28:55 UTC | Release v0.4.7. | 12 October 2015, 10:28:55 UTC |
90d7e6f | Richard Smith-Unna | 12 October 2015, 10:28:46 UTC | Get thresher with follow bug fix | 12 October 2015, 10:28:46 UTC |
d38d37b | Richard Smith-Unna | 08 August 2015, 04:47:22 UTC | Add optional saving of logs (closes #51) | 08 August 2015, 04:47:22 UTC |
2d6d655 | Richard Smith-Unna | 08 August 2015, 04:35:26 UTC | Change short arg for ratelimit to avoid conflict (fixes #53) | 08 August 2015, 04:35:26 UTC |
cb18b31 | Richard Smith-Unna | 08 August 2015, 04:33:08 UTC | Release v0.4.6. | 08 August 2015, 04:33:08 UTC |
c61837f | Richard Smith-Unna | 08 August 2015, 04:32:59 UTC | Update license field to current NPM spec | 08 August 2015, 04:32:59 UTC |
0cd2b75 | Richard Smith-Unna | 08 August 2015, 04:32:03 UTC | Bump thresher for DOI resolve fix (fixes #54) | 08 August 2015, 04:32:36 UTC |
7f5a4a6 | Richard Smith-Unna | 14 June 2015, 13:22:30 UTC | Release v0.4.5. | 14 June 2015, 13:22:30 UTC |
651ceb0 | Richard Smith-Unna | 14 June 2015, 13:22:22 UTC | Handle invalid attributes and log all failed captures in debug | 14 June 2015, 13:22:22 UTC |
24866c1 | Richard Smith-Unna | 14 June 2015, 13:02:50 UTC | Print version in first line of log | 14 June 2015, 13:02:50 UTC |
8e141ed | Richard Smith-Unna | 14 June 2015, 12:31:01 UTC | Release v0.4.4. | 14 June 2015, 12:31:01 UTC |
e0b694f | Richard Smith-Unna | 14 June 2015, 12:30:56 UTC | Bump thresher for new scraper validation | 14 June 2015, 12:30:56 UTC |
99a0983 | Richard Smith-Unna | 14 June 2015, 12:30:08 UTC | Validate scraper(dir)s before running (fixes #48) | 14 June 2015, 12:30:08 UTC |
04223ae | Richard Smith-Unna | 14 June 2015, 12:01:41 UTC | Release v0.4.3. | 14 June 2015, 12:01:41 UTC |
f44d8dc | Richard Smith-Unna | 14 June 2015, 11:58:48 UTC | Refactor CLI code Rate-limited loops now avoid using recursion. Argument paths are expanded before use. Element capture statistics are reported for each URL (closes #7) | 14 June 2015, 11:58:48 UTC |
bbc729a | Richard Smith-Unna | 14 June 2015, 11:57:14 UTC | Bump thresher depedency | 14 June 2015, 11:57:14 UTC |
6978162 | Richard Smith-Unna | 14 May 2015, 15:05:47 UTC | Merge branch 'master' of github.com:ContentMine/quickscrape | 14 May 2015, 15:05:47 UTC |
baa1dcb | Richard Smith-Unna | 14 May 2015, 15:05:39 UTC | Release v0.4.2. | 14 May 2015, 15:05:39 UTC |
2e7b22f | Richard Smith-Unna | 14 May 2015, 15:04:41 UTC | Bump thresher dependency | 14 May 2015, 15:04:41 UTC |
e6d064c | Richard Smith-Unna | 14 May 2015, 14:59:18 UTC | Export version globally for logging | 14 May 2015, 14:59:18 UTC |
5eef420 | Richard Smith-Unna | 06 May 2015, 10:14:53 UTC | Clarify installation instructions (see #41) | 06 May 2015, 10:14:53 UTC |
8448f0b | Ross Mounce | 04 May 2015, 14:20:56 UTC | No OS specific instructions any more | 04 May 2015, 14:20:56 UTC |
f087ce1 | Richard Smith-Unna | 11 April 2015, 12:00:43 UTC | Make it clear that headless scraping is optional | 11 April 2015, 12:00:43 UTC |
e971ae5 | Richard Smith-Unna | 11 April 2015, 11:57:34 UTC | Load version number from package.json This avoids accidentally forgetting to update the version number in multiple places when releasing. | 11 April 2015, 11:57:38 UTC |
1b183ba | Richard Smith-Unna | 11 April 2015, 11:51:03 UTC | Release v0.4.1. | 11 April 2015, 11:51:03 UTC |
f65967a | Richard Smith-Unna | 11 April 2015, 11:50:56 UTC | Fix version number reporting | 11 April 2015, 11:50:56 UTC |
d7ee835 | Richard Smith-Unna | 11 April 2015, 11:49:29 UTC | Simpler install using cross-platform NVM | 11 April 2015, 11:49:29 UTC |
6ce34fd | Richard Smith-Unna | 11 April 2015, 11:01:06 UTC | Release v0.4.0. | 11 April 2015, 11:01:06 UTC |
677ee3d | Richard Smith-Unna | 11 April 2015, 11:00:58 UTC | Update README for v0.4.0 | 11 April 2015, 11:00:58 UTC |
167be4c | Richard Smith-Unna | 11 April 2015, 10:57:22 UTC | Print help and exit when run with no arguments (fixes #36) | 11 April 2015, 10:57:22 UTC |
3b064cd | Richard Smith-Unna | 11 April 2015, 10:26:00 UTC | Trim empty lines from URLlist files (fixes #29) Many text editors add a terminal newline to files on save. This was previously being interpreted as an invalid URL. Fixed by filterng the URLs loaded from `--urllist` to remove empty entries. | 11 April 2015, 10:26:00 UTC |
8662a1e | Richard Smith-Unna | 11 April 2015, 09:38:12 UTC | Clear event listeners on thresher after each URL (fixes #33) When the URLlist is iterated over using recursive setTimeOuts, the Thresher object is staying in scope for all the nested calls. This is leading to event listeners accumulating on the Thresher object, and multiple identical handlers being called for the same event. As a shim, I have simply cleared all the event listeners after each URL finishes processing. | 11 April 2015, 09:38:16 UTC |
ac23ddb | Richard Smith-Unna | 11 April 2015, 09:36:52 UTC | Update dependencies | 11 April 2015, 09:36:52 UTC |
1cbc298 | Richard Smith-Unna | 10 April 2015, 21:37:42 UTC | Release v0.3.7. | 10 April 2015, 21:37:42 UTC |
f8b16d9 | Richard Smith-Unna | 10 April 2015, 21:37:37 UTC | Update README for v0.3.7 | 10 April 2015, 21:37:37 UTC |
4bd505b | Richard Smith-Unna | 10 April 2015, 21:36:17 UTC | Update thresher dependency to v0.1.3 | 10 April 2015, 21:36:36 UTC |
539f00a | Richard Smith-Unna | 10 April 2015, 21:36:06 UTC | Fix dates in bibJSON | 10 April 2015, 21:36:06 UTC |
b6983a8 | Richard Smith-Unna | 10 April 2015, 20:14:17 UTC | remove spurious files | 10 April 2015, 20:14:17 UTC |
71695d6 | Richard Smith-Unna | 10 April 2015, 20:12:17 UTC | Merge branch 'master' of github.com:ContentMine/quickscrape | 10 April 2015, 20:12:17 UTC |
cc3879e | Richard Smith-Unna | 31 March 2015, 09:06:28 UTC | fix error when missing log message | 31 March 2015, 09:06:28 UTC |
9e36288 | Richard Smith-Unna | 31 March 2015, 09:06:02 UTC | tidy bibjson html capture | 31 March 2015, 09:06:02 UTC |
f92e632 | Richard Smith-Unna | 22 January 2015, 09:38:34 UTC | readme tidy | 22 January 2015, 09:38:34 UTC |
192cb1c | Richard Smith-Unna | 22 January 2015, 09:35:42 UTC | no longer need unsafe-perms option | 22 January 2015, 09:35:42 UTC |
7012cb6 | Peter Murray-Rust | 12 January 2015, 16:14:39 UTC | removed files | 12 January 2015, 16:14:39 UTC |
980f10e | Peter Murray-Rust | 12 January 2015, 15:58:11 UTC | added bmc scraper | 12 January 2015, 15:58:11 UTC |
3722aea | Peter Murray-Rust | 12 January 2015, 15:52:56 UTC | Merge branch 'master' of https://github.com/ContentMine/quickscrape added scrapers for BMC | 12 January 2015, 15:52:56 UTC |
ab740ce | Peter Murray-Rust | 12 January 2015, 15:52:29 UTC | converted existing MDPI scraper to BMC trials | 12 January 2015, 15:52:29 UTC |
c86dabe | Richard Smith-Unna | 11 January 2015, 16:05:05 UTC | update README | 11 January 2015, 16:05:05 UTC |
f4a4587 | Richard Smith-Unna | 11 January 2015, 15:34:54 UTC | Release v0.3.6. | 11 January 2015, 15:34:54 UTC |
a615c09 | Richard Smith-Unna | 11 January 2015, 15:34:42 UTC | bump patch | 11 January 2015, 15:34:42 UTC |
e5764d8 | Richard Smith-Unna | 11 January 2015, 15:33:29 UTC | typo | 11 January 2015, 15:34:08 UTC |
4f37372 | Richard Smith-Unna | 11 January 2015, 14:28:11 UTC | Release v0.3.5. | 11 January 2015, 14:28:11 UTC |
a35be25 | Richard Smith-Unna | 11 January 2015, 14:27:53 UTC | bump thresher dependency version; bump version | 11 January 2015, 14:27:53 UTC |
fa2c3ec | Richard Smith-Unna | 11 January 2015, 12:38:18 UTC | Release v0.3.4. | 11 January 2015, 12:38:18 UTC |
531ea16 | Richard Smith-Unna | 11 January 2015, 12:38:14 UTC | prep for v0.3.4 | 11 January 2015, 12:38:14 UTC |
9105b8b | Richard Smith-Unna | 11 January 2015, 12:37:52 UTC | fix ref/table/fig output key | 11 January 2015, 12:37:52 UTC |
97dfcca | Richard Smith-Unna | 10 January 2015, 14:58:24 UTC | Release v0.3.3. | 10 January 2015, 14:58:24 UTC |
96f659f | Richard Smith-Unna | 10 January 2015, 14:54:52 UTC | add --outformat option | 10 January 2015, 14:54:52 UTC |
6ad698c | Richard Smith-Unna | 06 October 2014, 21:20:14 UTC | Release v0.3.2. | 06 October 2014, 21:20:14 UTC |
46be3a9 | Richard Smith-Unna | 06 October 2014, 21:20:02 UTC | prep for v0.3.2 | 06 October 2014, 21:20:02 UTC |
7d6870f | Richard Smith-Unna | 02 October 2014, 21:13:53 UTC | Release v0.3.1. | 02 October 2014, 21:13:53 UTC |
678a0d5 | Richard Smith-Unna | 02 October 2014, 21:13:43 UTC | prep for v0.3.1 | 02 October 2014, 21:13:43 UTC |
8dcf3c3 | Richard Smith-Unna | 02 October 2014, 20:59:35 UTC | Release v0.3.0. | 02 October 2014, 20:59:35 UTC |
173ee67 | Richard Smith-Unna | 02 October 2014, 20:59:14 UTC | prep for v0.3.0 | 02 October 2014, 20:59:14 UTC |
12688ba | Richard Smith-Unna | 02 October 2014, 20:35:38 UTC | write out structured JSON correctly | 02 October 2014, 20:35:38 UTC |
c7cfa03 | Richard Smith-Unna | 02 October 2014, 18:49:02 UTC | add mac DS_Store to gitignore | 02 October 2014, 18:49:02 UTC |
600a567 | Richard Smith-Unna | 22 September 2014, 10:28:56 UTC | tidy up logging | 22 September 2014, 10:28:56 UTC |
4942aeb | Richard Smith-Unna | 21 September 2014, 13:33:27 UTC | integrate thresher updates | 21 September 2014, 13:33:27 UTC |
1d1b53d | Richard Smith | 08 September 2014, 16:23:33 UTC | use new thresher interface | 08 September 2014, 16:24:24 UTC |
76f5bce | Richard Smith-Unna | 14 August 2014, 08:27:11 UTC | Release v0.2.8. | 14 August 2014, 08:27:11 UTC |
2abea18 | Richard Smith-Unna | 14 August 2014, 08:27:01 UTC | prepare for version bump | 14 August 2014, 08:27:01 UTC |
c62e8ce | Richard Smith-Unna | 02 August 2014, 15:10:06 UTC | Merge pull request #26 from Mec-iS/patch-1 Typo at line 10 | 02 August 2014, 15:10:06 UTC |
458b833 | Lorenzo | 02 August 2014, 14:05:47 UTC | Typo at line 10 the correct property's name for `thresher` object at line 10 is `ScraperBox` | 02 August 2014, 14:05:47 UTC |
715d6e4 | Richard Smith | 22 July 2014, 16:03:15 UTC | Release v0.2.7. | 22 July 2014, 16:03:15 UTC |
9d44283 | Richard Smith | 22 July 2014, 16:02:50 UTC | prep for another version bump | 22 July 2014, 16:02:50 UTC |
69e1e8c | Richard Smith | 22 July 2014, 15:57:23 UTC | Release v0.2.6. | 22 July 2014, 15:57:23 UTC |
81dc2f0 | Richard Smith | 22 July 2014, 15:57:14 UTC | prep for version bump | 22 July 2014, 15:57:14 UTC |
bbaa653 | Richard Smith | 22 July 2014, 15:56:02 UTC | Release v0.2.6. | 22 July 2014, 15:56:02 UTC |
38e9720 | Richard Smith | 22 July 2014, 15:55:52 UTC | integrate latest thresher API changes | 22 July 2014, 15:55:52 UTC |
996d5d6 | Matt Swain | 22 July 2014, 15:27:48 UTC | Reflect changes to Thresher API | 22 July 2014, 15:27:48 UTC |
334acaf | Richard Smith | 21 July 2014, 22:50:15 UTC | flush work | 21 July 2014, 22:50:15 UTC |
3d189fd | Richard Smith-Unna | 18 July 2014, 09:07:55 UTC | add libfontconfig to ubuntu instructions | 18 July 2014, 09:07:55 UTC |
9ed63c3 | Richard Smith-Unna | 17 July 2014, 08:59:44 UTC | Merge pull request #21 from scraperdragon/master Correct path in `quickscrape --scraper` | 17 July 2014, 08:59:44 UTC |
9fb6690 | Dragon Dave McKee | 16 July 2014, 20:19:07 UTC | Correct path in `quickscrape --scraper` `peerj.json` isn't in the root directory of `journal-scrapers` | 16 July 2014, 20:19:07 UTC |
cd5af47 | Richard Smith | 10 July 2014, 15:34:07 UTC | Release v0.2.5. | 10 July 2014, 15:34:07 UTC |
764e94f | Richard Smith | 10 July 2014, 15:33:53 UTC | dependency bump | 10 July 2014, 15:33:53 UTC |
d44a87d | Richard Smith-Unna | 10 July 2014, 15:26:06 UTC | fix thresher link | 10 July 2014, 15:26:06 UTC |
9ade74f | Richard Smith | 10 July 2014, 15:24:31 UTC | Release v0.2.4. | 10 July 2014, 15:24:31 UTC |
72f320c | Richard Smith | 10 July 2014, 15:24:23 UTC | prep for version bump | 10 July 2014, 15:24:23 UTC |
45a015f | Richard Smith | 10 July 2014, 14:53:15 UTC | Release v0.2.3. | 10 July 2014, 14:53:15 UTC |
c4fa699 | Richard Smith | 10 July 2014, 14:52:48 UTC | prepare for version bump | 10 July 2014, 14:52:48 UTC |
7805cfe | Richard Smith-Unna | 10 July 2014, 14:45:22 UTC | expand | 10 July 2014, 14:45:22 UTC |
7e7907f | Richard Smith-Unna | 10 July 2014, 14:37:23 UTC | reformat | 10 July 2014, 14:37:23 UTC |
4753972 | Richard Smith | 10 July 2014, 14:31:48 UTC | fix header | 10 July 2014, 14:31:48 UTC |
4467686 | Richard Smith | 10 July 2014, 14:31:03 UTC | restructure | 10 July 2014, 14:31:03 UTC |
887e785 | Richard Smith | 10 July 2014, 14:24:03 UTC | typos | 10 July 2014, 14:24:03 UTC |
b3c4d26 | Richard Smith | 10 July 2014, 14:22:15 UTC | add expanded description | 10 July 2014, 14:22:15 UTC |
87a5aaa | Richard Smith | 10 July 2014, 08:36:53 UTC | add scraper detection using new thresher ScraperBox API | 10 July 2014, 08:36:53 UTC |
1ff282c | Richard Smith | 10 July 2014, 08:36:32 UTC | yank coverage - no longer relevant for one-page app | 10 July 2014, 08:36:32 UTC |
00d5edf | Richard Smith | 06 July 2014, 12:12:20 UTC | Merge branch 'master' of github.com:ContentMine/quickscrape | 06 July 2014, 12:12:20 UTC |