855c6e6 | Luca Foppiano | 02 November 2020, 06:22:30 UTC | various experiments | 02 November 2020, 06:22:30 UTC |
0b80328 | Luca Foppiano | 30 October 2020, 07:02:35 UTC | wip docker file | 30 October 2020, 07:02:35 UTC |
3ca47dc | Luca Foppiano | 30 October 2020, 06:53:53 UTC | wip docker file | 30 October 2020, 06:53:53 UTC |
294c2b5 | Luca Foppiano | 14 October 2020, 03:23:17 UTC | adding some more logging | 14 October 2020, 03:23:17 UTC |
50b784b | Luca Foppiano | 12 October 2020, 02:44:13 UTC | output the model name for debugging | 12 October 2020, 02:44:13 UTC |
b0560b5 | Luca Foppiano | 12 October 2020, 00:15:17 UTC | improve documentation on delft configuration | 12 October 2020, 00:15:17 UTC |
bda9314 | lopez | 07 October 2020, 16:09:29 UTC | more self-explaining property file regarding DL models | 07 October 2020, 16:09:29 UTC |
2fd52bd | lopez | 07 October 2020, 15:46:38 UTC | add jni class for DeLFT classifier via JEP | 07 October 2020, 15:46:38 UTC |
ad5276c | lopez | 05 October 2020, 09:09:21 UTC | review benchmarking doc and licenses | 05 October 2020, 09:09:21 UTC |
570c8c0 | lopez | 04 October 2020, 08:31:59 UTC | update citation deep learning model and add corresponding benchmarks | 04 October 2020, 08:31:59 UTC |
41d5e6a | lopez | 02 October 2020, 17:48:27 UTC | update doc | 02 October 2020, 17:48:27 UTC |
01ab5c9 | lopez | 28 September 2020, 15:22:24 UTC | fix error case sentence segmentation without sentence | 28 September 2020, 15:22:24 UTC |
f43e301 | lopez | 24 September 2020, 15:32:55 UTC | support for page number ranges with postfix letter | 24 September 2020, 15:32:55 UTC |
ebdbb89 | kermitt2 | 24 September 2020, 13:20:50 UTC | Merge branch 'master' of https://github.com/kermitt2/grobid | 24 September 2020, 13:20:50 UTC |
32b3123 | kermitt2 | 24 September 2020, 13:20:31 UTC | upate citation model | 24 September 2020, 13:20:31 UTC |
a62daf3 | lopez | 23 September 2020, 22:00:31 UTC | fix on page range string manipulation | 23 September 2020, 22:00:31 UTC |
39fa6a9 | lopez | 23 September 2020, 20:01:35 UTC | better normalisation of page ranges | 23 September 2020, 20:01:35 UTC |
295e468 | lopez | 23 September 2020, 16:58:15 UTC | review serialization of series and editors in bib. ref. | 23 September 2020, 16:58:15 UTC |
4be789e | kermitt2 | 23 September 2020, 10:51:57 UTC | Merge branch 'master' of https://github.com/kermitt2/grobid | 23 September 2020, 10:51:57 UTC |
d18f5e4 | kermitt2 | 23 September 2020, 10:51:39 UTC | add specific series field in the citation model | 23 September 2020, 10:51:39 UTC |
0a22345 | lopez | 23 September 2020, 02:43:47 UTC | typo in formatting doc | 23 September 2020, 02:43:47 UTC |
a69290a | lopez | 23 September 2020, 02:06:21 UTC | fix issue with space characters in generated training data for citation model, add specific <series> label | 23 September 2020, 02:06:21 UTC |
ac47e1f | kermitt2 | 20 September 2020, 18:36:06 UTC | updated citation model for better identifier coverage | 20 September 2020, 18:36:06 UTC |
295be65 | lopez | 20 September 2020, 07:53:59 UTC | review handling of identifiers in ref bib, add training data | 20 September 2020, 07:53:59 UTC |
4d1938c | Patrice Lopez | 16 September 2020, 23:01:34 UTC | Merge pull request #640 from OrionNebula/OrionNebula--gradle_sentence_segmentation Add "sentence-segmentation/**" to Gradle package | 16 September 2020, 23:01:34 UTC |
6d3d927 | Olivia Trewin | 16 September 2020, 22:19:24 UTC | Add "sentence-segmentation/**" to Gradle package This adds the `sentence-segmentation` folder to the Gradle package definition. Without this, sentence segmentation does not work inside the Docker container (as the Ruby source is not found). | 16 September 2020, 22:19:24 UTC |
51ec543 | lopez | 16 September 2020, 06:03:01 UTC | update url with https:// | 16 September 2020, 06:03:01 UTC |
6d82c9e | lopez | 09 September 2020, 19:47:41 UTC | customize pmc eval | 09 September 2020, 19:47:41 UTC |
e262107 | lopez | 09 September 2020, 17:28:14 UTC | add doi, pmid and pmcid fields in the end-to-end eval | 09 September 2020, 17:28:14 UTC |
f56c7af | Patrice Lopez | 08 September 2020, 08:04:51 UTC | Merge pull request #636 from elifesciences/enable-dist-zip-for-docker-build enabled service dist zip for docker build | 08 September 2020, 08:04:51 UTC |
d43df05 | Daniel Ecer | 07 September 2020, 20:46:46 UTC | enabled service dist zip for docker build | 07 September 2020, 20:46:46 UTC |
46e4f91 | lopez | 25 August 2020, 23:53:38 UTC | add optional sentence segments and coordinates in figure/tacle captions and notes | 25 August 2020, 23:53:38 UTC |
d171bfc | lopez | 25 August 2020, 06:31:10 UTC | fix problem with sentence coordinates | 25 August 2020, 06:31:10 UTC |
3ba122e | lopez | 24 August 2020, 23:05:46 UTC | clean useless property | 24 August 2020, 23:05:46 UTC |
cc8a78a | Patrice Lopez | 24 August 2020, 21:59:28 UTC | Merge pull request #625 from kermitt2/sentence-segmentation Sentence segmentation | 24 August 2020, 21:59:28 UTC |
c34f810 | lopez | 24 August 2020, 21:53:43 UTC | Update change log | 24 August 2020, 21:53:43 UTC |
ff0bd69 | lopez | 24 August 2020, 21:44:55 UTC | Revert "adding REST API for the sentence segmentation" This reverts commit 3b81561eb221ba23013d74fa6beb33f321230cac. | 24 August 2020, 21:44:55 UTC |
ec78709 | Patrice Lopez | 21 August 2020, 16:46:10 UTC | fix deprecated crossref field query.title | 21 August 2020, 16:46:10 UTC |
50f2fac | Patrice Lopez | 21 August 2020, 16:19:04 UTC | Merge pull request #627 from kermitt2/revert-624-bugfix/crossref-api Revert "update call to crossref when title is supplied" | 21 August 2020, 16:19:04 UTC |
8f2e44c | Patrice Lopez | 21 August 2020, 16:11:07 UTC | Revert "update call to crossref when title is supplied" | 21 August 2020, 16:11:07 UTC |
3b81561 | Luca Foppiano | 21 August 2020, 04:53:59 UTC | adding REST API for the sentence segmentation | 21 August 2020, 04:56:55 UTC |
9b8437d | lopez | 21 August 2020, 03:44:34 UTC | add optional sentence segmentation in the figure and table captions | 21 August 2020, 03:44:34 UTC |
a1468ae | lopez | 21 August 2020, 03:08:04 UTC | Modify schema to allow text structure under figDesc, so that we can add sentence segmentation | 21 August 2020, 03:08:04 UTC |
f39a3bc | lopez | 20 August 2020, 22:30:05 UTC | set back default config | 20 August 2020, 22:30:05 UTC |
aaf0982 | lopez | 20 August 2020, 21:08:09 UTC | add some additional safety tests | 20 August 2020, 21:08:09 UTC |
a9f8678 | lopez | 20 August 2020, 20:58:36 UTC | add segmentSentences option to batch processing too | 20 August 2020, 20:58:36 UTC |
c594d67 | lopez | 20 August 2020, 20:03:02 UTC | document the sentence segmentation option | 20 August 2020, 20:03:02 UTC |
7d7e952 | lopez | 20 August 2020, 19:40:27 UTC | cleaning | 20 August 2020, 19:40:27 UTC |
9297f1d | lopez | 20 August 2020, 19:31:05 UTC | reorganize sentence segmentation resources | 20 August 2020, 19:31:05 UTC |
c5e9ff4 | lopez | 20 August 2020, 17:43:13 UTC | fix issues related to the string modification as returned by the pragmatic segmenter | 20 August 2020, 17:43:13 UTC |
70e4239 | lopez | 20 August 2020, 06:17:23 UTC | fix jruby multithreaded runtime | 20 August 2020, 06:17:23 UTC |
6551c95 | lopez | 20 August 2020, 05:36:10 UTC | add pragmatic segmenter implementation via jruby | 20 August 2020, 05:36:10 UTC |
68b8f2c | lopez | 20 August 2020, 03:05:22 UTC | manage OpenNLP detector for multithread (not thread safe) | 20 August 2020, 03:05:22 UTC |
1bea9a9 | lopez | 20 August 2020, 01:50:39 UTC | update XML schema to support <s> | 20 August 2020, 01:50:39 UTC |
7a21aed | lopez | 20 August 2020, 01:34:35 UTC | fix sentence coordinate issue | 20 August 2020, 01:34:35 UTC |
947a07e | Patrice Lopez | 20 August 2020, 01:10:29 UTC | Merge pull request #624 from kermitt2/bugfix/crossref-api update call to crossref when title is supplied | 20 August 2020, 01:10:29 UTC |
dce0230 | lopez | 20 August 2020, 01:10:06 UTC | add tests | 20 August 2020, 01:10:06 UTC |
469c52a | lopez | 20 August 2020, 00:40:18 UTC | avoid useless serialization of the header; fix abstract structuring | 20 August 2020, 00:40:18 UTC |
21dc59c | Luca Foppiano | 19 August 2020, 23:35:03 UTC | query title on query.bibliographics | 19 August 2020, 23:35:03 UTC |
f9f060b | lopez | 19 August 2020, 23:17:13 UTC | support forbidden spans in sentence segmentation | 19 August 2020, 23:17:13 UTC |
2d3070a | lopez | 19 August 2020, 20:17:39 UTC | add factory interface for implementation of sentence segmentation ; add OpenNLP implementation | 19 August 2020, 20:17:39 UTC |
a11dec8 | lopez | 19 August 2020, 19:35:11 UTC | double check idiom for lexicon lazy loading in multithreaded usage | 19 August 2020, 19:35:11 UTC |
cfa50ab | lopez | 19 August 2020, 15:22:04 UTC | fix tests | 19 August 2020, 15:22:04 UTC |
1354bf6 | lopez | 19 August 2020, 15:07:12 UTC | add <s> segmentation option, validation of the approach with openNLP | 19 August 2020, 15:07:12 UTC |
99831f7 | lopez | 18 August 2020, 14:31:24 UTC | avoid string concat | 18 August 2020, 14:31:24 UTC |
0860f61 | lopez | 12 August 2020, 20:58:45 UTC | update release documentation | 12 August 2020, 20:58:45 UTC |
ad877a2 | lopez | 12 August 2020, 20:18:39 UTC | Merge branch 'master' of https://github.com/kermitt2/grobid | 12 August 2020, 20:18:39 UTC |
d452205 | lopez | 12 August 2020, 20:18:25 UTC | update doc for release 0.6.1 | 12 August 2020, 20:18:25 UTC |
a348041 | lopez | 12 August 2020, 18:41:30 UTC | [Gradle Release Plugin] - new version commit: '0.6.2-SNAPSHOT'. | 12 August 2020, 18:41:30 UTC |
95c609b | lopez | 12 August 2020, 18:38:19 UTC | [Gradle Release Plugin] - pre tag commit: '0.6.1'. | 12 August 2020, 18:38:19 UTC |
7ddbd46 | lopez | 12 August 2020, 13:27:12 UTC | update doc | 12 August 2020, 13:27:12 UTC |
aeb0db8 | Patrice Lopez | 12 August 2020, 11:29:41 UTC | Merge pull request #597 from kermitt2/feature/update_jdk_and_gradle Update to gradle 6.5.1 to support JDK 13 and 14 | 12 August 2020, 11:29:41 UTC |
6b890a8 | lopez | 12 August 2020, 11:18:51 UTC | modify end-to-end eval action names | 12 August 2020, 11:18:51 UTC |
9bb1380 | Patrice Lopez | 12 August 2020, 09:45:18 UTC | Merge pull request #563 from kermitt2/keep_affiliations_after_consolidation keep the extracted affiliations if none found from consolidation. | 12 August 2020, 09:45:18 UTC |
f1c3cf9 | lopez | 12 August 2020, 09:00:15 UTC | some fixes | 12 August 2020, 09:00:15 UTC |
3b08564 | Luca Foppiano | 12 August 2020, 02:42:51 UTC | Merge branch 'master' into feature/update_jdk_and_gradle | 12 August 2020, 02:42:51 UTC |
1323638 | lopez | 11 August 2020, 22:54:11 UTC | refine matching | 11 August 2020, 22:54:11 UTC |
d8fe0a5 | lopez | 11 August 2020, 22:14:06 UTC | Merge branch 'master' into keep_affiliations_after_consolidation | 11 August 2020, 22:14:06 UTC |
c7b4b20 | lopez | 11 August 2020, 22:12:57 UTC | fix issues with #562 observed after merging | 11 August 2020, 22:12:57 UTC |
a998055 | lopez | 11 August 2020, 20:24:28 UTC | back to previous approach, with improvements | 11 August 2020, 20:24:28 UTC |
886bf8a | lopez | 11 August 2020, 15:58:05 UTC | Merge branch 'master' into keep_affiliations_after_consolidation | 11 August 2020, 15:58:05 UTC |
678c816 | Patrice Lopez | 11 August 2020, 15:56:23 UTC | Merge pull request #562 from kermitt2/512_PDF_and_authors_with_ORCID Check whether annotation is orcid and add orcid to author. | 11 August 2020, 15:56:23 UTC |
1501601 | lopez | 11 August 2020, 14:49:49 UTC | Merge branch 'master' into keep_affiliations_after_consolidation | 11 August 2020, 14:49:49 UTC |
227a804 | lopez | 11 August 2020, 14:48:25 UTC | defaulting crossref consolidation | 11 August 2020, 14:48:25 UTC |
753b072 | Patrice Lopez | 11 August 2020, 14:20:48 UTC | Merge branch 'master' into 512_PDF_and_authors_with_ORCID | 11 August 2020, 14:20:48 UTC |
5f93224 | lopez | 11 August 2020, 13:08:43 UTC | fix tests | 11 August 2020, 13:08:43 UTC |
82995a1 | lopez | 11 August 2020, 11:42:54 UTC | some simplification | 11 August 2020, 11:42:54 UTC |
8255d99 | lopez | 11 August 2020, 11:37:35 UTC | use Person deduplication for injecting consolidated authors | 11 August 2020, 11:37:35 UTC |
853d82d | lopez | 11 August 2020, 10:37:46 UTC | Merge branch 'master' into keep_affiliations_after_consolidation | 11 August 2020, 10:37:46 UTC |
6c2a7c8 | Patrice Lopez | 11 August 2020, 09:27:50 UTC | Merge pull request #591 from kermitt2/add-developers-guide Add users guide | 11 August 2020, 09:27:50 UTC |
670d06d | Patrice Lopez | 11 August 2020, 09:12:45 UTC | Merge pull request #580 from kermitt2/update_header [WIP] Full update of the header model | 11 August 2020, 09:12:45 UTC |
bda53a9 | Patrice Lopez | 11 August 2020, 07:49:49 UTC | Merge pull request #609 from bananaoomarang/more-env-config-overrides Allow overriding org.* properties with env vars | 11 August 2020, 07:49:49 UTC |
c3a4926 | lopez | 11 August 2020, 07:35:24 UTC | Merge branch 'master' into update_header | 11 August 2020, 07:35:24 UTC |
24774b3 | Patrice Lopez | 11 August 2020, 07:23:10 UTC | Merge pull request #559 from elifesciences/make-crf-engine-model-specific allow model specific crf engine configuration | 11 August 2020, 07:23:10 UTC |
933090c | lopez | 11 August 2020, 07:04:25 UTC | add json preprocessor format | 11 August 2020, 07:04:25 UTC |
1098177 | Luca Foppiano | 03 August 2020, 01:16:17 UTC | including dependencies in the fat jar | 03 August 2020, 01:16:17 UTC |
cf446ec | Patrice Lopez | 02 August 2020, 19:24:14 UTC | Merge pull request #614 from mash/callout-to-reference-author-match Improve callout to reference match | 02 August 2020, 19:24:14 UTC |
11cbbcf | Masakazu Ohtsuka | 02 August 2020, 10:03:12 UTC | remove parentheses often seen in callouts, for better callout to reference match | 02 August 2020, 10:36:47 UTC |
e9c371b | Daniel Ecer | 27 July 2020, 13:39:15 UTC | fixed pass in model to getGrobidCRFEngine | 27 July 2020, 13:39:15 UTC |
05d999f | Daniel Ecer | 27 July 2020, 09:25:44 UTC | Merge branch 'master' into make-crf-engine-model-specific | 27 July 2020, 09:25:44 UTC |