https://github.com/kermitt2/grobid

sort by:
Revision Author Date Message Commit Date
855c6e6 various experiments 02 November 2020, 06:22:30 UTC
0b80328 wip docker file 30 October 2020, 07:02:35 UTC
3ca47dc wip docker file 30 October 2020, 06:53:53 UTC
294c2b5 adding some more logging 14 October 2020, 03:23:17 UTC
50b784b output the model name for debugging 12 October 2020, 02:44:13 UTC
b0560b5 improve documentation on delft configuration 12 October 2020, 00:15:17 UTC
bda9314 more self-explaining property file regarding DL models 07 October 2020, 16:09:29 UTC
2fd52bd add jni class for DeLFT classifier via JEP 07 October 2020, 15:46:38 UTC
ad5276c review benchmarking doc and licenses 05 October 2020, 09:09:21 UTC
570c8c0 update citation deep learning model and add corresponding benchmarks 04 October 2020, 08:31:59 UTC
41d5e6a update doc 02 October 2020, 17:48:27 UTC
01ab5c9 fix error case sentence segmentation without sentence 28 September 2020, 15:22:24 UTC
f43e301 support for page number ranges with postfix letter 24 September 2020, 15:32:55 UTC
ebdbb89 Merge branch 'master' of https://github.com/kermitt2/grobid 24 September 2020, 13:20:50 UTC
32b3123 upate citation model 24 September 2020, 13:20:31 UTC
a62daf3 fix on page range string manipulation 23 September 2020, 22:00:31 UTC
39fa6a9 better normalisation of page ranges 23 September 2020, 20:01:35 UTC
295e468 review serialization of series and editors in bib. ref. 23 September 2020, 16:58:15 UTC
4be789e Merge branch 'master' of https://github.com/kermitt2/grobid 23 September 2020, 10:51:57 UTC
d18f5e4 add specific series field in the citation model 23 September 2020, 10:51:39 UTC
0a22345 typo in formatting doc 23 September 2020, 02:43:47 UTC
a69290a fix issue with space characters in generated training data for citation model, add specific <series> label 23 September 2020, 02:06:21 UTC
ac47e1f updated citation model for better identifier coverage 20 September 2020, 18:36:06 UTC
295be65 review handling of identifiers in ref bib, add training data 20 September 2020, 07:53:59 UTC
4d1938c Merge pull request #640 from OrionNebula/OrionNebula--gradle_sentence_segmentation Add "sentence-segmentation/**" to Gradle package 16 September 2020, 23:01:34 UTC
6d3d927 Add "sentence-segmentation/**" to Gradle package This adds the `sentence-segmentation` folder to the Gradle package definition. Without this, sentence segmentation does not work inside the Docker container (as the Ruby source is not found). 16 September 2020, 22:19:24 UTC
51ec543 update url with https:// 16 September 2020, 06:03:01 UTC
6d82c9e customize pmc eval 09 September 2020, 19:47:41 UTC
e262107 add doi, pmid and pmcid fields in the end-to-end eval 09 September 2020, 17:28:14 UTC
f56c7af Merge pull request #636 from elifesciences/enable-dist-zip-for-docker-build enabled service dist zip for docker build 08 September 2020, 08:04:51 UTC
d43df05 enabled service dist zip for docker build 07 September 2020, 20:46:46 UTC
46e4f91 add optional sentence segments and coordinates in figure/tacle captions and notes 25 August 2020, 23:53:38 UTC
d171bfc fix problem with sentence coordinates 25 August 2020, 06:31:10 UTC
3ba122e clean useless property 24 August 2020, 23:05:46 UTC
cc8a78a Merge pull request #625 from kermitt2/sentence-segmentation Sentence segmentation 24 August 2020, 21:59:28 UTC
c34f810 Update change log 24 August 2020, 21:53:43 UTC
ff0bd69 Revert "adding REST API for the sentence segmentation" This reverts commit 3b81561eb221ba23013d74fa6beb33f321230cac. 24 August 2020, 21:44:55 UTC
ec78709 fix deprecated crossref field query.title 21 August 2020, 16:46:10 UTC
50f2fac Merge pull request #627 from kermitt2/revert-624-bugfix/crossref-api Revert "update call to crossref when title is supplied" 21 August 2020, 16:19:04 UTC
8f2e44c Revert "update call to crossref when title is supplied" 21 August 2020, 16:11:07 UTC
3b81561 adding REST API for the sentence segmentation 21 August 2020, 04:56:55 UTC
9b8437d add optional sentence segmentation in the figure and table captions 21 August 2020, 03:44:34 UTC
a1468ae Modify schema to allow text structure under figDesc, so that we can add sentence segmentation 21 August 2020, 03:08:04 UTC
f39a3bc set back default config 20 August 2020, 22:30:05 UTC
aaf0982 add some additional safety tests 20 August 2020, 21:08:09 UTC
a9f8678 add segmentSentences option to batch processing too 20 August 2020, 20:58:36 UTC
c594d67 document the sentence segmentation option 20 August 2020, 20:03:02 UTC
7d7e952 cleaning 20 August 2020, 19:40:27 UTC
9297f1d reorganize sentence segmentation resources 20 August 2020, 19:31:05 UTC
c5e9ff4 fix issues related to the string modification as returned by the pragmatic segmenter 20 August 2020, 17:43:13 UTC
70e4239 fix jruby multithreaded runtime 20 August 2020, 06:17:23 UTC
6551c95 add pragmatic segmenter implementation via jruby 20 August 2020, 05:36:10 UTC
68b8f2c manage OpenNLP detector for multithread (not thread safe) 20 August 2020, 03:05:22 UTC
1bea9a9 update XML schema to support <s> 20 August 2020, 01:50:39 UTC
7a21aed fix sentence coordinate issue 20 August 2020, 01:34:35 UTC
947a07e Merge pull request #624 from kermitt2/bugfix/crossref-api update call to crossref when title is supplied 20 August 2020, 01:10:29 UTC
dce0230 add tests 20 August 2020, 01:10:06 UTC
469c52a avoid useless serialization of the header; fix abstract structuring 20 August 2020, 00:40:18 UTC
21dc59c query title on query.bibliographics 19 August 2020, 23:35:03 UTC
f9f060b support forbidden spans in sentence segmentation 19 August 2020, 23:17:13 UTC
2d3070a add factory interface for implementation of sentence segmentation ; add OpenNLP implementation 19 August 2020, 20:17:39 UTC
a11dec8 double check idiom for lexicon lazy loading in multithreaded usage 19 August 2020, 19:35:11 UTC
cfa50ab fix tests 19 August 2020, 15:22:04 UTC
1354bf6 add <s> segmentation option, validation of the approach with openNLP 19 August 2020, 15:07:12 UTC
99831f7 avoid string concat 18 August 2020, 14:31:24 UTC
0860f61 update release documentation 12 August 2020, 20:58:45 UTC
ad877a2 Merge branch 'master' of https://github.com/kermitt2/grobid 12 August 2020, 20:18:39 UTC
d452205 update doc for release 0.6.1 12 August 2020, 20:18:25 UTC
a348041 [Gradle Release Plugin] - new version commit: '0.6.2-SNAPSHOT'. 12 August 2020, 18:41:30 UTC
95c609b [Gradle Release Plugin] - pre tag commit: '0.6.1'. 12 August 2020, 18:38:19 UTC
7ddbd46 update doc 12 August 2020, 13:27:12 UTC
aeb0db8 Merge pull request #597 from kermitt2/feature/update_jdk_and_gradle Update to gradle 6.5.1 to support JDK 13 and 14 12 August 2020, 11:29:41 UTC
6b890a8 modify end-to-end eval action names 12 August 2020, 11:18:51 UTC
9bb1380 Merge pull request #563 from kermitt2/keep_affiliations_after_consolidation keep the extracted affiliations if none found from consolidation. 12 August 2020, 09:45:18 UTC
f1c3cf9 some fixes 12 August 2020, 09:00:15 UTC
3b08564 Merge branch 'master' into feature/update_jdk_and_gradle 12 August 2020, 02:42:51 UTC
1323638 refine matching 11 August 2020, 22:54:11 UTC
d8fe0a5 Merge branch 'master' into keep_affiliations_after_consolidation 11 August 2020, 22:14:06 UTC
c7b4b20 fix issues with #562 observed after merging 11 August 2020, 22:12:57 UTC
a998055 back to previous approach, with improvements 11 August 2020, 20:24:28 UTC
886bf8a Merge branch 'master' into keep_affiliations_after_consolidation 11 August 2020, 15:58:05 UTC
678c816 Merge pull request #562 from kermitt2/512_PDF_and_authors_with_ORCID Check whether annotation is orcid and add orcid to author. 11 August 2020, 15:56:23 UTC
1501601 Merge branch 'master' into keep_affiliations_after_consolidation 11 August 2020, 14:49:49 UTC
227a804 defaulting crossref consolidation 11 August 2020, 14:48:25 UTC
753b072 Merge branch 'master' into 512_PDF_and_authors_with_ORCID 11 August 2020, 14:20:48 UTC
5f93224 fix tests 11 August 2020, 13:08:43 UTC
82995a1 some simplification 11 August 2020, 11:42:54 UTC
8255d99 use Person deduplication for injecting consolidated authors 11 August 2020, 11:37:35 UTC
853d82d Merge branch 'master' into keep_affiliations_after_consolidation 11 August 2020, 10:37:46 UTC
6c2a7c8 Merge pull request #591 from kermitt2/add-developers-guide Add users guide 11 August 2020, 09:27:50 UTC
670d06d Merge pull request #580 from kermitt2/update_header [WIP] Full update of the header model 11 August 2020, 09:12:45 UTC
bda53a9 Merge pull request #609 from bananaoomarang/more-env-config-overrides Allow overriding org.* properties with env vars 11 August 2020, 07:49:49 UTC
c3a4926 Merge branch 'master' into update_header 11 August 2020, 07:35:24 UTC
24774b3 Merge pull request #559 from elifesciences/make-crf-engine-model-specific allow model specific crf engine configuration 11 August 2020, 07:23:10 UTC
933090c add json preprocessor format 11 August 2020, 07:04:25 UTC
1098177 including dependencies in the fat jar 03 August 2020, 01:16:17 UTC
cf446ec Merge pull request #614 from mash/callout-to-reference-author-match Improve callout to reference match 02 August 2020, 19:24:14 UTC
11cbbcf remove parentheses often seen in callouts, for better callout to reference match 02 August 2020, 10:36:47 UTC
e9c371b fixed pass in model to getGrobidCRFEngine 27 July 2020, 13:39:15 UTC
05d999f Merge branch 'master' into make-crf-engine-model-specific 27 July 2020, 09:25:44 UTC
back to top