https://github.com/kermitt2/grobid

sort by:
Revision Author Date Message Commit Date
1b8eaf9 Merge branch 'master' into improvement/remove-log-on-file-docker 18 April 2022, 06:53:40 UTC
1e6903e update benchmarks in doc 16 April 2022, 22:09:20 UTC
7e09747 move to 0.7.2-SNAPSHOT 16 April 2022, 18:49:28 UTC
910777d prepare release 0.7.1 16 April 2022, 18:23:15 UTC
df80445 update DeLFT version to 0.3.1 16 April 2022, 16:54:10 UTC
272dff6 retrain one model 16 April 2022, 16:25:18 UTC
4e7671b update benchmarks 16 April 2022, 16:24:45 UTC
85df3f5 Merge branch 'master' of github.com:kermitt2/grobid 16 April 2022, 15:21:34 UTC
102c686 fix for #908 16 April 2022, 15:17:55 UTC
026725c missing serialization of final feature for header 16 April 2022, 01:34:28 UTC
b0a58f7 update benchmarks 15 April 2022, 21:31:27 UTC
bf906e5 update DOI only request to glutton 14 April 2022, 18:54:01 UTC
d7f31c4 update parameters to processFulltextAssetDocument 13 April 2022, 15:42:11 UTC
0dbbae6 Merge pull request #870 from kermitt2/biblio-glutton-v0.2 Support for biblio-glutton v0.2 12 April 2022, 16:41:06 UTC
6fa5690 Merge branch 'master' into biblio-glutton-v0.2 11 April 2022, 16:18:53 UTC
4281f29 minor updates 11 April 2022, 16:18:37 UTC
8436dde missing branch switch 02 April 2022, 14:19:48 UTC
7305af9 Merge pull request #896 from kermitt2/delft-0.3.0 Update to DeLFT 0.3.0 31 March 2022, 19:25:13 UTC
ec6ab85 Merge branch 'master' into delft-0.3.0 31 March 2022, 09:38:54 UTC
fa142af adapt docker image for new delft version 31 March 2022, 09:27:55 UTC
0caf242 update docker file and script for new delft version 30 March 2022, 16:20:19 UTC
79ebdd7 more robustness wrt sentence segmenter output 26 March 2022, 19:17:10 UTC
7466fb6 filter out possible corrupted characters introduced by the sentence segmenter 23 March 2022, 20:56:08 UTC
d8e2afd filter out possible corrupted characters introduced by the sentence segmenter 23 March 2022, 20:48:12 UTC
dcc8438 cleaning and complete classifier runtime options 19 March 2022, 20:15:13 UTC
4a8393c fix classifier name for python compatibility 19 March 2022, 17:56:05 UTC
1c08cf3 cleaning 17 March 2022, 17:07:36 UTC
4e25350 change to JEP SharedInterpreter to support python packages not working with python subinterpreter; adapt DL classifier 15 March 2022, 17:27:54 UTC
b22e634 update dockerfile for DL models 14 March 2022, 23:54:25 UTC
e6d2dfe remove non relevant model 13 March 2022, 23:19:44 UTC
8811d4e update doc and resources 13 March 2022, 20:50:05 UTC
856ce28 update and add new DL models 12 March 2022, 23:44:07 UTC
d2a1e90 add JEP library install script 10 March 2022, 15:17:35 UTC
4c1c79a adapt library loading to new jep/delft 10 March 2022, 13:41:36 UTC
0626a2c update circleci image, fourth try 09 March 2022, 22:19:44 UTC
840fc2f update circleci image, third try 09 March 2022, 21:45:37 UTC
dacae89 update circleci image, second try 09 March 2022, 21:42:16 UTC
78c077b update circleci image 09 March 2022, 21:38:29 UTC
966c3b6 restaure default config file 09 March 2022, 20:47:46 UTC
1d888c1 upgrade JEP to 4.0.2; update wapiti and jep binaries; update DeLFT scripting; update citation models to delft 0.3.0 09 March 2022, 20:26:41 UTC
285f183 update doc for delft 0.3.0 and JEP version for python 3.8 09 March 2022, 18:28:14 UTC
a953035 support transformer name in the config 09 March 2022, 15:09:40 UTC
071fb8b Merge branch 'master' into biblio-glutton-v0.2 09 March 2022, 14:53:54 UTC
1f1ebd4 add more standard language codes 22 February 2022, 16:50:20 UTC
621f5a1 fix for #836 17 February 2022, 04:32:50 UTC
3373b7b correct model name for reference-segmenter 28 January 2022, 18:43:11 UTC
ed65612 avoid invalid xml:id in generated blank XML 23 December 2021, 17:22:20 UTC
5d7d211 Merge branch 'master' into biblio-glutton-v0.2 19 December 2021, 15:57:28 UTC
f466930 add pre-compiled regex 19 December 2021, 15:51:00 UTC
2e30c27 add some training data 19 December 2021, 11:56:21 UTC
7e69bd1 merge manually #511 19 December 2021, 10:52:29 UTC
b9b47ff better source linking in faq 18 December 2021, 19:50:17 UTC
b6bf216 doc typo 18 December 2021, 17:47:19 UTC
585dbc2 format last quote 18 December 2021, 17:42:47 UTC
d6c0384 make quotes more readable 18 December 2021, 17:40:39 UTC
b409e27 add faq entry on licensing 18 December 2021, 17:36:33 UTC
20a8208 Merge branch 'master' of github.com:kermitt2/grobid 11 December 2021, 16:51:37 UTC
d8ef689 markdown fix 11 December 2021, 16:51:28 UTC
b3f1199 Merge pull request #824 from kermitt2/bugfix/test-build-on-ci Add assemble build task to CI 09 December 2021, 11:47:22 UTC
8523c09 improve upload of test results 09 December 2021, 07:30:35 UTC
ba0d2d3 update circleci to upload correctly the test results 09 December 2021, 07:15:03 UTC
1248738 Revert "make simpler tests in ci" This reverts commit c8f1a4dcb0bcb27f6374026f331316eecdd87acf. 09 December 2021, 06:47:04 UTC
6e5236e Merge pull request #868 from kermitt2/fix-867 Fix regex, #867 07 December 2021, 04:58:47 UTC
dd0251d fix catastrophic backtracking issue with atomic grouping; tests 04 December 2021, 09:20:57 UTC
fe81e85 review comments 04 December 2021, 08:56:19 UTC
938d12a Merge pull request #854 from bnewbold/crossref-citation-training Annotated crossref citation training data 23 November 2021, 03:27:37 UTC
49c14ac back to <orgName> for university of disseration 20 November 2021, 01:51:52 UTC
c64b1fe update citation annotations based on review 15 November 2021, 23:08:19 UTC
ca379f7 bioRxiv training refs: normalize <orgname> -> <orgName> 15 November 2021, 23:04:20 UTC
d02420e updates to reference annotation docs Based on PR review 15 November 2021, 23:03:37 UTC
dc1f94a better handling of non PDF in the demo 15 November 2021, 01:29:27 UTC
cfd2f05 one training data correction 13 November 2021, 20:11:34 UTC
b400916 Merge pull request #857 from bnewbold/bnewbold-arxiv-url-abs visualizer: use arxiv.org/abs/<id> instead of arxiv.org/<id> 13 November 2021, 16:50:12 UTC
4939802 Merge pull request #853 from bnewbold/multiple-editor TEI: multiple editors each get <editor> around <persName> 13 November 2021, 15:55:14 UTC
23a300d add a few more missing XML escape cases 13 November 2021, 15:49:25 UTC
7a37e75 Merge branch 'master' of github.com:kermitt2/grobid 13 November 2021, 15:08:23 UTC
9b347e0 cleaning unused TEI serialization variant 13 November 2021, 15:08:11 UTC
f73ec07 Merge pull request #852 from bnewbold/escape-more-identifiers Escape more identifiers, including ISSN 13 November 2021, 14:57:33 UTC
344f97a Merge pull request #850 from shisheng-1/Modify_GRADLE_1 Improve GRADLE build Performance 13 November 2021, 14:45:28 UTC
443a5ce Merge pull request #832 from bnewbold/minor-author-training-fixes training: small fixes to name (citation) data 13 November 2021, 14:43:13 UTC
58c76eb one more unescaped XML case: <label> in raw affiliations Also added a regression test for this specific case. 12 November 2021, 23:53:15 UTC
beb12b0 visualizer: use arxiv.org/abs/<id> instead of arxiv.org/<id> 12 November 2021, 22:58:34 UTC
4a432b2 delete "org.gradle.configureondemand = true" 12 November 2021, 03:11:02 UTC
2daad1d add citation and name-citation training date from crossref 'unstructured' references These are hand-labeled from these notes: https://gist.github.com/bnewbold/b437e363e6a0429719c65c751babe84d 09 November 2021, 03:05:45 UTC
235dcea citations.xml: small URL labeling adjustments Trying to reduce incidents of non-URL tokens getting added to start or end of URLs. 09 November 2021, 03:04:16 UTC
b554a26 TEI: multiple editors each get <editor> around <persName> 09 November 2021, 00:35:39 UTC
de205c7 escape more identifiers, including ISSN, in TEI-XML output 09 November 2021, 00:17:00 UTC
fece11a regression test for unescaped ISSN numbers 09 November 2021, 00:16:24 UTC
e58d075 Improve GRADLE build Performance 08 November 2021, 15:45:05 UTC
c8f1a4d make simpler tests in ci 24 October 2021, 19:02:43 UTC
beebd9a change default delft architecture from the configuration with existing ones to avoid confusion #843 20 October 2021, 09:19:23 UTC
f29a6f5 Typo #841 18 October 2021, 00:06:23 UTC
61252d7 Merge pull request #833 from kermitt2/feature/change-base-image-delft-docker Docker with GPU 12 October 2021, 14:30:01 UTC
336f951 update documentation 12 October 2021, 13:52:57 UTC
63a6bff avoid cookies warning 14 September 2021, 09:33:09 UTC
78c3659 better score formatting 14 September 2021, 08:24:06 UTC
da3f142 Fix typo in dockerfile 14 September 2021, 04:08:04 UTC
80b24a0 update web queries to glutton 12 September 2021, 14:26:35 UTC
436ffbc change base image of grobid docker for delft 10 September 2021, 09:33:03 UTC
98d9ed9 Merge pull request #831 from kermitt2/bugfix/fix-wrongly-assigned-day-consolidation fix deserialised consolidation day 08 September 2021, 05:03:16 UTC
back to top