1b8eaf9 | Luca Foppiano | 18 April 2022, 06:53:40 UTC | Merge branch 'master' into improvement/remove-log-on-file-docker | 18 April 2022, 06:53:40 UTC |
1e6903e | lopez | 16 April 2022, 22:09:20 UTC | update benchmarks in doc | 16 April 2022, 22:09:20 UTC |
7e09747 | lopez | 16 April 2022, 18:49:28 UTC | move to 0.7.2-SNAPSHOT | 16 April 2022, 18:49:28 UTC |
910777d | lopez | 16 April 2022, 18:23:15 UTC | prepare release 0.7.1 | 16 April 2022, 18:23:15 UTC |
df80445 | lopez | 16 April 2022, 16:54:10 UTC | update DeLFT version to 0.3.1 | 16 April 2022, 16:54:10 UTC |
272dff6 | Patrice Lopez | 16 April 2022, 16:25:18 UTC | retrain one model | 16 April 2022, 16:25:18 UTC |
4e7671b | lopez | 16 April 2022, 16:24:45 UTC | update benchmarks | 16 April 2022, 16:24:45 UTC |
85df3f5 | lopez | 16 April 2022, 15:21:34 UTC | Merge branch 'master' of github.com:kermitt2/grobid | 16 April 2022, 15:21:34 UTC |
102c686 | lopez | 16 April 2022, 15:17:55 UTC | fix for #908 | 16 April 2022, 15:17:55 UTC |
026725c | Patrice Lopez | 16 April 2022, 01:34:28 UTC | missing serialization of final feature for header | 16 April 2022, 01:34:28 UTC |
b0a58f7 | lopez | 15 April 2022, 21:31:27 UTC | update benchmarks | 15 April 2022, 21:31:27 UTC |
bf906e5 | lopez | 14 April 2022, 18:54:01 UTC | update DOI only request to glutton | 14 April 2022, 18:54:01 UTC |
d7f31c4 | lopez | 13 April 2022, 15:42:11 UTC | update parameters to processFulltextAssetDocument | 13 April 2022, 15:42:11 UTC |
0dbbae6 | Patrice Lopez | 12 April 2022, 16:41:06 UTC | Merge pull request #870 from kermitt2/biblio-glutton-v0.2 Support for biblio-glutton v0.2 | 12 April 2022, 16:41:06 UTC |
6fa5690 | lopez | 11 April 2022, 16:18:53 UTC | Merge branch 'master' into biblio-glutton-v0.2 | 11 April 2022, 16:18:53 UTC |
4281f29 | lopez | 11 April 2022, 16:18:37 UTC | minor updates | 11 April 2022, 16:18:37 UTC |
8436dde | lopez | 02 April 2022, 14:19:48 UTC | missing branch switch | 02 April 2022, 14:19:48 UTC |
7305af9 | Patrice Lopez | 31 March 2022, 19:25:13 UTC | Merge pull request #896 from kermitt2/delft-0.3.0 Update to DeLFT 0.3.0 | 31 March 2022, 19:25:13 UTC |
ec6ab85 | lopez | 31 March 2022, 09:38:54 UTC | Merge branch 'master' into delft-0.3.0 | 31 March 2022, 09:38:54 UTC |
fa142af | lopez | 31 March 2022, 09:27:55 UTC | adapt docker image for new delft version | 31 March 2022, 09:27:55 UTC |
0caf242 | lopez | 30 March 2022, 16:20:19 UTC | update docker file and script for new delft version | 30 March 2022, 16:20:19 UTC |
79ebdd7 | lopez | 26 March 2022, 19:17:10 UTC | more robustness wrt sentence segmenter output | 26 March 2022, 19:17:10 UTC |
7466fb6 | lopez | 23 March 2022, 20:56:08 UTC | filter out possible corrupted characters introduced by the sentence segmenter | 23 March 2022, 20:56:08 UTC |
d8e2afd | lopez | 23 March 2022, 20:48:12 UTC | filter out possible corrupted characters introduced by the sentence segmenter | 23 March 2022, 20:48:12 UTC |
dcc8438 | lopez | 19 March 2022, 20:15:13 UTC | cleaning and complete classifier runtime options | 19 March 2022, 20:15:13 UTC |
4a8393c | lopez | 19 March 2022, 17:56:05 UTC | fix classifier name for python compatibility | 19 March 2022, 17:56:05 UTC |
1c08cf3 | lopez | 17 March 2022, 17:07:36 UTC | cleaning | 17 March 2022, 17:07:36 UTC |
4e25350 | lopez | 15 March 2022, 17:27:54 UTC | change to JEP SharedInterpreter to support python packages not working with python subinterpreter; adapt DL classifier | 15 March 2022, 17:27:54 UTC |
b22e634 | lopez | 14 March 2022, 23:54:25 UTC | update dockerfile for DL models | 14 March 2022, 23:54:25 UTC |
e6d2dfe | lopez | 13 March 2022, 23:19:44 UTC | remove non relevant model | 13 March 2022, 23:19:44 UTC |
8811d4e | lopez | 13 March 2022, 20:50:05 UTC | update doc and resources | 13 March 2022, 20:50:05 UTC |
856ce28 | lopez | 12 March 2022, 23:44:07 UTC | update and add new DL models | 12 March 2022, 23:44:07 UTC |
d2a1e90 | lopez | 10 March 2022, 15:17:35 UTC | add JEP library install script | 10 March 2022, 15:17:35 UTC |
4c1c79a | lopez | 10 March 2022, 13:41:36 UTC | adapt library loading to new jep/delft | 10 March 2022, 13:41:36 UTC |
0626a2c | lopez | 09 March 2022, 22:19:44 UTC | update circleci image, fourth try | 09 March 2022, 22:19:44 UTC |
840fc2f | lopez | 09 March 2022, 21:45:37 UTC | update circleci image, third try | 09 March 2022, 21:45:37 UTC |
dacae89 | lopez | 09 March 2022, 21:42:16 UTC | update circleci image, second try | 09 March 2022, 21:42:16 UTC |
78c077b | lopez | 09 March 2022, 21:38:29 UTC | update circleci image | 09 March 2022, 21:38:29 UTC |
966c3b6 | lopez | 09 March 2022, 20:47:46 UTC | restaure default config file | 09 March 2022, 20:47:46 UTC |
1d888c1 | lopez | 09 March 2022, 20:26:41 UTC | upgrade JEP to 4.0.2; update wapiti and jep binaries; update DeLFT scripting; update citation models to delft 0.3.0 | 09 March 2022, 20:26:41 UTC |
285f183 | lopez | 09 March 2022, 18:28:14 UTC | update doc for delft 0.3.0 and JEP version for python 3.8 | 09 March 2022, 18:28:14 UTC |
a953035 | lopez | 09 March 2022, 15:09:40 UTC | support transformer name in the config | 09 March 2022, 15:09:40 UTC |
071fb8b | lopez | 09 March 2022, 14:53:54 UTC | Merge branch 'master' into biblio-glutton-v0.2 | 09 March 2022, 14:53:54 UTC |
1f1ebd4 | lopez | 22 February 2022, 16:50:20 UTC | add more standard language codes | 22 February 2022, 16:50:20 UTC |
621f5a1 | lopez | 17 February 2022, 04:32:50 UTC | fix for #836 | 17 February 2022, 04:32:50 UTC |
3373b7b | lopez | 28 January 2022, 18:43:11 UTC | correct model name for reference-segmenter | 28 January 2022, 18:43:11 UTC |
ed65612 | lopez | 23 December 2021, 17:22:20 UTC | avoid invalid xml:id in generated blank XML | 23 December 2021, 17:22:20 UTC |
5d7d211 | lopez | 19 December 2021, 15:57:28 UTC | Merge branch 'master' into biblio-glutton-v0.2 | 19 December 2021, 15:57:28 UTC |
f466930 | lopez | 19 December 2021, 15:51:00 UTC | add pre-compiled regex | 19 December 2021, 15:51:00 UTC |
2e30c27 | lopez | 19 December 2021, 11:56:21 UTC | add some training data | 19 December 2021, 11:56:21 UTC |
7e69bd1 | lopez | 19 December 2021, 10:52:29 UTC | merge manually #511 | 19 December 2021, 10:52:29 UTC |
b9b47ff | lopez | 18 December 2021, 19:50:17 UTC | better source linking in faq | 18 December 2021, 19:50:17 UTC |
b6bf216 | lopez | 18 December 2021, 17:47:19 UTC | doc typo | 18 December 2021, 17:47:19 UTC |
585dbc2 | lopez | 18 December 2021, 17:42:47 UTC | format last quote | 18 December 2021, 17:42:47 UTC |
d6c0384 | lopez | 18 December 2021, 17:40:39 UTC | make quotes more readable | 18 December 2021, 17:40:39 UTC |
b409e27 | lopez | 18 December 2021, 17:36:33 UTC | add faq entry on licensing | 18 December 2021, 17:36:33 UTC |
20a8208 | lopez | 11 December 2021, 16:51:37 UTC | Merge branch 'master' of github.com:kermitt2/grobid | 11 December 2021, 16:51:37 UTC |
d8ef689 | lopez | 11 December 2021, 16:51:28 UTC | markdown fix | 11 December 2021, 16:51:28 UTC |
b3f1199 | Patrice Lopez | 09 December 2021, 11:47:22 UTC | Merge pull request #824 from kermitt2/bugfix/test-build-on-ci Add assemble build task to CI | 09 December 2021, 11:47:22 UTC |
8523c09 | Luca Foppiano | 09 December 2021, 07:30:35 UTC | improve upload of test results | 09 December 2021, 07:30:35 UTC |
ba0d2d3 | Luca Foppiano | 09 December 2021, 07:15:03 UTC | update circleci to upload correctly the test results | 09 December 2021, 07:15:03 UTC |
1248738 | Luca Foppiano | 09 December 2021, 06:47:04 UTC | Revert "make simpler tests in ci" This reverts commit c8f1a4dcb0bcb27f6374026f331316eecdd87acf. | 09 December 2021, 06:47:04 UTC |
6e5236e | Patrice Lopez | 07 December 2021, 04:58:47 UTC | Merge pull request #868 from kermitt2/fix-867 Fix regex, #867 | 07 December 2021, 04:58:47 UTC |
dd0251d | lopez | 04 December 2021, 09:20:57 UTC | fix catastrophic backtracking issue with atomic grouping; tests | 04 December 2021, 09:20:57 UTC |
fe81e85 | lopez | 04 December 2021, 08:56:19 UTC | review comments | 04 December 2021, 08:56:19 UTC |
938d12a | Patrice Lopez | 23 November 2021, 03:27:37 UTC | Merge pull request #854 from bnewbold/crossref-citation-training Annotated crossref citation training data | 23 November 2021, 03:27:37 UTC |
49c14ac | Bryan Newbold | 20 November 2021, 01:51:52 UTC | back to <orgName> for university of disseration | 20 November 2021, 01:51:52 UTC |
c64b1fe | Bryan Newbold | 15 November 2021, 23:08:19 UTC | update citation annotations based on review | 15 November 2021, 23:08:19 UTC |
ca379f7 | Bryan Newbold | 15 November 2021, 23:04:20 UTC | bioRxiv training refs: normalize <orgname> -> <orgName> | 15 November 2021, 23:04:20 UTC |
d02420e | Bryan Newbold | 15 November 2021, 23:03:36 UTC | updates to reference annotation docs Based on PR review | 15 November 2021, 23:03:37 UTC |
dc1f94a | lopez | 15 November 2021, 01:29:27 UTC | better handling of non PDF in the demo | 15 November 2021, 01:29:27 UTC |
cfd2f05 | lopez | 13 November 2021, 20:11:34 UTC | one training data correction | 13 November 2021, 20:11:34 UTC |
b400916 | Patrice Lopez | 13 November 2021, 16:50:12 UTC | Merge pull request #857 from bnewbold/bnewbold-arxiv-url-abs visualizer: use arxiv.org/abs/<id> instead of arxiv.org/<id> | 13 November 2021, 16:50:12 UTC |
4939802 | Patrice Lopez | 13 November 2021, 15:55:14 UTC | Merge pull request #853 from bnewbold/multiple-editor TEI: multiple editors each get <editor> around <persName> | 13 November 2021, 15:55:14 UTC |
23a300d | lopez | 13 November 2021, 15:49:25 UTC | add a few more missing XML escape cases | 13 November 2021, 15:49:25 UTC |
7a37e75 | lopez | 13 November 2021, 15:08:23 UTC | Merge branch 'master' of github.com:kermitt2/grobid | 13 November 2021, 15:08:23 UTC |
9b347e0 | lopez | 13 November 2021, 15:08:11 UTC | cleaning unused TEI serialization variant | 13 November 2021, 15:08:11 UTC |
f73ec07 | Patrice Lopez | 13 November 2021, 14:57:33 UTC | Merge pull request #852 from bnewbold/escape-more-identifiers Escape more identifiers, including ISSN | 13 November 2021, 14:57:33 UTC |
344f97a | Patrice Lopez | 13 November 2021, 14:45:28 UTC | Merge pull request #850 from shisheng-1/Modify_GRADLE_1 Improve GRADLE build Performance | 13 November 2021, 14:45:28 UTC |
443a5ce | Patrice Lopez | 13 November 2021, 14:43:13 UTC | Merge pull request #832 from bnewbold/minor-author-training-fixes training: small fixes to name (citation) data | 13 November 2021, 14:43:13 UTC |
58c76eb | Bryan Newbold | 12 November 2021, 23:53:12 UTC | one more unescaped XML case: <label> in raw affiliations Also added a regression test for this specific case. | 12 November 2021, 23:53:15 UTC |
beb12b0 | Bryan Newbold | 12 November 2021, 22:58:34 UTC | visualizer: use arxiv.org/abs/<id> instead of arxiv.org/<id> | 12 November 2021, 22:58:34 UTC |
4a432b2 | shisheng-1 | 12 November 2021, 03:11:02 UTC | delete "org.gradle.configureondemand = true" | 12 November 2021, 03:11:02 UTC |
2daad1d | Bryan Newbold | 09 November 2021, 03:05:42 UTC | add citation and name-citation training date from crossref 'unstructured' references These are hand-labeled from these notes: https://gist.github.com/bnewbold/b437e363e6a0429719c65c751babe84d | 09 November 2021, 03:05:45 UTC |
235dcea | Bryan Newbold | 09 November 2021, 03:04:13 UTC | citations.xml: small URL labeling adjustments Trying to reduce incidents of non-URL tokens getting added to start or end of URLs. | 09 November 2021, 03:04:16 UTC |
b554a26 | Bryan Newbold | 09 November 2021, 00:35:39 UTC | TEI: multiple editors each get <editor> around <persName> | 09 November 2021, 00:35:39 UTC |
de205c7 | Bryan Newbold | 09 November 2021, 00:17:00 UTC | escape more identifiers, including ISSN, in TEI-XML output | 09 November 2021, 00:17:00 UTC |
fece11a | Bryan Newbold | 09 November 2021, 00:16:24 UTC | regression test for unescaped ISSN numbers | 09 November 2021, 00:16:24 UTC |
e58d075 | shisheng-1 | 08 November 2021, 15:45:05 UTC | Improve GRADLE build Performance | 08 November 2021, 15:45:05 UTC |
c8f1a4d | lopez | 24 October 2021, 19:02:43 UTC | make simpler tests in ci | 24 October 2021, 19:02:43 UTC |
beebd9a | Luca Foppiano | 20 October 2021, 09:18:44 UTC | change default delft architecture from the configuration with existing ones to avoid confusion #843 | 20 October 2021, 09:19:23 UTC |
f29a6f5 | Luca Foppiano | 18 October 2021, 00:06:23 UTC | Typo #841 | 18 October 2021, 00:06:23 UTC |
61252d7 | Patrice Lopez | 12 October 2021, 14:30:01 UTC | Merge pull request #833 from kermitt2/feature/change-base-image-delft-docker Docker with GPU | 12 October 2021, 14:30:01 UTC |
336f951 | Patrice Lopez | 12 October 2021, 13:52:57 UTC | update documentation | 12 October 2021, 13:52:57 UTC |
63a6bff | lopez | 14 September 2021, 09:33:09 UTC | avoid cookies warning | 14 September 2021, 09:33:09 UTC |
78c3659 | lopez | 14 September 2021, 08:24:06 UTC | better score formatting | 14 September 2021, 08:24:06 UTC |
da3f142 | Luca Foppiano | 14 September 2021, 04:08:04 UTC | Fix typo in dockerfile | 14 September 2021, 04:08:04 UTC |
80b24a0 | lopez | 12 September 2021, 14:26:35 UTC | update web queries to glutton | 12 September 2021, 14:26:35 UTC |
436ffbc | Luca Foppiano | 10 September 2021, 09:33:03 UTC | change base image of grobid docker for delft | 10 September 2021, 09:33:03 UTC |
98d9ed9 | Patrice Lopez | 08 September 2021, 05:03:16 UTC | Merge pull request #831 from kermitt2/bugfix/fix-wrongly-assigned-day-consolidation fix deserialised consolidation day | 08 September 2021, 05:03:16 UTC |