https://github.com/kermitt2/grobid

sort by:
Revision Author Date Message Commit Date
5b14536 revert delft to 0.3.3 in docker 28 March 2024, 03:23:45 UTC
8460241 typos 26 March 2024, 12:03:50 UTC
cb10576 Merge pull request #1088 from kermitt2/reduce-image-size Reduce docker image size for the NN grobid version 11 March 2024, 19:43:23 UTC
32ba274 update delft version 28 February 2024, 08:20:43 UTC
a28a9d7 reduce the number of layers and the image size 28 February 2024, 08:01:34 UTC
d4822e1 fix error case with empty affiliation when creating pre-annotated training 18 February 2024, 16:15:14 UTC
3893a8d update date 17 February 2024, 23:07:08 UTC
989afea fix bug with empty affiliation 13 February 2024, 09:09:48 UTC
17cb0fc ensure ressearch infrastructures info are present with processHeaderFundingDocument 12 February 2024, 12:06:27 UTC
4daa2ce Merge pull request #1085 from kermitt2/research-infrastructures Research infrastructure recognition 11 February 2024, 23:32:01 UTC
8751dcb update model 11 February 2024, 23:11:59 UTC
212d822 review model 11 February 2024, 21:21:30 UTC
84d4191 update model 11 February 2024, 20:30:25 UTC
2c9ac67 update training data 11 February 2024, 19:12:11 UTC
c893cb4 update model 11 February 2024, 19:10:35 UTC
62655f4 better filtering of infrasructure 11 February 2024, 17:28:33 UTC
f6fa0ec Merge branch 'research-infrastructures' of github.com:kermitt2/grobid into research-infrastructures 11 February 2024, 16:43:47 UTC
041903f some training 11 February 2024, 16:43:42 UTC
a9b0f3f update training data 11 February 2024, 15:26:32 UTC
a66f6ab funding number fix 11 February 2024, 15:24:58 UTC
a352cdd adapt resources; update features 11 February 2024, 14:03:31 UTC
6f933bd update model to recognize research infrastructures 10 February 2024, 20:46:31 UTC
c20f668 update training data 10 February 2024, 20:46:02 UTC
0076518 adapt trainer and parser 10 February 2024, 20:45:27 UTC
4a37538 add resources; training data 10 February 2024, 18:18:54 UTC
cab0947 prepare for research infrastructure results 10 February 2024, 15:32:37 UTC
5a4b9b8 add research infrastructure annotations 10 February 2024, 13:19:40 UTC
ed9fef7 Merge pull request #1078 from kermitt2/copyrights-licenses Copyrights owner and licenses identification models 10 February 2024, 11:40:22 UTC
b829eff review eval set 10 February 2024, 10:32:32 UTC
261f975 comments 09 February 2024, 19:57:20 UTC
6b890fa fix conflicts with master 09 February 2024, 17:51:51 UTC
bcce229 update XML schema 09 February 2024, 16:43:16 UTC
e1ecac9 fix tests 09 February 2024, 14:34:23 UTC
d480e12 Merge branch 'master' of github.com:kermitt2/grobid 09 February 2024, 14:11:50 UTC
ecff63a update crf model patent-citation 09 February 2024, 14:11:21 UTC
16b9abb update XML schema 09 February 2024, 09:15:55 UTC
86b03e8 minor XML fix 08 February 2024, 17:49:22 UTC
269c897 Merge pull request #1082 from kermitt2/review-patent Review patent process 07 February 2024, 18:45:14 UTC
1ebc6c8 review serialization 06 February 2024, 20:20:01 UTC
8282dad fix usage of parameters 06 February 2024, 17:21:07 UTC
5750ad7 add tests 06 February 2024, 16:07:09 UTC
92d3c1d add tests, cleaning 06 February 2024, 14:51:28 UTC
6357f98 avoid adding bert models 06 February 2024, 09:47:11 UTC
08c0405 review sequence segmentation following max sequence length 05 February 2024, 22:53:32 UTC
017bc28 extend default config 05 February 2024, 20:49:10 UTC
0d524f7 review method profile and fix test 05 February 2024, 20:47:07 UTC
53f8c1d add additional tokenizer mode 05 February 2024, 20:46:24 UTC
80ca203 fix language code mismatch for Korean 05 February 2024, 20:45:45 UTC
a01d5ec cleaning used model 05 February 2024, 16:04:53 UTC
1550756 update model 05 February 2024, 16:04:19 UTC
d66896e review process and serialization 05 February 2024, 16:04:01 UTC
a0b5bc2 update US patent application mapping to year 05 February 2024, 16:03:27 UTC
b0145fa cleaning; review instance 05 February 2024, 12:34:02 UTC
fbb238c refactor process for DL models batch 04 February 2024, 21:50:22 UTC
a4eb584 remove outdated xml parser 04 February 2024, 18:59:19 UTC
0be5097 update DL models 04 February 2024, 18:58:23 UTC
8c4f19a some training fixes 04 February 2024, 18:57:56 UTC
c943239 review training parsing and selection for DL models 04 February 2024, 18:57:28 UTC
2ab91c2 add documentation for parameter includeRawCopyrights 04 February 2024, 16:57:30 UTC
ac6944e too quick 04 February 2024, 16:07:10 UTC
8b6a6e4 update models 04 February 2024, 16:01:15 UTC
af5a20b fix a missing serialization case; add option includeRawCopyrights in service 04 February 2024, 16:00:43 UTC
4359b40 change copyrights owner attribute to @rest, with a comment to explain 01 February 2024, 18:38:48 UTC
0f9a1dc fix copyright class naming 01 February 2024, 12:10:13 UTC
a74c05c rolling back ODD file to use ROMA 01 February 2024, 10:28:45 UTC
08d5fce add some unit tests 01 February 2024, 06:46:10 UTC
59f15e0 split labelling and decoding 01 February 2024, 01:12:40 UTC
0d31645 updated grobid ODD 01 February 2024, 00:32:40 UTC
d8fe160 default config for tests 29 January 2024, 18:27:52 UTC
2d90904 fix tests 29 January 2024, 17:53:32 UTC
23917c4 cleaning 29 January 2024, 17:43:16 UTC
268186d copyrights+licenses models integrated; TEI serialization 29 January 2024, 17:31:02 UTC
75ec437 fix indentation 28 January 2024, 19:58:05 UTC
b4f29a3 start integrating copyright and license model and classes 28 January 2024, 19:48:22 UTC
4816a7a add tini back to docker images to avoid zombie apocalypse with kubernetes 26 January 2024, 09:11:24 UTC
e14ce33 Merge pull request #1076 from kermitt2/bugfix/paragraph-coords Fix missing coordinates in paragraphs continuation 21 January 2024, 15:13:23 UTC
0d7913d visualise paragraphs or sentences mutually exclusively if the segmentation is enabled 18 January 2024, 08:58:32 UTC
4d0acef add missing coordinates when the paragraph continues after a reference callout 18 January 2024, 08:57:53 UTC
cbc77d5 Merge pull request #1075 from kermitt2/fix-note-page Fix OOBE when processing large quantities of notes 14 January 2024, 19:24:40 UTC
46913da add test to prevent a rare NPE when called from Pub2TEI 14 January 2024, 18:28:26 UTC
69fcecb add automatic build + push for the CRF light image 12 January 2024, 07:46:48 UTC
30780ef move dephypenisation at the end of the note process 11 January 2024, 11:44:28 UTC
6a3a0d5 avoid OOBE in note getPageNumber() 06 January 2024, 10:40:21 UTC
b50c9aa Merge pull request #1068 from kermitt2/feature/paragraphs-coordinates Add paragraphs coordinates 29 December 2023, 20:22:36 UTC
a506eb1 add coordinates for <p> in batch mode for consistency 29 December 2023, 20:05:21 UTC
7b7b5ce update documentation, fix typo 27 December 2023, 13:16:24 UTC
4620b63 cleanup 26 December 2023, 12:16:04 UTC
3ef0915 Merge branch 'master' into feature/paragraphs-coordinates # Conflicts: # grobid-service/src/main/resources/web/grobid/grobid.js 26 December 2023, 10:47:10 UTC
2ca3f35 Merge pull request #1069 from kermitt2/update-affiliation Update affiliation process 26 December 2023, 10:07:53 UTC
f84e855 indicate in doc coordinates for affiliation 25 December 2023, 19:57:37 UTC
54106db add one training example 23 December 2023, 10:31:42 UTC
5000f15 review test 22 December 2023, 11:59:16 UTC
0d310f2 fix wrong field name 22 December 2023, 11:24:37 UTC
200f626 cleaning 21 December 2023, 21:59:36 UTC
d137e21 seome trainng data 21 December 2023, 19:34:32 UTC
e235c88 fix for training data generation with updated affiliation-address parser 21 December 2023, 19:34:09 UTC
a9b6826 fix conflicts 21 December 2023, 16:38:02 UTC
4b77e73 Merge pull request #1070 from kermitt2/bugfix/fix-coord-name-title correct <title> coordinates attribute name 21 December 2023, 16:35:36 UTC
6eaa6f4 other typo for coords; exclude p 21 December 2023, 16:23:28 UTC
c776e3c fix typo, add affiliation coord element in demo 21 December 2023, 11:12:40 UTC
back to top