https://github.com/kermitt2/grobid

sort by:
Revision Author Date Message Commit Date
4168261 minor rephrase/typos 21 October 2022, 17:39:41 UTC
c0b27cf Merge branch 'master' into option-consolidate-with-doi-only 20 October 2022, 11:23:41 UTC
1837b3b Merge branch 'master' into option-consolidate-with-doi-only 20 October 2022, 11:18:44 UTC
3440cf8 training typos 20 October 2022, 09:46:11 UTC
a62fc12 Merge branch 'master' of github.com:kermitt2/grobid 20 October 2022, 07:50:58 UTC
0136530 clean/correct training 20 October 2022, 07:50:41 UTC
02b6e2c Unit tests I forgot to commit 20 October 2022, 02:09:04 UTC
da7bb0c quick refresh authors in references 19 October 2022, 20:00:19 UTC
dab259e Merge pull request #959 from kermitt2/feature/funding-statement Add funding statement in TEI output 19 October 2022, 07:36:47 UTC
6c8b888 remove field from PMC - make a general method for it 19 October 2022, 02:37:37 UTC
5f08df2 Merge branch 'master' into feature/funding-statement 19 October 2022, 02:04:23 UTC
9fdec9c cleaning useless hack ; generalizing post-processing for short texts 18 October 2022, 14:06:12 UTC
b528647 update name and add unit test 18 October 2022, 08:26:13 UTC
4ac0339 add funding XPaths in end to end evaluation 18 October 2022, 06:47:50 UTC
8e53a7d additional training data and model update 17 October 2022, 18:15:06 UTC
1836680 apply post processing of tei sections of text without considering tables and figures labels 17 October 2022, 10:56:59 UTC
0bd482c skip availability statement eval for e2e PMC set 16 October 2022, 16:56:53 UTC
134ac7a Merge pull request #838 from kermitt2/prioritize_crossref_author_meta prefer author meta from consolidation , crossref is considered more r… 15 October 2022, 13:41:21 UTC
197ecd0 Merge pull request #961 from kermitt2/martin-citation-annotations Corrected ref data from PR #864 15 October 2022, 13:35:30 UTC
f82daa5 corrected ref data from PR #864 15 October 2022, 13:22:50 UTC
ad1ee7f add back training data generation for raw reference strings 15 October 2022, 13:03:49 UTC
52352a8 add new hack 15 October 2022, 08:42:59 UTC
27a25ad roll back first hack 15 October 2022, 08:42:49 UTC
53ace9d cleaning; review basic keyword segmentation 15 October 2022, 07:04:36 UTC
14f5c5a avoid loosing text when processShort tag text as figure or table 13 October 2022, 05:43:08 UTC
b1c0cfd output funding statement in the back of the TEI output 13 October 2022, 03:24:43 UTC
06a526f Add mention to WSL mode in documentation related to #954 12 October 2022, 05:14:39 UTC
0f6c1b4 try to rephrase more clearly :) 08 October 2022, 06:56:21 UTC
2a16015 Merge pull request #951 from kermitt2/feature/data-availability-statement Data and code availability statement zone 07 October 2022, 06:25:50 UTC
1925296 set default timeout and max blocks higher 06 October 2022, 19:46:09 UTC
7bb0c8f update segmentation model prior to merge 06 October 2022, 19:38:15 UTC
08b176f more training data for segmentation model 05 October 2022, 18:44:20 UTC
bd3de67 fix errors in latest training data 05 October 2022, 15:49:42 UTC
0f48242 new training data segmentation for availability statements 03 October 2022, 18:35:22 UTC
3c8c9e5 add a script to select interesting training cases from JATS/PDF pairs 03 October 2022, 11:57:15 UTC
74e29c2 fix test 03 October 2022, 08:44:07 UTC
332daf1 better foot note identifier 02 October 2022, 18:05:47 UTC
15e8565 fix xml id for foot notes 02 October 2022, 17:55:23 UTC
2605750 remove unstable integration test 27 September 2022, 14:43:32 UTC
2278be1 fix conflict 27 September 2022, 14:21:55 UTC
4e96757 minor optional diff report 27 September 2022, 13:37:08 UTC
f9dc68f Merge pull request #944 from kermitt2/features/footnotes Link footnotes in the text 27 September 2022, 13:35:08 UTC
2f2241e minor for trace 26 September 2022, 17:11:30 UTC
278ee1d review eval, remove redundant normalize-space 26 September 2022, 16:17:57 UTC
7b0aa0c avoid regression; cleaning 26 September 2022, 10:59:12 UTC
68efda8 better field name for reporting 25 September 2022, 16:00:06 UTC
14cc516 remove non-Grobid TEI path 25 September 2022, 15:18:00 UTC
98aface do not restrict availability statement to data availability 25 September 2022, 14:39:09 UTC
e098b50 write header availability statement in the final TEI 25 September 2022, 14:31:37 UTC
24ad1f1 add new labels in the created training data for header and segmentation models 25 September 2022, 13:11:45 UTC
f427ad7 fix tests 25 September 2022, 12:39:42 UTC
cdf6cd5 cleaning/minor 25 September 2022, 12:06:23 UTC
feff30b remove useless layout token storage (already done by generalResultMapping) 25 September 2022, 12:05:55 UTC
9eaa522 avoid useless breaking change for all other grobid modeules 25 September 2022, 11:14:39 UTC
40331eb Merge branch 'master' into feature/data-availability-statement 25 September 2022, 10:42:43 UTC
655ccdf case no ref but note 25 September 2022, 07:21:34 UTC
e2ac939 fix test 24 September 2022, 18:17:44 UTC
f71fe72 cover bibliographicla callout which are in fact foot note callout 24 September 2022, 17:30:38 UTC
77185b0 rewrite footnote callout serialization; support multiple footnote callout in same paragraph; fix missing paragraph content 24 September 2022, 15:47:51 UTC
063f559 fix tests 24 September 2022, 11:49:44 UTC
3cca788 rename Footnote object to Note everywhere to avoid confusion (margin note are the same) 24 September 2022, 11:11:14 UTC
1207e0b clean not used and redundant code; factorize margin note and foot note; review footnote object 24 September 2022, 10:59:33 UTC
01699b7 Merge branch 'master' into features/footnotes 24 September 2022, 09:34:06 UTC
54d1c29 make evaluation pages more visible 12 September 2022, 12:44:47 UTC
0fbc152 add integration test with sample document 09 September 2022, 06:32:16 UTC
8bf7e96 add unit tests 09 September 2022, 05:57:34 UTC
8cd6c05 doc typo/minor update 06 September 2022, 12:55:37 UTC
761a1f6 cleanup 28 August 2022, 02:00:52 UTC
6403d12 link footnotes to superscript tokens only 28 August 2022, 01:58:03 UTC
7df6a01 fix @target attribute for footnotes 27 August 2022, 19:37:46 UTC
04b3500 link footnotes with heuristics 27 August 2022, 15:17:49 UTC
d22a1ff Compile the xpath at the instantiation of the patterns 26 August 2022, 07:22:04 UTC
fc1445b Merge branch 'feature/data-availability-statement' of github.com:kermitt2/grobid into feature/data-availability-statement 18 August 2022, 08:51:49 UTC
59d2602 follow annotation guidelines 18 August 2022, 08:51:37 UTC
d3bf062 Add retrained models with revised training data 16 August 2022, 15:10:06 UTC
1863bb4 review added affiliation files 15 August 2022, 05:22:10 UTC
8b1a674 review added header files 15 August 2022, 03:37:26 UTC
b55065e review additional segmentation files 15 August 2022, 02:08:43 UTC
0326b28 separate program and project names 15 August 2022, 00:09:20 UTC
c566ce2 revised segmentation and header files 14 August 2022, 07:03:52 UTC
cfe8869 update training data acknowledgement 14 August 2022, 00:45:35 UTC
1a92293 update training data acknowledgement 13 August 2022, 00:32:59 UTC
25ced8d update training data 12 August 2022, 23:15:18 UTC
e77115c re-annotate acknowledgements 12 August 2022, 19:54:33 UTC
5f3f863 add affilliation corrected training data 12 August 2022, 15:37:59 UTC
27eea22 add additional header training data 12 August 2022, 15:37:46 UTC
c8bbcab add additional segmentation training data 12 August 2022, 15:37:26 UTC
1ab0779 add reviewed files from PR #813 12 August 2022, 08:48:53 UTC
443fc56 rename raw file for kononova2019 12 August 2022, 07:22:42 UTC
56a8bd8 regenerate header files 12 August 2022, 06:59:02 UTC
c3352a2 extend the field specification to <p> + content-type 11 August 2022, 11:22:36 UTC
dee0f04 add trained model and update tagging label to "availability" 11 August 2022, 11:22:08 UTC
bc0add8 support additional attribute values 10 August 2022, 20:50:24 UTC
8c83a16 cosmetics 10 August 2022, 20:50:01 UTC
6e8c6b7 get training data "availability" label from <note> 10 August 2022, 11:29:32 UTC
2cb2010 update training labels, add funding in segmentation, revert data availability label temporarly 10 August 2022, 11:29:32 UTC
d9f4a82 Revert "Re-trained segmentation model with revised training data" This reverts commit 90c10a70df74fe38e337db612b423d20514356e9. 10 August 2022, 10:56:41 UTC
90c10a7 Re-trained segmentation model with revised training data 10 August 2022, 09:29:37 UTC
587a8b1 add reference to sub-sections 10 August 2022, 09:07:09 UTC
346af3a improve wording 10 August 2022, 04:35:23 UTC
back to top