4168261 | lopez | 21 October 2022, 17:39:41 UTC | minor rephrase/typos | 21 October 2022, 17:39:41 UTC |
c0b27cf | Achraf | 20 October 2022, 11:23:41 UTC | Merge branch 'master' into option-consolidate-with-doi-only | 20 October 2022, 11:23:41 UTC |
1837b3b | Achraf | 20 October 2022, 11:18:44 UTC | Merge branch 'master' into option-consolidate-with-doi-only | 20 October 2022, 11:18:44 UTC |
3440cf8 | lopez | 20 October 2022, 09:46:11 UTC | training typos | 20 October 2022, 09:46:11 UTC |
a62fc12 | lopez | 20 October 2022, 07:50:58 UTC | Merge branch 'master' of github.com:kermitt2/grobid | 20 October 2022, 07:50:58 UTC |
0136530 | lopez | 20 October 2022, 07:50:41 UTC | clean/correct training | 20 October 2022, 07:50:41 UTC |
02b6e2c | Luca Foppiano | 20 October 2022, 02:09:04 UTC | Unit tests I forgot to commit | 20 October 2022, 02:09:04 UTC |
da7bb0c | lopez | 19 October 2022, 20:00:19 UTC | quick refresh authors in references | 19 October 2022, 20:00:19 UTC |
dab259e | Patrice Lopez | 19 October 2022, 07:36:47 UTC | Merge pull request #959 from kermitt2/feature/funding-statement Add funding statement in TEI output | 19 October 2022, 07:36:47 UTC |
6c8b888 | Luca Foppiano | 19 October 2022, 02:37:37 UTC | remove field from PMC - make a general method for it | 19 October 2022, 02:37:37 UTC |
5f08df2 | Luca Foppiano | 19 October 2022, 02:04:23 UTC | Merge branch 'master' into feature/funding-statement | 19 October 2022, 02:04:23 UTC |
9fdec9c | lopez | 18 October 2022, 14:06:12 UTC | cleaning useless hack ; generalizing post-processing for short texts | 18 October 2022, 14:06:12 UTC |
b528647 | Luca Foppiano | 18 October 2022, 08:26:13 UTC | update name and add unit test | 18 October 2022, 08:26:13 UTC |
4ac0339 | Luca Foppiano | 18 October 2022, 06:47:50 UTC | add funding XPaths in end to end evaluation | 18 October 2022, 06:47:50 UTC |
8e53a7d | lopez | 17 October 2022, 18:15:06 UTC | additional training data and model update | 17 October 2022, 18:15:06 UTC |
1836680 | Luca Foppiano | 17 October 2022, 10:56:59 UTC | apply post processing of tei sections of text without considering tables and figures labels | 17 October 2022, 10:56:59 UTC |
0bd482c | lopez | 16 October 2022, 16:56:53 UTC | skip availability statement eval for e2e PMC set | 16 October 2022, 16:56:53 UTC |
134ac7a | Patrice Lopez | 15 October 2022, 13:41:21 UTC | Merge pull request #838 from kermitt2/prioritize_crossref_author_meta prefer author meta from consolidation , crossref is considered more r… | 15 October 2022, 13:41:21 UTC |
197ecd0 | Patrice Lopez | 15 October 2022, 13:35:30 UTC | Merge pull request #961 from kermitt2/martin-citation-annotations Corrected ref data from PR #864 | 15 October 2022, 13:35:30 UTC |
f82daa5 | lopez | 15 October 2022, 13:22:50 UTC | corrected ref data from PR #864 | 15 October 2022, 13:22:50 UTC |
ad1ee7f | lopez | 15 October 2022, 13:03:49 UTC | add back training data generation for raw reference strings | 15 October 2022, 13:03:49 UTC |
52352a8 | lopez | 15 October 2022, 08:42:59 UTC | add new hack | 15 October 2022, 08:42:59 UTC |
27a25ad | lopez | 15 October 2022, 08:42:49 UTC | roll back first hack | 15 October 2022, 08:42:49 UTC |
53ace9d | lopez | 15 October 2022, 07:04:36 UTC | cleaning; review basic keyword segmentation | 15 October 2022, 07:04:36 UTC |
14f5c5a | Luca Foppiano | 13 October 2022, 05:43:08 UTC | avoid loosing text when processShort tag text as figure or table | 13 October 2022, 05:43:08 UTC |
b1c0cfd | Luca Foppiano | 13 October 2022, 03:24:43 UTC | output funding statement in the back of the TEI output | 13 October 2022, 03:24:43 UTC |
06a526f | Luca Foppiano | 12 October 2022, 05:14:39 UTC | Add mention to WSL mode in documentation related to #954 | 12 October 2022, 05:14:39 UTC |
0f6c1b4 | lopez | 08 October 2022, 06:56:21 UTC | try to rephrase more clearly :) | 08 October 2022, 06:56:21 UTC |
2a16015 | Patrice Lopez | 07 October 2022, 06:25:50 UTC | Merge pull request #951 from kermitt2/feature/data-availability-statement Data and code availability statement zone | 07 October 2022, 06:25:50 UTC |
1925296 | lopez | 06 October 2022, 19:46:09 UTC | set default timeout and max blocks higher | 06 October 2022, 19:46:09 UTC |
7bb0c8f | lopez | 06 October 2022, 19:38:15 UTC | update segmentation model prior to merge | 06 October 2022, 19:38:15 UTC |
08b176f | lopez | 05 October 2022, 18:44:20 UTC | more training data for segmentation model | 05 October 2022, 18:44:20 UTC |
bd3de67 | lopez | 05 October 2022, 15:49:42 UTC | fix errors in latest training data | 05 October 2022, 15:49:42 UTC |
0f48242 | lopez | 03 October 2022, 18:35:22 UTC | new training data segmentation for availability statements | 03 October 2022, 18:35:22 UTC |
3c8c9e5 | lopez | 03 October 2022, 11:57:15 UTC | add a script to select interesting training cases from JATS/PDF pairs | 03 October 2022, 11:57:15 UTC |
74e29c2 | lopez | 03 October 2022, 08:44:07 UTC | fix test | 03 October 2022, 08:44:07 UTC |
332daf1 | lopez | 02 October 2022, 18:05:47 UTC | better foot note identifier | 02 October 2022, 18:05:47 UTC |
15e8565 | lopez | 02 October 2022, 17:55:23 UTC | fix xml id for foot notes | 02 October 2022, 17:55:23 UTC |
2605750 | lopez | 27 September 2022, 14:43:32 UTC | remove unstable integration test | 27 September 2022, 14:43:32 UTC |
2278be1 | lopez | 27 September 2022, 14:21:55 UTC | fix conflict | 27 September 2022, 14:21:55 UTC |
4e96757 | lopez | 27 September 2022, 13:37:08 UTC | minor optional diff report | 27 September 2022, 13:37:08 UTC |
f9dc68f | Patrice Lopez | 27 September 2022, 13:35:08 UTC | Merge pull request #944 from kermitt2/features/footnotes Link footnotes in the text | 27 September 2022, 13:35:08 UTC |
2f2241e | lopez | 26 September 2022, 17:11:30 UTC | minor for trace | 26 September 2022, 17:11:30 UTC |
278ee1d | lopez | 26 September 2022, 16:17:57 UTC | review eval, remove redundant normalize-space | 26 September 2022, 16:17:57 UTC |
7b0aa0c | lopez | 26 September 2022, 10:59:12 UTC | avoid regression; cleaning | 26 September 2022, 10:59:12 UTC |
68efda8 | lopez | 25 September 2022, 16:00:06 UTC | better field name for reporting | 25 September 2022, 16:00:06 UTC |
14cc516 | lopez | 25 September 2022, 15:18:00 UTC | remove non-Grobid TEI path | 25 September 2022, 15:18:00 UTC |
98aface | lopez | 25 September 2022, 14:39:09 UTC | do not restrict availability statement to data availability | 25 September 2022, 14:39:09 UTC |
e098b50 | lopez | 25 September 2022, 14:31:37 UTC | write header availability statement in the final TEI | 25 September 2022, 14:31:37 UTC |
24ad1f1 | lopez | 25 September 2022, 13:11:45 UTC | add new labels in the created training data for header and segmentation models | 25 September 2022, 13:11:45 UTC |
f427ad7 | lopez | 25 September 2022, 12:39:42 UTC | fix tests | 25 September 2022, 12:39:42 UTC |
cdf6cd5 | lopez | 25 September 2022, 12:06:23 UTC | cleaning/minor | 25 September 2022, 12:06:23 UTC |
feff30b | lopez | 25 September 2022, 12:05:55 UTC | remove useless layout token storage (already done by generalResultMapping) | 25 September 2022, 12:05:55 UTC |
9eaa522 | lopez | 25 September 2022, 11:14:39 UTC | avoid useless breaking change for all other grobid modeules | 25 September 2022, 11:14:39 UTC |
40331eb | lopez | 25 September 2022, 10:42:43 UTC | Merge branch 'master' into feature/data-availability-statement | 25 September 2022, 10:42:43 UTC |
655ccdf | lopez | 25 September 2022, 07:21:34 UTC | case no ref but note | 25 September 2022, 07:21:34 UTC |
e2ac939 | lopez | 24 September 2022, 18:17:44 UTC | fix test | 24 September 2022, 18:17:44 UTC |
f71fe72 | lopez | 24 September 2022, 17:30:38 UTC | cover bibliographicla callout which are in fact foot note callout | 24 September 2022, 17:30:38 UTC |
77185b0 | lopez | 24 September 2022, 15:47:51 UTC | rewrite footnote callout serialization; support multiple footnote callout in same paragraph; fix missing paragraph content | 24 September 2022, 15:47:51 UTC |
063f559 | lopez | 24 September 2022, 11:49:44 UTC | fix tests | 24 September 2022, 11:49:44 UTC |
3cca788 | lopez | 24 September 2022, 11:11:14 UTC | rename Footnote object to Note everywhere to avoid confusion (margin note are the same) | 24 September 2022, 11:11:14 UTC |
1207e0b | lopez | 24 September 2022, 10:59:33 UTC | clean not used and redundant code; factorize margin note and foot note; review footnote object | 24 September 2022, 10:59:33 UTC |
01699b7 | lopez | 24 September 2022, 09:34:06 UTC | Merge branch 'master' into features/footnotes | 24 September 2022, 09:34:06 UTC |
54d1c29 | lopez | 12 September 2022, 12:44:47 UTC | make evaluation pages more visible | 12 September 2022, 12:44:47 UTC |
0fbc152 | Luca Foppiano | 09 September 2022, 06:32:16 UTC | add integration test with sample document | 09 September 2022, 06:32:16 UTC |
8bf7e96 | Luca Foppiano | 09 September 2022, 05:57:34 UTC | add unit tests | 09 September 2022, 05:57:34 UTC |
8cd6c05 | lopez | 06 September 2022, 12:55:37 UTC | doc typo/minor update | 06 September 2022, 12:55:37 UTC |
761a1f6 | Luca Foppiano | 28 August 2022, 02:00:52 UTC | cleanup | 28 August 2022, 02:00:52 UTC |
6403d12 | Luca Foppiano | 28 August 2022, 01:58:03 UTC | link footnotes to superscript tokens only | 28 August 2022, 01:58:03 UTC |
7df6a01 | Luca Foppiano | 27 August 2022, 19:37:46 UTC | fix @target attribute for footnotes | 27 August 2022, 19:37:46 UTC |
04b3500 | Luca Foppiano | 27 August 2022, 14:42:21 UTC | link footnotes with heuristics | 27 August 2022, 15:17:49 UTC |
d22a1ff | Luca Foppiano | 26 August 2022, 07:20:40 UTC | Compile the xpath at the instantiation of the patterns | 26 August 2022, 07:22:04 UTC |
fc1445b | lopez | 18 August 2022, 08:51:49 UTC | Merge branch 'feature/data-availability-statement' of github.com:kermitt2/grobid into feature/data-availability-statement | 18 August 2022, 08:51:49 UTC |
59d2602 | lopez | 18 August 2022, 08:51:37 UTC | follow annotation guidelines | 18 August 2022, 08:51:37 UTC |
d3bf062 | lfoppiano | 16 August 2022, 15:08:23 UTC | Add retrained models with revised training data | 16 August 2022, 15:10:06 UTC |
1863bb4 | lopez | 15 August 2022, 05:22:10 UTC | review added affiliation files | 15 August 2022, 05:22:10 UTC |
8b1a674 | lopez | 15 August 2022, 03:37:26 UTC | review added header files | 15 August 2022, 03:37:26 UTC |
b55065e | lopez | 15 August 2022, 02:08:43 UTC | review additional segmentation files | 15 August 2022, 02:08:43 UTC |
0326b28 | lopez | 15 August 2022, 00:09:20 UTC | separate program and project names | 15 August 2022, 00:09:20 UTC |
c566ce2 | Luca Foppiano | 14 August 2022, 07:03:52 UTC | revised segmentation and header files | 14 August 2022, 07:03:52 UTC |
cfe8869 | lopez | 14 August 2022, 00:45:35 UTC | update training data acknowledgement | 14 August 2022, 00:45:35 UTC |
1a92293 | lopez | 13 August 2022, 00:32:59 UTC | update training data acknowledgement | 13 August 2022, 00:32:59 UTC |
25ced8d | lopez | 12 August 2022, 23:15:18 UTC | update training data | 12 August 2022, 23:15:18 UTC |
e77115c | lopez | 12 August 2022, 19:54:33 UTC | re-annotate acknowledgements | 12 August 2022, 19:54:33 UTC |
5f3f863 | Luca Foppiano | 12 August 2022, 15:37:59 UTC | add affilliation corrected training data | 12 August 2022, 15:37:59 UTC |
27eea22 | Luca Foppiano | 12 August 2022, 15:37:46 UTC | add additional header training data | 12 August 2022, 15:37:46 UTC |
c8bbcab | Luca Foppiano | 12 August 2022, 15:37:26 UTC | add additional segmentation training data | 12 August 2022, 15:37:26 UTC |
1ab0779 | Luca Foppiano | 12 August 2022, 08:48:28 UTC | add reviewed files from PR #813 | 12 August 2022, 08:48:53 UTC |
443fc56 | Luca Foppiano | 12 August 2022, 07:22:42 UTC | rename raw file for kononova2019 | 12 August 2022, 07:22:42 UTC |
56a8bd8 | Luca Foppiano | 12 August 2022, 06:59:02 UTC | regenerate header files | 12 August 2022, 06:59:02 UTC |
c3352a2 | Luca Foppiano | 11 August 2022, 11:22:36 UTC | extend the field specification to <p> + content-type | 11 August 2022, 11:22:36 UTC |
dee0f04 | Luca Foppiano | 11 August 2022, 11:22:08 UTC | add trained model and update tagging label to "availability" | 11 August 2022, 11:22:08 UTC |
bc0add8 | Luca Foppiano | 10 August 2022, 20:50:24 UTC | support additional attribute values | 10 August 2022, 20:50:24 UTC |
8c83a16 | Luca Foppiano | 10 August 2022, 20:50:01 UTC | cosmetics | 10 August 2022, 20:50:01 UTC |
6e8c6b7 | Luca Foppiano | 10 August 2022, 11:26:50 UTC | get training data "availability" label from <note> | 10 August 2022, 11:29:32 UTC |
2cb2010 | Luca Foppiano | 10 August 2022, 11:26:03 UTC | update training labels, add funding in segmentation, revert data availability label temporarly | 10 August 2022, 11:29:32 UTC |
d9f4a82 | Luca Foppiano | 10 August 2022, 10:56:41 UTC | Revert "Re-trained segmentation model with revised training data" This reverts commit 90c10a70df74fe38e337db612b423d20514356e9. | 10 August 2022, 10:56:41 UTC |
90c10a7 | lfoppiano | 10 August 2022, 09:29:37 UTC | Re-trained segmentation model with revised training data | 10 August 2022, 09:29:37 UTC |
587a8b1 | Luca Foppiano | 10 August 2022, 09:07:09 UTC | add reference to sub-sections | 10 August 2022, 09:07:09 UTC |
346af3a | lopez | 10 August 2022, 04:35:23 UTC | improve wording | 10 August 2022, 04:35:23 UTC |