8751dcb | Patrice Lopez | 11 February 2024, 23:11:59 UTC | update model | 11 February 2024, 23:11:59 UTC |
212d822 | Patrice Lopez | 11 February 2024, 21:21:30 UTC | review model | 11 February 2024, 21:21:30 UTC |
84d4191 | Patrice Lopez | 11 February 2024, 20:30:25 UTC | update model | 11 February 2024, 20:30:25 UTC |
2c9ac67 | lopez | 11 February 2024, 19:12:11 UTC | update training data | 11 February 2024, 19:12:11 UTC |
c893cb4 | Patrice Lopez | 11 February 2024, 19:10:35 UTC | update model | 11 February 2024, 19:10:35 UTC |
62655f4 | Patrice Lopez | 11 February 2024, 17:28:33 UTC | better filtering of infrasructure | 11 February 2024, 17:28:33 UTC |
f6fa0ec | lopez | 11 February 2024, 16:43:47 UTC | Merge branch 'research-infrastructures' of github.com:kermitt2/grobid into research-infrastructures | 11 February 2024, 16:43:47 UTC |
041903f | lopez | 11 February 2024, 16:43:42 UTC | some training | 11 February 2024, 16:43:42 UTC |
a9b0f3f | Patrice Lopez | 11 February 2024, 15:26:32 UTC | update training data | 11 February 2024, 15:26:32 UTC |
a66f6ab | Patrice Lopez | 11 February 2024, 15:24:58 UTC | funding number fix | 11 February 2024, 15:24:58 UTC |
a352cdd | Patrice Lopez | 11 February 2024, 14:03:31 UTC | adapt resources; update features | 11 February 2024, 14:03:31 UTC |
6f933bd | Patrice Lopez | 10 February 2024, 20:46:31 UTC | update model to recognize research infrastructures | 10 February 2024, 20:46:31 UTC |
c20f668 | Patrice Lopez | 10 February 2024, 20:46:02 UTC | update training data | 10 February 2024, 20:46:02 UTC |
0076518 | Patrice Lopez | 10 February 2024, 20:45:27 UTC | adapt trainer and parser | 10 February 2024, 20:45:27 UTC |
4a37538 | lopez | 10 February 2024, 18:18:54 UTC | add resources; training data | 10 February 2024, 18:18:54 UTC |
cab0947 | lopez | 10 February 2024, 15:32:37 UTC | prepare for research infrastructure results | 10 February 2024, 15:32:37 UTC |
5a4b9b8 | lopez | 10 February 2024, 13:19:40 UTC | add research infrastructure annotations | 10 February 2024, 13:19:40 UTC |
ed9fef7 | Patrice Lopez | 10 February 2024, 11:40:22 UTC | Merge pull request #1078 from kermitt2/copyrights-licenses Copyrights owner and licenses identification models | 10 February 2024, 11:40:22 UTC |
b829eff | Patrice Lopez | 10 February 2024, 10:32:32 UTC | review eval set | 10 February 2024, 10:32:32 UTC |
261f975 | lopez | 09 February 2024, 19:57:20 UTC | comments | 09 February 2024, 19:57:20 UTC |
6b890fa | lopez | 09 February 2024, 17:51:51 UTC | fix conflicts with master | 09 February 2024, 17:51:51 UTC |
bcce229 | lopez | 09 February 2024, 16:43:16 UTC | update XML schema | 09 February 2024, 16:43:16 UTC |
e1ecac9 | Patrice Lopez | 09 February 2024, 14:34:23 UTC | fix tests | 09 February 2024, 14:34:23 UTC |
d480e12 | Patrice Lopez | 09 February 2024, 14:11:50 UTC | Merge branch 'master' of github.com:kermitt2/grobid | 09 February 2024, 14:11:50 UTC |
ecff63a | Patrice Lopez | 09 February 2024, 14:11:21 UTC | update crf model patent-citation | 09 February 2024, 14:11:21 UTC |
16b9abb | lopez | 09 February 2024, 09:15:55 UTC | update XML schema | 09 February 2024, 09:15:55 UTC |
86b03e8 | Patrice Lopez | 08 February 2024, 17:49:22 UTC | minor XML fix | 08 February 2024, 17:49:22 UTC |
269c897 | Patrice Lopez | 07 February 2024, 18:45:14 UTC | Merge pull request #1082 from kermitt2/review-patent Review patent process | 07 February 2024, 18:45:14 UTC |
1ebc6c8 | Patrice Lopez | 06 February 2024, 20:20:01 UTC | review serialization | 06 February 2024, 20:20:01 UTC |
8282dad | Patrice Lopez | 06 February 2024, 17:21:07 UTC | fix usage of parameters | 06 February 2024, 17:21:07 UTC |
5750ad7 | Patrice Lopez | 06 February 2024, 16:07:09 UTC | add tests | 06 February 2024, 16:07:09 UTC |
92d3c1d | Patrice Lopez | 06 February 2024, 14:51:28 UTC | add tests, cleaning | 06 February 2024, 14:51:28 UTC |
6357f98 | lopez | 06 February 2024, 09:47:11 UTC | avoid adding bert models | 06 February 2024, 09:47:11 UTC |
08c0405 | Patrice Lopez | 05 February 2024, 22:53:32 UTC | review sequence segmentation following max sequence length | 05 February 2024, 22:53:32 UTC |
017bc28 | Patrice Lopez | 05 February 2024, 20:49:10 UTC | extend default config | 05 February 2024, 20:49:10 UTC |
0d524f7 | Patrice Lopez | 05 February 2024, 20:47:07 UTC | review method profile and fix test | 05 February 2024, 20:47:07 UTC |
53f8c1d | Patrice Lopez | 05 February 2024, 20:46:24 UTC | add additional tokenizer mode | 05 February 2024, 20:46:24 UTC |
80ca203 | Patrice Lopez | 05 February 2024, 20:45:45 UTC | fix language code mismatch for Korean | 05 February 2024, 20:45:45 UTC |
a01d5ec | Patrice Lopez | 05 February 2024, 16:04:53 UTC | cleaning used model | 05 February 2024, 16:04:53 UTC |
1550756 | Patrice Lopez | 05 February 2024, 16:04:19 UTC | update model | 05 February 2024, 16:04:19 UTC |
d66896e | Patrice Lopez | 05 February 2024, 16:04:01 UTC | review process and serialization | 05 February 2024, 16:04:01 UTC |
a0b5bc2 | Patrice Lopez | 05 February 2024, 16:03:27 UTC | update US patent application mapping to year | 05 February 2024, 16:03:27 UTC |
b0145fa | lopez | 05 February 2024, 12:34:02 UTC | cleaning; review instance | 05 February 2024, 12:34:02 UTC |
fbb238c | Patrice Lopez | 04 February 2024, 21:50:22 UTC | refactor process for DL models batch | 04 February 2024, 21:50:22 UTC |
a4eb584 | Patrice Lopez | 04 February 2024, 18:59:19 UTC | remove outdated xml parser | 04 February 2024, 18:59:19 UTC |
0be5097 | Patrice Lopez | 04 February 2024, 18:58:23 UTC | update DL models | 04 February 2024, 18:58:23 UTC |
8c4f19a | Patrice Lopez | 04 February 2024, 18:57:56 UTC | some training fixes | 04 February 2024, 18:57:56 UTC |
c943239 | Patrice Lopez | 04 February 2024, 18:57:28 UTC | review training parsing and selection for DL models | 04 February 2024, 18:57:28 UTC |
2ab91c2 | lopez | 04 February 2024, 16:57:30 UTC | add documentation for parameter includeRawCopyrights | 04 February 2024, 16:57:30 UTC |
ac6944e | lopez | 04 February 2024, 16:07:10 UTC | too quick | 04 February 2024, 16:07:10 UTC |
8b6a6e4 | lopez | 04 February 2024, 16:01:15 UTC | update models | 04 February 2024, 16:01:15 UTC |
af5a20b | lopez | 04 February 2024, 16:00:43 UTC | fix a missing serialization case; add option includeRawCopyrights in service | 04 February 2024, 16:00:43 UTC |
4359b40 | lopez | 01 February 2024, 18:38:48 UTC | change copyrights owner attribute to @rest, with a comment to explain | 01 February 2024, 18:38:48 UTC |
0f9a1dc | lopez | 01 February 2024, 12:10:13 UTC | fix copyright class naming | 01 February 2024, 12:10:13 UTC |
a74c05c | lopez | 01 February 2024, 10:28:45 UTC | rolling back ODD file to use ROMA | 01 February 2024, 10:28:45 UTC |
08d5fce | Luca Foppiano | 01 February 2024, 06:46:10 UTC | add some unit tests | 01 February 2024, 06:46:10 UTC |
59f15e0 | Luca Foppiano | 01 February 2024, 01:12:40 UTC | split labelling and decoding | 01 February 2024, 01:12:40 UTC |
0d31645 | Luca Foppiano | 01 February 2024, 00:32:40 UTC | updated grobid ODD | 01 February 2024, 00:32:40 UTC |
d8fe160 | lopez | 29 January 2024, 18:27:52 UTC | default config for tests | 29 January 2024, 18:27:52 UTC |
2d90904 | lopez | 29 January 2024, 17:53:32 UTC | fix tests | 29 January 2024, 17:53:32 UTC |
23917c4 | lopez | 29 January 2024, 17:43:16 UTC | cleaning | 29 January 2024, 17:43:16 UTC |
268186d | lopez | 29 January 2024, 17:31:02 UTC | copyrights+licenses models integrated; TEI serialization | 29 January 2024, 17:31:02 UTC |
75ec437 | lopez | 28 January 2024, 19:58:05 UTC | fix indentation | 28 January 2024, 19:58:05 UTC |
b4f29a3 | lopez | 28 January 2024, 19:48:22 UTC | start integrating copyright and license model and classes | 28 January 2024, 19:48:22 UTC |
4816a7a | Patrice Lopez | 26 January 2024, 09:11:24 UTC | add tini back to docker images to avoid zombie apocalypse with kubernetes | 26 January 2024, 09:11:24 UTC |
e14ce33 | Patrice Lopez | 21 January 2024, 15:13:23 UTC | Merge pull request #1076 from kermitt2/bugfix/paragraph-coords Fix missing coordinates in paragraphs continuation | 21 January 2024, 15:13:23 UTC |
0d7913d | Luca Foppiano | 18 January 2024, 08:58:32 UTC | visualise paragraphs or sentences mutually exclusively if the segmentation is enabled | 18 January 2024, 08:58:32 UTC |
4d0acef | Luca Foppiano | 18 January 2024, 08:57:53 UTC | add missing coordinates when the paragraph continues after a reference callout | 18 January 2024, 08:57:53 UTC |
cbc77d5 | Patrice Lopez | 14 January 2024, 19:24:40 UTC | Merge pull request #1075 from kermitt2/fix-note-page Fix OOBE when processing large quantities of notes | 14 January 2024, 19:24:40 UTC |
46913da | lopez | 14 January 2024, 18:28:26 UTC | add test to prevent a rare NPE when called from Pub2TEI | 14 January 2024, 18:28:26 UTC |
69fcecb | Luca Foppiano | 12 January 2024, 07:46:48 UTC | add automatic build + push for the CRF light image | 12 January 2024, 07:46:48 UTC |
30780ef | Luca Foppiano | 11 January 2024, 11:44:28 UTC | move dephypenisation at the end of the note process | 11 January 2024, 11:44:28 UTC |
6a3a0d5 | Luca Foppiano | 06 January 2024, 10:40:21 UTC | avoid OOBE in note getPageNumber() | 06 January 2024, 10:40:21 UTC |
b50c9aa | Patrice Lopez | 29 December 2023, 20:22:36 UTC | Merge pull request #1068 from kermitt2/feature/paragraphs-coordinates Add paragraphs coordinates | 29 December 2023, 20:22:36 UTC |
a506eb1 | lopez | 29 December 2023, 20:05:21 UTC | add coordinates for <p> in batch mode for consistency | 29 December 2023, 20:05:21 UTC |
7b7b5ce | Luca Foppiano | 27 December 2023, 13:16:24 UTC | update documentation, fix typo | 27 December 2023, 13:16:24 UTC |
4620b63 | Luca Foppiano | 26 December 2023, 12:16:04 UTC | cleanup | 26 December 2023, 12:16:04 UTC |
3ef0915 | Luca Foppiano | 26 December 2023, 10:47:10 UTC | Merge branch 'master' into feature/paragraphs-coordinates # Conflicts: # grobid-service/src/main/resources/web/grobid/grobid.js | 26 December 2023, 10:47:10 UTC |
2ca3f35 | Patrice Lopez | 26 December 2023, 10:07:53 UTC | Merge pull request #1069 from kermitt2/update-affiliation Update affiliation process | 26 December 2023, 10:07:53 UTC |
f84e855 | lopez | 25 December 2023, 19:57:37 UTC | indicate in doc coordinates for affiliation | 25 December 2023, 19:57:37 UTC |
54106db | lopez | 23 December 2023, 10:31:42 UTC | add one training example | 23 December 2023, 10:31:42 UTC |
5000f15 | lopez | 22 December 2023, 11:59:16 UTC | review test | 22 December 2023, 11:59:16 UTC |
0d310f2 | lopez | 22 December 2023, 11:24:37 UTC | fix wrong field name | 22 December 2023, 11:24:37 UTC |
200f626 | lopez | 21 December 2023, 21:59:36 UTC | cleaning | 21 December 2023, 21:59:36 UTC |
d137e21 | lopez | 21 December 2023, 19:34:32 UTC | seome trainng data | 21 December 2023, 19:34:32 UTC |
e235c88 | lopez | 21 December 2023, 19:34:09 UTC | fix for training data generation with updated affiliation-address parser | 21 December 2023, 19:34:09 UTC |
a9b6826 | lopez | 21 December 2023, 16:38:02 UTC | fix conflicts | 21 December 2023, 16:38:02 UTC |
4b77e73 | Patrice Lopez | 21 December 2023, 16:35:36 UTC | Merge pull request #1070 from kermitt2/bugfix/fix-coord-name-title correct <title> coordinates attribute name | 21 December 2023, 16:35:36 UTC |
6eaa6f4 | lopez | 21 December 2023, 16:23:28 UTC | other typo for coords; exclude p | 21 December 2023, 16:23:28 UTC |
c776e3c | lopez | 21 December 2023, 11:12:40 UTC | fix typo, add affiliation coord element in demo | 21 December 2023, 11:12:40 UTC |
4423079 | Luca Foppiano | 20 December 2023, 05:12:54 UTC | add missing coordinates elements in batch mode | 20 December 2023, 05:12:54 UTC |
442d219 | Luca Foppiano | 20 December 2023, 05:09:17 UTC | add title in the list of coordinates | 20 December 2023, 05:10:07 UTC |
7f0e0ca | Luca Foppiano | 20 December 2023, 05:09:17 UTC | add title in the list of coordinates | 20 December 2023, 05:09:17 UTC |
e769a52 | Luca Foppiano | 20 December 2023, 04:40:59 UTC | correct <title> coordinates attribute name | 20 December 2023, 04:42:39 UTC |
bf76b4e | Luca Foppiano | 20 December 2023, 04:40:59 UTC | correct <title> coordinates attribute name | 20 December 2023, 04:40:59 UTC |
4241e14 | Luca Foppiano | 18 December 2023, 04:06:40 UTC | Revert "add docker grobid development build" This reverts commit 44df68fa5b6c3318d936259c23a993e3a299f32a. | 18 December 2023, 04:06:40 UTC |
44df68f | Luca Foppiano | 18 December 2023, 03:44:43 UTC | add docker grobid development build | 18 December 2023, 03:44:43 UTC |
315c340 | Luca Foppiano | 18 December 2023, 03:27:25 UTC | Add paragraphs coordinates | 18 December 2023, 03:27:25 UTC |
fb621d6 | lopez | 14 December 2023, 19:56:19 UTC | big fixing for updated affiliation parser | 14 December 2023, 19:56:19 UTC |
d7b2ff1 | lopez | 13 December 2023, 17:24:33 UTC | proper affiliation coordinates | 13 December 2023, 17:24:33 UTC |