91d1b7c | lopez | 12 May 2023, 10:12:50 UTC | Merge branch 'master' into 3em-dash-support | 12 May 2023, 10:12:50 UTC |
699bd15 | Patrice Lopez | 10 May 2023, 14:29:44 UTC | minor tei formatting improvements | 10 May 2023, 14:29:44 UTC |
7c723b4 | Patrice Lopez | 08 May 2023, 14:29:47 UTC | fix extra indent after book title | 08 May 2023, 14:29:47 UTC |
b104f74 | lopez | 01 May 2023, 18:24:52 UTC | rephrase | 01 May 2023, 18:24:52 UTC |
39a6f1b | lopez | 01 May 2023, 18:04:05 UTC | more update on deep learning models | 01 May 2023, 18:04:05 UTC |
bc27c2e | lopez | 01 May 2023, 10:02:45 UTC | Merge branch 'master' into 3em-dash-support | 01 May 2023, 10:02:45 UTC |
687bd3c | Patrice Lopez | 30 April 2023, 19:17:14 UTC | markdown syntax fix | 30 April 2023, 19:17:14 UTC |
d17574d | Patrice Lopez | 30 April 2023, 19:10:32 UTC | update doc index | 30 April 2023, 19:10:32 UTC |
0d50be7 | Patrice Lopez | 30 April 2023, 18:56:25 UTC | add benchmarking for next release | 30 April 2023, 18:56:25 UTC |
4358f74 | Patrice Lopez | 28 April 2023, 10:55:21 UTC | add our evaluation datasets | 28 April 2023, 10:55:21 UTC |
572bc07 | Patrice Lopez | 27 April 2023, 14:42:48 UTC | more model options | 27 April 2023, 14:42:48 UTC |
a2b1795 | Patrice Lopez | 24 April 2023, 17:13:39 UTC | add more benchmarks | 24 April 2023, 17:13:39 UTC |
9606fea | Patrice Lopez | 23 April 2023, 12:28:53 UTC | fix tests | 23 April 2023, 12:28:53 UTC |
8d0ae64 | Patrice Lopez | 23 April 2023, 10:38:08 UTC | update doc for <note> coordinate option | 23 April 2023, 10:38:08 UTC |
3dbdb08 | Patrice Lopez | 23 April 2023, 10:36:05 UTC | refresh some models | 23 April 2023, 10:36:05 UTC |
45de58c | lopez | 23 April 2023, 09:51:30 UTC | review note text cleaning | 23 April 2023, 09:51:30 UTC |
e496d95 | lopez | 23 April 2023, 08:41:42 UTC | add optional coordinates to note element | 23 April 2023, 08:41:42 UTC |
b75415c | lopez | 22 April 2023, 15:48:03 UTC | fix for #995 | 22 April 2023, 15:48:03 UTC |
189d3b4 | Patrice Lopez | 12 April 2023, 12:00:10 UTC | update eval | 12 April 2023, 12:00:10 UTC |
b515620 | Patrice Lopez | 12 April 2023, 11:58:47 UTC | Merge pull request #998 from kermitt2/bugfix/fix_hypen_break_number_parsing Extend DASH_PATTERN to cover a particular minus sign | 12 April 2023, 11:58:47 UTC |
1761402 | Luca Foppiano | 11 April 2023, 06:08:15 UTC | Extend DASH_PATTERN to cover a particular minus sign #980 | 11 April 2023, 06:08:15 UTC |
cdd604f | lopez | 29 March 2023, 18:54:27 UTC | Merge branch 'master' of github.com:kermitt2/grobid | 29 March 2023, 18:54:27 UTC |
bcddaab | lopez | 29 March 2023, 18:53:47 UTC | fix demo link | 29 March 2023, 18:53:47 UTC |
8d981a8 | Patrice Lopez | 21 March 2023, 21:23:28 UTC | covering new end-to-end evaluation datasets for PLOS and eLife | 21 March 2023, 21:23:28 UTC |
c6b0f5a | Patrice Lopez | 21 March 2023, 21:21:48 UTC | Merge pull request #993 from kermitt2/addElementID Add parameter to generate automatically xml:id for elements in the TEI XML result for the batch command line | 21 March 2023, 21:21:48 UTC |
783637e | Patrice Lopez | 21 March 2023, 21:14:23 UTC | document the parameter | 21 March 2023, 21:14:23 UTC |
e4362d5 | Patrice Lopez | 21 March 2023, 21:12:23 UTC | add parameter to generate xml:id automatically via batch command | 21 March 2023, 21:12:23 UTC |
73031c8 | Patrice Lopez | 17 March 2023, 13:14:38 UTC | update models; benchmark; default settings | 17 March 2023, 13:14:38 UTC |
e426bfe | Patrice Lopez | 17 March 2023, 10:53:12 UTC | update header model (this is the best header model so far) | 17 March 2023, 10:53:12 UTC |
01e2b74 | lopez | 09 March 2023, 08:36:36 UTC | document the additional PLOS and eLife evaluation datasets; update benchmarks | 09 March 2023, 08:36:36 UTC |
c1fa76d | Patrice Lopez | 07 March 2023, 15:48:15 UTC | update citation model | 07 March 2023, 15:48:15 UTC |
4b23b57 | Patrice Lopez | 06 March 2023, 18:27:47 UTC | Merge branch 'master' of github.com:kermitt2/grobid | 06 March 2023, 18:27:47 UTC |
e24b35c | Patrice Lopez | 06 March 2023, 18:26:09 UTC | update models with added training data | 06 March 2023, 18:26:09 UTC |
e368fa2 | lopez | 04 March 2023, 17:15:48 UTC | some citation cases | 04 March 2023, 17:15:48 UTC |
c3e06f0 | Patrice Lopez | 04 March 2023, 08:48:56 UTC | with latest training | 04 March 2023, 08:48:56 UTC |
fbd0315 | lopez | 03 March 2023, 19:29:41 UTC | fix new training example segmentation | 03 March 2023, 19:29:41 UTC |
e8f8d39 | lopez | 03 March 2023, 19:25:15 UTC | second try add some biorxiv training reference segmentation | 03 March 2023, 19:25:15 UTC |
70086c3 | lopez | 03 March 2023, 19:24:31 UTC | add some biorxiv training reference segmentation | 03 March 2023, 19:24:31 UTC |
a6e5330 | lopez | 03 March 2023, 19:21:16 UTC | add some biorxiv training headers | 03 March 2023, 19:21:16 UTC |
5837c0c | lopez | 02 March 2023, 19:12:29 UTC | some arXiv trainng data | 02 March 2023, 19:12:29 UTC |
f39b982 | lopez | 02 March 2023, 16:23:32 UTC | add group/collaboration in the extracted reference results | 02 March 2023, 16:23:32 UTC |
122f717 | lopez | 01 March 2023, 20:07:30 UTC | remaining training PMC 1500 | 01 March 2023, 20:07:30 UTC |
5c86a89 | lopez | 01 March 2023, 20:03:35 UTC | add PMC 1500 set citation training | 01 March 2023, 20:03:35 UTC |
1f815ef | lopez | 01 March 2023, 20:02:37 UTC | review logger | 01 March 2023, 20:02:37 UTC |
3e31f3a | lopez | 01 March 2023, 20:02:09 UTC | add PMC 1500 set reference segmentation training | 01 March 2023, 20:02:09 UTC |
8e2ea6c | lopez | 01 March 2023, 19:58:41 UTC | add PMC 1500 set segmentation training | 01 March 2023, 19:58:41 UTC |
b847753 | Patrice Lopez | 28 February 2023, 16:50:21 UTC | Merge pull request #990 from kermitt2/review-analyzers Review analyzers | 28 February 2023, 16:50:21 UTC |
a63e5eb | Patrice Lopez | 28 February 2023, 16:45:39 UTC | refresh model | 28 February 2023, 16:45:39 UTC |
25490f9 | Patrice Lopez | 28 February 2023, 08:48:30 UTC | more analyser tests | 28 February 2023, 08:48:30 UTC |
7cbdc9e | Patrice Lopez | 27 February 2023, 12:38:48 UTC | test for Korean anlyser | 27 February 2023, 12:38:48 UTC |
53a02f8 | Patrice Lopez | 27 February 2023, 12:38:29 UTC | review Korean analyzer | 27 February 2023, 12:38:29 UTC |
69e6429 | Patrice Lopez | 27 February 2023, 12:38:02 UTC | update wipo analyser for korean | 27 February 2023, 12:38:02 UTC |
f68a793 | Patrice Lopez | 26 February 2023, 20:38:25 UTC | apply digit/alpha unicode separation for reference strings | 26 February 2023, 20:38:25 UTC |
ec89505 | Patrice Lopez | 26 February 2023, 20:37:45 UTC | review CJK analysers; add digit/alpha unicode separation | 26 February 2023, 20:37:45 UTC |
45d027a | lopez | 25 February 2023, 17:10:17 UTC | correct annotations | 25 February 2023, 17:10:17 UTC |
2f87f1c | lopez | 25 February 2023, 13:31:14 UTC | Merge branch 'master' of github.com:kermitt2/grobid | 25 February 2023, 13:31:14 UTC |
16f9c07 | lopez | 25 February 2023, 13:30:45 UTC | review training | 25 February 2023, 13:30:45 UTC |
ae7cc1f | Patrice Lopez | 22 February 2023, 20:33:38 UTC | update citation model | 22 February 2023, 20:33:38 UTC |
fa3c0ff | lopez | 22 February 2023, 15:45:09 UTC | re-annotate | 22 February 2023, 15:45:09 UTC |
187446c | lopez | 22 February 2023, 13:13:53 UTC | fix annotations | 22 February 2023, 13:13:53 UTC |
5839cbb | Patrice Lopez | 22 February 2023, 11:27:14 UTC | update citation parser model | 22 February 2023, 11:27:14 UTC |
59bbe46 | lopez | 21 February 2023, 20:44:21 UTC | training data | 21 February 2023, 20:44:21 UTC |
90a6dc9 | Patrice Lopez | 21 February 2023, 13:03:15 UTC | update with additional training | 21 February 2023, 13:03:15 UTC |
5d51d49 | lopez | 20 February 2023, 22:08:45 UTC | add training | 20 February 2023, 22:08:45 UTC |
e63aaa8 | lopez | 20 February 2023, 00:07:24 UTC | update demo url | 20 February 2023, 00:07:24 UTC |
0fc08d8 | lopez | 20 February 2023, 00:04:57 UTC | point now to HuggingFace spaces for demo | 20 February 2023, 00:04:57 UTC |
6317b73 | lopez | 18 February 2023, 21:30:02 UTC | fix console/demo client page to run as embedded huggingface app | 18 February 2023, 21:30:02 UTC |
c5f252f | lopez | 16 February 2023, 18:31:08 UTC | missing closing label; review annotations | 16 February 2023, 18:31:08 UTC |
dc1b75f | lopez | 12 February 2023, 21:09:50 UTC | review annotations | 12 February 2023, 21:09:50 UTC |
7007aee | lopez | 12 February 2023, 15:12:01 UTC | emphasize using DL models in the doc | 12 February 2023, 15:12:01 UTC |
e6c1a0a | lopez | 12 February 2023, 15:11:14 UTC | update DeLFT version | 12 February 2023, 15:11:14 UTC |
d2cae60 | Patrice Lopez | 11 February 2023, 15:43:45 UTC | Merge pull request #986 from marktab/patch-1 Update Readme.md | 11 February 2023, 15:43:45 UTC |
4d2e2e3 | MarkTab marktab.net | 10 February 2023, 04:13:21 UTC | Update Readme.md Formatting improvements -- spelling fixes | 10 February 2023, 04:13:21 UTC |
542e7a2 | lopez | 09 February 2023, 16:52:51 UTC | fix wrong content type in doc for processCitation | 09 February 2023, 16:52:51 UTC |
c9351fc | lopez | 09 February 2023, 12:10:43 UTC | Merge branch 'master' of github.com:kermitt2/grobid | 09 February 2023, 12:10:43 UTC |
ece5e57 | lopez | 09 February 2023, 12:10:37 UTC | minor | 09 February 2023, 12:10:37 UTC |
6dd48d7 | lopez | 09 February 2023, 12:09:56 UTC | add eval end2end for PLOS and eLife test sets | 09 February 2023, 12:09:56 UTC |
4d86a79 | Luca Foppiano | 09 February 2023, 08:24:34 UTC | Merge pull request #975 from kermitt2/feature/support-mac-arm Support for Apple ARM M1 | 09 February 2023, 08:24:34 UTC |
904f40e | Luca Foppiano | 09 February 2023, 03:06:01 UTC | Update documentation | 09 February 2023, 03:06:01 UTC |
650da39 | lopez | 07 February 2023, 13:46:38 UTC | support PLOS JATS; quieter handling of crossref 404 | 07 February 2023, 13:46:38 UTC |
93d2983 | lopez | 17 January 2023, 15:44:37 UTC | minor doc updates | 17 January 2023, 15:44:37 UTC |
732d5a1 | lopez | 10 January 2023, 10:36:43 UTC | review ref expansion | 10 January 2023, 10:36:43 UTC |
2661cae | Patrice Lopez | 04 January 2023, 18:33:19 UTC | update log levels | 04 January 2023, 18:33:19 UTC |
713c35f | Patrice Lopez | 04 January 2023, 17:48:15 UTC | allow deep learning training for segmentation model | 04 January 2023, 17:48:15 UTC |
87476e1 | Patrice Lopez | 04 January 2023, 17:47:27 UTC | add model | 04 January 2023, 17:47:27 UTC |
4812074 | Patrice Lopez | 04 January 2023, 17:45:31 UTC | more robustness in case of reference segmenter deficiency | 04 January 2023, 17:45:31 UTC |
2d98cc5 | Patrice Lopez | 04 January 2023, 17:44:55 UTC | less warn log, more info | 04 January 2023, 17:44:55 UTC |
ee18014 | Luca Foppiano | 06 December 2022, 06:09:33 UTC | remove useless jep library | 06 December 2022, 06:09:33 UTC |
de6016d | Patrice Lopez | 05 December 2022, 17:58:58 UTC | more log debug; model update | 05 December 2022, 17:58:58 UTC |
c5ed7bc | Patrice Lopez | 03 December 2022, 19:50:01 UTC | update models/scores | 03 December 2022, 19:50:01 UTC |
ac6146c | lopez | 03 December 2022, 14:42:13 UTC | update logs | 03 December 2022, 14:42:13 UTC |
7d717b8 | lopez | 03 December 2022, 12:12:33 UTC | Merge branch 'master' of github.com:kermitt2/grobid | 03 December 2022, 12:12:33 UTC |
579939b | lopez | 03 December 2022, 12:12:23 UTC | review shared libraries got lin-64 JNI | 03 December 2022, 12:12:23 UTC |
bdefc2c | Patrice Lopez | 02 December 2022, 20:39:52 UTC | update model | 02 December 2022, 20:39:52 UTC |
2ecfd38 | Patrice Lopez | 02 December 2022, 18:33:07 UTC | review incremental training | 02 December 2022, 18:33:07 UTC |
9b0b4ad | Luca Foppiano | 01 December 2022, 07:54:51 UTC | Add rebuilt binaries for Mac ARM architecture + fix path construction | 01 December 2022, 07:54:51 UTC |
27d2917 | lopez | 30 November 2022, 13:35:42 UTC | update lin64 wapiti binaries built with release flags | 30 November 2022, 13:35:42 UTC |
64ad6fb | lopez | 30 November 2022, 13:00:56 UTC | fix author rank | 30 November 2022, 13:00:56 UTC |
8330b02 | lopez | 30 November 2022, 11:49:13 UTC | update lin64 wapiti binaries built with release flags | 30 November 2022, 11:49:13 UTC |
cbbc09a | Patrice Lopez | 25 November 2022, 17:02:42 UTC | try to fix circleci config | 25 November 2022, 17:02:42 UTC |