https://github.com/kermitt2/grobid

sort by:
Revision Author Date Message Commit Date
91d1b7c Merge branch 'master' into 3em-dash-support 12 May 2023, 10:12:50 UTC
699bd15 minor tei formatting improvements 10 May 2023, 14:29:44 UTC
7c723b4 fix extra indent after book title 08 May 2023, 14:29:47 UTC
b104f74 rephrase 01 May 2023, 18:24:52 UTC
39a6f1b more update on deep learning models 01 May 2023, 18:04:05 UTC
bc27c2e Merge branch 'master' into 3em-dash-support 01 May 2023, 10:02:45 UTC
687bd3c markdown syntax fix 30 April 2023, 19:17:14 UTC
d17574d update doc index 30 April 2023, 19:10:32 UTC
0d50be7 add benchmarking for next release 30 April 2023, 18:56:25 UTC
4358f74 add our evaluation datasets 28 April 2023, 10:55:21 UTC
572bc07 more model options 27 April 2023, 14:42:48 UTC
a2b1795 add more benchmarks 24 April 2023, 17:13:39 UTC
9606fea fix tests 23 April 2023, 12:28:53 UTC
8d0ae64 update doc for <note> coordinate option 23 April 2023, 10:38:08 UTC
3dbdb08 refresh some models 23 April 2023, 10:36:05 UTC
45de58c review note text cleaning 23 April 2023, 09:51:30 UTC
e496d95 add optional coordinates to note element 23 April 2023, 08:41:42 UTC
b75415c fix for #995 22 April 2023, 15:48:03 UTC
189d3b4 update eval 12 April 2023, 12:00:10 UTC
b515620 Merge pull request #998 from kermitt2/bugfix/fix_hypen_break_number_parsing Extend DASH_PATTERN to cover a particular minus sign 12 April 2023, 11:58:47 UTC
1761402 Extend DASH_PATTERN to cover a particular minus sign #980 11 April 2023, 06:08:15 UTC
cdd604f Merge branch 'master' of github.com:kermitt2/grobid 29 March 2023, 18:54:27 UTC
bcddaab fix demo link 29 March 2023, 18:53:47 UTC
8d981a8 covering new end-to-end evaluation datasets for PLOS and eLife 21 March 2023, 21:23:28 UTC
c6b0f5a Merge pull request #993 from kermitt2/addElementID Add parameter to generate automatically xml:id for elements in the TEI XML result for the batch command line 21 March 2023, 21:21:48 UTC
783637e document the parameter 21 March 2023, 21:14:23 UTC
e4362d5 add parameter to generate xml:id automatically via batch command 21 March 2023, 21:12:23 UTC
73031c8 update models; benchmark; default settings 17 March 2023, 13:14:38 UTC
e426bfe update header model (this is the best header model so far) 17 March 2023, 10:53:12 UTC
01e2b74 document the additional PLOS and eLife evaluation datasets; update benchmarks 09 March 2023, 08:36:36 UTC
c1fa76d update citation model 07 March 2023, 15:48:15 UTC
4b23b57 Merge branch 'master' of github.com:kermitt2/grobid 06 March 2023, 18:27:47 UTC
e24b35c update models with added training data 06 March 2023, 18:26:09 UTC
e368fa2 some citation cases 04 March 2023, 17:15:48 UTC
c3e06f0 with latest training 04 March 2023, 08:48:56 UTC
fbd0315 fix new training example segmentation 03 March 2023, 19:29:41 UTC
e8f8d39 second try add some biorxiv training reference segmentation 03 March 2023, 19:25:15 UTC
70086c3 add some biorxiv training reference segmentation 03 March 2023, 19:24:31 UTC
a6e5330 add some biorxiv training headers 03 March 2023, 19:21:16 UTC
5837c0c some arXiv trainng data 02 March 2023, 19:12:29 UTC
f39b982 add group/collaboration in the extracted reference results 02 March 2023, 16:23:32 UTC
122f717 remaining training PMC 1500 01 March 2023, 20:07:30 UTC
5c86a89 add PMC 1500 set citation training 01 March 2023, 20:03:35 UTC
1f815ef review logger 01 March 2023, 20:02:37 UTC
3e31f3a add PMC 1500 set reference segmentation training 01 March 2023, 20:02:09 UTC
8e2ea6c add PMC 1500 set segmentation training 01 March 2023, 19:58:41 UTC
b847753 Merge pull request #990 from kermitt2/review-analyzers Review analyzers 28 February 2023, 16:50:21 UTC
a63e5eb refresh model 28 February 2023, 16:45:39 UTC
25490f9 more analyser tests 28 February 2023, 08:48:30 UTC
7cbdc9e test for Korean anlyser 27 February 2023, 12:38:48 UTC
53a02f8 review Korean analyzer 27 February 2023, 12:38:29 UTC
69e6429 update wipo analyser for korean 27 February 2023, 12:38:02 UTC
f68a793 apply digit/alpha unicode separation for reference strings 26 February 2023, 20:38:25 UTC
ec89505 review CJK analysers; add digit/alpha unicode separation 26 February 2023, 20:37:45 UTC
45d027a correct annotations 25 February 2023, 17:10:17 UTC
2f87f1c Merge branch 'master' of github.com:kermitt2/grobid 25 February 2023, 13:31:14 UTC
16f9c07 review training 25 February 2023, 13:30:45 UTC
ae7cc1f update citation model 22 February 2023, 20:33:38 UTC
fa3c0ff re-annotate 22 February 2023, 15:45:09 UTC
187446c fix annotations 22 February 2023, 13:13:53 UTC
5839cbb update citation parser model 22 February 2023, 11:27:14 UTC
59bbe46 training data 21 February 2023, 20:44:21 UTC
90a6dc9 update with additional training 21 February 2023, 13:03:15 UTC
5d51d49 add training 20 February 2023, 22:08:45 UTC
e63aaa8 update demo url 20 February 2023, 00:07:24 UTC
0fc08d8 point now to HuggingFace spaces for demo 20 February 2023, 00:04:57 UTC
6317b73 fix console/demo client page to run as embedded huggingface app 18 February 2023, 21:30:02 UTC
c5f252f missing closing label; review annotations 16 February 2023, 18:31:08 UTC
dc1b75f review annotations 12 February 2023, 21:09:50 UTC
7007aee emphasize using DL models in the doc 12 February 2023, 15:12:01 UTC
e6c1a0a update DeLFT version 12 February 2023, 15:11:14 UTC
d2cae60 Merge pull request #986 from marktab/patch-1 Update Readme.md 11 February 2023, 15:43:45 UTC
4d2e2e3 Update Readme.md Formatting improvements -- spelling fixes 10 February 2023, 04:13:21 UTC
542e7a2 fix wrong content type in doc for processCitation 09 February 2023, 16:52:51 UTC
c9351fc Merge branch 'master' of github.com:kermitt2/grobid 09 February 2023, 12:10:43 UTC
ece5e57 minor 09 February 2023, 12:10:37 UTC
6dd48d7 add eval end2end for PLOS and eLife test sets 09 February 2023, 12:09:56 UTC
4d86a79 Merge pull request #975 from kermitt2/feature/support-mac-arm Support for Apple ARM M1 09 February 2023, 08:24:34 UTC
904f40e Update documentation 09 February 2023, 03:06:01 UTC
650da39 support PLOS JATS; quieter handling of crossref 404 07 February 2023, 13:46:38 UTC
93d2983 minor doc updates 17 January 2023, 15:44:37 UTC
732d5a1 review ref expansion 10 January 2023, 10:36:43 UTC
2661cae update log levels 04 January 2023, 18:33:19 UTC
713c35f allow deep learning training for segmentation model 04 January 2023, 17:48:15 UTC
87476e1 add model 04 January 2023, 17:47:27 UTC
4812074 more robustness in case of reference segmenter deficiency 04 January 2023, 17:45:31 UTC
2d98cc5 less warn log, more info 04 January 2023, 17:44:55 UTC
ee18014 remove useless jep library 06 December 2022, 06:09:33 UTC
de6016d more log debug; model update 05 December 2022, 17:58:58 UTC
c5ed7bc update models/scores 03 December 2022, 19:50:01 UTC
ac6146c update logs 03 December 2022, 14:42:13 UTC
7d717b8 Merge branch 'master' of github.com:kermitt2/grobid 03 December 2022, 12:12:33 UTC
579939b review shared libraries got lin-64 JNI 03 December 2022, 12:12:23 UTC
bdefc2c update model 02 December 2022, 20:39:52 UTC
2ecfd38 review incremental training 02 December 2022, 18:33:07 UTC
9b0b4ad Add rebuilt binaries for Mac ARM architecture + fix path construction 01 December 2022, 07:54:51 UTC
27d2917 update lin64 wapiti binaries built with release flags 30 November 2022, 13:35:42 UTC
64ad6fb fix author rank 30 November 2022, 13:00:56 UTC
8330b02 update lin64 wapiti binaries built with release flags 30 November 2022, 11:49:13 UTC
cbbc09a try to fix circleci config 25 November 2022, 17:02:42 UTC
back to top