sort by:
Revision Author Date Message Commit Date
bc20dc0 adding grobid home to artifacts Former-commit-id: c74edbec283fc6c76bf42e3ed00170abd49f3434 22 September 2017, 09:42:57 UTC
3aa12c3 packaging grobid home Former-commit-id: 53af88685fae1f402304565637008f538a57369e 22 September 2017, 09:21:10 UTC
b72f94c application plugin for grobid service Former-commit-id: 4086df5d3ee7249d21e0f870d7c8696846a8b77c 22 September 2017, 09:10:29 UTC
a5398f8 Adding possibility to generate GrobidProperties instance using a customised grobidHomeFinder Former-commit-id: f28502555f52e55d08be18228b4a7ad7da2b4665 22 September 2017, 08:50:38 UTC
85d360f Minor cosmetic cleanup, removing hardcoded path in favour of streams resources Former-commit-id: 187d09f4fad54e239e36837b64bb6bc7ff119a2c 22 September 2017, 06:56:04 UTC
a2f4f2f gradle support Former-commit-id: d4d4b7c99bf1834b6f33260b470b5452f3d0299c 21 September 2017, 16:20:26 UTC
2a7952d Merge branch 'master' into dropwizard-service # Conflicts: # doc/Grobid-batch.md # doc/training/fulltext.md # grobid-core/pom.xml # grobid-core/src/main/resources/log4j.xml # grobid-core/src/test/java/org/grobid/core/engines/EngineTest.java # grobid-core/src/test/java/org/grobid/core/lexicon/FastMatcherTest.java # grobid-core/src/test/java/org/grobid/core/lexicon/LexiconIntegrationTest.java # grobid-service/src/main/java/org/grobid/service/GrobidRestService.java # grobid-service/src/main/java/org/grobid/service/process/GrobidRestProcessFiles.java # grobid-service/src/main/java/org/grobid/service/process/GrobidRestProcessGeneric.java # grobid-service/src/main/java/org/grobid/service/process/GrobidRestProcessString.java # grobid-service/src/main/resources/log4j-jetty.properties # grobid-trainer/src/main/java/org/grobid/trainer/NameCitationTrainer.java # pom.xml Former-commit-id: e594e2e527be09a19ef0ac069e21b83d43c5161e 20 September 2017, 21:06:47 UTC
14f3d45 putting back the slf4j/log4j implementation with scope runtime Former-commit-id: 815167bb682ee6052bdd141e4aa884cebbb6897d 20 September 2017, 07:27:02 UTC
79211a2 ignoring tests depending on an external service Former-commit-id: f71015f8b4672f1756340ec41898feb414195885 19 September 2017, 17:03:32 UTC
01fe0b8 Removing logs but enabling only for tests, libraries using grobid-core they have to provide a log implementation Former-commit-id: d801dc49d16dd9f855ec9f2a689dc5af619d0fd4 19 September 2017, 17:02:09 UTC
20d57ed uniforming method names Former-commit-id: 277cd1d576d701bf1abbcd07e8f40f7948584c92 19 September 2017, 16:59:55 UTC
b042c0a Merge stuff and make it compile Former-commit-id: 511570d4e7114014dec13981e5e0003766945769 19 September 2017, 16:12:21 UTC
5786491 Merge branch 'master' of https://github.com/kermitt2/grobid Former-commit-id: bc34d6a744a34bc0318674bfed3c25b80c7e0565 19 September 2017, 16:02:22 UTC
2b167ae Deactivate test on crossref rest api, as it appears unreliable Former-commit-id: 0084f370a8638d4889c4dfc68121b3cd70a0eb3d 19 September 2017, 16:02:07 UTC
7d4ebd9 cosmetics Former-commit-id: 4e492a84522f4cf7571aa0a0b241abb069c8e5e9 19 September 2017, 15:30:45 UTC
e449d3a fixed method name Former-commit-id: 701d17f9624f895d5f84d0846e48f4deb05bbcf5 19 September 2017, 15:12:48 UTC
b4b11f7 Merge branch 'master' of https://github.com/kermitt2/grobid Former-commit-id: 555f8f0321901cc528b546e58945bd8931083d12 19 September 2017, 14:10:25 UTC
9f05cb8 Keep for each LayoutToken a list of TaggingLabel associated Former-commit-id: a7fc77f25f2a5a1843d09ff395bc2970558cf189 19 September 2017, 14:10:13 UTC
0d6100e Corrections for new citation training and updated model Former-commit-id: f00c51e2a70f7f1fb0303428aa1731c64e1fc4b3 14 September 2017, 15:48:19 UTC
d8a5d8b Merge pull request #239 from iorala/inspire-reference-training-2 Add CERN training data, update date in grobid-service webapp Former-commit-id: e15ea0d28c3d908527a6ccb6e645c505574e1c34 13 September 2017, 14:43:07 UTC
0884cea Update maven git-commit-id plugin version Former-commit-id: 06d69edf4914747a9934297ea58cdfc551f3bdf9 13 September 2017, 14:41:19 UTC
e06f025 Add CERN training data, update date in grobid-service webapp Former-commit-id: 0f11be99948d65383fe44afed7606072d5118b2d 13 September 2017, 12:09:00 UTC
8d2c77c update crossref consolidation, add batch PDF annotations, update service and documentation Former-commit-id: d9e47d2176fc5ff4e01bda97dc3354fae974a5f9 12 September 2017, 21:19:55 UTC
578c0a3 multithread support of CrossRef request pool Former-commit-id: 0b2a52db5b1e7b22b507855afee75621ea925a64 08 September 2017, 12:49:57 UTC
442a822 Add special case for crossref first name normalisation Former-commit-id: f6f00129566390ef032f5485a5d41bfecdc8580b 07 September 2017, 16:47:52 UTC
e1ae122 merge FastMatcher Former-commit-id: f2c2e4cf5b36244d281581bfed8e88c8f60467db 07 September 2017, 13:55:27 UTC
13839d6 Refresh FastMatcher and Lexicon pattern matcher Former-commit-id: d505fbb03706365d161d496c59cd6ad132a41d28 07 September 2017, 12:52:59 UTC
a140bf6 Merge pull request #238 from Vi-dot/master Debug crossref pool, better rate limiter Former-commit-id: d5f691b609c9d20ebad65f2995a078ca86cab374 07 September 2017, 12:52:02 UTC
75f2812 Debug crossref pool, better rate limiter Former-commit-id: 5679135515f7344d313f53102d34ce4a2aea3dea 07 September 2017, 09:19:25 UTC
96e293e Add method addTerm that by default ignores delimiters Former-commit-id: 1725ce61660d96a09ea9657f6ed62add58bd414a 06 September 2017, 20:20:33 UTC
f3ee416 modification of TEI output for issue #232 Former-commit-id: 1538a8a58f3d2f5f15ff7d348a9dbefa0cf52b21 06 September 2017, 20:19:10 UTC
8440596 Merge pull request #236 from iorala/inspire-reference-training-2 Add CERN evaluation data for arXive.org and INSPIRE-HEP Former-commit-id: 1c233ea2cf0eb7e72f2c6c6102f4f061411be255 06 September 2017, 17:20:14 UTC
7d671ba Fix #235, retrain citation model, crossref Former-commit-id: fe582d50958fd991464955af34465fb1f5287a75 06 September 2017, 17:17:24 UTC
ecd61d3 Add CERN evaluation data for arXive.org and INSPIRE-HEP Former-commit-id: 3ce232adda491afd83974168e6e3fb0cc3732863 06 September 2017, 14:58:27 UTC
985858d Merge pull request #234 from iorala/inspire-reference-training-2 Review errata, fix excluded training file, add new training file Former-commit-id: 6b1bb25e75486bb4c1b6bc58531c6945e89b8209 04 September 2017, 15:57:57 UTC
9f230c3 Review errata, fix excluded training file, add new training file Former-commit-id: facaa80260b3b5eb8fed93cc5cb290741b9725b9 04 September 2017, 15:37:21 UTC
57ed820 Fix overlooked compilation problems Former-commit-id: 76d5cd63f39b0b5cc94217c9695e9d87cc044251 04 September 2017, 06:41:35 UTC
2d7cf55 Review citation training data and retrain model Former-commit-id: 4bbe3a31e86f74815a7f45445d7fd938dc7ea115 04 September 2017, 06:09:33 UTC
435c385 Merge pull request #231 from iorala/inspire-reference-training-2 add CERN arXiv and journal training data for citation, reference-segm… Former-commit-id: 9eebcace1f9d3eaa56077c84cb83c619764a201e 03 September 2017, 06:27:09 UTC
dad633f Merge branch 'master' of https://github.com/kermitt2/grobid Former-commit-id: edd8dd50c2420dfbece0352450a5736cd1dbfc90 03 September 2017, 06:22:16 UTC
4256df0 Complete integration of public crossref json api for header consolidation Former-commit-id: 54f007bdc75ad3eb21637c7b04c80b03cc623cef 03 September 2017, 06:22:08 UTC
172d22d add CERN arXiv and journal training data for citation, reference-segmenter and segmentation models fix errors in training data Former-commit-id: 1f1c76c10400a580dd5bb3ed2e17636b61be7095 01 September 2017, 16:49:21 UTC
0d560d6 Merge pull request #228 from iorala/improve_documentation update tag explanations of bibliographic reference documentation, fix typos Former-commit-id: 7d90aa5a9b87bf384eb9e69098ffb0b05c806244 30 August 2017, 15:47:03 UTC
13a5147 add explanations to bibliographic reference documentation, fix typos Former-commit-id: 70aa313f4ddcc47a3bf05560a1e99142c0bd7834 30 August 2017, 14:48:40 UTC
dc72d2e update readFileToString Former-commit-id: c438d3a277529fe04e1a3fe2436ef92822688f52 29 August 2017, 07:28:29 UTC
9d56737 Start integrating new CrossRef API for consolidation Former-commit-id: 414a609d3a4ccbe574b58c9a8c25684b49afa658 29 August 2017, 06:31:40 UTC
cdff698 Generic field-level evaluation: Cleaning and typos Former-commit-id: ba6d5a2d07faf1e8bdf1ced82578180991594e23 27 August 2017, 02:38:39 UTC
160bfda Finalize generic field-level evaluation review and unit tests Former-commit-id: 0ea3879162abac42bc735b586caa3ae78df119b4 27 August 2017, 02:36:03 UTC
756a528 Enable CORS for all web services Former-commit-id: 819b9b0424022859c1317da0e5e8d73d79707209 26 August 2017, 19:52:59 UTC
87b25f6 Review standard model evaluation at field level, refactorize and add tests Former-commit-id: f28b5ce41a8fd1ddd831f9718108815cae4d0180 26 August 2017, 02:35:11 UTC
3f429cd revert change in TEI citation parser Former-commit-id: 8e4ce03fc91a75d0185fd61f3a480444df21933e 25 August 2017, 15:00:15 UTC
81a06c7 Merge branch 'master' of https://github.com/kermitt2/grobid Former-commit-id: b32c708f34de5102d253c59e2be05e680a610338 25 August 2017, 14:53:27 UTC
0571057 review latest INSPIRe training data, retrain ref. models, debug Former-commit-id: 85efa4c42e0d2d374225f127b74b49c83cff5483 25 August 2017, 14:53:15 UTC
8b2bfa2 TEI Citation Parser, support text around the citations - adding some assertions Former-commit-id: 98391ad9c65ddbf1848964bc0d969bb10027b6b8 25 August 2017, 10:27:49 UTC
f05a329 TEI Citation Parser, support text around the citations Former-commit-id: 83d8b77926411ccdc358970545da713712120545 25 August 2017, 10:21:44 UTC
2a8ee8a Merge branch 'master' of https://github.com/kermitt2/grobid Former-commit-id: 73c877e226caba36a9dd7a82527f27d38bd8d7fb 24 August 2017, 17:49:10 UTC
588e51a review date parser, inpire collaboration list, retrain Former-commit-id: 44caa33db98007e73bdbb426d9e6d6397ceccd0f 24 August 2017, 17:49:03 UTC
53f63ae Merge pull request #224 from iorala/inspire-reference-training-2 Added CERN arXiv and journal training data / change arXiv <idno>-elements Former-commit-id: 4b40370f39bbe470ae2dbc5bb02fb82ea2fdbd39 24 August 2017, 17:46:46 UTC
dd95480 add CERN arXiv and journal training data / change arXiv <idno>-elements Former-commit-id: ede5aa8b00ecf13117ddc6189c771a49420a5026 24 August 2017, 15:25:32 UTC
670eeca use now clusteror for result extraction of author names in citation and header; some refined person name normalisation Former-commit-id: 677f594f59c9788f28bb516b3e79238989b17589 22 August 2017, 04:08:04 UTC
5e06ac6 review and update citation and header author features, training and models Former-commit-id: 5eacc8739b21365ae5aadb81005908e77fa3966a 21 August 2017, 06:58:09 UTC
c39e5b8 Update and debug citation parser with features dedicated to identifier Former-commit-id: 42c390ab3e04e895781e7fd87f99c3212d1fe713 20 August 2017, 18:36:02 UTC
f7bddb1 Update citation model and features Former-commit-id: 67dd39f08de8527f2cf12abc9f7a70ca16b30d94 19 August 2017, 17:37:29 UTC
b84cc27 Reworked features for citation model, review FastMatcher, add new CERN training data, thanks @iorala ! Former-commit-id: 24ef4e1f0898729ac05120216b98aae5c74ba305 19 August 2017, 00:16:14 UTC
1996e0f Slightly better cleaning Former-commit-id: ff69e5a77d46db8b05a094f2242e3d538c3a7d9b 16 August 2017, 15:55:58 UTC
94dfa10 Rewritten citation parser with clusteror and LayoutToken Former-commit-id: e738777bb5e38066605746557c5ee6e27ff80b01 16 August 2017, 15:26:59 UTC
2dd777d Add collaboration to citation parser results Former-commit-id: ad30b0902aee65b786363c89c4558dfa7a8bdf30 15 August 2017, 21:25:21 UTC
0bf89cb declare labels for citation model, including new label collaboration Former-commit-id: a3545822a242148131e8c3b3d6fdbe7bae7f9fa3 15 August 2017, 21:03:10 UTC
e713efc Update <bibliScope> syntax in documentation Former-commit-id: 139e11b885e3044ae7d8ac8442f14d758eed0fa0 15 August 2017, 15:47:41 UTC
aacda26 Clarifications regarding refs, spaces, lists Former-commit-id: 852ab57010923a73ba320cdb1688c31687f4efc6 15 August 2017, 09:14:52 UTC
25c49b4 some more typos in the training annotation guidelines Former-commit-id: 6ece164be4c3430f2460d40f3de118e674f40cbc 15 August 2017, 03:45:05 UTC
dc45f78 some typos in the training annotation guidelines Former-commit-id: b9113960beeda50110693eaaa7a87ed2d1399abb 15 August 2017, 03:36:51 UTC
dee8294 review training guidelines for segmentation and fulltext models Former-commit-id: 9a5312b433bf4dc780eda37a23c6f1efdac65519 15 August 2017, 03:15:39 UTC
c4a68ee Review documentation for training data for citation model Former-commit-id: ca223f18aacd08fae8f72dc12ff6ed045474f447 15 August 2017, 01:26:49 UTC
5721901 Documentation typos Former-commit-id: 1558d6f3eace5c6159802c626d5b14a313e62cde 14 August 2017, 22:13:17 UTC
b1ccdc9 Documentation on affiliation-address model Former-commit-id: addd8782c8ad13cbeb5c44d0822b317ca3664004 14 August 2017, 21:54:23 UTC
31b4ebc list items need to be inside list elements Former-commit-id: 99705e08404fd77437434bd2c80aeda1f29f0d55 12 August 2017, 16:45:47 UTC
42f2006 list items need to be inside list elements Former-commit-id: e05f023a3b2ffa638160a482f9db9996b368232d 12 August 2017, 16:33:53 UTC
abbb6e8 just a missing space Former-commit-id: 28d3f512e632b643dfd29328cc1a982f5313f781 12 August 2017, 16:28:24 UTC
ed9ffc9 Merge remote-tracking branch 'origin/master' into dropwizard-service Former-commit-id: 9afcd9091e553e8dd4f49a2b2b0a15391b18434b 09 August 2017, 21:34:04 UTC
ddcb316 Merge branch 'master' of https://github.com/kermitt2/grobid Former-commit-id: f0744e66c20e25b89a9427616c810fcdfbb3479a 09 August 2017, 20:58:21 UTC
49d08d3 typos Former-commit-id: 05eeee6a6b275f2ba2b60c05e824bd9d6ba0d1f2 09 August 2017, 20:56:38 UTC
2706018 removing lexicon already in grobid-home submodule Former-commit-id: 68b0013c275affcb8bd8f6f965a42a9266257914 09 August 2017, 20:41:22 UTC
e8f3447 add documentation on general principles for adding training data Former-commit-id: bfa59be85899a0a6bf857ca429071407b87fdc50 09 August 2017, 19:51:30 UTC
8a11a01 Merge branch 'master' of https://github.com/kermitt2/grobid Former-commit-id: 6fabff08ee8ce513a0833862197f2a001aba422d 09 August 2017, 17:41:40 UTC
643e6cd Update text utilities to support entity-fishing, update citation model with unicode normalisation Former-commit-id: c90059844d792daacbb03e97073483065631c982 09 August 2017, 17:41:23 UTC
f20f0b3 Merge pull request #220 from aoboturov/patch-1 Added an editorconfig Former-commit-id: b21d3b2c6e3f1cd028d56df4be467a1d6f74d819 09 August 2017, 16:12:18 UTC
27a26cf Added an editorconfig Former-commit-id: d611011fc2eb60a76ea78c727863205b7ef46741 09 August 2017, 14:39:18 UTC
9f13f02 Removing module model committed by mistake Former-commit-id: 288fe58f19194765d334c78fcbd58f8e8186341c 09 August 2017, 09:39:48 UTC
7a3133c Merge pull request #219 from aoboturov/fix/cleanup Refactored to extract unicode normalization into a normaliseTextAndRe… Former-commit-id: b1b1cd690c0a079db642fc5b04c62ce526b9a315 09 August 2017, 09:07:11 UTC
e21c003 Refactored to extract unicode normalization into a normaliseTextAndRemoveSpaces method. Former-commit-id: e1ba59b9024a1d9e57f1b8dbac68ecfe02e11d8f 09 August 2017, 08:26:06 UTC
dd21b3b typos in annotation guidelines Former-commit-id: 2cc73ae4071a38a02a6434fd0e90c74f1de5a97c 09 August 2017, 02:40:21 UTC
0f3becd adding annotation guidelines for date Former-commit-id: 93a58bdcad18f614dd16c69f54688c1c6e5777bd 09 August 2017, 02:15:55 UTC
1daccc5 structure documentation on training data Former-commit-id: 208896fcbf71b9f09a12f48d6b9d9500f05f9640 09 August 2017, 02:02:55 UTC
54fa78f Merge pull request #217 from jfix/training_doc Training doc Former-commit-id: 465b5535c8772864b0f5cf3ee271767edca73cdb 09 August 2017, 00:47:15 UTC
89bd19d Unicode normalisation (for safety) in training tokens Former-commit-id: 4db7ba8b0fdb25c1b16a96eb0d32d7c91f74d680 09 August 2017, 00:37:50 UTC
30f2b07 Merge branch 'partial-merge-pr-216' Former-commit-id: 4d68e2d1f151a68cc7cd40d5e61ba7028ddfa414 08 August 2017, 19:30:19 UTC
ef050f9 Merge pull request #218 from aoboturov/partial-merge-pr-216 Fix for the loop end conditions in the ReferenceSegmenterParser. Former-commit-id: e795f1848324028bcb2a17dcc33f62755a919ab5 08 August 2017, 13:33:24 UTC
3b484ba Fix for the loop end conditions in the ReferenceSegmenterParser. Former-commit-id: 8a8688447990d1c06ce2665e1b64378740ddd237 08 August 2017, 12:26:08 UTC
2b71e9a forget one file Former-commit-id: 7f0eea314c6eeb257e4d402a81b59180162b8828 08 August 2017, 09:47:46 UTC
back to top