f8844f4 | lopez | 05 July 2022, 22:06:20 UTC | fix tests | 05 July 2022, 22:06:20 UTC |
1b9f89d | lopez | 05 July 2022, 21:35:05 UTC | cleaning | 05 July 2022, 21:35:05 UTC |
16addcd | lopez | 30 June 2022, 15:05:54 UTC | add lemmatizer models, add tests | 30 June 2022, 15:05:54 UTC |
44b2f13 | lopez | 28 June 2022, 20:15:25 UTC | start adding spark nlp tokenizers | 28 June 2022, 20:15:25 UTC |
4a991a9 | lopez | 27 June 2022, 16:46:01 UTC | add yet another console screen shot | 27 June 2022, 16:46:01 UTC |
e8556cc | lopez | 24 June 2022, 23:20:07 UTC | fix provided chunks with just offset | 24 June 2022, 23:20:07 UTC |
df9b1df | lopez | 23 June 2022, 10:55:14 UTC | add link to lmdn pt and fa | 23 June 2022, 10:55:14 UTC |
4f4535b | lopez | 22 June 2022, 13:56:12 UTC | add demo badge | 22 June 2022, 13:56:12 UTC |
d01a774 | lopez | 22 June 2022, 13:55:00 UTC | doc typos | 22 June 2022, 13:55:00 UTC |
6ed7eb6 | lopez | 22 June 2022, 13:40:46 UTC | update rest api response for latest version | 22 June 2022, 13:40:46 UTC |
d390a79 | lopez | 22 June 2022, 13:40:27 UTC | fix default confidence score for fixed language | 22 June 2022, 13:40:27 UTC |
6240e3e | lopez | 22 June 2022, 12:37:36 UTC | fisnih update of runtimes with latest version | 22 June 2022, 12:37:36 UTC |
1c9c10d | lopez | 22 June 2022, 12:26:34 UTC | update runtime with latest version | 22 June 2022, 12:26:34 UTC |
96183b5 | lopez | 22 June 2022, 10:43:26 UTC | update supported languages | 22 June 2022, 10:43:26 UTC |
ec12bbb | lopez | 21 June 2022, 12:03:14 UTC | cleaning | 21 June 2022, 12:03:14 UTC |
d5c02cb | lopez | 21 June 2022, 11:47:02 UTC | fix #144 and update Sweble version | 21 June 2022, 11:47:02 UTC |
4f80308 | lopez | 20 June 2022, 13:38:10 UTC | load pt and fa lower knowledge bases | 20 June 2022, 13:38:10 UTC |
c7399ec | lopez | 20 June 2022, 13:37:24 UTC | add more pt fa resources and models | 20 June 2022, 13:37:24 UTC |
6f4f3fb | lopez | 19 June 2022, 23:18:48 UTC | add test for pt article | 19 June 2022, 23:18:48 UTC |
fbb5f01 | lopez | 18 June 2022, 05:47:13 UTC | additional mediawiki parser config | 18 June 2022, 05:47:13 UTC |
7af44ce | lopez | 18 June 2022, 05:13:38 UTC | add portuguese and farsi resources | 18 June 2022, 05:13:38 UTC |
f0869c1 | lopez | 12 May 2022, 12:25:26 UTC | update for newly supported languages | 12 May 2022, 12:25:26 UTC |
05e11bf | lopez | 10 May 2022, 18:40:16 UTC | update query parameters | 10 May 2022, 18:40:16 UTC |
de164be | lopez | 10 May 2022, 17:33:08 UTC | default maxTermFrequency override in query | 10 May 2022, 17:33:08 UTC |
464e9d5 | lopez | 09 May 2022, 17:37:15 UTC | update French model | 09 May 2022, 17:37:15 UTC |
8dd9151 | lopez | 07 May 2022, 15:43:37 UTC | review frequency threshold | 07 May 2022, 15:43:37 UTC |
2c6d4b8 | lopez | 06 May 2022, 21:40:29 UTC | tune max frequency threshold for French | 06 May 2022, 21:40:29 UTC |
b04e906 | Patrice Lopez | 06 May 2022, 10:15:17 UTC | Merge pull request #142 from kermitt2/0.0.5 Add word frequency db and related fix | 06 May 2022, 10:15:17 UTC |
0ce05c2 | lopez | 06 May 2022, 10:04:51 UTC | add tests | 06 May 2022, 10:04:51 UTC |
265ff83 | lopez | 05 May 2022, 21:34:36 UTC | counting concepts without wikipedia pages | 05 May 2022, 21:34:36 UTC |
25ca2ca | lopez | 04 May 2022, 20:37:55 UTC | add statement variant loading | 04 May 2022, 20:37:55 UTC |
45b1019 | lopez | 04 May 2022, 20:37:29 UTC | update lang config | 04 May 2022, 20:37:29 UTC |
3b45f20 | lopez | 03 May 2022, 23:27:30 UTC | update French models | 03 May 2022, 23:27:30 UTC |
e3a8833 | lopez | 03 May 2022, 19:54:31 UTC | tune language config | 03 May 2022, 19:54:31 UTC |
e6615de | lopez | 03 May 2022, 19:06:43 UTC | fix mention extraction in ja/zh | 03 May 2022, 19:06:43 UTC |
a67df44 | lopez | 03 May 2022, 15:53:43 UTC | add term frequency option and associated control mechanism | 03 May 2022, 15:53:43 UTC |
092451a | lopez | 03 May 2022, 15:52:26 UTC | review statement, label and concept lmdb loading | 03 May 2022, 15:52:26 UTC |
672d461 | lopez | 03 May 2022, 15:51:22 UTC | ru config | 03 May 2022, 15:51:22 UTC |
6aa2ecc | lopez | 03 May 2022, 15:49:00 UTC | add basic tests for jz zh | 03 May 2022, 15:49:00 UTC |
35fe86c | lopez | 30 April 2022, 17:46:01 UTC | missing json encoding | 30 April 2022, 17:46:01 UTC |
d057ba6 | lopez | 30 April 2022, 16:14:47 UTC | add traininf and models for zh, ja, ru | 30 April 2022, 16:14:47 UTC |
85be1ed | lopez | 29 April 2022, 11:32:08 UTC | add word frequency db; extend supported languages | 29 April 2022, 11:32:08 UTC |
2e87363 | lopez | 24 April 2022, 21:48:20 UTC | add missing configs and test article in zh | 24 April 2022, 21:48:20 UTC |
5f9e7a8 | lopez | 24 April 2022, 14:49:12 UTC | update grobid version | 24 April 2022, 14:49:12 UTC |
c54312a | lopez | 23 April 2022, 23:56:53 UTC | selected stopwords for extra languages | 23 April 2022, 23:56:53 UTC |
4d0b70f | lopez | 23 April 2022, 21:38:59 UTC | add language support | 23 April 2022, 21:38:59 UTC |
a30202b | lopez | 23 April 2022, 21:36:18 UTC | add ru media wiki parser config | 23 April 2022, 21:36:18 UTC |
b999b41 | lopez | 23 April 2022, 19:56:48 UTC | finalize ja media wiki parser config | 23 April 2022, 19:56:48 UTC |
64054fe | lopez | 23 April 2022, 19:25:53 UTC | finalize zh media wiki parser config | 23 April 2022, 19:25:53 UTC |
40e671d | lopez | 11 April 2022, 09:11:37 UTC | Merge branch 'master' of github.com:kermitt2/entity-fishing | 11 April 2022, 09:11:37 UTC |
220fa10 | lopez | 11 April 2022, 09:11:23 UTC | update to gradle 7 | 11 April 2022, 09:11:23 UTC |
8f68fa6 | Patrice Lopez | 07 April 2022, 07:05:02 UTC | adjust default selector threshold for Arabic | 07 April 2022, 07:05:02 UTC |
4363777 | lopez | 04 March 2022, 18:02:32 UTC | review and update entity embeddings creation | 04 March 2022, 18:02:32 UTC |
544ff17 | lopez | 03 March 2022, 15:08:40 UTC | update resource url to new kb version | 03 March 2022, 15:08:40 UTC |
cf508a2 | lopez | 03 March 2022, 09:43:20 UTC | support images in console demo for new languages | 03 March 2022, 09:43:20 UTC |
b954901 | lopez | 17 February 2022, 12:14:46 UTC | load Arabic lower KB; support for queries in Arabic | 17 February 2022, 12:14:46 UTC |
1030ac0 | lopez | 15 February 2022, 07:30:28 UTC | add config for Arabic | 15 February 2022, 07:30:28 UTC |
fd877d7 | lopez | 14 February 2022, 14:48:55 UTC | update doc formatting | 14 February 2022, 14:48:55 UTC |
1a1f29c | Patrice Lopez | 14 February 2022, 14:42:00 UTC | Merge pull request #140 from kermitt2/long-text Segment long text automatically and manage sliding context | 14 February 2022, 14:42:00 UTC |
f846754 | lopez | 14 February 2022, 14:40:58 UTC | update version | 14 February 2022, 14:40:58 UTC |
b64dcd2 | lopez | 14 February 2022, 07:27:10 UTC | segmentation of pathologically long sentences; update tests | 14 February 2022, 07:27:10 UTC |
ca7a797 | lopez | 14 February 2022, 07:25:47 UTC | add Arabic media wiki page parser | 14 February 2022, 07:25:47 UTC |
c8778a8 | lopez | 09 February 2022, 12:59:36 UTC | add full document propagation (for long text); update doc | 09 February 2022, 12:59:36 UTC |
05e5c42 | lopez | 09 February 2022, 05:38:19 UTC | add configurable parameters in the query | 09 February 2022, 05:38:19 UTC |
a42b625 | lopez | 08 February 2022, 05:50:07 UTC | fix wrong query | 08 February 2022, 05:50:07 UTC |
cb532c2 | lopez | 07 February 2022, 08:15:12 UTC | update gradle dependencies | 07 February 2022, 08:15:12 UTC |
eff973c | lopez | 07 February 2022, 06:42:30 UTC | fix sentence offsets when segmenting long text | 07 February 2022, 06:42:30 UTC |
f228767 | lopez | 06 February 2022, 16:18:07 UTC | add server-side management of long text, with segmentation and sliding context | 06 February 2022, 16:18:07 UTC |
7a9b87c | lopez | 06 February 2022, 16:16:58 UTC | some precision in the documentation on target disambiguation | 06 February 2022, 16:16:58 UTC |
192d1db | Patrice Lopez | 04 February 2022, 10:26:45 UTC | Merge pull request #130 from kermitt2/dependabot/maven/org.apache.httpcomponents-httpclient-4.5.13 Bump httpclient from 4.5.3 to 4.5.13 | 04 February 2022, 10:26:45 UTC |
81f555c | Patrice Lopez | 04 February 2022, 10:26:34 UTC | Merge pull request #112 from kermitt2/dependabot/maven/junit-junit-4.13.1 Bump junit from 4.12 to 4.13.1 | 04 February 2022, 10:26:34 UTC |
3580bd4 | Patrice Lopez | 04 February 2022, 10:26:17 UTC | Merge pull request #119 from kermitt2/dependabot/maven/com.google.guava-guava-29.0-jre Bump guava from 25.1-jre to 29.0-jre | 04 February 2022, 10:26:17 UTC |
582cb22 | Patrice Lopez | 04 February 2022, 10:25:44 UTC | Merge pull request #122 from kermitt2/dependabot/maven/commons-io-commons-io-2.7 Bump commons-io from 2.5 to 2.7 | 04 February 2022, 10:25:44 UTC |
568ad1a | Patrice Lopez | 04 February 2022, 10:25:21 UTC | Merge pull request #137 from kermitt2/dependabot/maven/com.fasterxml.jackson.core-jackson-databind-2.9.10.8 Bump jackson-databind from 2.9.10.1 to 2.9.10.8 | 04 February 2022, 10:25:21 UTC |
72fd4e3 | Patrice Lopez | 04 February 2022, 10:24:49 UTC | Merge pull request #138 from kermitt2/dependabot/maven/com.thoughtworks.xstream-xstream-1.4.19 Bump xstream from 1.4.11 to 1.4.19 | 04 February 2022, 10:24:49 UTC |
b48dd7e | dependabot[bot] | 01 February 2022, 00:56:59 UTC | Bump xstream from 1.4.11 to 1.4.19 Bumps [xstream](https://github.com/x-stream/xstream) from 1.4.11 to 1.4.19. - [Release notes](https://github.com/x-stream/xstream/releases) - [Commits](https://github.com/x-stream/xstream/commits) --- updated-dependencies: - dependency-name: com.thoughtworks.xstream:xstream dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> | 01 February 2022, 00:56:59 UTC |
3d70d80 | dependabot[bot] | 09 December 2021, 20:00:18 UTC | Bump jackson-databind from 2.9.10.1 to 2.9.10.8 Bumps [jackson-databind](https://github.com/FasterXML/jackson) from 2.9.10.1 to 2.9.10.8. - [Release notes](https://github.com/FasterXML/jackson/releases) - [Commits](https://github.com/FasterXML/jackson/commits) --- updated-dependencies: - dependency-name: com.fasterxml.jackson.core:jackson-databind dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> | 09 December 2021, 20:00:18 UTC |
bd6db6d | lopez | 27 September 2021, 14:18:37 UTC | missing update to grobid 0.7.0 | 27 September 2021, 14:18:37 UTC |
beb1f50 | lopez | 10 July 2021, 11:33:39 UTC | rephrase readme | 10 July 2021, 11:33:39 UTC |
eb1a704 | lopez | 10 July 2021, 11:29:11 UTC | typo | 10 July 2021, 11:29:11 UTC |
24e5bba | lopez | 10 July 2021, 11:13:53 UTC | add some usage examples | 10 July 2021, 11:13:53 UTC |
c428bfe | lopez | 13 June 2021, 16:29:28 UTC | update to latest grobid and grobid ner; review init of grobid-ner; update doc | 13 June 2021, 16:29:28 UTC |
04ec196 | lopez | 09 June 2021, 21:32:30 UTC | discard jcenter repo, clarify current grobid version to use 0.6.2 | 09 June 2021, 21:32:30 UTC |
b0246a8 | dependabot[bot] | 04 June 2021, 00:53:05 UTC | Bump httpclient from 4.5.3 to 4.5.13 Bumps httpclient from 4.5.3 to 4.5.13. --- updated-dependencies: - dependency-name: org.apache.httpcomponents:httpclient dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> | 04 June 2021, 00:53:05 UTC |
4969330 | lopez | 05 May 2021, 01:09:40 UTC | types | 05 May 2021, 01:09:40 UTC |
bcb9c0c | lopez | 05 May 2021, 01:07:00 UTC | Merge branch 'master' of https://github.com/kermitt2/nerd | 05 May 2021, 01:07:00 UTC |
2917424 | lopez | 05 May 2021, 01:06:36 UTC | add 2017 scores in the readme | 05 May 2021, 01:06:36 UTC |
c335d17 | dependabot[bot] | 26 April 2021, 16:45:31 UTC | Bump commons-io from 2.5 to 2.7 Bumps commons-io from 2.5 to 2.7. Signed-off-by: dependabot[bot] <support@github.com> | 26 April 2021, 16:45:31 UTC |
f2f0adb | Patrice Lopez | 14 April 2021, 16:33:22 UTC | Merge pull request #121 from Slashdacoda/master Update filenames and wikipipadia.xml @docName | 14 April 2021, 16:33:22 UTC |
32d3c94 | Sascha Schüller | 14 April 2021, 09:16:25 UTC | Merge pull request #1 from Slashdacoda/fix-path-issue-with-invalid-files update filenames and wikipipadia.xml @docName | 14 April 2021, 09:16:25 UTC |
dbc8af8 | schuellersa | 14 April 2021, 09:10:39 UTC | https://github.com/kermitt2/entity-fishing/issues/120 -> update @docName to fix filename and make useable in Windows enviroment | 14 April 2021, 09:10:39 UTC |
2a1f1cf | schuellersa | 13 April 2021, 13:57:32 UTC | https://github.com/kermitt2/entity-fishing/issues/120 -> fix filename to make usebla in Windows enviroment | 13 April 2021, 13:57:32 UTC |
2ca0603 | Sascha Schüller | 13 April 2021, 13:40:22 UTC | Delete Chippenham_United_F.C. https://github.com/kermitt2/entity-fishing/issues/120 -> try to fix | 13 April 2021, 13:40:22 UTC |
d9d97bd | Sascha Schüller | 13 April 2021, 13:34:49 UTC | Delete Alfred_Conkling_Coxe,_Sr. https://github.com/kermitt2/entity-fishing/issues/120 -> try to fix | 13 April 2021, 13:34:49 UTC |
e8b0a86 | dependabot[bot] | 31 March 2021, 19:14:56 UTC | Bump guava from 25.1-jre to 29.0-jre Bumps [guava](https://github.com/google/guava) from 25.1-jre to 29.0-jre. - [Release notes](https://github.com/google/guava/releases) - [Commits](https://github.com/google/guava/commits) Signed-off-by: dependabot[bot] <support@github.com> | 31 March 2021, 19:14:56 UTC |
6364840 | lopez | 22 March 2021, 02:51:44 UTC | update to GROBID version 0.6.2 | 22 March 2021, 02:51:44 UTC |
a6148a1 | dependabot[bot] | 12 October 2020, 22:06:54 UTC | Bump junit from 4.12 to 4.13.1 Bumps [junit](https://github.com/junit-team/junit4) from 4.12 to 4.13.1. - [Release notes](https://github.com/junit-team/junit4/releases) - [Changelog](https://github.com/junit-team/junit4/blob/main/doc/ReleaseNotes4.12.md) - [Commits](https://github.com/junit-team/junit4/compare/r4.12...r4.13.1) Signed-off-by: dependabot[bot] <support@github.com> | 12 October 2020, 22:06:54 UTC |
af02c40 | lopez | 25 September 2020, 03:07:13 UTC | clarification on training models | 25 September 2020, 03:07:13 UTC |
6c17912 | lopez | 12 August 2020, 21:37:31 UTC | Update to grobid 0.6.1 | 12 August 2020, 21:37:31 UTC |
c861f16 | lopez | 05 July 2020, 18:12:33 UTC | add activation for jdk > 1.8 | 05 July 2020, 18:12:33 UTC |