db97eb9 | Vyacheslav Zholudev | 17 January 2015, 15:47:23 UTC | [maven-release-plugin] prepare release grobid-parent-0.3.0 Former-commit-id: 7d6db71a75df6177bedb6c212e34f61f0abfe3b8 | 17 January 2015, 15:47:23 UTC |
1756a0b | Vyacheslav Zholudev | 17 January 2015, 15:05:04 UTC | preparing for release Former-commit-id: 325d0310175c6d199388b090a8de66b25e344db3 | 17 January 2015, 15:05:04 UTC |
e825827 | Vyacheslav Zholudev | 17 January 2015, 14:58:46 UTC | fixing exception re-throwing include cybozu language detection profiles into grobid-home Former-commit-id: 74f6c06127d5ee5129c7dfe7e5fea365afdfb4fc | 17 January 2015, 14:58:46 UTC |
a9a3c66 | kermitt2 | 05 January 2015, 14:09:17 UTC | Error in citation training data. Former-commit-id: de8979fcb08cfa161cd081bc52d1032cfb7f9392 | 05 January 2015, 14:09:17 UTC |
54163c8 | kermitt2 | 02 January 2015, 02:43:46 UTC | Fix some TEI encoding issues for the full text. Former-commit-id: 1b5364f758fec793147f58c76fa44a65736e5e1c | 02 January 2015, 02:43:46 UTC |
e4989bf | kermitt2 | 30 December 2014, 01:38:48 UTC | Merge branch 'master' into segmentation Former-commit-id: 6ad647358ccb4f1c753a726a5e53eec7878bcfe2 | 30 December 2014, 01:38:48 UTC |
d123e62 | kermitt2 | 30 December 2014, 01:35:00 UTC | Two small bugs. Last commit for this branch! Former-commit-id: 18898ee1a6deef7cd0a2bb3b474a51a531d593f8 | 30 December 2014, 01:35:00 UTC |
7d8283b | kermitt2 | 27 December 2014, 18:41:02 UTC | Update of some models Former-commit-id: af34c320e09f6a247cc2c5edc387daa2d4d85ad8 | 27 December 2014, 18:41:02 UTC |
76f40b3 | kermitt2 | 27 December 2014, 03:54:08 UTC | Fix a problem in the evaluation of CRF++ models. Former-commit-id: b0fb1a5b1fe59c518ba582ea988a750db0c7d01d | 27 December 2014, 03:54:08 UTC |
361ebe8 | kermitt2 | 24 December 2014, 17:24:49 UTC | Add some training data for the TermiTH corpus. Former-commit-id: aa5fe212a8301f31731dae0b89dea4a579f3b2c5 | 24 December 2014, 17:24:49 UTC |
b7579e3 | kermitt2 | 24 December 2014, 03:31:59 UTC | Add training data for name sequence in citations Former-commit-id: c9f2d61b08b78fca1bc254f42f6407f474f63585 | 24 December 2014, 03:31:59 UTC |
9bf8a5b | kermitt2 | 24 December 2014, 01:47:57 UTC | Improve training format generation for citation authors. Former-commit-id: 2fc13cae2921f28dc66902919be972c4c2c3f2df | 24 December 2014, 01:47:57 UTC |
f988541 | kermitt2 | 23 December 2014, 16:58:39 UTC | Improve language identification for Header processing. Former-commit-id: 0710e01991f0340100fd904c093d2f57b6169916 | 23 December 2014, 16:58:39 UTC |
7c89d18 | kermitt2 | 23 December 2014, 03:23:09 UTC | Currently do not use the segmentation model when only the header is to be processed. Former-commit-id: 6deedf28e92b0395a0bafe72837a726935550add | 23 December 2014, 03:23:09 UTC |
9778c6a | kermitt2 | 23 December 2014, 01:33:03 UTC | Adapt restricted segmentation for header processing Former-commit-id: 3b5ba53066f19f98e89b066db7a6d5c4235fb7e2 | 23 December 2014, 01:33:03 UTC |
be5defe | kermitt2 | 22 December 2014, 02:04:39 UTC | Fix an error in the training of the segmentation model. Former-commit-id: 071b8107a6aedb1b8aea57768e1cf4c7c7613c78 | 22 December 2014, 02:04:39 UTC |
7bc1297 | kermitt2 | 21 December 2014, 17:19:38 UTC | Improve robustness, use the segmentation model for the header parser by default Former-commit-id: 555cbefec9ea2494d491d620797321e6ee68715d | 21 December 2014, 17:19:38 UTC |
85d6a28 | kermitt2 | 21 December 2014, 17:19:01 UTC | Update the REST service manual Former-commit-id: d88af2fee73c1e4a6e88b49e8137ee780d9ecd12 | 21 December 2014, 17:19:01 UTC |
6de9de8 | kermitt2 | 20 December 2014, 04:19:34 UTC | Forget to remove some garbage ;) Former-commit-id: 4311167151a133ed330f756d2467afa5ee65bbed | 20 December 2014, 04:19:34 UTC |
109a877 | kermitt2 | 20 December 2014, 01:22:13 UTC | Add realignment and robustness to the segmentation parser. Former-commit-id: 03ad9e945387126bad13fff1115a5a21827dfaca | 20 December 2014, 01:22:13 UTC |
87f2184 | kermitt2 | 18 December 2014, 00:51:15 UTC | Create training data for the citation model in combination with the training data for citation We also need to generate the training data for the other models (same as already in createTrainingHeader !) Former-commit-id: 4ef08c9dffa0ba0c786cbc93e9a39de1c1f387dc | 18 December 2014, 00:51:15 UTC |
9d38a75 | kermitt2 | 17 December 2014, 22:25:22 UTC | Add REST service processReferences which gives in TEI all the extracted and parsed references of a PDF file. Former-commit-id: dae4c97d074fde5cecd559820981df4656aeb82d | 17 December 2014, 22:25:22 UTC |
9981efd | kermitt2 | 17 December 2014, 20:23:32 UTC | Add batch processing of the bibliographical sections of a PDF. Former-commit-id: 2c2dce8f486e0ecc7d813ca52c17e1bc908a3607 | 17 December 2014, 20:23:32 UTC |
6126249 | kermitt2 | 17 December 2014, 13:43:40 UTC | Fix TEI serialisation error, improve header processing. Former-commit-id: 48e7a4ce2bfa9fba3c3691e3a0b67607c7e41873 | 17 December 2014, 13:43:40 UTC |
6470d0e | kermitt2 | 13 December 2014, 02:54:47 UTC | Update dependencies. Former-commit-id: 52c93f1c6649c0fd395001fbe6d3faf87c24aee0 | 13 December 2014, 02:54:47 UTC |
e008ea1 | kermitt2 | 11 December 2014, 17:17:20 UTC | Ensure that the vintage mode CRFPP models and library is still well working as well. Former-commit-id: 45a71f2d9e44e05d92fa7ee90c69db6d521bbb24 | 11 December 2014, 17:17:20 UTC |
f106011 | kermitt2 | 10 December 2014, 17:00:28 UTC | Fix cases where the segmenter stops due to rare special sequences of accent/diaresis Former-commit-id: f3d8c34352fc70009132cfe0b44689c75ffdb580 | 10 December 2014, 17:00:28 UTC |
52c83fe | kermitt2 | 10 December 2014, 16:59:39 UTC | Cleaning related to the TEI serialization Former-commit-id: a813ceb5435ebf8185054e71682eee8d784118d4 | 10 December 2014, 16:59:39 UTC |
54cdac1 | kermitt2 | 09 December 2014, 21:20:30 UTC | Update TEI output for biblio references (<biblScope>) Former-commit-id: cdc3a238137ac78b8301ed1babc78b3d5d6ed8fb | 09 December 2014, 21:20:30 UTC |
2de04f7 | kermitt2 | 09 December 2014, 21:20:01 UTC | Add exemples of usage of the services with curl Former-commit-id: ed1cc751f99fb0a545b39f351868c41b0d5df372 | 09 December 2014, 21:20:01 UTC |
f6ad853 | kermitt2 | 09 December 2014, 16:14:56 UTC | Correct type in trainer usage message. Former-commit-id: 9f37263a39182b6d85a57c6e8c662d13a10d535d | 09 December 2014, 16:14:56 UTC |
72e7b84 | kermitt2 | 09 December 2014, 16:14:31 UTC | Update version to pre-release 0.3. Former-commit-id: cddad4ebc281b6b099958ce117199039b0e0d080 | 09 December 2014, 16:14:31 UTC |
2ace76d | kermitt2 | 08 December 2014, 15:47:26 UTC | Avoid reloading of models in grobid-service Former-commit-id: 22b6f841ea4a116528764e63cf4462de2d81f692 | 08 December 2014, 15:47:26 UTC |
e578393 | kermitt2 | 08 December 2014, 12:56:03 UTC | Add robustness for the segmentation model in case of particular PDF accent/diaresis sequences Former-commit-id: 63c99257ddfbf89162fea36d09ec0aec729276f2 | 08 December 2014, 12:56:03 UTC |
4acc7a5 | kermitt2 | 08 December 2014, 01:11:13 UTC | Updating TEI output to reflect P5 changes on <biblScope> Former-commit-id: a405be3eddfa0531937b6c14377845780a1ead66 | 08 December 2014, 01:11:13 UTC |
3e22519 | kermitt2 | 20 October 2014, 02:21:13 UTC | Update of the ant build files for new language detection lib. Former-commit-id: 65f0cca3973753c9b9e77e3fcd381502ffd74fa4 | 20 October 2014, 02:21:13 UTC |
b82ec38 | kermitt2 | 20 October 2014, 01:12:33 UTC | Remove unused class. Former-commit-id: 3090c6e8e2824cf51d6752efb2d1826b7f60b3c6 | 20 October 2014, 01:12:33 UTC |
4dc6fc9 | Vyacheslav Zholudev | 09 October 2014, 09:57:08 UTC | getting rid of lingpipe - cosmetic Former-commit-id: c7d840d12ccb64e6c729ed12f510abdde600074b | 09 October 2014, 09:57:08 UTC |
41f04c7 | Vyacheslav Zholudev | 09 October 2014, 09:55:52 UTC | getting rid of lingpipe Former-commit-id: 5013dfda1862c2b4131bb605e937e0d82a580d44 | 09 October 2014, 09:55:52 UTC |
3ec6a6e | Vyacheslav Zholudev | 09 October 2014, 09:48:07 UTC | Merge branch 'segmentation' of https://github.com/kermitt2/grobid into segmentation Former-commit-id: 6bc76a7cce6ca6d0aacd408c09cb1aeaabab846c | 09 October 2014, 09:48:07 UTC |
bdda257 | kermitt2 | 09 October 2014, 00:52:09 UTC | Change language detector default value for the property test. Former-commit-id: 02819c8b7c108626608422cb0b6340b096df6e26 | 09 October 2014, 00:52:09 UTC |
f6a4456 | Vyacheslav Zholudev | 08 October 2014, 12:17:09 UTC | email sanitization and attachment factored out Former-commit-id: 2b7170d4931a1b735a75326631fce0c793b57046 | 08 October 2014, 12:17:09 UTC |
d7626d0 | Vyacheslav Zholudev | 08 October 2014, 10:03:58 UTC | factored out email assigner Former-commit-id: 68dfce5218418c1c9a1aceaa8f04a6f6f930bdcf | 08 October 2014, 10:03:58 UTC |
37c783d | Vyacheslav Zholudev | 07 October 2014, 12:57:47 UTC | new language detection library and factory (https://code.google.com/p/language-detection/) Former-commit-id: 0ded4382c6d2d63ec4d40dcc91daa50cc281342d | 07 October 2014, 12:57:47 UTC |
54f80ae | kermitt2 | 02 October 2014, 23:55:53 UTC | More training data Former-commit-id: 4190ae6860242999f336d17f350fac0c80198f28 | 02 October 2014, 23:55:53 UTC |
7a56048 | kermitt2 | 02 October 2014, 23:31:02 UTC | Add new gazetteers resources Former-commit-id: aee0297eb0e58b241bb3d1b2aa891312143cd9a0 | 02 October 2014, 23:31:02 UTC |
3e63458 | kermitt2 | 02 October 2014, 22:29:27 UTC | Additional training data for the reference model Former-commit-id: 1fee876f5f65c394f28dc1c7813a328c38bcaadb | 02 October 2014, 22:29:27 UTC |
6d2e4ed | kermitt2 | 02 October 2014, 22:28:50 UTC | Small fixes in the affiliation TEI output Former-commit-id: 71047b1a7ef67a9c156850e7e23b6c28d6ac6791 | 02 October 2014, 22:28:50 UTC |
3e047d5 | kermitt2 | 03 July 2014, 20:03:30 UTC | Additional training data for citations Former-commit-id: 7753438ea49d7964755dc677aa02d81ed6cfbc52 | 03 July 2014, 20:03:30 UTC |
8e4f15b | kermitt2 | 02 July 2014, 00:00:44 UTC | Add some robustness for Issue #16 Former-commit-id: 73cc824d580adf12d9553236a9750fb843058643 | 02 July 2014, 00:00:44 UTC |
bbd1c39 | kermitt2 | 01 July 2014, 23:58:22 UTC | Add some robustness for issue #16 Former-commit-id: 066159d5b9a8abed14f8a652cd628c1ee81bdccf | 01 July 2014, 23:58:22 UTC |
43419fd | kermitt2 | 01 July 2014, 04:37:04 UTC | Modifications in PDF text extraction for issue #16 Former-commit-id: f3346e91f07aa11c06a31e30969bef945d687a26 | 01 July 2014, 04:37:04 UTC |
da22033 | kermitt2 | 01 July 2014, 03:40:39 UTC | Modifications in PDF text extraction for issue #16 Former-commit-id: a28c2a7a7f6b17add8c698c6c15d9d2bb380039e | 01 July 2014, 03:40:39 UTC |
ffbcf95 | kermitt2 | 26 June 2014, 18:25:07 UTC | Add city name matcher Former-commit-id: a655e6d0509b0638690787f0ede66ba29b4fa1d5 | 26 June 2014, 18:25:07 UTC |
4ebff49 | kermitt2 | 26 June 2014, 18:24:53 UTC | Making FastMatcher a little bit more modular Former-commit-id: 0298d7503ff5af66581ef74fcd37b846307504e6 | 26 June 2014, 18:24:53 UTC |
90781cd | kermitt2 | 24 June 2014, 02:02:13 UTC | Update model name Former-commit-id: b4f9269412a8eaad1a71f35e4a206ad41ea46a70 | 24 June 2014, 02:02:13 UTC |
6bf063b | kermitt2 | 19 June 2014, 21:00:59 UTC | Fix issues with empty result from one model to the other. Former-commit-id: dd1d65b4c0ae1fde175d6c38bb037d651c8ad9f8 | 19 June 2014, 21:00:59 UTC |
528f204 | kermitt2 | 19 June 2014, 16:59:25 UTC | Propagate fix for issue #31 and ensure that batch process closes the resources only at the end of the batch Former-commit-id: 4c0adccd1b92dc63f73f84cfb2d96d621870a275 | 19 June 2014, 16:59:25 UTC |
0f7b0f5 | kermitt2 | 19 June 2014, 00:55:48 UTC | Fix issue #31 and small header model for GitHub rule on large files. Former-commit-id: 014ebba07f2b0455a7dd633ca46199d2c4d199a9 | 19 June 2014, 00:55:48 UTC |
7fe9d8d | kermitt2 | 17 June 2014, 15:44:33 UTC | TEI parser for the OpenEdition format Former-commit-id: e96d5c347f9189b0733151301c7c0ff441488259 | 17 June 2014, 15:44:33 UTC |
7a75cc2 | kermitt2 | 16 June 2014, 09:42:57 UTC | Partial citation model to fit the 100MB max rule of Github. Former-commit-id: b8bc77817bd6677bc4e07d2cc7331922846d2456 | 16 June 2014, 09:42:57 UTC |
1b147de | kermitt2 | 15 June 2014, 11:19:11 UTC | Additional training data. Former-commit-id: 8ce34b678dd051f7f7089f85821110783f8cfc29 | 15 June 2014, 11:19:11 UTC |
3fbae2d | kermitt2 | 15 June 2014, 11:17:30 UTC | New training data and add local libraries. Former-commit-id: 10b412ea941bdc191c6297f38cd429ffb6190521 | 15 June 2014, 11:17:30 UTC |
62135f8 | kermitt2 | 02 May 2014, 16:11:21 UTC | Fix an issue with "issue" field ;) Former-commit-id: 5b8ce1fd50a2de07658934a1f02e6ea269fa42a9 | 02 May 2014, 16:11:21 UTC |
3843094 | Vyacheslav Zholudev | 30 April 2014, 15:49:13 UTC | fixing bugs with empty strings Former-commit-id: 92b6aaa3f253523c8867262ad49fdf939aa8547c | 30 April 2014, 15:49:13 UTC |
ce98d2c | Vyacheslav Zholudev | 30 April 2014, 09:23:31 UTC | lingpipe in local maven + getting rid of the lingpipe maven repo in pom.xml Former-commit-id: ea89df16e45dec9a5f2b2a570543b29562e7dffc | 30 April 2014, 09:23:31 UTC |
cce9dff | Vyacheslav Zholudev | 30 April 2014, 09:17:48 UTC | lingpipe in local maven + getting rid of the lingpipe maven repo in pom.xml Former-commit-id: 5f30e2401ea50532e94c61a6fa8729cae456908c | 30 April 2014, 09:17:48 UTC |
793c850 | Vyacheslav Zholudev | 30 April 2014, 08:27:30 UTC | Merge branch 'segmentation' of https://github.com/kermitt2/grobid into segmentation Former-commit-id: 38b3ca9ba765345a09a9608890baedc15d8d7594 | 30 April 2014, 08:27:30 UTC |
44182fd | Vyacheslav Zholudev | 30 April 2014, 08:26:54 UTC | adapting to java 6 + not loading native libs if they not needed Former-commit-id: b32264fb5fb9380c5ca68ae176aecca06a30de23 | 30 April 2014, 08:26:54 UTC |
4002e83 | kermitt2 | 29 April 2014, 23:16:01 UTC | Update build.xml files and jar for ant. Former-commit-id: 604a6f64efddd7829d9aafec9c766654da4de46d | 29 April 2014, 23:16:01 UTC |
4dccfdf | Vyacheslav Zholudev | 29 April 2014, 13:41:33 UTC | updating grobid-core version Former-commit-id: 4f718a6c395dda3c8b650c50918c94839be18b92 | 29 April 2014, 13:41:33 UTC |
3eff280 | Vyacheslav Zholudev | 29 April 2014, 13:23:31 UTC | switching code to java 6 again Former-commit-id: 04c78df234559cb403d04a3bd5dd2bdaa9074c7b | 29 April 2014, 13:23:31 UTC |
b04f1c9 | kermitt2 | 29 April 2014, 13:09:32 UTC | Review of CrossRef consolidation. Avoid reloading the models in grobid service with Wapiti (to be further tested and adapted). Former-commit-id: a27878c461ce1bb6b1caa2ab4367809a64c61fc7 | 29 April 2014, 13:09:32 UTC |
2291741 | kermitt2 | 23 April 2014, 21:15:33 UTC | New training data, small fixes and cleaning. Former-commit-id: 5de31b88420475661393a725a7dc907ca21eb000 | 23 April 2014, 21:15:33 UTC |
07f6f26 | kermitt2 | 22 April 2014, 06:32:15 UTC | Improve the fulltext and the reference segmenter models. Former-commit-id: af16378881a0efb9175ca40a36e501e2a7286e0e | 22 April 2014, 06:32:15 UTC |
4dbb81d | Vyacheslav Zholudev | 15 April 2014, 15:00:34 UTC | propagating EngineParsers to other parsers that need other models Former-commit-id: ee3e8bb5e0254a1df80f0d1da113a28ebd13ce40 | 15 April 2014, 15:00:34 UTC |
27a2c43 | Vyacheslav Zholudev | 15 April 2014, 13:03:47 UTC | factoring our parsers from Engine (to make it more robust with nulls and lazy initialization) Former-commit-id: 56b57ab860930d5f3250814b86a6cab0368bec31 | 15 April 2014, 13:03:47 UTC |
c864417 | Vyacheslav Zholudev | 15 April 2014, 12:26:30 UTC | Merge branch 'segmentation' of https://github.com/kermitt2/grobid into segmentation Former-commit-id: e569fcf592b7f871d218f66d1d3496118b4e06fc | 15 April 2014, 12:26:30 UTC |
5848592 | Vyacheslav Zholudev | 15 April 2014, 12:24:52 UTC | initial version of reference segmenter Former-commit-id: 3d764cc07209cc43dd3e19fc60695cad14e51993 | 15 April 2014, 12:24:52 UTC |
db2d1dc | kermitt2 | 14 April 2014, 17:05:58 UTC | Starting the integration of the segmentation model in the full text processing. Former-commit-id: 8adde8a93e4223b7bae42c5ae6c0e948efbd9776 | 14 April 2014, 17:05:58 UTC |
7dcb974 | kermitt2 | 08 April 2014, 02:08:26 UTC | Use of the segmentation model in the fulltext parser, and lots of cleaning. Former-commit-id: bb874d856d9508088d7167dfa5aedef43edf5cca | 08 April 2014, 02:08:26 UTC |
2af61b6 | kermitt2 | 07 April 2014, 01:04:20 UTC | Add a header parser method using the segmentation model for extracting the header section. Former-commit-id: a53c938f77457ca31dd9a1a89651b9231a35cacb | 07 April 2014, 01:04:20 UTC |
05511a3 | kermitt2 | 05 April 2014, 22:33:38 UTC | Adaptation of patent processing features for Wapiti. Former-commit-id: 1575ed64059d8bfe03d56aa261694de50f4ae474 | 05 April 2014, 22:33:38 UTC |
4fc24f8 | kermitt2 | 05 April 2014, 20:35:24 UTC | Update patent models and patent trainer. Former-commit-id: 96ba0f4fce814b625cea3d65d4c343ce19fcea89 | 05 April 2014, 20:35:24 UTC |
8929a90 | Vyacheslav Zholudev | 03 April 2014, 13:41:49 UTC | cosmetic Former-commit-id: 000606a061e14e099ee39334080727c1b84f3cb3 | 03 April 2014, 13:41:49 UTC |
d5a8fae | Vyacheslav Zholudev | 03 April 2014, 13:23:35 UTC | First version of addressing tokens in a document Former-commit-id: 5b0272d3677c1c415a1b00ba2769a894179ac978 | 03 April 2014, 13:23:35 UTC |
f3fa2b9 | Vyacheslav Zholudev | 01 April 2014, 16:42:47 UTC | polishing code Former-commit-id: 1f80d3491995e1ff1e020dd0ae3dfa9c565abd73 | 01 April 2014, 16:42:47 UTC |
571bdae | Vyacheslav Zholudev | 01 April 2014, 16:07:36 UTC | polishing code Former-commit-id: 36bb0ce39cbd110704dc411359a2d76e71333cf1 | 01 April 2014, 16:07:36 UTC |
2a3f021 | Vyacheslav Zholudev | 01 April 2014, 15:51:44 UTC | minor refactorings Former-commit-id: 7cc2146d3045e5ea5aeeca5189e5f21c31577463 | 01 April 2014, 15:51:44 UTC |
2753523 | Vyacheslav Zholudev | 01 April 2014, 13:12:30 UTC | making private fields in Document. Switching from ArrayLists to Lists Former-commit-id: dc226086dfff53199a287db8bed52c63bba2ee29 | 01 April 2014, 13:12:30 UTC |
94fd19f | Vyacheslav Zholudev | 01 April 2014, 12:05:22 UTC | refactoring to make model stateless Former-commit-id: 6a0ff58e223d85e8d75d396cd7d8b33804bdbf26 | 01 April 2014, 12:05:22 UTC |
a148981 | Vyacheslav Zholudev | 01 April 2014, 09:37:09 UTC | date model for crfpp Former-commit-id: 7f92b8fa05592fedf0dd1f66d5ee78525bb0e3c1 | 01 April 2014, 09:37:09 UTC |
9702a64 | Vyacheslav Zholudev | 31 March 2014, 17:30:34 UTC | grobi property for wapiti crfpp engine Former-commit-id: fc8862477db656f8ea57fd5abccfad47f7ac4c89 | 31 March 2014, 17:30:34 UTC |
b988931 | Vyacheslav Zholudev | 31 March 2014, 15:17:17 UTC | Merge branch 'wapiti' of https://github.com/kermitt2/grobid into wapiti Former-commit-id: edae23d4c115edb696d4b3df09265aaf1e585d0a | 31 March 2014, 15:17:17 UTC |
7887fae | Vyacheslav Zholudev | 31 March 2014, 15:15:48 UTC | minor Former-commit-id: a8100d8d05d8fdfa6ebd9e73858598bf5fbdcf02 | 31 March 2014, 15:15:48 UTC |
79bf76d | Vyacheslav Zholudev | 31 March 2014, 14:56:36 UTC | making document usage thread-safe Former-commit-id: 95fed02f7347e8318e93e1214d2633b16bd31643 | 31 March 2014, 14:56:36 UTC |
cd36062 | kermitt2 | 31 March 2014, 14:26:03 UTC | Improvement of the segmentation model and more training data. Former-commit-id: 468e22d3f4d619ff421537e1f4026b44ad3780c2 | 31 March 2014, 14:26:03 UTC |
23daf85 | Vyacheslav Zholudev | 31 March 2014, 13:50:21 UTC | some comparison numbers of wapiti and crfpp Former-commit-id: aac7ba72e1b8d29e78e3b8c6c1c6f6e66ea036a3 | 31 March 2014, 13:50:21 UTC |
dad3f68 | Vyacheslav Zholudev | 31 March 2014, 13:48:01 UTC | vz training data file for citations Former-commit-id: f64b5c44b886d92b2797a8fa1e41a175e30aeac9 | 31 March 2014, 13:48:01 UTC |
78d0ea9 | kermitt2 | 27 March 2014, 17:20:23 UTC | Correct the number of instances issue. Error with the StringTokenizer applied on the CRF labeled result fixed. Former-commit-id: e71947d363066d7ba83246ac01ab4817847174ec | 27 March 2014, 17:20:23 UTC |