https://github.com/thunlp/THULAC-Python

sort by:
Revision Author Date Message Commit Date
3f1f126 Update Postprocesser.py add python3 encoding 22 June 2018, 02:59:37 UTC
b238077 Merge pull request #61 from lsvih/patch-1 Default value of `rm_space` is False 22 June 2018, 02:56:07 UTC
6187da1 Update README.md change icwb2 path 25 February 2018, 15:01:14 UTC
35bf0c6 Update README.md 19 January 2018, 03:57:11 UTC
0a0fb63 Update SoExtention.py fix fast_cut decode("utf-8") error 16 January 2018, 15:36:55 UTC
e358c8c fix the maxlength’s bug 25 December 2017, 02:29:44 UTC
b30bc59 Merge pull request #56 from linhx13/master Fix __fast_cutline return value for postag. 16 December 2017, 16:33:57 UTC
f8e3ec1 Fix __fast_cutline return value for postag. 16 December 2017, 11:35:43 UTC
0003a5f Create LICENSE 07 December 2017, 12:26:55 UTC
ceb4aa2 Update README.md Add register.doc 11 November 2017, 14:25:41 UTC
1102b95 重选了语气词,避免一些不应该分开的词语被分开 05 November 2017, 00:21:16 UTC
48443ef Update README.md update model_link to new link 21 August 2017, 11:56:19 UTC
548cad5 修改readme中cut函数注解 30 June 2017, 15:21:02 UTC
b22fb70 合并一部分同功能函数 09 June 2017, 13:12:53 UTC
d922e36 将语气助词预处理为S标签 09 June 2017, 12:16:12 UTC
957a872 不再重复按’_’字符切分文本 28 May 2017, 02:52:34 UTC
0eabdbf 解决.so文件重复加载的问题 22 May 2017, 05:17:55 UTC
0dc8ffd 解决使用fast方法出现core dumped 问题 10 May 2017, 08:06:38 UTC
de88d3d 将cut_f的读取从读取整个文件改为按行读取 09 May 2017, 03:49:22 UTC
61375c1 增加rm_space参数;尝试添加多线程 06 April 2017, 08:43:09 UTC
fc385de 给程序入口添加了些注释 29 March 2017, 11:55:35 UTC
5b28da6 Merge pull request #20 from xpgeng/master Modify the format of README 26 March 2017, 12:26:11 UTC
cf8f557 modify the format of README 26 March 2017, 12:21:23 UTC
314c1fb 修复python3的兼容性问题,修复对空格分词报错的问题 15 March 2017, 13:08:40 UTC
2da9383 解决T2S无法使用的问题 13 March 2017, 13:55:11 UTC
8fe1c72 添加测试文件,修改一些细微bug 13 March 2017, 08:51:58 UTC
e9c5d11 Update README.md 修正ReadMe中的链接 10 March 2017, 06:35:21 UTC
8092cc1 Update README.md 修改ReadMe中的链接问题 10 March 2017, 06:32:51 UTC
cc32705 Merge branch 'develop' 10 March 2017, 06:26:37 UTC
b28d163 去除当前系统判断 10 March 2017, 06:14:50 UTC
f01a23b 修改readme,补传SoExtention.py 10 March 2017, 06:10:46 UTC
7ef2188 Merge pull request #15 from Kipsora/master A bug that will produce exception of 'tuple index out of range' when … 09 March 2017, 15:35:58 UTC
2592c91 A bug that will produce exception of 'tuple index out of range' when processing long passage fixed. 09 March 2017, 14:43:59 UTC
9a02efa add “__” as prefix of private variables and functions in __init__.py 06 March 2017, 03:54:26 UTC
1de07af add THULAC.so into python version, add fast_cut, fast_cut_f method 06 March 2017, 03:31:58 UTC
a0cb6fe 修改书名长度只有1的时候的bug 14 February 2017, 14:24:56 UTC
6a7ee99 修复由于python数组长度在循环判断中不改变导致的部分标点符号分词失败问题 14 February 2017, 08:21:41 UTC
64d5917 恢复-deli 参数 19 January 2017, 13:36:48 UTC
1307d93 修改一些格式问题 17 January 2017, 09:13:58 UTC
75062d5 修改ReadMe的一些格式错误 17 January 2017, 09:12:11 UTC
30b9d33 1. 在pip上发布THULAC python版本,重新整理并定义了接口,方便用户使用; 2. 增加用户自定义字典功能; 3. 增加时间词,标点符号的后处理; 4. 与其他语言版本同步。 17 January 2017, 09:09:34 UTC
44bd790 Update README.md 修改历史的显示错误 12 January 2017, 06:18:10 UTC
6c13ece 1、增加了自动分句功能,如果一段话(读入的一行)超过了50000字,会按照常见句子结束符(。!?:!?: 7种)作为分隔符进行切分 2、现在在分词的时候,中文和英文不会被分到一起,会严格分开 11 January 2017, 08:17:29 UTC
ff26492 修改词性标注的bug 29 December 2016, 04:10:07 UTC
9f0d8c1 兼容python3, 修改raw_input 28 December 2016, 06:17:00 UTC
935a420 兼容python3 28 December 2016, 06:05:12 UTC
bd90584 兼容python3 28 December 2016, 05:59:20 UTC
06a9370 增加ReadMe的历史和其他版本的引用 22 December 2016, 04:40:52 UTC
1cad5e7 增加ReadMe中对模型使用的提示说明 22 December 2016, 04:35:12 UTC
df986ed 修补一些使用词表时的bug 21 December 2016, 02:53:50 UTC
a507dd6 修改《》中的空格的bug 30 November 2016, 09:44:07 UTC
cc87f72 将对网址的特殊处理去掉,并增加=当做正常的分隔符加入前处理 23 November 2016, 10:05:01 UTC
0c6d075 将对@(邮箱)的特殊处理去掉,并增加@和;当做正常的分隔符加入前处理 23 November 2016, 05:50:55 UTC
0f467b6 增加README对模型下载的说明 28 October 2016, 11:59:11 UTC
46cc198 修改 读取模型的代码问题 24 October 2016, 13:31:08 UTC
7a3f2dc 修改readme部分内容 29 September 2016, 10:53:48 UTC
22c0b34 修改说明和去掉不用的引用~ 28 September 2016, 10:54:55 UTC
c568f39 add first version 17 May 2016, 05:14:21 UTC
c87c9b7 Initial commit 17 May 2016, 05:05:05 UTC
back to top