https://github.com/jdf/cue.language

sort by:
Revision Author Date Message Commit Date
cebbd3d Merge pull request #3 from fuhrmanator/patch-1 More complete stop words (from Veronis) 23 September 2019, 01:29:36 UTC
b744f8c Remove underscored words 21 September 2019, 20:38:46 UTC
9340e95 More complete stop words (from Veronis) See http://clu.uni.no/corpora/1999-1/0042.html 15 December 2015, 20:06:33 UTC
7a97a56 Armenian! 04 July 2011, 16:42:37 UTC
7d5dd37 Merge branch 'master' of github.com:jdf/cue.language 03 July 2011, 18:41:03 UTC
e482546 formatting 03 July 2011, 18:40:42 UTC
3fc572b run through alphetical sort 04 January 2011, 14:42:04 UTC
9079613 some more german corrections 04 January 2011, 14:40:57 UTC
6a1423e spelling: unse/unsem etc. do not exist. 03 January 2011, 11:10:31 UTC
fff1fde fix most common cases for broken abbreviation detection in SentenceIterator 04 December 2009, 21:41:53 UTC
5a142ec get all noted items from counter 03 December 2009, 20:26:43 UTC
0a15d8e Decided cannot do stream-based sentence iterator 03 December 2009, 04:41:57 UTC
e6f2e6b remove bogus main method from NGramIterator 03 December 2009, 03:43:15 UTC
8f8d43e use unencoded characters in urls 02 December 2009, 22:17:23 UTC
6c02d5d make license link a real link 02 December 2009, 22:03:48 UTC
9ecfa1f added license to readme 02 December 2009, 22:03:18 UTC
c9296d3 added docs for stopwords 02 December 2009, 21:59:03 UTC
ef0526d finished doc draft 02 December 2009, 21:53:41 UTC
013d643 Add Counter method to fetch all counted things, ordered by frequency descending 02 December 2009, 21:53:22 UTC
43ced87 add "whose" to english stop words 02 December 2009, 20:41:20 UTC
1262af9 NGramIterator now takes optional StopWords to exclude n-grams containing stop words 02 December 2009, 20:32:36 UTC
3e7e111 Have SentenceIterator normalize whitespace 02 December 2009, 20:31:44 UTC
d4b30a4 readme 02 December 2009, 19:30:33 UTC
4078d7e ignores 02 December 2009, 19:02:43 UTC
b006c67 license 02 December 2009, 15:21:00 UTC
65a3da4 initial import 02 December 2009, 15:11:39 UTC
back to top