https://github.com/RaRe-Technologies/gensim

sort by:
Revision Author Date Message Commit Date
e889fa3 update CHANGELOG for v3.8.3 02 May 2020, 23:52:10 UTC
00d27b9 fix import collision 01 May 2020, 14:45:55 UTC
c3d95ab Merge remote-tracking branch 'upstream/develop' into release-3.8.3 01 May 2020, 12:18:31 UTC
29d1092 [MRG] Update README instructions + clean up testing (#2814) * update README instructions * WIP: enable test deps * unpin old tensorflow in tests - old versions not present in newer Pythons * looking into segfault in py3.6 - https://travis-ci.org/github/RaRe-Technologies/gensim/jobs/681096362 * put back pyemd * put keras back * put back tensorflow * investigate segfault in py3.6 * address review comments * avoid py3.6 segfault in Travis tests 01 May 2020, 02:13:05 UTC
f006987 add missing six import 28 April 2020, 08:19:04 UTC
9fdc600 treat missing corpusfile non-fatal 28 April 2020, 08:04:23 UTC
4ed3c22 flake8 fixes 28 April 2020, 08:02:17 UTC
0f06ace avoid circular import of utils seems to break the Py2.7 build under AppVeyor, but not Travis CI 28 April 2020, 08:01:25 UTC
b514a04 disable corpusfile extensions under Py2 on Windows 28 April 2020, 07:07:16 UTC
510551f do not install numpy via appveyor.yml, no need 28 April 2020, 05:28:24 UTC
b74fa68 clean up setup.py 28 April 2020, 05:21:55 UTC
a8e771d bump versions to 3.8.3 28 April 2020, 05:16:58 UTC
0e7c2aa Prepare gensim v3.8.2 to pin smart_open for Py2.7 compatibility (#2787) * pin version of smart-open to py2 compatible * add a comment Co-Authored-By: Radim Řehůřek <me@radimrehurek.com> * make pin conditional (py27 only) * prepare version 3.8.2 * add deprecations to README * get rid rackcdn, bump scipy to 1.0.0 * bump scipy to 1.0.0 in setup.py Co-authored-by: Yury Brukau <brukau@ebu.ch> Co-authored-by: Radim Řehůřek <me@radimrehurek.com> Co-authored-by: Michael Penkov <misha.penkov@gmail.com> Co-authored-by: Ivan Menshikh <menshikh.iv@gmail.com> 28 April 2020, 05:16:58 UTC
afaf76f get tests to pass on Py27 (#2812) * get tests to pass on Py27 * enable Py27 builds under Appveyor and TravisCI * fix utils.open invocation 28 April 2020, 05:06:40 UTC
996801b Merge pull request #2777 from tirkarthi/fix-xml cElementTree has been deprecated since Python 3.3 and removed in Python 3.9 24 April 2020, 19:54:32 UTC
47357de Merge pull request #2799 from Chenxin-Guo/develop Fix typo in comments 24 April 2020, 18:15:33 UTC
585b0c0 Merge branch 'develop' into fix-xml 24 April 2020, 17:35:13 UTC
d70b129 Merge pull request #2806 from RaRe-Technologies/piskvorky-patch-1 Cache README badges 24 April 2020, 08:34:55 UTC
fd3537a Use HTML comments instead of Markdown comment - simpler & easier to read and maintain 23 April 2020, 14:11:23 UTC
68bd860 Cache badges - use google's caching proxy for img.shields.io badges - fixes #2805 23 April 2020, 14:01:57 UTC
ca726c6 Merge pull request #2772 from RaRe-Technologies/piskvorky-patch-1 Add "downloads" badge to README 22 April 2020, 21:09:13 UTC
2a04825 Merge branch 'develop' into piskvorky-patch-1 22 April 2020, 18:55:42 UTC
8791bb7 Make twitter badge blue, too 22 April 2020, 10:03:05 UTC
63dc990 Remove conda-forge badge 22 April 2020, 10:01:17 UTC
188a590 Use blue in badges 22 April 2020, 09:59:42 UTC
db11c14 Use downloads badge from shields.io 22 April 2020, 09:55:35 UTC
7f194c9 Use nicer twitter badge 22 April 2020, 09:45:44 UTC
5b5b545 Add osx+py38 case for avoid multiprocessing issue (#2800) * add osx+py38 case for avoid multiprocessing issue * add comment, fix warning * extend comment Co-Authored-By: Radim Řehůřek <me@radimrehurek.com> * Update gensim/utils.py * Update gensim/utils.py Co-Authored-By: Michael Penkov <m@penkov.dev> Co-authored-by: Radim Řehůřek <me@radimrehurek.com> Co-authored-by: Michael Penkov <m@penkov.dev> 20 April 2020, 06:23:42 UTC
10cec93 Fix typo in comments\nThe rows of the corpus are actually documents, fix the comment to reduce confusion 17 April 2020, 06:06:32 UTC
a2ec4c3 Fix FastText RAM usage in tests (+ fixes for wheel building) (#2791) * pin `bucket` parameter (to avoid RAM issues on CI system) + get rid win32 skip * fix flake8 * partially fix doc building * better workaround for docs build * fix sphinx-gallery * avoid test error * get back loading of old model (because large buckets) * Update setup.py Co-Authored-By: Radim Řehůřek <me@radimrehurek.com> * Update gensim/test/test_fasttext.py Co-Authored-By: Radim Řehůřek <me@radimrehurek.com> * define missing buckets & fix formatting Co-authored-by: Ivan Menshikh <imenshikh@embedika.ru> Co-authored-by: Radim Řehůřek <me@radimrehurek.com> 13 April 2020, 06:38:57 UTC
a6247af cElementTree has been deprecated since Python 3.3 and removed in Python 3.9. 30 March 2020, 16:23:03 UTC
ec222e8 fix build, use KeyedVectors class (#2774) 24 March 2020, 04:55:01 UTC
d952a51 fix broken english in tests (#2773) 23 March 2020, 10:06:17 UTC
a4894bb link downloads badge to pepy instead of pypi 23 March 2020, 08:17:58 UTC
de0ef26 Get rid of "wheels" badge 22 March 2020, 18:09:24 UTC
8a2e2a7 Add downloads badge to README - idea from https://github.com/RaRe-Technologies/smart_open/issues/440 22 March 2020, 18:03:51 UTC
a811a23 fixed get_keras_embedding, now accepts word mapping (#2676) * fixed get_keras_embedding, now accepts word mapping * skip tests if keras not installed * removed unnessecary comment from test_keyed_vectors * fixed indentation * fixed flake import error * moved skip test decorator to class * Update gensim/models/keyedvectors.py Co-Authored-By: Michael Penkov <m@penkov.dev> * Update gensim/models/keyedvectors.py Co-Authored-By: Michael Penkov <m@penkov.dev> * Update gensim/models/keyedvectors.py Co-Authored-By: Michael Penkov <m@penkov.dev> * renamed keras_installed flag to upper case, removed unneeded comment Co-authored-by: Zhicharevich <Alex_Zhicharevich@intuit.com> Co-authored-by: Michael Penkov <m@penkov.dev> 21 March 2020, 09:01:44 UTC
1b3ad81 Fix out of range issue in gensim.summarization.keywords (#2738) * Fixed out of range error in keywords.py * Now using min() function to improve readability * Added a test to make sure that keywords does not fail when words param is greater than number of words in string * Fixing travisCI build error from not having 2 lines after class definition * Fixed whitespace issue for flake8 Co-authored-by: Carter Olsen <olsencar@oregonstate.edu> 21 March 2020, 08:23:53 UTC
f767e1e use collections.abc for Mapping (#2750) * use collections.abc.Mapping when available * ignore py2, tox -e py27-linux revealed setup.py requires python 3.5 * use collections.abc.Iterable 21 March 2020, 08:15:40 UTC
30ca5b3 Fix KeyedVectors.add matrix type (#2761) * add type test * cast internal state to passed type * ekv -> kv * parametrize datatype & cast embeddings passed to `add` to KV datatype * set f32 as default type Co-authored-by: Ivan Menshikh <imenshikh@embedika.ru> Co-authored-by: Michael Penkov <m@penkov.dev> 21 March 2020, 06:53:46 UTC
493e52f remove mention of py27 (#2751) on 25 oct 2019, setup.py was updated to require python 3.5. this change removes the suggestion of testing against py27. 21 March 2020, 04:49:23 UTC
cb3d87c Fix fastText word_vec() for OOV words with use_norm=True (#2764) * add a test for oov similarity * fix a test for oov similarity * fix it once more * prepare the real fix * remove a redundant variable * less accurate comparison Co-authored-by: David Dale <ddale@yandex-team.ru> 21 March 2020, 04:34:40 UTC
0d75f2d Improve documentation in run_similarity_queries example (#2770) 21 March 2020, 04:02:34 UTC
68ec5b8 Merge pull request #2739 from gojomo/rm-cpp-files rm autogenerated *.cpp files that shouldn't be in source control 24 February 2020, 22:55:34 UTC
b92e087 del cython.sh 31 January 2020, 18:37:42 UTC
8d79794 Fix TypeError when using the -m flag (#2734) Currently, if you attempt to use the script with the --min-article-character you get an error because it gets parsed a string and the functions expect an int. This fix addresses the issue. ``` Traceback (most recent call last): File "/usr/lib/python3.6/runpy.py", line 193, in _run_module_as_main "__main__", mod_spec) File "/usr/lib/python3.6/runpy.py", line 85, in _run_code exec(code, run_globals) File "/usr/local/lib/python3.6/dist-packages/gensim/scripts/segment_wiki.py", line 385, in <module> include_interlinks=args.include_interlinks File "/usr/local/lib/python3.6/dist-packages/gensim/scripts/segment_wiki.py", line 141, in segment_and_write_all_articles for idx, article in enumerate(article_stream): File "/usr/local/lib/python3.6/dist-packages/gensim/scripts/segment_wiki.py", line 100, in segment_all_articles for article in wiki_sections_text: File "/usr/local/lib/python3.6/dist-packages/gensim/scripts/segment_wiki.py", line 332, in get_texts_with_sections if sum(len(body.strip()) for (_, body) in sections) < self.min_article_character: TypeError: '<' not supported between instances of 'int' and 'str'``` 30 January 2020, 06:56:25 UTC
47a0675 rm autogenerated *.cpp files that shouldn't be in source control 29 January 2020, 02:13:31 UTC
9352dad Merge pull request #2715 from gojomo/py38-plus-build-tuning closes #2713 28 January 2020, 00:10:58 UTC
0e624c1 improved comments via piskvorky review Co-Authored-By: Radim Řehůřek <me@radimrehurek.com> 27 January 2020, 12:43:33 UTC
f5e05d0 better CI logs w/ gdb after core dump 27 January 2020, 08:12:57 UTC
b8346c1 drop py35, add py38 (travis), update explicit dependency versions 27 January 2020, 08:07:22 UTC
d05259a better replacement of deprecated .clock() 27 January 2020, 07:56:31 UTC
4710308 Use time.time instead of time.clock in gensim/models/hdpmodel.py (#2730) * Use time.process_time() instead of time.clock() * time.process_time() -> time.time() 23 January 2020, 07:28:59 UTC
4d22327 Implement saving to Facebook format (#2712) * Add writing header for binary FB format (#2611) * Adding writing vocabulary, vectors, output layer for FB format (#2611) * Clean up writing to binary FB format (#2611) * Adding tests for saving FastText models to binary FB format (#2611) * Extending tests for saving FastText models to binary FB format (#2611) * Clean up (flake8) writing to binary FB format (#2611) * Word count bug fix + including additional test (#2611) * Removing f-strings for Python 3.5 compatibility + clean-up(#2611) * Clean up the comments (#2611) * Removing forgotten f-string for Python 3.5 compatibility (#2611) * Correct tests failing @ CI (#2611) * Another attempt to correct tests failing @ CI (#2611) * Yet another attempt to correct tests failing @ CI (#2611) * New attempt to correct tests failing @ CI (#2611) * Fix accidentally broken test (#2611) * Include Radim remarks to saving models in binary FB format (#2611) * Correcting loss bug (#2611) * Completed correcting loss bug (#2611) * Correcting breaking doc building bug (#2611) * Include first batch of Michael remarks * Refactoring SaveFacebookFormatRoundtripModelToModelTest according to Michael remarks (#2611) * Refactoring remaining tests according to Michael remarks (#2611) * Cleaning up the test refactoring (#2611) * Refactoring handling tuple result from struct.unpack (#2611) * Removing unused import (#2611) * Refactoring variable name according to Michael review (#2611) * Removing redundant saving in test for Facebook binary saving (#2611) * Minimizing context manager blocks span (#2611) * Remove obsolete comment (#2611) * Shortening method name (#2611) * Moving model parameters to _check_roundtrip function (#2611) * Finished moving model parameters to _check_roundtrip function (#2611) * Clean-up FT_HOME behaviour (#2611) * Simplifying vectors equality check (#2611) * Unifying testing method names (#2611) * Refactoring _create_and_save_fb_model method name (#2611) * Refactoring test names (#2611) * Refactoring flake8 errors (#2611) * Correcting fasttext invocation handling (#2611) * Removing _parse_wordvectors function (#2611) * Correcting whitespace and simplifying test assertion (#2611) * Removing redundant anonymous variable (#2611) * Moving assertion outside of a context manager (#2611) * Function rename (#2611) * Cleaning doc strings and comments in FB binary format saving functionality (#2611) * Cleaning doc strings in end-user API for FB binary format saving (#2611) * Correcting FT_CMD execution in SaveFacebookByteIdentityTest (#2611) 23 January 2020, 07:25:20 UTC
fbc7d09 Fix overflow error for `*Vec` corpusfile-based training (#2700) * long long types for expected_examples & total_documents * regenerate .cpp files 08 January 2020, 03:53:05 UTC
3abcb9f Refactor bm25 to include model parametrization (cont.) (#2722) * Refactor bm25 to include model parametrization * Refactor constants back and fix typo * Refactor parameters order and description * Add BM25 tests This closes #2597 and closes #2606 * Simplify asserts in BM25 tests * Refactor BM25.get_score Co-authored-by: Marcelo d'Almeida <md@id.uff.br> 08 January 2020, 03:51:13 UTC
3d129de Fix ValueError when instantiating SparseTermSimilarityMatrix (#2689) * force python int before calling islice. islice don't accept numpy int * add test to check islice error * it makes test to fail * make sure that islice receives a python int * fix typo 06 January 2020, 14:02:51 UTC
f022028 Initialize self.cfs in Dictionary.compatify method (#2618) * Fix for #2574 * Fix for #2574 05 January 2020, 02:18:23 UTC
74a375d Change similarity strategy when finding n best (#2720) * Find largest by absolute value * Add helper function to simplify code & add unit test for it 05 January 2020, 01:41:31 UTC
12897cb Fix appveyor builds (#2706) * move install_wheels script * git add continuous_integration/check_wheels.py * bump versions for numpy and scipy * update old requirements.txt * add file header * get rid of install_wheels.py hack * fixup: update travis.yml * Update continuous_integration/check_wheels.py Co-Authored-By: Radim Řehůřek <me@radimrehurek.com> * Update continuous_integration/check_wheels.py Co-Authored-By: Radim Řehůřek <me@radimrehurek.com> Co-authored-by: Radim Řehůřek <me@radimrehurek.com> 01 January 2020, 10:10:37 UTC
cc8188c Fix simple typo: voacab -> vocab (#2719) Closes #2718 01 January 2020, 10:05:33 UTC
36ae46f Rerun Soft Cosine Measure tutorial notebook (#2691) 21 December 2019, 03:46:53 UTC
de0dcc3 Warn when BM25.average_idf < 0 (#2687) Closes #2684 02 December 2019, 20:41:01 UTC
e391f0c [Issue-2670] Bug fix: Initialize doc_no2 because it is not set when corpus' is empty (#2672) * [Issue-2670] Bug fix: Initialize doc_no2 because it is not set when 'corpus' is empty * [Issue-2670] Add: unittests should fail on invalid input (generator and empty corpus) * [Issue-2670] Add: Fix unittest for generator * [Issue-2670] Fix unittest tox:flake8 errors * [Issue-2670] Fix: empty corpus def in unittest * [Issue-2670] Fix: empty corpus and generator unittests * [Issue-2670] Fix: empty corpus and generator unittests 23 November 2019, 11:50:17 UTC
e7c9f0e Fix local import degrading the performance of word2vec model loading (#2671) (#2682) 21 November 2019, 23:07:38 UTC
1052b9b Speed up word2vec model loading (#2671) * Speed up word2vec binary model loading (#2642) * Add correctness tests for optimized word2vec model loading (#2642) * Include remarks of Radim to code speeding up vectors loading (#2671) * Include remarks of Michael to code speeding up vectors loading (#2671) * Refactor _load_word2vec_format into a few functions for better readability * Clean-up _add_word_to_result function 18 November 2019, 19:01:31 UTC
f72a55d Update word2vec2tensor.py (#2678) 18 November 2019, 16:04:05 UTC
3d65961 Comment out Hacktober Fest from README (#2677) - uncomment next year 11 November 2019, 23:05:20 UTC
44ea793 Improve explanation of top_chain_var parameter in Dynamic Topic Model (DTM) documentation * improve & corrected gensim documentation (#2637) * more descriptive explanation of top_chain_var 03 November 2019, 07:12:15 UTC
ee61691 Document accessing model's vocabulary (#2661) * document accessing model's vocabulary * update images 01 November 2019, 01:13:44 UTC
34ee98b replacing deleted notebooks with placeholders (#2654) 29 October 2019, 08:42:31 UTC
e859c11 Remove native Python implementations of Cython extensions (#2630) * Remove native Python implementations of Cython extensions Fix #2511 * remove print statement in tox.ini * remove print statement in tox.ini * fix flake8 issues * fix missing imports * adjust exception message * bring back FAST_VERSION variable * fixup: missing parens * disable progress bar for tox * respond to review comments * remove C/C++ sources generated from Cython files * update setup.py * remove duplicate line in setup.py * fix numpy bootstrapping * update tox.ini * handle cython dependency in setup.py * fixup in setup.py: lowercase c * more cython sourcery * fix tox.ini * Fix merge artifact in setup.py * fix merge artifact * disable pip progress bar under CircleCI 25 October 2019, 12:54:24 UTC
1894339 Delete requirements.txt (#2648) 24 October 2019, 15:02:51 UTC
e228a93 Fix links to documentation in README.md (#2646) * Fix links to documentation in README.md * Update README.md 24 October 2019, 15:00:25 UTC
86ed0d8 fix setup.py to get documentation to build under CircleCI (#2650) 24 October 2019, 14:59:57 UTC
bcee414 Improve gensim documentation (numfocus) (#2591) * Update makefile to point to new subdirectory * Update layout.html to show new documentation sections * introduce sphinx gallery * reorganize gallery * trim tut3.rst * git add docs/to_python.py * git add gallery/010_tutorials/run_doc2vec_lee.py * minor layout tweak * add downloader api howto * add fasttext tutorial and howto * use pprint in fasttext tutorial * add summarization tutorial * git add gallery/020_howtos/run_howto_compare_lda.py * add fasttext thumbnails * adding core concepts tutorial * add summarization plot * update notebook to use 20newsgroups * update notebook * improve notebook * update howtos * fix distance metrics tutorial * improve distance_metrics.ipynb * git add gallery/010_tutorials/run_distance_metrics.py * git add gallery/020_howtos/run_news_classification.py * move downloader API to tutorials section * add docs/src/auto_examples so bindr can pick up the notebooks * minor changes * git add gallery/010_tutorials/run_lda.py * more minor changes * More minor changes * git add gallery/010_tutorials/run_word2vec.py * updated notebooks * git add gallery/010_tutorials/run_wmd.py * add image * move parts of intro.rst to core concepts tutorial * move README.txt to wiki * get rid of fasttext wrapper tutorial * update top-level heading * more minor changes * minor updates * improve Doc2Vec tutorial, move explanations from IMDB * git add gallery/020_howtos/run_doc2vec_imdb.py * git st * fix notebook paths for bindr * rename gallery to documentation * git add binder/requirements.txt * git add auto_examples/000_core/requirements.txt * adding requirements.txt for binder * removing requirements files added in desperation * update conf.py * remove temporary files from git branch * rm images * merge "getting started" into "core concepts" * add some clarifying text * add Jupyter notebook * Revert "get rid of fasttext wrapper tutorial" This reverts commit 3ec0a468263af6b3e3e647bb6547d496a19483a4. * get rid of fasttext wrapper guide * git add auto_examples/ * minor fixes * fix typo * add listing of corpora and models * get rid of binder * git add gallery/020_howtos/run_doc.py * more instructions for authorship * improve linkage between core tutorials * add highlighting * move downloader to howto * restore support and about sections * sync toolbars * Add installation instructions to top page * clean up html * add wordcloud-based thumbnails * updated notebooks * update script * add sphinx-gallery to doc dependencies * include memory_profiler in docs_testenv * git add README.rst * use proper temporary file * reorganize tutorials section * clarify version control in README.rst * git rm 020_howtos/saved_model_wrapper * move pivoted document normalization to tutorials section * fix ordering in howto section * add images * add annoy to doc dependencies * update gitignore * disable tox spinner * turn off progress bar for pip * fix labels * naming fixes * git rm docs/notebooks/gensim\ Quick\ Start.ipynb * git rm docs/notebooks/Corpora_and_Vector_Spaces.ipynb * git rm gensim\ Quick\ Start.ipynb * git rm docs/notebooks/Topics_and_Transformations.ipynb * git rm docs/notebooks/Similarity_Queries.ipynb * git rm docs/notebooks/summarization_tutorial.ipynb * git rm docs/notebooks/distance_metrics.ipynb * git rm docs/notebooks/word2vec.ipynb * git rm docs/notebooks/doc2vec-lee.ipynb * git rm docs/notebooks/gensim_news_classification.ipynb * git rm docs/notebooks/lda_training_tips.ipynb * git rm docs/notebooks/doc2vec-IMDB.ipynb * git rm docs/notebooks/annoytutorial.ipynb * git rm tutorial.rst tut1.rst tut2.rst tut3.rst * minor update to layout.html * git rm changes_080.rst * minor tweaks to gallery and surrounding docs * remove cruft from run_doc2vec_imdb.py * update doc howto * fixup * git add requirements_docs.txt * more dependencies in requirements_docs.txt * re-enable LDA howto * add missing images * add built LDA howto * port tutorials.md to gallery * WIP: cleaning up docs * language clean up + pin exact versions in doc requirements * git add redirects.csv test_redirects.py * remove gensim_numfocus namespace qualifier * doc cleanup in Other resources * fix redirects * regenerated tutorials * Added tools/check_gallery.py * committing unsuccessful attempt to fix a tutorial before deleting it * remove tutorials that don't work * index page fixes * add install anchor * Update redirects.csv * link fixes from local testing * replace easy_install with pip * renamed run_040_compare_lda.py to run_compare_lda.py * minor fixes * more fixes from website testing * updating wordcloud images * add pandas to requirements_docs.txt * !! * more dependency + code fixes * update upload path to "live" website * update test_redirects.py * git rm redirects.csv test_redirects.py 21 October 2019, 21:52:57 UTC
e102574 Change find_interlinks return type to list of tuples (#2636) * Change interlinks format to list of tuples. Fixes #2635 This commit fixes the issue in RaRe-Technologies/gensim#2635 This commit changes the interlinks storage in the `segment_wiki` script from dictionary to a list of tuples. We can process the test wikidata used in the test suite of gensim to inspect the new behavior. ``` python gensim/scripts/segment_wiki.py -i \ -f ~/Downloads/enwiki-latest-pages-articles1.xml-p000000010p000030302-shortened.bz2 \ -o ~/Downloads/enwiki-latest.json.gz ``` We get the following output: ``` $ cat ~/Downloads/enwiki-latest.json.gz | zcat | head -1 | jq -r '.interlinks[] | [.[0], .[1]] | @tsv' | sort | head -ism -ism 1848 Revolution 1848 Revolution 1917 October Revolution 1917 October Revolution 6 February 1934 crisis February 1934 riots A. S. Neill A. S. Neill AK Press AK Press Abu Hanifa Abu Hanifa Adolf Brand Adolf Brand Adolf Brand Adolf Brand Adolf Hitler Hitler ``` All tests pass for the related test file. ``` python -m unittest gensim.test.test_scripts /Users/smishra/miniconda3/envs/TwitterNER/lib/python3.7/bz2.py:131: ResourceWarning: unclosed file <_io.BufferedReader name='/Users/smishra/workspace/codes/python/gensim/gensim/test/test_data/enwiki-latest-pages-articles1.xml-p000000010p000030302-shortened.bz2'> self._buffer = None ResourceWarning: Enable tracemalloc to get the object allocation traceback ..... ---------------------------------------------------------------------- Ran 5 tests in 6.298s OK ``` * Updated docstrings * Fixed flake8 issue of long line in docsrtring * Fixed comments and replaces assertTrue with assertEqual * Fixed unittest comment and checks for wikicorpus 19 October 2019, 16:09:41 UTC
3e027c2 Move Py2 deprecation warning to top of changelog (#2627) It belongs at the top. People should see it immediately without having to scroll down to an older release. 11 October 2019, 05:54:59 UTC
289a6ca Handling for iterables without 0-th element, fixes #2556 (#2629) * Handling for iterables without 0-th element, fixes #2556 * Improved accessing the first element for the case of big datasets 10 October 2019, 06:54:51 UTC
a7713aa Disable Py2.7 builds under Travis, CircleCI and AppVeyor (#2601) * Disable Py2.7 builds under Travis and AppVeyor * use Py3.7.4 image under CircleCI * tweak circleci config.yml * patch tox.ini * more fixes to get docs building under tox * s/python3.7/python3/ * delay annoy ImportError until actual use * bring back Pattern * simplify invokation of pip command * add install_numpy_scipy.py * fixup * use sys.executable * adjust version in install_wheels.py * adjust travis.yml * adjust version in install_wheels.py back * add logging statements * use version_info instead of sys.version * fixup 10 October 2019, 06:47:17 UTC
2131e3a Properly install Pattern library for documentation build (#2626) * Probably fixes #2534 * Uppercase P * Added comment 08 October 2019, 10:50:08 UTC
25f8a42 Fixed #2554 (#2619) 04 October 2019, 02:39:03 UTC
26f1e81 Add hacktoberfest-related documentation (#2616) * git add HACKTOBERFEST.md * clarify contributions * respond to review comments * add link to HACKTOBERFEST.md from README.md * typo * include comments from Gordon 02 October 2019, 01:50:31 UTC
2fac325 added release/check_wheels.py (#2610) * added release/check_wheels.py * added preamble * Update release/check_wheels.py Co-Authored-By: Radim Řehůřek <me@radimrehurek.com> * respond to review comments 29 September 2019, 10:08:48 UTC
f89808d Merge branch 'master' into develop 23 September 2019, 09:19:37 UTC
8420768 Merge branch 'release-3.8.1' 23 September 2019, 09:19:34 UTC
526b6b4 updated CHANGELOG.md for version 3.8.1 23 September 2019, 09:17:33 UTC
bec0d3d bumped version to 3.8.1 23 September 2019, 09:15:31 UTC
aed72c4 Use BASE_DIR instead of base_dir in _load_info in downloader. (#2605) 20 September 2019, 01:55:01 UTC
a47eed8 Optimize Poincare model training (#2589) * fix list operation * fix variable name * fix list comprehension 07 September 2019, 09:12:45 UTC
4c8be8b Fix bug where a module import set up logging, pin doctools for Py2 (#2552) * Fix bug with inadvertent logging setup - module imports must never issue logging events (logging not set up yet in main app, triggers a default config init) - warnings better, but in this case, we can more simply re-use the existing functionality from `gensim.utils` * Update textcleaner.py * get Py2.7 docs to build * bugfix in setup.py 26 August 2019, 23:24:24 UTC
4ab2167 Added unicode_errors arg in loading a vocab file. (#2570) * Added unicode_errors arg in loading a vocab file. * add a vocab file decoding error handling as same as a model file * add test and test data of loading a vocab file which has decoding error * add 'rstrip()' to load vocab file to make it same as model file * reverted except adding unicode_errors arg. * handle key values as unicode for Python2. 26 August 2019, 09:21:17 UTC
decd4a0 catch loading older TfidfModels without smartirs (#2559) * catch loading older TfidfModels with smartirs * add tfidf model made with version 3.2.0 * add test for tfidf backwards compatibility to version 3.2.0 * simplify __getitem__ test 26 August 2019, 09:15:32 UTC
0d0a908 Fix usage example in phrases.py (#2575) * Update phrases.py * Update phrases.py * Update phrases.py * Fix trailing whitespace in gensim/models/phrases.py * Update gensim/models/phrases.py More trailing whitespace 26 August 2019, 09:13:27 UTC
f2ad5c2 Update the version of smart_open in the setup.py file (#2582) Fixes #2576 23 August 2019, 08:07:18 UTC
f97d0e7 Merge branch 'master' into develop 09 July 2019, 00:36:19 UTC
fdc0195 Merge branch 'release-3.8.0' 09 July 2019, 00:36:16 UTC
2247450 more fixes to release/prepare.sh 09 July 2019, 00:30:30 UTC
4b673b4 adjust behavior of bump_version.py 09 July 2019, 00:21:44 UTC
d0a4394 minor fixes in prepare.sh 09 July 2019, 00:20:30 UTC
back to top