https://github.com/RaRe-Technologies/gensim

sort by:
Revision Author Date Message Commit Date
30179c8 Merge branch 'release-4.3.2' 23 August 2023, 15:27:31 UTC
2e5f3d2 git add docs/src/conf.py 23 August 2023, 15:27:18 UTC
b6d35e6 updated CHANGELOG.md for version 4.3.2 23 August 2023, 15:18:15 UTC
fa1f817 bumped version to 4.3.2 23 August 2023, 15:10:47 UTC
c4a7acd docs: Fix a few typos (#3366) * docs: Fix a few typos There are small typos in: - docs/src/auto_examples/howtos/run_doc2vec_imdb.py - docs/src/auto_examples/howtos/run_doc2vec_imdb.rst - docs/src/gallery/howtos/run_doc2vec_imdb.py - gensim/test/test_corpora.py Fixes: - Should read `output` rather than `ouput`. - Should read `count` rather than `counnt`. Signed-off-by: Tim Gates <tim.gates@iress.com> * make -C docs/src html rebuilt docs that were updated by this PR with Sphinx 5.1.1 * updated documentation --------- Signed-off-by: Tim Gates <tim.gates@iress.com> Co-authored-by: Michael Penkov <m@penkov.dev> 23 August 2023, 14:33:12 UTC
431bd4a point curl to develop branch 23 August 2023, 12:09:34 UTC
84a53c0 disable wheel tests for py3.10 on windows 23 August 2023, 11:20:51 UTC
c401ca3 Omit NumPy deprecation warnings from SCM autoexample (#3305) 23 August 2023, 11:00:39 UTC
0a601ca update time-slice in documentation (#3475) 23 August 2023, 10:53:42 UTC
760dbc7 make test more tolerant Radim thinks that the sign doesn't matter here, only the magnitude 23 August 2023, 10:38:51 UTC
560c947 Update README.md 23 August 2023, 10:23:25 UTC
73e8b6a make gensim CI great again (#3488) * pin numpy and scipy versions for running tests I suspect the latest numpy and/or scipy wheels are the cause of the recent slowdown, because the tests that ran on 2023/06/01 against the versions that I'm pinning in this commit were much faster. * pin cython as well * add some more diagnostic steps to workflow * adjust tests.yml * git add test_wheel.py * increase pytest verbosity * fixup * fixup * fixup * adjust cython version in pin * output cython output as artifacts * update tests.yml, show numpy libs for all platforms * show Cython version during build_ext step * add more diagnostics to setup.py and tests.yml * force cythonization * still more diagnostics * increase test scope to entire suite * clean up, add helpful comments * update test.yml * remove cython magic from setup.py, that gets handled by pyproject.toml * fixup * git mv test_wheel.py .github/workflows/ * reduce timeout to 15 minutes * make that 20 minutes 23 August 2023, 09:50:20 UTC
de016e8 further reduce test scope 22 August 2023, 12:50:47 UTC
94abd9c temporarily limit unit test scope while investigating slow tests 22 August 2023, 11:25:05 UTC
791347e output test durations 22 August 2023, 11:20:14 UTC
d181cf8 output FAST_VERSION in tests workflow 22 August 2023, 10:51:19 UTC
ec6a0af fix typo in word2vec_inner.pyx 22 August 2023, 10:51:11 UTC
147e23a disable remaining Py3.11 tests for CIBW 22 August 2023, 08:06:34 UTC
c0c4e42 increase test timeout while investigating failures 22 August 2023, 06:04:56 UTC
360ac10 Bump pypa/cibuildwheel from 2.12.1 to 2.13.1 (#3483) Bumps [pypa/cibuildwheel](https://github.com/pypa/cibuildwheel) from 2.12.1 to 2.13.1. - [Release notes](https://github.com/pypa/cibuildwheel/releases) - [Changelog](https://github.com/pypa/cibuildwheel/blob/main/docs/changelog.md) - [Commits](https://github.com/pypa/cibuildwheel/compare/v2.12.1...v2.13.1) --- updated-dependencies: - dependency-name: pypa/cibuildwheel dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> 21 August 2023, 15:22:03 UTC
bf19210 git rm gensim/models/flsamodel.py (#3470) 21 August 2023, 15:19:35 UTC
d735004 deal with emergent flake8 errors 08 August 2023, 13:31:03 UTC
7eb07f8 dummy commit to trigger CI 08 August 2023, 13:25:06 UTC
3ae286e Doc fixes and separate workflow for building docs via CI (#3462) * Use new smart_open compression parameter instead of ignore_ext when possible Keep compatibility with old versions of smart_open just in case. 1.8.1 is required by the deps, 5.1.0 introduced the compression parameter, 6.0.0 dropped the ignore_ext parameter. * Download the NLTK WordNet data before using the WordNet lemmatizer Fixes the docs build for the run_lda.py and run_ensemblelda.py tutorials. * Add missing documentation build dependency scikit-learn The run_compare_lda.py howto and run_word2vec.py tutorial import it. It was removed from docs_testenv when the scikit-learn wrapper was removed. Fixes: commit a21d9cc768598640f38e4bd03d368f8712a9aa77 * Update the order in which the tutorials will be placed Add the ensemblelda and scm tutorials added in 2021. Remove the summarization tutorial as it was removed in 2020. Use the order from the existing prebuilt docs files. Without a defined order they will be placed non-deterministically, which means commits not changing docs will change prebuilt docs. Fixes: commit 76579b3fd33d6a59fb397e8d101bb4326951afa8 Fixes: commit ddeeb1274c996b6992e11f7cbe7ca44c776f6daf Fixes: commit 2dcaaf80f4fb8023acc2f118b0966d92fca9500e * Improve the display of the stale prebuilt docs files Print the .md5 file when it is the stale file. Print the source path for each stale file. Print paths relative to the source tree. Print only one stale file pair per line. * Upload the changes to the generated docs to GitHub artifacts Building the docs often takes too long locally, so this allows pull request submitters to build on GitHub, download the changes and incorporate the changes to the generated docs in a commit, then update their pull request with the generated docs commit. Also check that the changes to the prebuilt docs are committed, except for docs that change for every single rebuild. * Update duplicate copies of howtos and tutorials * move action to separate workflow --------- Co-authored-by: Michael Penkov <m@penkov.dev> 25 May 2023, 15:50:03 UTC
eb98bf3 Build wheels with oldest supported numpy (#3467) * Use oldest-supported-numpy for build * add workflow step to test wheels against older numpy * download installwheel.py * fix installwheel command * patch installwheel.py to take processor into account * patch installwheel.py * patch installwheel.py * add step to debug test environment * work around numpy bug https://github.com/numpy/numpy/issues/23104 * add pip freeze prior to wheel build * git add .github/workflows/pipfreezedammit.py * work around cibuildwheel not outputting pip-freeze * update build-wheels.yml * Revert previous commits This reverts commit 69c33bf7c51a9c5c65942e3ff171d228c7aa2a31. This reverts commit a6cdaa0b109da1a62058ebad2e847a40665c9003. This reverts commit eef4177f753997a5b32e0134d10c01ecc9cc5d37. This reverts commit 8f63e7b57d501d68bfad209bc6b1077b38e7830c. --------- Co-authored-by: Michael Penkov <m@penkov.dev> 21 May 2023, 12:26:11 UTC
7dadd70 Add tox environments to setup.cfg, fix napoleon import (#3392) * add tox environments to setup.cfg * Using sphinx.ext.napoleon instead of sphinxcontrib.napoleon in conf.py * roll back changes to setup.cfg --------- Co-authored-by: Michael Penkov <m@penkov.dev> 14 May 2023, 13:21:27 UTC
bd22879 Merge branch 'develop' of github.com:piskvorky/gensim into develop 29 April 2023, 19:41:03 UTC
6ceb662 change sponsorship notice 29 April 2023, 19:38:59 UTC
4ca0653 Merge pull request #3459 from Holmes5/httpsfix Replace HTTP with HTTPS in enwiki URLs 29 April 2023, 14:57:30 UTC
525f67a Merge pull request #3456 from pabs3/update-links Update broken/redirecting/unencrypted links 29 April 2023, 14:56:39 UTC
714a333 Merge pull request #3464 from RaRe-Technologies/dependabot/github_actions/pypa/cibuildwheel-2.12.1 Bump pypa/cibuildwheel from 2.12.0 to 2.12.1 29 April 2023, 14:55:05 UTC
5cc2cad increase tolerance in flaky test 29 April 2023, 14:43:44 UTC
bb7dfc5 Bump pypa/cibuildwheel from 2.12.0 to 2.12.1 Bumps [pypa/cibuildwheel](https://github.com/pypa/cibuildwheel) from 2.12.0 to 2.12.1. - [Release notes](https://github.com/pypa/cibuildwheel/releases) - [Changelog](https://github.com/pypa/cibuildwheel/blob/main/docs/changelog.md) - [Commits](https://github.com/pypa/cibuildwheel/compare/v2.12.0...v2.12.1) --- updated-dependencies: - dependency-name: pypa/cibuildwheel dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> 01 April 2023, 12:59:50 UTC
73825d6 Update duplicate copies of howtos and tutorials 19 March 2023, 07:56:15 UTC
5207fab replacing urls 14 March 2023, 12:30:26 UTC
c110730 Update links to Google Groups Support users without JavaScript, avoid redirects, use https. 13 March 2023, 23:41:53 UTC
27f27b1 Update GitHub links to https 13 March 2023, 23:41:53 UTC
38ff69a Update SourceForge URLs to https 13 March 2023, 23:41:53 UTC
1c9fc75 Update Wikipedia URLs to https 13 March 2023, 23:41:52 UTC
e50776e Link to the SciPy install page instead of download page The download page no longer exists. Also update the URL to https. 13 March 2023, 23:41:52 UTC
b898dd0 Update links to RaRe Technologies website to https Avoids a redirect and protects the content. 13 March 2023, 23:41:52 UTC
0581678 Update PyPI classifiers link to pypi.org, https and HTML 13 March 2023, 23:41:51 UTC
8415062 Update PyPI module links to pypi.org and https Avoids redirects and protects the network data. 13 March 2023, 23:41:51 UTC
8e1faa1 Update links to the website to use https This avoids a redirect and protects the content on the network. 13 March 2023, 23:41:50 UTC
5397b0c Add trailing slash to links to the website front page Avoids a redirect to URL including the slash. Also update URLs to https. 13 March 2023, 23:41:50 UTC
3c1490f Update link to gensim install instructions The install page now redirects to a section of the index page. Also update the link to https. Fixes: commit e17bcf2bd14be4f99e5efca03caaa6b17d70fa03 13 March 2023, 23:41:50 UTC
0ee958f Fix links that point to the website about page This page was removed from the website for Gensim 4. Link instead to the right subsections of the intro page. Also update the links to https. 13 March 2023, 23:41:49 UTC
32604bc Update links to the GNU LGPL v2.1 license The license was superseded by the LGPL v3 and consequently the URL of the LGPL v2.1 has changed. Also update existing correct links to https. Fixes: commit beb04ea1f8f9c438b0a40aa4cbbd955ea065f84f 13 March 2023, 23:41:49 UTC
ccffa84 Tell git to ignore C code generated by Cython from fastss.pyx (#3455) Fixes: commit 05284d1f73ef4de03204f3e6e94438efcddb1001 Fixes: commit 67f45da599865c4351e9a66100658fb45af27f99 13 March 2023, 05:30:56 UTC
3ab5371 ci: move wheels upload into its own job (#3454) 10 March 2023, 13:13:34 UTC
8c81abc git rm .travis.yml 10 March 2023, 06:24:31 UTC
91bb1be update check_wheels.py 10 March 2023, 06:24:17 UTC
f57d4ac bump version for development 10 March 2023, 06:24:06 UTC
f4dfda4 Merge branch 'master' into develop 09 March 2023, 06:49:52 UTC
a60edb3 Merge branch 'release-4.3.1' 09 March 2023, 06:49:50 UTC
35c586e bump versions 09 March 2023, 06:48:19 UTC
faca509 bump date in change log 09 March 2023, 06:46:47 UTC
6b35d00 bumped version to 4.3.1 09 March 2023, 06:45:57 UTC
e7d77a1 update change log for 3.4.1 (#3451) * update change log for 3.4.1 * Update CHANGELOG.md Co-authored-by: Radim Řehůřek <radimrehurek@seznam.cz> --------- Co-authored-by: Radim Řehůřek <radimrehurek@seznam.cz> 09 March 2023, 06:42:54 UTC
cfb1a62 remove unused dependency, handle ImportError (#3447) 07 March 2023, 12:37:38 UTC
7002a60 Bump codecov/codecov-action from 2 to 3 (#3450) Bumps [codecov/codecov-action](https://github.com/codecov/codecov-action) from 2 to 3. - [Release notes](https://github.com/codecov/codecov-action/releases) - [Changelog](https://github.com/codecov/codecov-action/blob/main/CHANGELOG.md) - [Commits](https://github.com/codecov/codecov-action/compare/v2...v3) --- updated-dependencies: - dependency-name: codecov/codecov-action dependency-type: direct:production update-type: version-update:semver-major ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> 06 March 2023, 08:17:54 UTC
0f63f35 ci: enable arm64/aarch64 wheel builds (#3448) * ci: move linters into reusable workflow Move lint steps into its own workflow that can be called from other workflows (`on.workflow_dispatch`). * ci: enable builds for macos-arm64 and linux-aarch64 * ci: replace multibuild by qemu+cibuildwheel * ci: configure dependabot for github-actions This allow dependabot to create PR to update thrid-party GitHub Actions. * ci: run wheel test only on cp311 and native archs 06 March 2023, 07:19:23 UTC
b4f0e0b Update Python version in docs (#3446) 03 March 2023, 12:03:30 UTC
63fce18 fix: conversion of cosine distance to cosine similarity is incorrect (#3441) Co-authored-by: Mona Shaheen <mona.shaheen@dynamicyield.com> 03 March 2023, 12:02:43 UTC
f260d1e check hs and negative. add tests (#3443) 17 February 2023, 14:45:00 UTC
f35faae Merge branch 'travis' into develop 21 December 2022, 00:41:18 UTC
23f5488 bump develop version 21 December 2022, 00:39:22 UTC
7ae8bdc correct DOCKER_TEST_IMAGE env var 19 December 2022, 11:23:09 UTC
f67ab87 continue using xenial to build 3.8 aarch64 wheels 19 December 2022, 11:17:45 UTC
f3b32fc let multibuild pick DOCKER_TEST_IMAGE as advised here: https://github.com/multi-build/multibuild/issues/492#issuecomment-1357307275 19 December 2022, 09:18:00 UTC
38e7273 get rid of visdom from .travis.yml it's causing the Py3.10 wheel graviton build to fail 19 December 2022, 07:57:10 UTC
6cfeb32 get rid of nmslib test dependency in travis.yml 19 December 2022, 07:31:52 UTC
9c1183d disable nsmlib tests for Py3.10 and .11 19 December 2022, 07:23:10 UTC
5c5bd60 ensure importlib-metadata is up-to-date 18 December 2022, 12:44:24 UTC
e524047 adjust numpy and scipy versions for Py3.10 and 3.11 in travis.yml 18 December 2022, 11:46:39 UTC
8c96063 Merge branch 'master' into develop 17 December 2022, 16:03:10 UTC
adf393c Merge branch 'release-4.3.0' 17 December 2022, 16:03:04 UTC
8a56c5f bumped version to 4.3.0 17 December 2022, 16:02:14 UTC
8e673e4 update multibuild version 17 December 2022, 15:43:52 UTC
97a9793 Merge branch 'master' into develop 17 December 2022, 15:06:17 UTC
8de15b8 Merge branch 'release-4.3.0' 17 December 2022, 15:06:14 UTC
99c6c7b bump version in conf.py 17 December 2022, 15:06:05 UTC
1a5ee4b bumped version to 4.3.0 17 December 2022, 15:02:10 UTC
f571f33 updated changelog for next release (#3412) * updated changelog for next release * updated changelog 17 December 2022, 13:30:44 UTC
01758ce Merge pull request #3416 from RaRe-Technologies/numpy_hack fix numpy hack in setup.py 17 December 2022, 13:08:13 UTC
2fd3e89 fix numpy hack in setup.py fix #3225 16 December 2022, 13:28:32 UTC
db90a6a Fix backwards compatibility bug in Word2Vec (#3415) * add missing model file * update test 16 December 2022, 13:12:45 UTC
45d35ee Flsamodel (#3398) * added flsamodel * added FuzzyTM to dependencies * flake8 * less dependencies and not imported from FuzzyTM * added flsamodel Co-authored-by: Michael Penkov <m@penkov.dev> 12 December 2022, 13:46:40 UTC
50a9e6b Fixed issue when using add_vector with FastTextKeyedVectors (#3389) Since Gensim 4.0, 'key' in FastTextKeyedVectors always returns True by design. The proper way to check if a key already exists is with 'key' in FastTextKeyedVectors.key_to_index. Co-authored-by: dcarron <dcarron@idiap.ch> 12 December 2022, 13:40:02 UTC
ca8e4e8 refactor wheel building and testing workflow (#3410) handle windows separately - it's cleaner that way 07 December 2022, 03:09:49 UTC
e656d77 Fix bug that prevents loading old models (#3359) * fix loading old gensim model by new model * Revert "fix loading old gensim model by new model" This reverts commit d398e92291e3de4d6d2a73016150cd9aa2e52714. * fix loading old gensim model by new model * Update word2vec.py * Update word2vec.py * Update word2vec.py Co-authored-by: Michael Penkov <m@penkov.dev> 07 December 2022, 02:50:47 UTC
68fdbf9 clarify runtime expectations (#3381) * clarify runtime expectations * update doc files made stale by this PR Co-authored-by: Michael Penkov <m@penkov.dev> 07 December 2022, 02:50:22 UTC
985a3bd fix pip invokation Seems to consistently fail for MacOS Py3.11 06 December 2022, 14:13:30 UTC
a9b9714 Add support for Python 3.11 and drop support for Python 3.7 (#3402) * add support for python 3.11 * change to oldest version numpy and scipy that support 3.11 * add build for python3.11 * disable nmslib test for python 3.11 * fix formatting * fix formatting * change build for 3.8 * drop Py3.7 support and builds Co-authored-by: Michael Penkov <m@penkov.dev> 06 December 2022, 12:24:42 UTC
3f536fb Coherence Model - work on documents without tokens (#3406) 06 December 2022, 07:38:16 UTC
cc70a6c Fixed the broken link in readme.md (#3409) 06 December 2022, 00:07:26 UTC
3331b82 Implement numpy hack in setup.py to enable install under Poetry (#3363) * Closes #3362: Install issue poetry * get rid of redundant exception handling this code can never raise an exception, so we shouldn't be expecting them Co-authored-by: Michael Penkov <m@penkov.dev> 03 December 2022, 15:34:57 UTC
b6ea788 Switch to Cython language level 3 (#3344) * Switch to Cython language level 3 Python 2 is not supported by gensim, so switching to language level 3 should be fine. Silences a warning from Cython: FutureWarning: Cython directive 'language_level' not set, using 2 for now (Py2). This will change in a later release! * fix cythonization with language_level=3 Co-authored-by: Radim Řehůřek <radimrehurek@seznam.cz> 03 December 2022, 14:11:22 UTC
b17e6af fix deprecation warning from pytest (#3354) gensim/matutils.py:22: DeprecationWarning: Please use `triu` from the `scipy.linalg` namespace, the `scipy.linalg.special_matrices` namespace is deprecated. from scipy.linalg.special_matrices import triu 03 December 2022, 13:49:18 UTC
c93eb0b Fixed bug in loss computation for Word2Vec with hierarchical softmax (#3397) * fixed loss computation for sg, hs * fixed loss computation for cbow, hs 03 November 2022, 13:49:57 UTC
back to top