https://github.com/RaRe-Technologies/gensim

sort by:
Revision Author Date Message Commit Date
a60edb3 Merge branch 'release-4.3.1' 09 March 2023, 06:49:50 UTC
35c586e bump versions 09 March 2023, 06:48:19 UTC
faca509 bump date in change log 09 March 2023, 06:46:47 UTC
6b35d00 bumped version to 4.3.1 09 March 2023, 06:45:57 UTC
e7d77a1 update change log for 3.4.1 (#3451) * update change log for 3.4.1 * Update CHANGELOG.md Co-authored-by: Radim Řehůřek <radimrehurek@seznam.cz> --------- Co-authored-by: Radim Řehůřek <radimrehurek@seznam.cz> 09 March 2023, 06:42:54 UTC
cfb1a62 remove unused dependency, handle ImportError (#3447) 07 March 2023, 12:37:38 UTC
7002a60 Bump codecov/codecov-action from 2 to 3 (#3450) Bumps [codecov/codecov-action](https://github.com/codecov/codecov-action) from 2 to 3. - [Release notes](https://github.com/codecov/codecov-action/releases) - [Changelog](https://github.com/codecov/codecov-action/blob/main/CHANGELOG.md) - [Commits](https://github.com/codecov/codecov-action/compare/v2...v3) --- updated-dependencies: - dependency-name: codecov/codecov-action dependency-type: direct:production update-type: version-update:semver-major ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> 06 March 2023, 08:17:54 UTC
0f63f35 ci: enable arm64/aarch64 wheel builds (#3448) * ci: move linters into reusable workflow Move lint steps into its own workflow that can be called from other workflows (`on.workflow_dispatch`). * ci: enable builds for macos-arm64 and linux-aarch64 * ci: replace multibuild by qemu+cibuildwheel * ci: configure dependabot for github-actions This allow dependabot to create PR to update thrid-party GitHub Actions. * ci: run wheel test only on cp311 and native archs 06 March 2023, 07:19:23 UTC
b4f0e0b Update Python version in docs (#3446) 03 March 2023, 12:03:30 UTC
63fce18 fix: conversion of cosine distance to cosine similarity is incorrect (#3441) Co-authored-by: Mona Shaheen <mona.shaheen@dynamicyield.com> 03 March 2023, 12:02:43 UTC
f260d1e check hs and negative. add tests (#3443) 17 February 2023, 14:45:00 UTC
f35faae Merge branch 'travis' into develop 21 December 2022, 00:41:18 UTC
23f5488 bump develop version 21 December 2022, 00:39:22 UTC
7ae8bdc correct DOCKER_TEST_IMAGE env var 19 December 2022, 11:23:09 UTC
f67ab87 continue using xenial to build 3.8 aarch64 wheels 19 December 2022, 11:17:45 UTC
f3b32fc let multibuild pick DOCKER_TEST_IMAGE as advised here: https://github.com/multi-build/multibuild/issues/492#issuecomment-1357307275 19 December 2022, 09:18:00 UTC
38e7273 get rid of visdom from .travis.yml it's causing the Py3.10 wheel graviton build to fail 19 December 2022, 07:57:10 UTC
6cfeb32 get rid of nmslib test dependency in travis.yml 19 December 2022, 07:31:52 UTC
9c1183d disable nsmlib tests for Py3.10 and .11 19 December 2022, 07:23:10 UTC
5c5bd60 ensure importlib-metadata is up-to-date 18 December 2022, 12:44:24 UTC
e524047 adjust numpy and scipy versions for Py3.10 and 3.11 in travis.yml 18 December 2022, 11:46:39 UTC
8c96063 Merge branch 'master' into develop 17 December 2022, 16:03:10 UTC
adf393c Merge branch 'release-4.3.0' 17 December 2022, 16:03:04 UTC
8a56c5f bumped version to 4.3.0 17 December 2022, 16:02:14 UTC
8e673e4 update multibuild version 17 December 2022, 15:43:52 UTC
97a9793 Merge branch 'master' into develop 17 December 2022, 15:06:17 UTC
8de15b8 Merge branch 'release-4.3.0' 17 December 2022, 15:06:14 UTC
99c6c7b bump version in conf.py 17 December 2022, 15:06:05 UTC
1a5ee4b bumped version to 4.3.0 17 December 2022, 15:02:10 UTC
f571f33 updated changelog for next release (#3412) * updated changelog for next release * updated changelog 17 December 2022, 13:30:44 UTC
01758ce Merge pull request #3416 from RaRe-Technologies/numpy_hack fix numpy hack in setup.py 17 December 2022, 13:08:13 UTC
2fd3e89 fix numpy hack in setup.py fix #3225 16 December 2022, 13:28:32 UTC
db90a6a Fix backwards compatibility bug in Word2Vec (#3415) * add missing model file * update test 16 December 2022, 13:12:45 UTC
45d35ee Flsamodel (#3398) * added flsamodel * added FuzzyTM to dependencies * flake8 * less dependencies and not imported from FuzzyTM * added flsamodel Co-authored-by: Michael Penkov <m@penkov.dev> 12 December 2022, 13:46:40 UTC
50a9e6b Fixed issue when using add_vector with FastTextKeyedVectors (#3389) Since Gensim 4.0, 'key' in FastTextKeyedVectors always returns True by design. The proper way to check if a key already exists is with 'key' in FastTextKeyedVectors.key_to_index. Co-authored-by: dcarron <dcarron@idiap.ch> 12 December 2022, 13:40:02 UTC
ca8e4e8 refactor wheel building and testing workflow (#3410) handle windows separately - it's cleaner that way 07 December 2022, 03:09:49 UTC
e656d77 Fix bug that prevents loading old models (#3359) * fix loading old gensim model by new model * Revert "fix loading old gensim model by new model" This reverts commit d398e92291e3de4d6d2a73016150cd9aa2e52714. * fix loading old gensim model by new model * Update word2vec.py * Update word2vec.py * Update word2vec.py Co-authored-by: Michael Penkov <m@penkov.dev> 07 December 2022, 02:50:47 UTC
68fdbf9 clarify runtime expectations (#3381) * clarify runtime expectations * update doc files made stale by this PR Co-authored-by: Michael Penkov <m@penkov.dev> 07 December 2022, 02:50:22 UTC
985a3bd fix pip invokation Seems to consistently fail for MacOS Py3.11 06 December 2022, 14:13:30 UTC
a9b9714 Add support for Python 3.11 and drop support for Python 3.7 (#3402) * add support for python 3.11 * change to oldest version numpy and scipy that support 3.11 * add build for python3.11 * disable nmslib test for python 3.11 * fix formatting * fix formatting * change build for 3.8 * drop Py3.7 support and builds Co-authored-by: Michael Penkov <m@penkov.dev> 06 December 2022, 12:24:42 UTC
3f536fb Coherence Model - work on documents without tokens (#3406) 06 December 2022, 07:38:16 UTC
cc70a6c Fixed the broken link in readme.md (#3409) 06 December 2022, 00:07:26 UTC
3331b82 Implement numpy hack in setup.py to enable install under Poetry (#3363) * Closes #3362: Install issue poetry * get rid of redundant exception handling this code can never raise an exception, so we shouldn't be expecting them Co-authored-by: Michael Penkov <m@penkov.dev> 03 December 2022, 15:34:57 UTC
b6ea788 Switch to Cython language level 3 (#3344) * Switch to Cython language level 3 Python 2 is not supported by gensim, so switching to language level 3 should be fine. Silences a warning from Cython: FutureWarning: Cython directive 'language_level' not set, using 2 for now (Py2). This will change in a later release! * fix cythonization with language_level=3 Co-authored-by: Radim Řehůřek <radimrehurek@seznam.cz> 03 December 2022, 14:11:22 UTC
b17e6af fix deprecation warning from pytest (#3354) gensim/matutils.py:22: DeprecationWarning: Please use `triu` from the `scipy.linalg` namespace, the `scipy.linalg.special_matrices` namespace is deprecated. from scipy.linalg.special_matrices import triu 03 December 2022, 13:49:18 UTC
c93eb0b Fixed bug in loss computation for Word2Vec with hierarchical softmax (#3397) * fixed loss computation for sg, hs * fixed loss computation for cbow, hs 03 November 2022, 13:49:57 UTC
fdf40eb PERF: pyemd to POT for EMD computation in `wmdistance` (#3327) * PERF: switch from pyemd to POT for EMD computation * Adapt citations * Adapt dependency * Adapt tests * Update cache for gallery Co-authored-by: TLouf <loufthomas@gmail.com> 03 November 2022, 13:44:56 UTC
a435f24 Giving missing credit to Alex in docs (#3393) 02 October 2022, 15:05:30 UTC
5dbfb1e Implement Okapi BM25 variants in Gensim (#3304) * Add and unit-test gensim.models.bm25model.OkapiBM25Model * Document gensim.models.bm25 * Add and unit-test gensim.models.bm25model.{Lucene,Atire}BM25Model * Add normalize_{queries,documents} params to gensim.similarities.docsim * Add example of BM25 to gensim.similarities.docsim.SparseMatrixSimilarity * Refresh stale gallery cache * Update gensim/models/bm25model.py Co-authored-by: Radim Řehůřek <me@radimrehurek.com> Co-authored-by: Radim Řehůřek <me@radimrehurek.com> 08 September 2022, 00:50:27 UTC
ff3531b Copyedit and fix outdated statements (#3375) * print statements replaced with print function for Python 3 * gensim.utils.smart_open replaced with smart_open.open for newer gensim Co-authored-by: Michael Penkov <m@penkov.dev> 22 August 2022, 12:56:01 UTC
77c3a7f Replace np.multiply with np.square and copyedit in translation_matrix.py (#3374) * Replace np.multiply with np.square and copyedit * Copyedit translation_matrix.py Co-authored-by: Michael Penkov <m@penkov.dev> 22 August 2022, 12:54:57 UTC
7f314ee Fix typo in word2vec and KeyedVectors docstrings (#3365) * Add missing word in word2vec docstring * Fix docstring typo in KeyedVectors distances() word_or_vector, not word_or_vectors 22 August 2022, 12:48:56 UTC
2350e92 update release/hijack_pr.py 22 August 2022, 06:38:29 UTC
99e43c4 fix: fix error 'for loop initial declaration' (#3378) Co-authored-by: Michael Penkov <m@penkov.dev> 22 August 2022, 01:45:35 UTC
62669ae pin sphinx versions, add explicit gallery_top label (#3383) * pin sphinx versions, add explicit gallery_top label * make flake8 happy again * make flake8 happy again * make flake8 happy again * make flake8 happy again 21 August 2022, 15:15:45 UTC
ded7877 Disable the Gensim 3=>4 warning in docs (#3346) * disable 3=>4 docs notification * replace migration docs by sponsorship link 06 May 2022, 00:53:28 UTC
eeb7e86 Clean up references to `Morfessor`, `tox` and `gensim.models.wrappers` (#3345) * Drop reference to Morfessor It was dropped from usage in 2022. Fixes: commit acce8a21377d1f599f8b9ec56b10321c91d4109d * Drop references to tox Usage of tox was dropped in 2022. Where necessary, replace tox commands with the current equivalent. Fixes: commit 3ce81a44cc59b3d77edee043ee14050f611008df * Drop direct and indirect references to gensim.modules.wrapper(s) They were all removed in 2021. TestDtmModel escaped being noticed via test failures because all the tests it runs are skipped without the DTM_PATH environment variable being set. Fixes: commit a21d9cc768598640f38e4bd03d368f8712a9aa77 05 May 2022, 13:41:54 UTC
5a19dd0 Update Python module MANIFEST (#3343) Drop COPYING.LESSER as it was moved to COPYING in 2016. Drop ez_setup.py as it was removed in 2018. Switch doc2vec_inner.c to doc2vec_inner.cpp, as it has always been using language=c++. Suggested-by: setup.py build Fixes: commit beb04ea1f8f9c438b0a40aa4cbbd955ea065f84f Fixes: commit 2891861d77f9eff2dc703214c099240ef227b7da Fixes: commit 1aa11bbaa7beba9b3068cb2e2d04a500ebd31f30 05 May 2022, 06:45:02 UTC
400d906 Allow overriding the Cython version requirement (#3323) Use an environment variable for this since it is often easier to set in a build wrapper rather than trying to override command-line options in the right layer of a multi-layer build wrapper and it also requires a lot less code to do the override. This will be useful for using alpha versions of Cython or old versions of Cython provided by the distros or specific versions that fix certain bugs. 05 May 2022, 06:43:33 UTC
9b316bc disable py3.6 builds in travis.yml 02 May 2022, 12:31:03 UTC
8e9e09c bump up dev version 01 May 2022, 16:25:37 UTC
aac9a7a Merge branch 'master' into develop 01 May 2022, 07:57:02 UTC
acbba2f Merge branch 'release-4.2.0' 01 May 2022, 07:57:01 UTC
239792e update CHANGELOG 01 May 2022, 07:56:42 UTC
97cef99 bump timeout for wheel building step it seems to occasionally go over 10 minutes, e.g. https://github.com/RaRe-Technologies/gensim/runs/6231204549 29 April 2022, 23:22:46 UTC
a5fd65c bumped version to 4.2.0 29 April 2022, 16:04:54 UTC
20a16e2 Merge pull request #3338 from RaRe-Technologies/changelog_5.0 Changelog 5.0 29 April 2022, 15:52:07 UTC
d570cae rename release to 4.2.0 29 April 2022, 15:51:41 UTC
f5bc192 get rid of "bad issues" section in CHANGELOG 29 April 2022, 12:49:30 UTC
7d942b2 Update CHANGELOG.md 29 April 2022, 02:55:34 UTC
ed8122e fix FastText Docs (#3339) 29 April 2022, 02:54:36 UTC
97f63e0 more work on sorting issues and PRs 29 April 2022, 02:38:41 UTC
efd5b77 git add release/generate_changelog.py 26 April 2022, 13:07:54 UTC
2f09b77 started work on CHANGELOG for new release 26 April 2022, 13:07:48 UTC
742fb18 fixup 26 April 2022, 12:28:12 UTC
8483502 update changelog script 26 April 2022, 12:25:53 UTC
995ad8a update release/README.md 26 April 2022, 11:41:37 UTC
7cb443b Merge pull request #3197 from silviatti/fix_3181 Fix computation of topic coherence 25 April 2022, 08:20:57 UTC
533da75 Fix issue probably-meant-fstring found at https://codereview.doctor (#3332) 24 April 2022, 05:55:03 UTC
93ed2fb Fix broken external link for LDA implementation (#3190) * Fix broken external link * Fix external link to be more permanent * update documentation * rebuild on Linux dev machine Co-authored-by: Michael Penkov <m@penkov.dev> 22 April 2022, 11:53:39 UTC
9bbf12c Check gallery up to date as part of CI (#3329) * Check gallery up to date as part of CI Fix #2916 * tweak check_gallery.py * update CI workflow * update stale doc cache * update stale docs 22 April 2022, 10:11:47 UTC
5fe3bbc Merge pull request #3307 from RaRe-Technologies/fix_docs [MRG] Documentation fixes 19 April 2022, 18:36:19 UTC
2982148 add pytest info to index.html 19 April 2022, 16:30:11 UTC
be960a3 Merge remote-tracking branch 'origin/develop' into fix_docs 19 April 2022, 15:56:43 UTC
d872c02 retrained nb on Linux 19 April 2022, 06:20:27 UTC
91175dd Merge pull request #3309 from alhoo/feature/respect-encoding-when-reading-binary-keyedvectors Respect encoding when reading binary keyed vectors 15 April 2022, 14:27:16 UTC
3eaca71 Merge branch 'develop' into fix_docs 15 April 2022, 13:44:00 UTC
edaeee9 Added encoding='utf-8' keyword argument to TextDirectoryCorpus. Used smart_open to replace builtin open. (#3317) 15 April 2022, 12:26:04 UTC
cea25a3 Merge pull request #3301 from pabs3/cleanup-unused-files Remove unused Jupyter screenshots 15 April 2022, 12:24:33 UTC
17c6cf0 Merge branch 'develop' into feature/respect-encoding-when-reading-binary-keyedvectors 15 April 2022, 12:23:43 UTC
662e380 Update test_translation_matrix.py 15 April 2022, 12:23:04 UTC
998074e Update keyedvectors.py 15 April 2022, 12:21:36 UTC
de9ee81 Merge pull request #3299 from pabs3/path-to-python-binary Enable test_word2vec_stand_alone_script by using sys.executable for python 15 April 2022, 12:18:31 UTC
317e447 Merge pull request #3297 from pabs3/use-datapath Use gensim.test.utils datapath() to construct paths to the test data 15 April 2022, 12:16:38 UTC
05ca318 Merge pull request #3318 from RaRe-Technologies/fix_evaluate_pairs [MRG] Clean up evaluate_word_pairs code 02 April 2022, 12:49:29 UTC
4c9671a allow non-utf8 encoding in evaluate_word_pairs 02 April 2022, 08:34:20 UTC
dfe740f fixes #3315: clean up evaluate_word_pairs 02 April 2022, 08:28:05 UTC
168a9ef Use gensim.test.utils datapath() to construct paths to the test data Makes the code more readable and consistent with other tests. 02 April 2022, 06:59:40 UTC
3ad9841 Enable test_word2vec_stand_alone_script by using sys.executable for python 02 April 2022, 06:43:49 UTC
0b1b3aa Remove unused Jupyter screenshots Fixes: commit 34ee98b3ca29054c7c20139ff20922c2efdcf834 02 April 2022, 06:15:17 UTC
back to top