https://github.com/google/sentencepiece

sort by:
Revision Author Date Message Commit Date
7dc9a76 Merge pull request #995 from google/dependabot/pip/dot-github/workflows/requirements/build-time-deps-6d60a12ad5 Bump the build-time-deps group in /.github/workflows/requirements with 3 updates 05 April 2024, 13:12:34 UTC
404882d Bump the build-time-deps group Bumps the build-time-deps group in /.github/workflows/requirements with 3 updates: [cibuildwheel](https://github.com/pypa/cibuildwheel), [wheel](https://github.com/pypa/wheel) and [setuptools](https://github.com/pypa/setuptools). Updates `cibuildwheel` from 2.16.5 to 2.17.0 - [Release notes](https://github.com/pypa/cibuildwheel/releases) - [Changelog](https://github.com/pypa/cibuildwheel/blob/main/docs/changelog.md) - [Commits](https://github.com/pypa/cibuildwheel/compare/v2.16.5...v2.17) Updates `wheel` from 0.42.0 to 0.43.0 - [Release notes](https://github.com/pypa/wheel/releases) - [Changelog](https://github.com/pypa/wheel/blob/main/docs/news.rst) - [Commits](https://github.com/pypa/wheel/compare/0.42.0...0.43.0) Updates `setuptools` from 69.1.1 to 69.2.0 - [Release notes](https://github.com/pypa/setuptools/releases) - [Changelog](https://github.com/pypa/setuptools/blob/main/NEWS.rst) - [Commits](https://github.com/pypa/setuptools/compare/v69.1.1...v69.2.0) --- updated-dependencies: - dependency-name: cibuildwheel dependency-type: direct:production update-type: version-update:semver-minor dependency-group: build-time-deps - dependency-name: wheel dependency-type: direct:production update-type: version-update:semver-minor dependency-group: build-time-deps - dependency-name: setuptools dependency-type: direct:production update-type: version-update:semver-minor dependency-group: build-time-deps ... Signed-off-by: dependabot[bot] <support@github.com> 01 April 2024, 15:29:14 UTC
4d6a1f4 Merge pull request #985 from google/dependabot/pip/dot-github/workflows/requirements/build-time-deps-0b1593cb10 Bump the build-time-deps group in /.github/workflows/requirements with 3 updates 03 March 2024, 00:53:10 UTC
1d7ce29 Bump the build-time-deps group Bumps the build-time-deps group in /.github/workflows/requirements with 3 updates: [twine](https://github.com/pypa/twine), [pip](https://github.com/pypa/pip) and [setuptools](https://github.com/pypa/setuptools). Updates `twine` from 4.0.2 to 5.0.0 - [Release notes](https://github.com/pypa/twine/releases) - [Changelog](https://github.com/pypa/twine/blob/main/docs/changelog.rst) - [Commits](https://github.com/pypa/twine/compare/4.0.2...5.0.0) Updates `pip` from 23.3.2 to 24.0 - [Changelog](https://github.com/pypa/pip/blob/main/NEWS.rst) - [Commits](https://github.com/pypa/pip/compare/23.3.2...24.0) Updates `setuptools` from 69.0.3 to 69.1.1 - [Release notes](https://github.com/pypa/setuptools/releases) - [Changelog](https://github.com/pypa/setuptools/blob/main/NEWS.rst) - [Commits](https://github.com/pypa/setuptools/compare/v69.0.3...v69.1.1) --- updated-dependencies: - dependency-name: twine dependency-type: direct:production update-type: version-update:semver-major dependency-group: build-time-deps - dependency-name: pip dependency-type: direct:production update-type: version-update:semver-major dependency-group: build-time-deps - dependency-name: setuptools dependency-type: direct:production update-type: version-update:semver-minor dependency-group: build-time-deps ... Signed-off-by: dependabot[bot] <support@github.com> 01 March 2024, 15:09:24 UTC
725952d makes the return value of --help same as official abseil library 26 February 2024, 13:01:58 UTC
52a7f15 increment version v0.2.1 26 February 2024, 05:56:28 UTC
9082653 use ::testing::TempDir/SrcDir 26 February 2024, 05:30:23 UTC
3b2ea62 fix build error 25 February 2024, 16:19:08 UTC
0ba5069 add nfc, nfd normalization tsv files 25 February 2024, 15:47:08 UTC
a216bd0 Merge pull request #981 from google/dependabot/pip/dot-github/workflows/requirements/cryptography-42.0.4 Bump cryptography from 42.0.2 to 42.0.4 in /.github/workflows/requirements 22 February 2024, 07:56:11 UTC
c7b4cd5 Bump cryptography in /.github/workflows/requirements Bumps [cryptography](https://github.com/pyca/cryptography) from 42.0.2 to 42.0.4. - [Changelog](https://github.com/pyca/cryptography/blob/main/CHANGELOG.rst) - [Commits](https://github.com/pyca/cryptography/compare/42.0.2...42.0.4) --- updated-dependencies: - dependency-name: cryptography dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com> 21 February 2024, 20:45:34 UTC
1d91514 Merge pull request #979 from h-vetinari/libdir move setting of default CMAKE_INSTALL_{BIN,INCLUDE,LIB}DIR before first use 21 February 2024, 05:26:56 UTC
b2863fc also change CMAKE_INSTALL_INCDIR->CMAKE_INSTALL_INCLUDEDIR in src/CMakeLists.txt 21 February 2024, 01:08:41 UTC
2701208 unify spelling of CMAKE_INSTALL_INCLUDEDIR Following GNUInstallDirs defaults, see also CMake docs: https://cmake.org/cmake/help/latest/command/install.html 20 February 2024, 10:20:08 UTC
26f9f58 move setting of default CMAKE_INSTALL_{BIN,INCLUDE,LIB}DIR before first use 20 February 2024, 10:13:20 UTC
17d7580 suppress warnings in testharnress 19 February 2024, 08:06:52 UTC
4a3cd1c Merge pull request #975 from google/dependabot/pip/dot-github/workflows/requirements/cryptography-42.0.2 Bump cryptography from 42.0.0 to 42.0.2 in /.github/workflows/requirements 19 February 2024, 04:52:25 UTC
670d2e7 Bump cryptography in /.github/workflows/requirements Bumps [cryptography](https://github.com/pyca/cryptography) from 42.0.0 to 42.0.2. - [Changelog](https://github.com/pyca/cryptography/blob/main/CHANGELOG.rst) - [Commits](https://github.com/pyca/cryptography/compare/42.0.0...42.0.2) --- updated-dependencies: - dependency-name: cryptography dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com> 17 February 2024, 00:55:36 UTC
2b8772a Merge pull request #974 from xunkai55/patch-1 Fix a typo in api.md 14 February 2024, 12:50:54 UTC
ffd8e9e Fix a typo in api.md 12 February 2024, 10:57:30 UTC
03243af Merge pull request #970 from google/dependabot/pip/dot-github/workflows/requirements/build-time-deps-bd99d7bc59 Bump the build-time-deps group in /.github/workflows/requirements with 1 update 06 February 2024, 09:12:26 UTC
d0fe405 Merge pull request #972 from google/dependabot/pip/dot-github/workflows/requirements/cryptography-42.0.0 Bump cryptography from 41.0.7 to 42.0.0 in /.github/workflows/requirements 06 February 2024, 09:12:06 UTC
a8a618f Bump cryptography in /.github/workflows/requirements Bumps [cryptography](https://github.com/pyca/cryptography) from 41.0.7 to 42.0.0. - [Changelog](https://github.com/pyca/cryptography/blob/main/CHANGELOG.rst) - [Commits](https://github.com/pyca/cryptography/compare/41.0.7...42.0.0) --- updated-dependencies: - dependency-name: cryptography dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com> 06 February 2024, 03:24:50 UTC
bbbe548 Bump the build-time-deps group Bumps the build-time-deps group in /.github/workflows/requirements with 1 update: [cibuildwheel](https://github.com/pypa/cibuildwheel). Updates `cibuildwheel` from 2.16.2 to 2.16.5 - [Release notes](https://github.com/pypa/cibuildwheel/releases) - [Changelog](https://github.com/pypa/cibuildwheel/blob/main/docs/changelog.md) - [Commits](https://github.com/pypa/cibuildwheel/compare/v2.16.2...v2.16.5) --- updated-dependencies: - dependency-name: cibuildwheel dependency-type: direct:production update-type: version-update:semver-patch dependency-group: build-time-deps ... Signed-off-by: dependabot[bot] <support@github.com> 01 February 2024, 16:00:54 UTC
53de765 allows to load precomputed seed sentencepieces for unigram from a file. 28 January 2024, 16:17:08 UTC
0fe7add fixed crash bug in unigram model training 28 January 2024, 05:19:00 UTC
41c4b7f returns unicode characetr offsets in normalize method 22 January 2024, 07:19:04 UTC
6b468a0 support bytes output in decode method 20 January 2024, 08:16:17 UTC
7b9ee4c Merge pull request #962 from Halmoni100/external-absl-2 Additional external absl fixes 18 January 2024, 06:31:43 UTC
0ea22c0 Fix build for external absl 18 January 2024, 00:33:51 UTC
a34fb40 Add idempotency to external absl mod 18 January 2024, 00:32:48 UTC
4ce471c Update cross_build.yml runs apt-get update to update the local index 16 January 2024, 16:02:45 UTC
de1747b added functionality to override normalizer spec 16 January 2024, 04:06:05 UTC
0018af1 better exteranl abseil and protobuf support 16 January 2024, 02:15:46 UTC
acf8ebe build universal osx binary 14 January 2024, 01:51:01 UTC
ed76ecc add more advanced SentencePieceNormalizer class 13 January 2024, 17:19:50 UTC
f5c7363 remove absl/random and absl/memory, add absl::btree_map 07 January 2024, 10:48:48 UTC
adf9e81 move SharedBitGen to random namespace 06 January 2024, 15:56:51 UTC
49afc4c Merge pull request #959 from google/revert-957-dependabot/github_actions/github-actions-bcafe21e81 Revert "Bump the github-actions group with 2 updates" 06 January 2024, 15:24:40 UTC
fb490c5 Revert "Bump the github-actions group with 2 updates" 06 January 2024, 13:55:18 UTC
06eee09 Added Normalization API 04 January 2024, 09:04:20 UTC
e7b5260 Merge pull request #955 from pnacht/pinned-pip Hash-pin Python dependencies in CI/CD release workflows 03 January 2024, 03:29:39 UTC
2909148 Merge pull request #957 from google/dependabot/github_actions/github-actions-bcafe21e81 Bump the github-actions group with 2 updates 03 January 2024, 03:29:14 UTC
b888bd7 Bump the github-actions group with 2 updates Bumps the github-actions group with 2 updates: [actions/upload-artifact](https://github.com/actions/upload-artifact) and [actions/setup-python](https://github.com/actions/setup-python). Updates `actions/upload-artifact` from 3.1.3 to 4.0.0 - [Release notes](https://github.com/actions/upload-artifact/releases) - [Commits](https://github.com/actions/upload-artifact/compare/a8a3f3ad30e3422c9c7b888a15615d19a852ae32...c7d193f32edcb7bfad88892161225aeda64e9392) Updates `actions/setup-python` from 4.7.1 to 5.0.0 - [Release notes](https://github.com/actions/setup-python/releases) - [Commits](https://github.com/actions/setup-python/compare/65d7f2d534ac1bc67fcd62888c5f4f3d2cb2b236...0a5c61591373683505ea898e09a3ea4f39ef2b9c) --- updated-dependencies: - dependency-name: actions/upload-artifact dependency-type: direct:production update-type: version-update:semver-major dependency-group: github-actions - dependency-name: actions/setup-python dependency-type: direct:production update-type: version-update:semver-major dependency-group: github-actions ... Signed-off-by: dependabot[bot] <support@github.com> 01 January 2024, 15:40:30 UTC
eecae39 Set up dependabot to keep them updated Signed-off-by: Pedro Kaj Kjellerup Nacht <pnacht@google.com> 27 December 2023, 23:33:26 UTC
e31c147 Hash-pin CI/CD pip dependencies Signed-off-by: Pedro Kaj Kjellerup Nacht <pnacht@google.com> 27 December 2023, 23:33:26 UTC
3c2fc66 Update common.h 23 December 2023, 17:00:56 UTC
96aabae add set_min_log_level function to python to change the loglevel from python wrapper. 23 December 2023, 09:28:40 UTC
bd3925a Merge pull request #936 from google/dependabot/github_actions/github-actions-a69c9d1746 Bump the github-actions group with 1 update 23 December 2023, 08:17:09 UTC
b937146 Merge pull request #938 from pnacht/token-permissions Set minimal permissions for GitHub workflows 23 December 2023, 08:16:39 UTC
a5262b5 Merge pull request #947 from chenqy4933/master fix(cmake): fix android build error 23 December 2023, 08:16:25 UTC
6b32c01 merges internal changes to github exteranl repos 23 December 2023, 07:20:11 UTC
fc1584e fix(cmake): fix android build error 05 December 2023, 02:17:59 UTC
3ced0ec wheel.yml: Update cibuildwheel to pass in MacOS Signed-off-by: Pedro Kaj Kjellerup Nacht <pnacht@google.com> 28 November 2023, 13:46:20 UTC
ad56e74 Set minimal permissions for workflows Signed-off-by: Pedro Kaj Kjellerup Nacht <pnacht@google.com> 17 November 2023, 21:38:13 UTC
14f5e57 Bump the github-actions group with 1 update Bumps the github-actions group with 1 update: [slsa-framework/slsa-github-generator](https://github.com/slsa-framework/slsa-github-generator). - [Release notes](https://github.com/slsa-framework/slsa-github-generator/releases) - [Changelog](https://github.com/slsa-framework/slsa-github-generator/blob/main/CHANGELOG.md) - [Commits](https://github.com/slsa-framework/slsa-github-generator/compare/v1.2.0...v1.9.0) --- updated-dependencies: - dependency-name: slsa-framework/slsa-github-generator dependency-type: direct:production update-type: version-update:semver-minor dependency-group: github-actions ... Signed-off-by: dependabot[bot] <support@github.com> 17 November 2023, 03:49:37 UTC
022f8c3 Merge pull request #934 from pnacht/pinned-gha Hash-pin GitHub Actions, add dependabot 17 November 2023, 03:49:04 UTC
02ea8ed Add dependabot to monitor GHA Signed-off-by: Pedro Kaj Kjellerup Nacht <pnacht@google.com> 13 November 2023, 20:50:39 UTC
eaf71c2 Hash-pin GHA Signed-off-by: Pedro Kaj Kjellerup Nacht <pnacht@google.com> 13 November 2023, 20:49:59 UTC
8cbdf13 Improves the thread utilization in batch encoding/decoding 05 August 2023, 09:01:02 UTC
635fe84 Upgrade the sentencepiece_model_pb2.py and sentencepiece.py 01 July 2023, 02:38:33 UTC
7b694e4 Merge pull request #867 from vmarkovtsev/patch-1 Fix nasty bug in BPE position encoding 25 May 2023, 07:31:30 UTC
cb22883 Merge pull request #870 from ryandesign/ryandesign-protobuf-lite Fix overlinking with protobuf 24 May 2023, 16:27:11 UTC
4183597 Fix pkg-config file to avoid overlinking When configured to use external protobuf, remove -lprotobuf-lite from pkg-config file's Libs section. The sentencepiece libraries link with libprotobuf-lite; it is not necessary for users of the sentencepiece libraries to link with protobuf unless they are using static linking so this should have been in Libs.private. Use Requires.private so that we use protobuf's pkg-config file to get the flags. 21 May 2023, 18:38:08 UTC
e081c67 Remove empty placeholders in pkg-config file Remove @pkgconfiglibs@ and @pkgconfigcflags@ which were being replaced with the empty string because these variables were never set anywhere. 21 May 2023, 18:31:47 UTC
3805cbb Fix nasty bug in BPE position encoding Example: sid=1313136 left=37596 right=37597 produce 18446744071878513373, which is incorrect. 18 May 2023, 17:39:30 UTC
f2219b5 prepare for 0.2.00 14 May 2023, 14:35:14 UTC
0b344d0 Added arm architecture 14 May 2023, 11:21:38 UTC
2f66fbf Added arm architecture 14 May 2023, 11:17:25 UTC
6693e7e Fixes test workpath 14 May 2023, 10:57:55 UTC
b857ba9 Split build and test 14 May 2023, 10:51:31 UTC
fad8ae6 Added fail first flag 14 May 2023, 10:36:13 UTC
f2fcd85 Fixes cross build yaml 14 May 2023, 10:29:06 UTC
6c901b0 Fixes build test errors in big-endian machines 14 May 2023, 09:54:52 UTC
17f9c6b Fixes build test errors in big-endian machines 14 May 2023, 09:53:35 UTC
827591a Fixes build test errors in big-endian machines 14 May 2023, 09:08:39 UTC
3863f76 increases the max number of threads 30 April 2023, 17:37:15 UTC
25b64fc Fix the test error on windows 28 April 2023, 06:20:50 UTC
bb0b610 Fix the ULM training bugs 27 April 2023, 17:32:57 UTC
ba44ab1 Fix bugs in the handling of duplicated bigrams 24 April 2023, 07:25:10 UTC
69d34c7 prepare for v0.1.99 15 April 2023, 06:33:01 UTC
d9a2b21 Fix bugs the seed score computation. 15 April 2023, 05:59:52 UTC
518c57c build wheel from sdist for testing 12 April 2023, 07:41:58 UTC
fabfe30 build wheel from sdist for testing 12 April 2023, 07:35:45 UTC
d6e597b build wheel from sdist for testing 12 April 2023, 07:24:31 UTC
f2884a1 test loacl sdist build on github actions 12 April 2023, 04:42:27 UTC
609a2b7 test loacl sdist build on github actions 12 April 2023, 02:17:27 UTC
8fd5c6b test loacl sdist build on github actions 12 April 2023, 01:43:39 UTC
e07ebf7 support pretokenization in BPE mode. 11 April 2023, 06:48:08 UTC
119e58d Fixes include path when using external protobuf 10 April 2023, 10:15:46 UTC
2b07137 fixes IS_BIGENDIAN macro places 10 April 2023, 02:28:20 UTC
e58bb68 add pretokenization_delimiter options. Initialize seed pieces more accurately. 10 April 2023, 02:11:37 UTC
6c9fd79 Merge pull request #845 from chris-ha458/patch-1 Update sentencepiece_python_module_example.ipynb 09 April 2023, 08:13:58 UTC
9b53e21 Update sentencepiece_python_module_example.ipynb fix typo 08 April 2023, 14:26:13 UTC
c032c26 automatically detect -latomic linker option 05 April 2023, 00:01:15 UTC
5489c0a add -latomic in static linking 04 April 2023, 17:26:29 UTC
c945229 updated set-output commands 04 April 2023, 15:51:06 UTC
799c025 creates sdist with build_sdist.sh 04 April 2023, 05:05:44 UTC
59d84ba Ubuntu 18.04 to 20.04 migration 04 April 2023, 03:53:27 UTC
f54d8ba includes the sentencepiece source files in python source package 04 April 2023, 03:15:11 UTC
back to top