https://github.com/SoftwareHeritage/swh-storage

sort by:
Revision Author Date Message Commit Date
63858e5 Add method to storage for searching origins This adds method 'origin_search' to storage enabling to search for origins whose urls contain a string pattern or match a regular expression Related T848 05 December 2017, 10:47:54 UTC
a4abad0 schemata: Add missing __init__.py for build package purposes 24 November 2017, 10:13:48 UTC
c33ad51 sql/upgrades: add 112 -> 113 Related T851 17 November 2017, 14:05:53 UTC
8311a4f swh.storage: Open indexer_configuration_add endpoint Related T851 17 November 2017, 13:46:39 UTC
5a0ddd6 swh.storage.tests: Fix broken content_mimetype tests Due to a new value in db for the same tool. Related 35253443fe0bd792e84f3ce939ee20e7eed52f9b Related T849 17 November 2017, 10:28:49 UTC
3525344 swh-data: Add new content mimetype's indexer configuration Related T849 15 November 2017, 14:36:19 UTC
775a1b8 add __init__.py for tests 10 November 2017, 13:16:40 UTC
c5740fd origin_visit_get: make order repeatable 10 November 2017, 13:16:40 UTC
c4ec821 Add missing visit key for occurrence_add documentation 10 November 2017, 13:16:40 UTC
9b29324 Make unique indices actually unique and vice versa 09 November 2017, 17:11:11 UTC
f87a530 Add 110->111 SQL upgrade script 06 November 2017, 12:46:56 UTC
e7a7b56 Remove unused content provenance cache tables 03 November 2017, 15:31:36 UTC
75f4edd sql/upgrades/110: add metadata tables to SQL schema 03 November 2017, 11:41:28 UTC
477bd44 docs: add absolute anchor to documentation index 02 November 2017, 10:09:27 UTC
d13751a Refactor entry points to origin_metadata table with get_by function deleting entry points get_all and get_by_provider creating one unique entry point origin_metadata_get_by adding entry point provider_get_by name+url 24 October 2017, 12:30:01 UTC
48fc6f7 Refactor origin_metadata and adding provider table and logic added documentation and new version for schema 23 October 2017, 14:33:26 UTC
76c5326 Create origin_metadata tables and logic Summary: adding add and get entry points and tests for origin_metadata test for origin_metadata add and get functions pass Added entry points get_all and get_by_provenance for origin_metadata Closes T737 References P168 Test Plan: tests for add, get, get_all and get_by_provenance Reviewers: ardumont, #reviewers! Maniphest Tasks: T737 Differential Revision: https://forge.softwareheritage.org/D254 23 October 2017, 14:33:26 UTC
17882a9 test for origin_metadata add and get functions pass 23 October 2017, 14:33:26 UTC
3b425af Create origin_metadata tables and logic adding add and get entry points and tests for origin_metadata 23 October 2017, 14:33:26 UTC
7e2e9a9 docs: integrate postgres DB schema in dev docs 21 October 2017, 15:19:40 UTC
cc8a618 sql doc: also generate SVG version of the schema chart 21 October 2017, 15:04:23 UTC
6ecf3d6 doc: convert archiver blueprint to rst and link it from doc index 21 October 2017, 14:47:03 UTC
8915caf db schema chart: add metadata and statistic clusters to place stray tables where they belong 21 October 2017, 13:54:23 UTC
89b9ce4 db schema chart: use more current titles for some clusters 21 October 2017, 13:54:13 UTC
d35e739 Make swh.storage.schemata work on SQLalchemy 1.0 12 October 2017, 17:51:00 UTC
91e1e5e Drop doctests from build as they mess up flask 12 October 2017, 16:41:33 UTC
ecdb993 Cleanup packaging 12 October 2017, 15:16:59 UTC
92e46df Move kafka_python to extra requirements 11 October 2017, 16:51:37 UTC
f56e812 swh.storage.converters: Fix typo in docstring 11 October 2017, 15:42:53 UTC
b8f5018 Flask and doctest are unhappy with each other See also https://github.com/pallets/flask/issues/1680 11 October 2017, 15:39:11 UTC
1964eaf Cleanup kafka-related requirements 11 October 2017, 15:33:01 UTC
13a62b1 swh.storage.listener: drop cyclic dependency on swh.journal 11 October 2017, 15:30:25 UTC
d910b67 add python3-kafka to build-depends 11 October 2017, 15:25:06 UTC
6cf4fc9 Cleanup tests during debian package build 11 October 2017, 15:22:46 UTC
e36a36b Bump dependency on swh.model 11 October 2017, 15:18:34 UTC
2022da2 schemata.distribution: update for reuse by the Debian loader 10 October 2017, 13:59:31 UTC
6a97d92 test_storage: update tests to use DentryPerms instead of raw values 09 October 2017, 10:23:40 UTC
c1bdce3 db: properly handle IntEnums (e.g. DentryPerms) 09 October 2017, 10:15:05 UTC
e67765b schemata: add a new package for ancillary schemata This package is inaugurated by the distribution schemata 14 September 2017, 15:30:39 UTC
82ca9bb debian/control: wrap-and-sort 14 September 2017, 15:29:18 UTC
118a962 sanitize docstrings for sphinx 07 September 2017, 08:21:34 UTC
e14e72d sql: add origin_visit to swh_stat_counters() 04 September 2017, 18:17:03 UTC
4fbc237 sql/upgrades: add 107 to 108 script 01 September 2017, 12:14:21 UTC
20f47dd sql/swh-func: keep a cache of exact object counts as a table Close T719 (cc @rdicosmo) 01 September 2017, 08:18:42 UTC
e1ec2d8 test_storage: move tests that were inadvertently "local-only" to the base class 01 September 2017, 08:18:42 UTC
45b3426 docs/: add sphinx apidoc generation skeleton change cherry picked from python module template commit 71b117ba0cf9f1251b1cac26d0994df03a4c787d 30 August 2017, 10:26:04 UTC
464d5bc storage_testing: leverage reset_db_tables from db_testing 03 August 2017, 16:16:16 UTC
9d416d3 Added revision_metadata table and methods into storage Summary: - testing missing, add and get methods on revision_metadata Reviewers: ardumont, #reviewers! Differential Revision: https://forge.softwareheritage.org/D235 28 July 2017, 10:08:27 UTC
806f511 tests: move teardown reset_tables logic to storage_testing 19 July 2017, 14:56:27 UTC
db4bee4 storage_testing: allow reuse of storage test config in subclasses 17 July 2017, 15:03:11 UTC
6d063ac tests: use new DbTestFixture, add new storage test fixture 17 July 2017, 12:18:12 UTC
cd9513a tests: server: don't print run_app line on startup 17 July 2017, 12:18:01 UTC
af4d3da Merge branch 'content_metadata_logic' 13 July 2017, 09:18:38 UTC
bb17076 Refactor to keep only content_metadata logic using the metadata_indexer with NPM tool added tests for add with duplicate or without and tests for missing content_metadata deleted redundant conversion for cdb_copy 12 July 2017, 08:32:02 UTC
8465e7b Added content_metadata logic to the storage Summary: Keeping the indexed content_metadata in the storage with a content_metadata table. Possibility to add with and without duplications and getting the content_metadata with tool information. works only for NPM context also because tool configuration. TODO: - add json schema in json/ (should we keep CodeMeta schema as is or put PURl to it for the translated_metadata_property?) Test Plan: - content_metadata_missing - content_metadata_add skipping duplicate - content_metadata_add dropping duplicate - content_metadata_get - db_to_metadata conveter Reviewers: ardumont Differential Revision: https://forge.softwareheritage.org/D219 07 July 2017, 13:27:14 UTC
b8c480f Remove more references to the old swh.storage.archiver module 30 June 2017, 12:44:01 UTC
fb39e1c Move archiver to its own package 26 June 2017, 12:33:29 UTC
0a86965 archiver: remove useless test method 22 June 2017, 15:24:39 UTC
4a40af7 archiver: update tests 22 June 2017, 15:23:13 UTC
5e34b84 archiver.director: properly initialize start_id 22 June 2017, 15:17:22 UTC
afce552 archiver.director: add throttling for delayed workers 19 June 2017, 16:59:26 UTC
93283d3 archiver.worker: add filtering for sources This allows us to avoid copying data from sources where we would be charged for accesses. 19 June 2017, 14:22:19 UTC
96697de archiver.worker: adjust ongoing behavior for new backend API The backend API now returns proper datetime objects rather than plain timestamps. 19 June 2017, 14:07:36 UTC
a2ebef4 archiver.director: add a start-id option to allow parallelism and restarts 19 June 2017, 14:07:07 UTC
55d7b73 archiver.copier: don't eat exceptions when logging them 19 June 2017, 14:06:29 UTC
9b2d191 requirements: remove python-fastimport (migrated from the vault) 14 June 2017, 13:51:55 UTC
9c8455e bump requirements for new version of scheduler 12 June 2017, 13:38:13 UTC
c5be4dc tasks: update to new swh.scheduler.task api 12 June 2017, 13:35:07 UTC
cb855e0 Merge tag 'v0.0.85' v0.0.85 - Improve license endpoint's unknown license policy 06 June 2017, 16:42:01 UTC
0ae9592 swh.storage: Clean up dead code Related T721 06 June 2017, 16:36:38 UTC
cbbce73 archiver: minimal update for new schema 06 June 2017, 16:14:58 UTC
0965a85 swh.storage: Clean up dead code Related T721 06 June 2017, 15:52:02 UTC
0a7fdb0 sql/upgrades: create db upgrade 106->107 Related T721 06 June 2017, 13:33:32 UTC
80944cb archiver: Make package work for actual objstorage dependency version The head version of the objstorage migrated. Thus the archiver tests broke locally. The test code was fixed to make the local tests pass with the actual head version of the objstorage. But we did not deploy the new objstorage yet, thus the archiver tests break during the packaging phase (of any swh.storage.* package). This commit is a temporary fix to make everyone happy (since we do need to package other storage version for example). This needs to be reverted as soon as we bump the dependency to the new objstorage (> 0.17). Related f681f83 Related b4dfe06 Related 08827dc Related D205 06 June 2017, 13:29:23 UTC
0cf935e swh.storage: Improve license endpoint's unknown license policy Prior to this commit, the content_fossology_license_add endpoint was not symmetric with other endpoints since it returned the list of licenses it was unable to deal with. Now, it simply adds those unknown licenses in the fossology_license table. And it no longer returns anything (as other endpoints). Related T721 06 June 2017, 13:28:32 UTC
f681f83 swh.storage.archiver: Fix tests Related D205 02 June 2017, 14:20:45 UTC
08827dc Revert "swh.storage.archiver: Fix tests" for packaging reason This reverts commit b4dfe06f9f06bab2ce4b6f44ca9c20dec2e7f28f. 02 June 2017, 14:16:05 UTC
4bcd830 sql/upgrades: create db upgrade 105->106 Related T722 02 June 2017, 12:43:42 UTC
5ff3979 swh.storage: Update db schema to new version Related T722 01 June 2017, 14:55:59 UTC
a95a3c4 Add new entry for language indexer tool Related T722 01 June 2017, 14:29:57 UTC
8f7a5c5 swh.storage: fossology license endpoints: use idx_configuration_id Related T722 01 June 2017, 13:31:03 UTC
7738a76 swh.storage: indexer endpoints: Fix filtering missing data issue The filtering on missing data was only using the content's sha1 and not the indexer configuration id. Nothing to fix in production though. We do not use yet the filtering nor have multiple tools for the same indexer though. Related T722 01 June 2017, 12:22:35 UTC
f18e2df swh.storage: ctags endpoints: use indexer_configuration_id Related T722 01 June 2017, 12:12:08 UTC
a8ce0d9 swh.storage.tests: Refactor reading the indexer tools Related T722 01 June 2017, 11:22:03 UTC
f3600de swh.storage: language endpoints: use indexer_configuration_id Related T722 01 June 2017, 10:32:22 UTC
45a923b swh.storage: mimetype endpoints: use indexer_configuration_id Related T722 01 June 2017, 10:16:10 UTC
b4dfe06 swh.storage.archiver: Fix tests Related D205 01 June 2017, 08:55:48 UTC
85fdb1c swh-storage: Add indexer_configuration_get endpoint 31 May 2017, 16:26:49 UTC
8e97754 sql/swh-func: use pg_stat_user_tables for more accurate tuple statistics Close T719 30 May 2017, 11:09:33 UTC
55495ec vault: remove from storage repository 09 May 2017, 13:17:37 UTC
76ed458 swh.storage: Use aggregate key to filter on missing skipped contents Related T703 Closes D200 31 March 2017, 10:26:44 UTC
0b5e938 sql/upgrades: add 103 -> 104 Adapt according to review on D200. Use sql/bin/db-upgrade to generate the upgrade script. Related T703 30 March 2017, 12:13:00 UTC
abeb873 Revert "swh.storage: Use upsert scheme on (skipped_)content_add function" This reverts commit 8dddc91a7860f9db04026132c874d809a581f5db. 27 March 2017, 12:21:57 UTC
8dddc91 swh.storage: Use upsert scheme on (skipped_)content_add function Related T703 27 March 2017, 12:21:23 UTC
ada557c swh.storage: Extract key variable for insertion Related T703 27 March 2017, 12:21:23 UTC
4ff4307 swh.storage: Add checksum blake2s256 Related T703 24 March 2017, 23:07:51 UTC
c94ba89 d/control: Add python3-swh.journal dependency with version 24 March 2017, 11:57:18 UTC
3fe8924 swh.storage.listener: Use swh.journal.serializers.key_to_kafka function Related T529 23 March 2017, 14:49:40 UTC
47cb71b swh.storage.listener: Send notify data as dict of composite primary key All data are sent as dictionary of composite primary key for each object type. This also aligns the content type data with the skipped_content type data (sending sha1, sha1_git, sha256). Related T529 23 March 2017, 14:06:05 UTC
9be297b vault: gitfast: add documentation, remove more useless directory explorations 22 March 2017, 13:02:02 UTC
back to top