https://github.com/SoftwareHeritage/swh-storage

sort by:
Revision Author Date Message Commit Date
329f4cf SQL: nicer API for swh_content_find_* functions the functions no longer fail with internal errors (due to "insert into ... strict"), but rather return NULL in those cases, because no match has been found. 30 September 2015, 10:14:29 UTC
7fe460f bump SQL schema version to 17 include upgrade script from 16 30 September 2015, 09:04:30 UTC
be72b19 sql-func refactoring: single dir*_entry_add function The function is parametric in the type of entries to be added (file/dir/rev). Get rid of the previous 3 (99% overlapping) functions swh_directory_{file,rev,dir}_entry_add functions. Make storage.py use the new function. 30 September 2015, 09:04:30 UTC
4612bc6 Refactor - pep8 compliance - Remove unused import 30 September 2015, 08:57:57 UTC
c956386 Improve output response for find occurrence api 30 September 2015, 08:56:54 UTC
141d392 Implement content_find_occurrence up to the db 30 September 2015, 08:43:30 UTC
3e2fe23 Refactor - db.py returns None or the content found The triplet of None should be sandboxed to db.py 30 September 2015, 08:43:30 UTC
91debb1 Refactor - Permit reuse of the content_find api 30 September 2015, 08:43:30 UTC
6944ab8 Refactor - Return result directory Remove the needless yield 30 September 2015, 08:42:51 UTC
311de1b Refactor - Reuse 30 September 2015, 08:42:51 UTC
df4bdfa Open content-find-occurrence 30 September 2015, 08:42:51 UTC
bdfc021 test_storage: add collision detection test 30 September 2015, 07:40:40 UTC
9aa18e4 test_api_client: use absolute import to allow rm __init__.py 30 September 2015, 07:25:06 UTC
b4e4b20 SQL swh_content_missing: build result incrementally by uniting on contents that appear to be missing when looking at a single checksum algorithm. This is semantically equivalent to the previous approach, but allows to exploit single-column indexes; which are non-redundant, smaller, and hence more likely to be used. This is the "metro ligne 6" approach. 30 September 2015, 07:18:22 UTC
7a33494 db_testing.py: allow to customize DB restore 29 September 2015, 16:15:56 UTC
f88331c Bump dependencies on swh.core to >= 0.0.6 29 September 2015, 14:51:45 UTC
bfe9016 debian/control: indent with spaces 29 September 2015, 14:51:25 UTC
e424885 Bump dependency on swh.core 29 September 2015, 14:41:40 UTC
bb8cdff move db_test.py from swh-storage to swh-core 29 September 2015, 14:39:10 UTC
260c379 Add swh.storage.api package to setup.py 29 September 2015, 14:26:51 UTC
2b46e69 swh.storage.api: Serialize/deserialize exceptions This currently uses pickle which breaks the msgpack + dict "abstraction" we settled on. This should be fixed eventually. 29 September 2015, 13:04:43 UTC
58f821c Update api client and server to msgpack 29 September 2015, 11:55:22 UTC
79d6539 gitignore: ignore .eggs/ dir 29 September 2015, 09:16:41 UTC
b041526 Refactor - Migrate content_present to content_find with unified api 29 September 2015, 08:28:05 UTC
1ddeff4 Move remote_storage and api to api.client and api.server Closes T44. 29 September 2015, 08:21:38 UTC
1cb39a2 sql/swh-schema: unix_path to bytea 28 September 2015, 15:36:28 UTC
f660268 sql/upgrades/015: drop function before readding it 28 September 2015, 15:35:16 UTC
d371898 SQL schema diagram: add skipped_content 28 September 2015, 14:19:37 UTC
eb7493c sql/upgrades/015: bump database version 28 September 2015, 14:01:25 UTC
6a4da5c sql/swh-func: fix outdated comments 28 September 2015, 14:00:02 UTC
13315d2 sql/sql-func: Make tests pass again 28 September 2015, 14:00:02 UTC
be3910e Merge branch 'merge-dir-entries' 28 September 2015, 13:46:39 UTC
8870ff4 swh-func.sql: add refactoring TODO item 28 September 2015, 13:46:28 UTC
3c17d73 swh_directory_entry_*_add: port to merged dir entries 28 September 2015, 13:46:28 UTC
345293a swh_directory_walk_one: port to merged dir entries 28 September 2015, 13:46:28 UTC
f4b8b40 swh_content_find_dir: port to merged dir entries 28 September 2015, 13:46:28 UTC
c08ee26 upgrades/015.sql: migration script for dir_list_* table merge 28 September 2015, 13:46:28 UTC
f068673 SQL schema: merge dir entry arrays into directory This removes tables directory_list_{file,dir,rev}, and add corresponding arrays {dir,file,rev}_entries to the directory table, which was just a set up to now. Indexes migrated accordingly. 28 September 2015, 13:46:28 UTC
945bdfb sql/sql-func: copy origin from tmp_skipped_content 28 September 2015, 13:41:14 UTC
573b8d4 swh.storage.storage: Only add missing skipped_content 28 September 2015, 13:33:20 UTC
e16565e swh.storage.storage: add origin to skipped_content 28 September 2015, 13:32:16 UTC
9188980 sql/swh-func.sql: remove useless declaration 28 September 2015, 12:49:39 UTC
8ff3109 swh.storage.storage: Allow storing "skipped" contents 28 September 2015, 12:49:12 UTC
09193a5 swh.storage.db: map stored procedures for skipped_content 28 September 2015, 12:43:27 UTC
9073916 sql schema: update skipped_content and add stored procedures for update 28 September 2015, 12:42:39 UTC
f1f62bd sql/swh-schema: add index on content(sha256) 28 September 2015, 09:44:15 UTC
d53dc22 sql/swh-schema: add skipped_content table 28 September 2015, 09:43:48 UTC
0dd1ece SQL: add swh_content_find, to lookup content by checksum(s) Intente use is content-by-content lookup (e.g., in the web app), not mass addition or check for missing contents. We already have swh_content_missing for that. 27 September 2015, 13:35:38 UTC
7de609d SQL: improve indentation of directory_walk_one to avoid very long lines 27 September 2015, 12:43:25 UTC
1ce7f21 swh_content_find_occ. query: improve doc string 25 September 2015, 09:52:02 UTC
c4e2e78 all-in-1 query from sha1 to occurrence information query name: swh_content_find_occurrence given the sha1 of some content, returns info about its origin (type, url) and context (branch, revision, path) 25 September 2015, 09:52:02 UTC
785f69b find queries: comment SQL code 25 September 2015, 09:52:02 UTC
8c5d7f5 new queries to find dir/occurrence of some content queries: - swh_content_find_directory(content_id sha1): content_dir - swh_revision_find_occurrence(revision_id sha1_git): occurrence Sample use: zack-linuxgit-arrays=> select sha1 from content offset 100000 limit 1; sha1 -------------------------------------------- \xdd63ccef43879c3d7ea47c3f637ef2453f7a58a1 (1 row) Time: 21,751 ms zack-linuxgit-arrays=> select * from swh_content_find_directory('\xdd63ccef43879c3d7ea47c3f637ef2453f7a58a1'::sha1); directory | path --------------------------------------------+---------------------------- \x24f14db9cc7aa0b106dcb09c9927dfa6712dfd1e | drivers/md/dm-log-writes.c (1 row) Time: 729,122 ms zack-linuxgit-arrays=> select id from revision where directory = '\x24f14db9cc7aa0b106dcb09c9927dfa6712dfd1e'::sha1_git; id -------------------------------------------- \x0450b2d120ed9e6d4ac7a6eade0ad116f69b88f7 (1 row) Time: 395,754 ms zack-linuxgit-arrays=> select * from swh_revision_find_occurrence('\x0450b2d120ed9e6d4ac7a6eade0ad116f69b88f7'::sha1_git); origin | branch | revision --------+--------------------+-------------------------------------------- 1 | refs/tags/v4.1-rc3 | \x030bbdbf4c833bc69f502eae58498bc5572db736 (1 row) Time: 22,006 ms zack-linuxgit-arrays=> 25 September 2015, 09:52:02 UTC
372ee30 Add debian metadata 25 September 2015, 09:12:50 UTC
1ea9a1d Add missing file README.dev 25 September 2015, 07:28:00 UTC
bfb0274 Add missing dependency on swh.core 25 September 2015, 07:27:50 UTC
5c72222 swh.storage.storage: update docs of origin_add_one 24 September 2015, 13:49:52 UTC
9b78bfb swh.storage.storage: make origin_add_one idempotent 24 September 2015, 13:48:15 UTC
ccbc216 Simplify the api content_present Only True if found, False otherwise. And raises ValueError in case of bad inputs (empty input or no sha1, sha256 keys) 24 September 2015, 13:02:19 UTC
f2dac75 Fix: Escape is only for copy_to as psycopg2 do not escape for copy_to as it does for other queries 24 September 2015, 12:14:44 UTC
a8ddcd5 Improve formatting of the sql query 24 September 2015, 12:06:34 UTC
bf19550 Simplify command that triggers the storage server 24 September 2015, 12:05:14 UTC
e1a2df2 Implement the content_present api 24 September 2015, 11:40:11 UTC
4348ff4 Add a README.dev on how to run 24 September 2015, 11:40:11 UTC
0444468 Open /content/present api 24 September 2015, 11:40:11 UTC
95e159c query swh_revision_list: add TODOs about ordering 23 September 2015, 15:35:42 UTC
f9621ae new SQL queries: revision rev-list & revision log 23 September 2015, 15:28:42 UTC
d85261c swh-func.sql: use ENUM type directory_entry_type instead of TEXT. Adapt swh_directory_walk_one accordingly 23 September 2015, 14:03:43 UTC
f61605e SQL diagram: add deps occurrences -> revision 23 September 2015, 13:15:55 UTC
d88a775 swh.storage.api: Add logging to the wsgi app 23 September 2015, 12:55:18 UTC
df97c2b swh.storage.api: Attempt closing temporary files 23 September 2015, 12:54:54 UTC
a6e88b2 remote_storage: use proper filename for metadata 23 September 2015, 09:55:13 UTC
4287922 remote_storage: Post contents as multipart files 23 September 2015, 09:51:38 UTC
6c49702 Add missing requirements 23 September 2015, 09:51:17 UTC
83a5963 swh.storage.api: add run_from_webserver wrapper 23 September 2015, 08:42:30 UTC
20f043b swh.storage.api: Properly escape arbitrary byte sequences in arguments 22 September 2015, 15:02:34 UTC
9185046 Add setup.py and friends 22 September 2015, 14:02:37 UTC
42946e1 SQL diagram: add missing edges for releases/revisions 22 September 2015, 10:17:00 UTC
49387fb SQL diagram: fix target of dir entry -> content arrow 22 September 2015, 10:12:33 UTC
51272e2 tests: add documentation for Test{,Remote}Storage 22 September 2015, 09:57:14 UTC
4ae8648 DB schema graph: logical placement of dir entries/lists 22 September 2015, 09:24:40 UTC
d9f2995 clusters.dot: document how to reference individual columns 22 September 2015, 09:19:01 UTC
aa2fbb9 SQL schema: add implicit FK dir_entry_dir -> dir 22 September 2015, 09:17:58 UTC
687f2c2 SQL schema graph: more granular implicit FK arrows in particular: make both arrow sources and arrow destinations point to specific table fields, rather than to whole tables 22 September 2015, 09:15:03 UTC
6b36837 SQL schema: add implicit FK on directories i.e., port the schema graph after switching to arrays for encoding directories 22 September 2015, 09:07:05 UTC
d9b14ab tests: make both storage tests inherit from an abstract class This prevents tests from the base class from running twice 22 September 2015, 09:04:44 UTC
f70bcd5 test_remote_storage: Add code to run the server automatically 22 September 2015, 09:04:44 UTC
c005660 swh.storage.api: add / route for server testing 22 September 2015, 09:04:44 UTC
19cc2d8 add requirements.txt, listing psycopg2 as dependency 21 September 2015, 19:07:57 UTC
caedb46 Add a remote API server and a client for remote storage 21 September 2015, 16:10:25 UTC
cc87c6f test_storage: don't break if the storage has no objstorage In prevision of the remote storage tests. 21 September 2015, 16:10:25 UTC
e07902a db.py: drop unused csv module (pep8) this makes "make check" pass again on the whole module 21 September 2015, 15:24:02 UTC
37e7281 db.py: use single quotes consistently (style) 21 September 2015, 15:22:44 UTC
16aca48 db: uniform case-ing in SQL query (cosmetic) 21 September 2015, 15:21:16 UTC
ba5443a sql/swh-schema.sql: add indexes on directory lists 21 September 2015, 14:02:29 UTC
b8aa5c4 Merge branch 'master' into directory-listing-arrays 18 September 2015, 13:10:35 UTC
e9f08de swh.storage.storage: allow listing missing contents per hash 18 September 2015, 10:58:11 UTC
662c8ee swh.storage.storage: add the contents that _are_ missing 18 September 2015, 10:57:03 UTC
04c1db8 swh.storage.storage: content ids are sha1, not sha1_git 18 September 2015, 10:42:14 UTC
8f97c01 Merge branch 'master' into directory-listing-arrays 18 September 2015, 10:31:25 UTC
back to top