https://github.com/SoftwareHeritage/swh-storage

sort by:
Revision Author Date Message Commit Date
fb3722d storage: Fix edge case when searching symbols When the query is syntactly wrong, before that commit, we broke the server. Now it raises a bad input (400) request. 28 November 2016, 15:07:05 UTC
7f27e14 Add pagination to content_ctags_search api endpoint Related T605 24 November 2016, 10:11:19 UTC
1fc21e6 storage: Open content_ctags_search for full-text search Related T605 23 November 2016, 16:30:54 UTC
c690359 storage: Add fulltext search function on ctags Related T605 23 November 2016, 16:12:56 UTC
3dafd17 storage: Fix error in function which reads licenses Related T602 22 November 2016, 15:55:17 UTC
f5ece61 storage: Add indexer_configuration table json schema Related T596 18 November 2016, 13:55:18 UTC
f454e44 storage: Update recognized fossology licenses Related T596 18 November 2016, 12:25:19 UTC
bc7f776 storage: Fix divergent schema upgrade 15 November 2016, 17:04:10 UTC
99b09d4 Fix pep8 violation 15 November 2016, 17:04:01 UTC
3a4616c storage: ctags - Align conflict update policy with license endpoints In case of wanting to update, we first delete all ctags symbols for those impacted contents. Then we add the ctags information. Otherwise, simply add new entries. And In case of conflict, do nothing. 10 November 2016, 16:25:48 UTC
9a079c5 storage: Update fossology_license to latest design Related T596 10 November 2016, 16:17:54 UTC
7528033 Update known licenses from fossology's master branch Related T596 Related 09923374e0f321da78faa0b37b2814fea9c5f1c1 10 November 2016, 09:44:16 UTC
3dd9b0f storage: Return unknown licenses Related T596 09 November 2016, 15:46:47 UTC
a63cbc7 storage: Open content_license endpoint (add/get) Related T596 09 November 2016, 11:30:39 UTC
2fffbd4 storage: Add license and content_license tables Related T596 08 November 2016, 14:06:12 UTC
04f2b2d storage: Add comments on enum 08 November 2016, 14:06:12 UTC
022e985 storage: Move enums to new swh-enums.sql namespace 08 November 2016, 14:06:11 UTC
51c4896 storage: add check_config method The check_config method allows a dynamic check of the configuration for a running storage. We can make sure that we have proper permissions on the object storage as well as the database before running things. 03 November 2016, 14:39:38 UTC
5e8bba5 storage: Improve index on content_ctags Work on the suggestion message from postgresql psycopg2.OperationalError: index row size 3992 exceeds maximum 2712 for index "content_ctags_id_name_kind_line_lang_idx" HINT: Values larger than 1/3 of a buffer page cannot be indexed. Consider a function index of an MD5 hash of the value, or use full text indexing. Related T589 20 October 2016, 13:55:44 UTC
33043b1 storage: ctags - Improve schema Related T589 20 October 2016, 13:25:28 UTC
a74a141 storage: Open ctags entry points (missing, add, get) Related T589 19 October 2016, 16:33:19 UTC
71b4a88 Remove noisy test attribute 'one' 19 October 2016, 16:33:18 UTC
4bd537f storage: allow adding several origins at once 19 October 2016, 14:49:08 UTC
141afef common: allow passing in the cursor for the transaction decorators 19 October 2016, 14:42:19 UTC
f7becde Add the means to pipe contents to another queue once copied Related T575 13 October 2016, 13:24:00 UTC
d2eb077 indexer: Unify function names according to conventions 13 October 2016, 12:18:21 UTC
97f610a Add tests around the content_{mimetype/language}_add endpoints Related T582 13 October 2016, 12:18:13 UTC
1373667 indexer: Open mimetype/language get endpoints 13 October 2016, 12:18:12 UTC
54efa89 indexer: open drop/skip policy update on duplicates (language/mimetype) This adds the optional conflict_update parameter which specifies what to do when conflicts on sha1 occurs. conflict_update by default is false which ignores duplicates. Otherwise, conflich_update to true, overwrite existing data. Related T582 13 October 2016, 12:18:12 UTC
dddbc4c Fix: Remove nose test attribute 'one' 13 October 2016, 12:08:00 UTC
3fcc628 Fix provenance storage init function 12 October 2016, 00:23:25 UTC
2fd7f72 provenance: Rework configuration setup 11 October 2016, 23:33:57 UTC
30f7883 Open language_mimetype_{missing,add} endpoints Related T578 07 October 2016, 18:30:55 UTC
859860c sql/schema: Add content_language table Related T578 07 October 2016, 16:53:38 UTC
fd717f3 Open content_mimetype_add endpoint to add missing mimetypes Related T577 07 October 2016, 15:08:05 UTC
a77c187 Open content_mimetype_missing endpoint to list missing mimetypes Related T577 07 October 2016, 15:08:05 UTC
5e9244c sql/schema: Add content_mimetype table Towards starting computing information on contents Related T577 07 October 2016, 15:08:04 UTC
7add2cd DB schema graph: add new "provenance" cluster it includes the cache_* tables that are currently being populated 07 October 2016, 14:53:49 UTC
0f29092 DB schema graph: add stray origin_visit table 07 October 2016, 14:53:14 UTC
a1aa8be Align implementation with docstring's contract 29 September 2016, 18:31:23 UTC
6c505cc Fix: Missing incremented version 5 for archiver.dbversion 29 September 2016, 16:57:40 UTC
1afea82 Retrieve information on a content cached 29 September 2016, 16:45:10 UTC
a43b962 Rename to swh_cache_content_get_all 29 September 2016, 14:55:06 UTC
f12d9ef Fix copyright range 29 September 2016, 12:42:57 UTC
1b4aa4f archiver: Remove print statement 29 September 2016, 12:42:57 UTC
4b5287e upgrades/085: add upgrade script 23 September 2016, 11:39:43 UTC
005710e sql/swh-func: content cache populates lines in deterministic order This should reduce lock contention when parallelizing the operation 23 September 2016, 11:38:11 UTC
4d6d3bd archiver: Pass the destination as parameter of the worker to backend 23 September 2016, 10:28:32 UTC
394bb4d archiver: Add missing property for worker to backend 23 September 2016, 10:28:32 UTC
718dda6 archiver: Complete docstring's information 23 September 2016, 10:28:32 UTC
f29c207 archiver: Simplify update on content 23 September 2016, 10:28:32 UTC
a67aa26 archiver: Improve 'unknown sha1' and 'force copy' policies The 'unknown sha1 path' cannot happen in the default archiver since it reads from the archive db (so the fallback code is not necessary in the worker). To the contrary, since 'archiver to backend' reads from stdin (for now), we could have unregistered sha1s from that source. This commit makes the director deal with that before sending sha1 to workers. It's also the director's job to set the state to 'missing' when the force_copy is true before sending sha1 to worker. 23 September 2016, 10:28:32 UTC
9b04941 archiver: Fix random.choice input to a list 23 September 2016, 10:28:31 UTC
7332c31 sql/archiver/schema: Filter unknown sha1s from content_archive endpoint 23 September 2016, 10:28:31 UTC
de67eb7 provenance: fix typo: we have hex in the message, not hashes 22 September 2016, 18:37:57 UTC
ff87ac5 swh-func: content-revision cache population now takes a list of revs 22 September 2016, 16:51:14 UTC
30f5645 swh-func: less churn in the cache_content_revision table 22 September 2016, 12:42:43 UTC
4c3623c Archiver: Fix to copy only to targeted destination Before that, it could for example pushed copies to other mirrors where the content was missing. 22 September 2016, 11:43:45 UTC
cdf11d5 d/control: Bump dependency version to latest python3-swh.core 22 September 2016, 10:37:43 UTC
57053fe Refactor: Align source/destination configuration property names 22 September 2016, 10:37:43 UTC
f163c2a Handle copies of not registered contents in archiver db Closes T569 22 September 2016, 10:37:42 UTC
df2e00a Refactor logging warning/critical message 21 September 2016, 16:18:11 UTC
88e0a05 Improve on cooking code and docstrings - Fix docstring typos - Some function calls were not renamed. 21 September 2016, 16:14:44 UTC
0879adf Refactor the vault cooker to add new bundle types Summary: Make some updates to the vault in order to prepare the next arrival of the revision cooker. Reviewers: #reviewers, ardumont Reviewed By: #reviewers, ardumont Subscribers: ardumont Maniphest Tasks: T531 Differential Revision: https://forge.softwareheritage.org/D115 Closes D115 21 September 2016, 16:14:44 UTC
6900685 Be defensive against potential not found content 20 September 2016, 18:10:16 UTC
189e9c1 d/control: Fix typo in version 20 September 2016, 14:37:38 UTC
9ad3aef d/control: Bump dependency version to latest python3-swh.objstorage 20 September 2016, 14:35:34 UTC
0476b9c Fix objstorage instanciation in tests 20 September 2016, 14:31:23 UTC
2b18a7a Remove optional dependency on swh.objstorage.cloud If you want the archiver to have cloud abilities, install the package python3-swh.objstorage.cloud on the server as well. 20 September 2016, 14:31:23 UTC
f7665b3 sql/swh-schema: content->revision cache only has one line per content 19 September 2016, 12:31:03 UTC
941c52c Unify configuration property between director/worker 17 September 2016, 10:47:53 UTC
76d13b0 Deal with potential missing contents in the archiver db Logging an entry about it 17 September 2016, 10:47:50 UTC
5281ad1 Improve get_contents_error implementation - Only read the storage key once. - Improve the logging error. 17 September 2016, 10:47:33 UTC
d26477a Remove dead code already moved in archiver/db 17 September 2016, 10:47:32 UTC
80b21aa Adapt archiver director to read sha1 from stdin Also, adds a force_copy flag in the configuration file to avoid checking preexistence of sha1. This is to be efficient for the first time copy in a new backend. 16 September 2016, 20:14:38 UTC
ee4ecd7 archiver: Unify configuration file between director/worker The initial director and worker had split configuration files. Now it's unified to be both archiver/worker.yml file 16 September 2016, 20:12:53 UTC
5db327a Archiver: Adapt ArchiverToBackendDirector to latest storage api 15 September 2016, 14:30:03 UTC
57ee3b6 content_archive_get: api entry point to list cache contents 15 September 2016, 14:12:59 UTC
7d0b963 archiver: Add missing instruction about 003 upgrade 15 September 2016, 13:07:10 UTC
1228c3f Remove print statement 15 September 2016, 09:06:04 UTC
315a8eb cache_content_get: Fix broken test Related ae99623a1eb7e959944f420c9418ad519ce5bc6e 15 September 2016, 08:55:50 UTC
ce2f4e6 Improve choose backup contents to use multiple sources 15 September 2016, 08:21:40 UTC
02e1f1e Add logging ability to copier 14 September 2016, 18:18:26 UTC
ae99623 Archiver: Filter missing contents before archival 14 September 2016, 17:48:25 UTC
791444a archive - sql/upgrades/004: Insert new archive id + clean up Drop unused archive.url column. Simplify associated tests setup on the archiver with retention policy. 14 September 2016, 10:06:21 UTC
d0cbf0a Archiver Director/Worker: Add copy to backend worker implementation Actions storage: - Open cache_content_get to retrieve contents in cache - sql/upgrades/080: Add stored procedure to read contents from cache Actions archiver: - d/control: Add dependency to archiver on python3-swh.objstorage.cloud - Renamed Archiver(Director|Worker) to ArchiverWithRetentionPolicy(Director|Worker) - Add ArchiverToBackend(Director|Worker) - Add new celery task dedicated for new workers - Update docstring details Related T555 14 September 2016, 09:31:56 UTC
a71109b Http API to access the SWH vault Summary: This API currently only concern the directories as it uses the first draft of the cooker. Ref T532 Depends on D102 Reviewers: #reviewers! Maniphest Tasks: T532 Differential Revision: https://forge.softwareheritage.org/D108 12 September 2016, 12:23:38 UTC
b11cfe3 First version of the directory cooker & cache Summary: This first version does create a compressed folder of an archive directory but is not linked to any API or notification system. This diff is submitted for architecture and code review and will evolve. Reviewers: #reviewers! Differential Revision: https://forge.softwareheritage.org/D102 12 September 2016, 10:51:20 UTC
1c90cd4 Refactor: Rename adequately swh.storage.db.Db.origin_visit functions In swh.storage.db: - origin_visit_get -> origin_visit_get_all - origin_visit_get_by -> occurrence_by_origin_visit - origin_visit_info -> origin_visit_get 08 September 2016, 12:13:08 UTC
598f38c origin_visit_get_by: Fix origin_visit output data + format change This endpoint was wrongly returning only 1 result. 08 September 2016, 11:49:04 UTC
8c10704 Fix typo and remove unused parameter 06 September 2016, 12:27:27 UTC
6feddb3 Fix and explicit docstring's meaning 06 September 2016, 12:27:27 UTC
cf7ff9e origin_visit_get_by: Update to retrieve associated occurrence info Closed T559 05 September 2016, 09:12:41 UTC
157c129 Open origin_visit_get_by api to retrieve information on origin, visit Related T559 02 September 2016, 13:09:11 UTC
fc0f589 Simplify content provenance information retrieval After further discussion, we only retrieve partial information (revision, origin, visit, path) for now. Related T553 01 September 2016, 16:41:19 UTC
05548a8 Fix: drop a function and not a type Related T553 01 September 2016, 13:24:27 UTC
963634d provenance.tasks: add the revision -> origin cache task 01 September 2016, 12:26:13 UTC
fc3a8b5 test_storage: properly pipeline origin and content provenance caches 01 September 2016, 10:02:02 UTC
2c70c08 storage: make the origin cache properly return revisions 01 September 2016, 10:01:40 UTC
346ccd2 tests.server_testing: sleep for shorter amounts of time to speed up tests 01 September 2016, 09:28:38 UTC
back to top