swh:1:snp:eb70f1f85391e4b077c211bec36af0061c4bf937

sort by:
Revision Author Date Message Commit Date
e85dacd New upstream version 0.0.68 13 October 2016, 13:30:20 UTC
f7becde Add the means to pipe contents to another queue once copied Related T575 13 October 2016, 13:24:00 UTC
d2eb077 indexer: Unify function names according to conventions 13 October 2016, 12:18:21 UTC
97f610a Add tests around the content_{mimetype/language}_add endpoints Related T582 13 October 2016, 12:18:13 UTC
1373667 indexer: Open mimetype/language get endpoints 13 October 2016, 12:18:12 UTC
54efa89 indexer: open drop/skip policy update on duplicates (language/mimetype) This adds the optional conflict_update parameter which specifies what to do when conflicts on sha1 occurs. conflict_update by default is false which ignores duplicates. Otherwise, conflich_update to true, overwrite existing data. Related T582 13 October 2016, 12:18:12 UTC
dddbc4c Fix: Remove nose test attribute 'one' 13 October 2016, 12:08:00 UTC
11cacb6 New upstream version 0.0.67 12 October 2016, 00:24:12 UTC
3fcc628 Fix provenance storage init function 12 October 2016, 00:23:25 UTC
50c91aa New upstream version 0.0.66 11 October 2016, 23:39:26 UTC
2fd7f72 provenance: Rework configuration setup 11 October 2016, 23:33:57 UTC
bd47651 New upstream version 0.0.65 08 October 2016, 08:00:30 UTC
30f7883 Open language_mimetype_{missing,add} endpoints Related T578 07 October 2016, 18:30:55 UTC
859860c sql/schema: Add content_language table Related T578 07 October 2016, 16:53:38 UTC
fd717f3 Open content_mimetype_add endpoint to add missing mimetypes Related T577 07 October 2016, 15:08:05 UTC
a77c187 Open content_mimetype_missing endpoint to list missing mimetypes Related T577 07 October 2016, 15:08:05 UTC
5e9244c sql/schema: Add content_mimetype table Towards starting computing information on contents Related T577 07 October 2016, 15:08:04 UTC
7add2cd DB schema graph: add new "provenance" cluster it includes the cache_* tables that are currently being populated 07 October 2016, 14:53:49 UTC
0f29092 DB schema graph: add stray origin_visit table 07 October 2016, 14:53:14 UTC
e2dd645 New upstream version 0.0.64 29 September 2016, 19:50:59 UTC
a1aa8be Align implementation with docstring's contract 29 September 2016, 18:31:23 UTC
6c505cc Fix: Missing incremented version 5 for archiver.dbversion 29 September 2016, 16:57:40 UTC
1afea82 Retrieve information on a content cached 29 September 2016, 16:45:10 UTC
a43b962 Rename to swh_cache_content_get_all 29 September 2016, 14:55:06 UTC
f12d9ef Fix copyright range 29 September 2016, 12:42:57 UTC
1b4aa4f archiver: Remove print statement 29 September 2016, 12:42:57 UTC
4b5287e upgrades/085: add upgrade script 23 September 2016, 11:39:43 UTC
005710e sql/swh-func: content cache populates lines in deterministic order This should reduce lock contention when parallelizing the operation 23 September 2016, 11:38:11 UTC
8219b44 New upstream version 0.0.63 23 September 2016, 10:29:50 UTC
4d6d3bd archiver: Pass the destination as parameter of the worker to backend 23 September 2016, 10:28:32 UTC
394bb4d archiver: Add missing property for worker to backend 23 September 2016, 10:28:32 UTC
718dda6 archiver: Complete docstring's information 23 September 2016, 10:28:32 UTC
f29c207 archiver: Simplify update on content 23 September 2016, 10:28:32 UTC
a67aa26 archiver: Improve 'unknown sha1' and 'force copy' policies The 'unknown sha1 path' cannot happen in the default archiver since it reads from the archive db (so the fallback code is not necessary in the worker). To the contrary, since 'archiver to backend' reads from stdin (for now), we could have unregistered sha1s from that source. This commit makes the director deal with that before sending sha1 to workers. It's also the director's job to set the state to 'missing' when the force_copy is true before sending sha1 to worker. 23 September 2016, 10:28:32 UTC
9b04941 archiver: Fix random.choice input to a list 23 September 2016, 10:28:31 UTC
7332c31 sql/archiver/schema: Filter unknown sha1s from content_archive endpoint 23 September 2016, 10:28:31 UTC
de67eb7 provenance: fix typo: we have hex in the message, not hashes 22 September 2016, 18:37:57 UTC
29eefbe New upstream version 0.0.62 22 September 2016, 16:54:52 UTC
ff87ac5 swh-func: content-revision cache population now takes a list of revs 22 September 2016, 16:51:14 UTC
30f5645 swh-func: less churn in the cache_content_revision table 22 September 2016, 12:42:43 UTC
5c9b562 New upstream version 0.0.61 22 September 2016, 11:44:05 UTC
4c3623c Archiver: Fix to copy only to targeted destination Before that, it could for example pushed copies to other mirrors where the content was missing. 22 September 2016, 11:43:45 UTC
cdf11d5 d/control: Bump dependency version to latest python3-swh.core 22 September 2016, 10:37:43 UTC
57053fe Refactor: Align source/destination configuration property names 22 September 2016, 10:37:43 UTC
f163c2a Handle copies of not registered contents in archiver db Closes T569 22 September 2016, 10:37:42 UTC
df2e00a Refactor logging warning/critical message 21 September 2016, 16:18:11 UTC
88e0a05 Improve on cooking code and docstrings - Fix docstring typos - Some function calls were not renamed. 21 September 2016, 16:14:44 UTC
0879adf Refactor the vault cooker to add new bundle types Summary: Make some updates to the vault in order to prepare the next arrival of the revision cooker. Reviewers: #reviewers, ardumont Reviewed By: #reviewers, ardumont Subscribers: ardumont Maniphest Tasks: T531 Differential Revision: https://forge.softwareheritage.org/D115 Closes D115 21 September 2016, 16:14:44 UTC
6900685 Be defensive against potential not found content 20 September 2016, 18:10:16 UTC
0b72673 New upstream version 0.0.60 20 September 2016, 14:46:48 UTC
189e9c1 d/control: Fix typo in version 20 September 2016, 14:37:38 UTC
9ad3aef d/control: Bump dependency version to latest python3-swh.objstorage 20 September 2016, 14:35:34 UTC
0476b9c Fix objstorage instanciation in tests 20 September 2016, 14:31:23 UTC
2b18a7a Remove optional dependency on swh.objstorage.cloud If you want the archiver to have cloud abilities, install the package python3-swh.objstorage.cloud on the server as well. 20 September 2016, 14:31:23 UTC
f7665b3 sql/swh-schema: content->revision cache only has one line per content 19 September 2016, 12:31:03 UTC
42cd25c New upstream version 0.0.59 17 September 2016, 10:50:14 UTC
941c52c Unify configuration property between director/worker 17 September 2016, 10:47:53 UTC
76d13b0 Deal with potential missing contents in the archiver db Logging an entry about it 17 September 2016, 10:47:50 UTC
5281ad1 Improve get_contents_error implementation - Only read the storage key once. - Improve the logging error. 17 September 2016, 10:47:33 UTC
d26477a Remove dead code already moved in archiver/db 17 September 2016, 10:47:32 UTC
64c7441 New upstream version 0.0.58 16 September 2016, 20:17:14 UTC
80b21aa Adapt archiver director to read sha1 from stdin Also, adds a force_copy flag in the configuration file to avoid checking preexistence of sha1. This is to be efficient for the first time copy in a new backend. 16 September 2016, 20:14:38 UTC
ee4ecd7 archiver: Unify configuration file between director/worker The initial director and worker had split configuration files. Now it's unified to be both archiver/worker.yml file 16 September 2016, 20:12:53 UTC
c636e55 New upstream version 0.0.57 15 September 2016, 14:30:11 UTC
5db327a Archiver: Adapt ArchiverToBackendDirector to latest storage api 15 September 2016, 14:30:03 UTC
57ee3b6 content_archive_get: api entry point to list cache contents 15 September 2016, 14:12:59 UTC
7d0b963 archiver: Add missing instruction about 003 upgrade 15 September 2016, 13:07:10 UTC
1228c3f Remove print statement 15 September 2016, 09:06:04 UTC
53a366d New upstream version 0.0.56 15 September 2016, 08:56:35 UTC
315a8eb cache_content_get: Fix broken test Related ae99623a1eb7e959944f420c9418ad519ce5bc6e 15 September 2016, 08:55:50 UTC
ce2f4e6 Improve choose backup contents to use multiple sources 15 September 2016, 08:21:40 UTC
02e1f1e Add logging ability to copier 14 September 2016, 18:18:26 UTC
ae99623 Archiver: Filter missing contents before archival 14 September 2016, 17:48:25 UTC
791444a archive - sql/upgrades/004: Insert new archive id + clean up Drop unused archive.url column. Simplify associated tests setup on the archiver with retention policy. 14 September 2016, 10:06:21 UTC
d0cbf0a Archiver Director/Worker: Add copy to backend worker implementation Actions storage: - Open cache_content_get to retrieve contents in cache - sql/upgrades/080: Add stored procedure to read contents from cache Actions archiver: - d/control: Add dependency to archiver on python3-swh.objstorage.cloud - Renamed Archiver(Director|Worker) to ArchiverWithRetentionPolicy(Director|Worker) - Add ArchiverToBackend(Director|Worker) - Add new celery task dedicated for new workers - Update docstring details Related T555 14 September 2016, 09:31:56 UTC
a71109b Http API to access the SWH vault Summary: This API currently only concern the directories as it uses the first draft of the cooker. Ref T532 Depends on D102 Reviewers: #reviewers! Maniphest Tasks: T532 Differential Revision: https://forge.softwareheritage.org/D108 12 September 2016, 12:23:38 UTC
b11cfe3 First version of the directory cooker & cache Summary: This first version does create a compressed folder of an archive directory but is not linked to any API or notification system. This diff is submitted for architecture and code review and will evolve. Reviewers: #reviewers! Differential Revision: https://forge.softwareheritage.org/D102 12 September 2016, 10:51:20 UTC
de46caf New upstream version 0.0.55 08 September 2016, 13:21:28 UTC
1c90cd4 Refactor: Rename adequately swh.storage.db.Db.origin_visit functions In swh.storage.db: - origin_visit_get -> origin_visit_get_all - origin_visit_get_by -> occurrence_by_origin_visit - origin_visit_info -> origin_visit_get 08 September 2016, 12:13:08 UTC
598f38c origin_visit_get_by: Fix origin_visit output data + format change This endpoint was wrongly returning only 1 result. 08 September 2016, 11:49:04 UTC
8c10704 Fix typo and remove unused parameter 06 September 2016, 12:27:27 UTC
6feddb3 Fix and explicit docstring's meaning 06 September 2016, 12:27:27 UTC
65757a7 New upstream version 0.0.54 05 September 2016, 10:36:34 UTC
cf7ff9e origin_visit_get_by: Update to retrieve associated occurrence info Closed T559 05 September 2016, 09:12:41 UTC
157c129 Open origin_visit_get_by api to retrieve information on origin, visit Related T559 02 September 2016, 13:09:11 UTC
d0400dc New upstream version 0.0.53 02 September 2016, 09:14:09 UTC
fc0f589 Simplify content provenance information retrieval After further discussion, we only retrieve partial information (revision, origin, visit, path) for now. Related T553 01 September 2016, 16:41:19 UTC
05548a8 Fix: drop a function and not a type Related T553 01 September 2016, 13:24:27 UTC
963634d provenance.tasks: add the revision -> origin cache task 01 September 2016, 12:26:13 UTC
fc3a8b5 test_storage: properly pipeline origin and content provenance caches 01 September 2016, 10:02:02 UTC
2c70c08 storage: make the origin cache properly return revisions 01 September 2016, 10:01:40 UTC
346ccd2 tests.server_testing: sleep for shorter amounts of time to speed up tests 01 September 2016, 09:28:38 UTC
8c47b78 db: pull the data from the swh_cache_revision_origin_add operation 01 September 2016, 09:28:13 UTC
72d27a8 Open api endpoint to read content provenance information Related T553 31 August 2016, 18:20:29 UTC
1a237ae Open api entry point to populate cache_revision_origin Related T554 31 August 2016, 18:19:03 UTC
088e4f0 sql/upgrades/079: Add stored procedure to read content provenance Drop obsolete content_find_occurrence stored procedure Related T554 Related T553 31 August 2016, 17:14:37 UTC
0885000 Intermediary - Refactor - Add content provenance information reading ability Remove unneeded stored procedure and use CTE instead. Related T553 Note: Common Table Expression https://www.postgresql.org/docs/current/static/queries-with.html 31 August 2016, 16:54:08 UTC
5d0ac29 Intermediary - Add content provenance information reading ability This is only stored procedure for now. They will be simplified soon. Related T553 Note: docs on 7.2.1.5. LATERAL Subqueries: https://www.postgresql.org/docs/current/static/queries-table-expressions.html This permits to let the `from table` data be accessible to ulterior function call. 31 August 2016, 16:54:08 UTC
74c80b7 sql/swh-func: Add population functions for cache_revision_origin Refs T554 31 August 2016, 14:59:47 UTC
6915737 d/control: Add package python3-swh.storage.provenance As for python3-swh.storage.archiver, the runtime dependency on python3-swh.core is the implicit one from python3-swh.storage. The same note about splitting repository and adding explicit dependency applies. Related T557 31 August 2016, 09:50:14 UTC
back to top