a5fa26c | Nicolas Dandrimont | 14 February 2017, 18:25:02 UTC | debian/control: update swh-model requirement | 14 February 2017, 18:25:25 UTC |
ec8cebf | Nicolas Dandrimont | 14 February 2017, 18:14:13 UTC | converters: normalize timestamps using swh.model To make sure corruptions such as T680 don't happen again, use the same normalization function as swh.model before inserting timestamps into our database. This makes swh.storage reject non-integer timestamp values as well. Update tests to reflect this change. | 14 February 2017, 18:17:33 UTC |
c6abed2 | Nicolas Dandrimont | 09 February 2017, 17:40:51 UTC | sql/archiver: get the count of objects in each archive Close T672 | 09 February 2017, 17:40:51 UTC |
8ce681c | Nicolas Dandrimont | 09 February 2017, 17:40:24 UTC | sql/archiver: move function defs to the functions file | 09 February 2017, 17:40:24 UTC |
08b0802 | Antoine Pietri | 09 February 2017, 11:12:09 UTC | requirements: split internal and external requirements in two separate files | 09 February 2017, 14:09:28 UTC |
94d72cc | Antoine Pietri | 06 February 2017, 16:47:14 UTC | requirements.txt: s/dateutil/python-dateutil/ | 09 February 2017, 14:09:28 UTC |
76ea627 | Antoine Pietri | 06 February 2017, 16:47:05 UTC | style: test_storage.py: wrap >80 cols line | 09 February 2017, 14:09:28 UTC |
598114c | Nicolas Dandrimont | 07 February 2017, 17:28:05 UTC | sql/archiver: keep archive counts using a bucketed list The buckets are using the last two bytes of the object id, so that we spread the load across different lines on sequential archivings. | 07 February 2017, 17:28:05 UTC |
b5cd7f0 | Nicolas Dandrimont | 01 February 2017, 14:40:06 UTC | sql/upgrades: add 99 → 100 | 01 February 2017, 14:40:06 UTC |
48df525 | Nicolas Dandrimont | 01 February 2017, 14:37:32 UTC | sql/swh-func: in occurrence_get_by: only return data pertaining one visit By default, return data from the latest visit instead of returning data from all visits, which doesn't make much sense. | 01 February 2017, 14:37:32 UTC |
b5b8fd0 | Nicolas Dandrimont | 01 February 2017, 14:20:01 UTC | sql/swh-func: actually filter swh_visit_find_by_date by origin... | 01 February 2017, 14:21:35 UTC |
fed8afb | Antoine R. Dumont (@ardumont) | 26 January 2017, 13:35:05 UTC | d/control: Update dependencies Closes T646 | 26 January 2017, 14:07:46 UTC |
54cb088 | Antoine R. Dumont (@ardumont) | 26 January 2017, 13:34:43 UTC | Refactor: Unify redundant behavior in api server instantiation Related T646 | 26 January 2017, 13:40:17 UTC |
c92af7a | Antoine R. Dumont (@ardumont) | 26 January 2017, 13:13:30 UTC | Refactor: Unify redundant behavior in SWHRemoteAPI Related T646 | 26 January 2017, 13:16:45 UTC |
4213a0c | Antoine R. Dumont (@ardumont) | 19 January 2017, 13:19:24 UTC | Return page of results for origin visits endpoints Related T636 | 19 January 2017, 13:40:13 UTC |
a8c0e13 | Nicolas Dandrimont | 17 January 2017, 12:11:46 UTC | sql/swh-schema: reorder fields according to production database | 17 January 2017, 12:13:40 UTC |
4893a52 | Nicolas Dandrimont | 11 January 2017, 13:29:01 UTC | sql: refactor to split out indexes and triggers This is the first step towards having different sets of indexes between the master and read-only replicas. | 11 January 2017, 13:43:29 UTC |
e4d5aec | Nicolas Dandrimont | 03 January 2017, 15:30:20 UTC | archiver.worker: fix typo | 03 January 2017, 15:30:20 UTC |
2ff562f | Antoine R. Dumont (@ardumont) | 31 December 2016, 13:46:41 UTC | test: Fix wrong key from base_url to url | 31 December 2016, 13:47:14 UTC |
4bb4246 | Antoine R. Dumont (@ardumont) | 20 December 2016, 08:38:34 UTC | d/control: Update to latest objstorage | 20 December 2016, 08:38:34 UTC |
d5f4640 | Antoine R. Dumont (@ardumont) | 15 December 2016, 15:07:27 UTC | Unify objstorage and storage configuration Related T613 | 15 December 2016, 17:25:53 UTC |
afbdb14 | Antoine R. Dumont (@ardumont) | 15 December 2016, 14:44:19 UTC | Adapt storage's objstorage parameter as a setup property Related T613 | 15 December 2016, 17:25:15 UTC |
ebd1797 | Antoine R. Dumont (@ardumont) | 05 December 2016, 11:00:46 UTC | storage: Fix missing function definition change Related T610 | 05 December 2016, 11:01:57 UTC |
cc7a1ca | Antoine R. Dumont (@ardumont) | 05 December 2016, 10:57:47 UTC | storage: Move hash-function before schema install Related T610 | 05 December 2016, 11:01:40 UTC |
ec0893a | Antoine R. Dumont (@ardumont) | 05 December 2016, 09:12:37 UTC | storage: Adapt ctags' sql schema migration to be faster Related T610 | 05 December 2016, 10:43:43 UTC |
c9e2b15 | Antoine R. Dumont (@ardumont) | 02 December 2016, 16:52:53 UTC | storage: Cleanup only conflictual data for ctags, fossology_license Related T610 | 02 December 2016, 16:52:53 UTC |
648d071 | Antoine R. Dumont (@ardumont) | 02 December 2016, 16:28:12 UTC | storage: Adapt missing endpoint filtering for content_fossology_license Related T610 | 02 December 2016, 16:28:12 UTC |
7ef1700 | Antoine R. Dumont (@ardumont) | 02 December 2016, 15:17:22 UTC | storage: Add tool information on language api endpoints Related T610 | 02 December 2016, 15:17:22 UTC |
6a26397 | Antoine R. Dumont (@ardumont) | 02 December 2016, 14:24:16 UTC | storage: Add tool information on mimetype api endpoints Related T610 | 02 December 2016, 14:24:16 UTC |
2b5a2ff | Antoine R. Dumont (@ardumont) | 02 December 2016, 12:06:40 UTC | storage: Format ctags tool output result Related T610 | 02 December 2016, 12:31:32 UTC |
d35a85a | Antoine R. Dumont (@ardumont) | 02 December 2016, 10:49:02 UTC | storage: Add tool information on ctags api endpoints Related T610 | 02 December 2016, 12:31:32 UTC |
43b37dc | Antoine R. Dumont (@ardumont) | 02 December 2016, 09:20:02 UTC | Update indexer configuration - Index/constraints - pass 2 Related T610 | 02 December 2016, 12:31:25 UTC |
12770e2 | Antoine R. Dumont (@ardumont) | 24 November 2016, 13:53:50 UTC | Update indexer configuration data Related T574 Related T610 | 02 December 2016, 12:31:18 UTC |
0ccb413 | Antoine R. Dumont (@ardumont) | 01 December 2016, 09:25:41 UTC | storage: Add missing function in swh-func.sql Since 096 migration update. | 01 December 2016, 09:30:15 UTC |
d11f4e4 | Antoine R. Dumont (@ardumont) | 01 December 2016, 09:21:25 UTC | doc-sql: Add subgraph for content_indexer tables | 01 December 2016, 09:30:07 UTC |
d3ba860 | Antoine R. Dumont (@ardumont) | 01 December 2016, 09:20:39 UTC | doc-sql: Fix arrow from revision(directory) to directory(id) | 01 December 2016, 09:20:39 UTC |
ebcfd49 | Antoine R. Dumont (@ardumont) | 30 November 2016, 13:54:20 UTC | storage: Actually use the index for searching expression | 30 November 2016, 13:54:20 UTC |
ba29d18 | Antoine R. Dumont (@ardumont) | 29 November 2016, 16:20:37 UTC | Add index on ctags' name column | 29 November 2016, 16:20:37 UTC |
641ad5c | Antoine R. Dumont (@ardumont) | 29 November 2016, 16:03:25 UTC | storage: Use strict equality on ctags search | 29 November 2016, 16:03:25 UTC |
fb3722d | Antoine R. Dumont (@ardumont) | 28 November 2016, 15:07:05 UTC | storage: Fix edge case when searching symbols When the query is syntactly wrong, before that commit, we broke the server. Now it raises a bad input (400) request. | 28 November 2016, 15:07:05 UTC |
7f27e14 | Antoine R. Dumont (@ardumont) | 24 November 2016, 10:11:19 UTC | Add pagination to content_ctags_search api endpoint Related T605 | 24 November 2016, 10:11:19 UTC |
1fc21e6 | Antoine R. Dumont (@ardumont) | 23 November 2016, 16:30:54 UTC | storage: Open content_ctags_search for full-text search Related T605 | 23 November 2016, 16:30:54 UTC |
c690359 | Antoine R. Dumont (@ardumont) | 23 November 2016, 15:32:23 UTC | storage: Add fulltext search function on ctags Related T605 | 23 November 2016, 16:12:56 UTC |
3dafd17 | Antoine R. Dumont (@ardumont) | 22 November 2016, 15:55:17 UTC | storage: Fix error in function which reads licenses Related T602 | 22 November 2016, 15:55:17 UTC |
f5ece61 | Antoine R. Dumont (@ardumont) | 18 November 2016, 13:54:43 UTC | storage: Add indexer_configuration table json schema Related T596 | 18 November 2016, 13:55:18 UTC |
f454e44 | Antoine R. Dumont (@ardumont) | 18 November 2016, 12:24:07 UTC | storage: Update recognized fossology licenses Related T596 | 18 November 2016, 12:25:19 UTC |
bc7f776 | Antoine R. Dumont (@ardumont) | 15 November 2016, 17:04:10 UTC | storage: Fix divergent schema upgrade | 15 November 2016, 17:04:10 UTC |
99b09d4 | Antoine R. Dumont (@ardumont) | 15 November 2016, 17:04:01 UTC | Fix pep8 violation | 15 November 2016, 17:04:01 UTC |
3a4616c | Antoine R. Dumont (@ardumont) | 10 November 2016, 16:18:30 UTC | storage: ctags - Align conflict update policy with license endpoints In case of wanting to update, we first delete all ctags symbols for those impacted contents. Then we add the ctags information. Otherwise, simply add new entries. And In case of conflict, do nothing. | 10 November 2016, 16:25:48 UTC |
9a079c5 | Antoine R. Dumont (@ardumont) | 10 November 2016, 15:54:01 UTC | storage: Update fossology_license to latest design Related T596 | 10 November 2016, 16:17:54 UTC |
7528033 | Antoine R. Dumont (@ardumont) | 09 November 2016, 15:58:10 UTC | Update known licenses from fossology's master branch Related T596 Related 09923374e0f321da78faa0b37b2814fea9c5f1c1 | 10 November 2016, 09:44:16 UTC |
3dd9b0f | Antoine R. Dumont (@ardumont) | 09 November 2016, 15:45:12 UTC | storage: Return unknown licenses Related T596 | 09 November 2016, 15:46:47 UTC |
a63cbc7 | Antoine R. Dumont (@ardumont) | 08 November 2016, 16:08:04 UTC | storage: Open content_license endpoint (add/get) Related T596 | 09 November 2016, 11:30:39 UTC |
2fffbd4 | Antoine R. Dumont (@ardumont) | 08 November 2016, 11:42:22 UTC | storage: Add license and content_license tables Related T596 | 08 November 2016, 14:06:12 UTC |
04f2b2d | Antoine R. Dumont (@ardumont) | 08 November 2016, 11:06:28 UTC | storage: Add comments on enum | 08 November 2016, 14:06:12 UTC |
022e985 | Antoine R. Dumont (@ardumont) | 08 November 2016, 11:04:18 UTC | storage: Move enums to new swh-enums.sql namespace | 08 November 2016, 14:06:11 UTC |
51c4896 | Nicolas Dandrimont | 03 November 2016, 14:39:38 UTC | storage: add check_config method The check_config method allows a dynamic check of the configuration for a running storage. We can make sure that we have proper permissions on the object storage as well as the database before running things. | 03 November 2016, 14:39:38 UTC |
5e8bba5 | Antoine R. Dumont (@ardumont) | 20 October 2016, 13:53:19 UTC | storage: Improve index on content_ctags Work on the suggestion message from postgresql psycopg2.OperationalError: index row size 3992 exceeds maximum 2712 for index "content_ctags_id_name_kind_line_lang_idx" HINT: Values larger than 1/3 of a buffer page cannot be indexed. Consider a function index of an MD5 hash of the value, or use full text indexing. Related T589 | 20 October 2016, 13:55:44 UTC |
33043b1 | Antoine R. Dumont (@ardumont) | 20 October 2016, 12:41:22 UTC | storage: ctags - Improve schema Related T589 | 20 October 2016, 13:25:28 UTC |
a74a141 | Antoine R. Dumont (@ardumont) | 19 October 2016, 16:29:43 UTC | storage: Open ctags entry points (missing, add, get) Related T589 | 19 October 2016, 16:33:19 UTC |
71b4a88 | Antoine R. Dumont (@ardumont) | 19 October 2016, 16:28:48 UTC | Remove noisy test attribute 'one' | 19 October 2016, 16:33:18 UTC |
4bd537f | Nicolas Dandrimont | 19 October 2016, 14:47:10 UTC | storage: allow adding several origins at once | 19 October 2016, 14:49:08 UTC |
141afef | Nicolas Dandrimont | 19 October 2016, 14:42:19 UTC | common: allow passing in the cursor for the transaction decorators | 19 October 2016, 14:42:19 UTC |
f7becde | Antoine R. Dumont (@ardumont) | 11 October 2016, 16:13:45 UTC | Add the means to pipe contents to another queue once copied Related T575 | 13 October 2016, 13:24:00 UTC |
d2eb077 | Antoine R. Dumont (@ardumont) | 13 October 2016, 10:20:24 UTC | indexer: Unify function names according to conventions | 13 October 2016, 12:18:21 UTC |
97f610a | Antoine R. Dumont (@ardumont) | 13 October 2016, 09:28:07 UTC | Add tests around the content_{mimetype/language}_add endpoints Related T582 | 13 October 2016, 12:18:13 UTC |
1373667 | Antoine R. Dumont (@ardumont) | 13 October 2016, 09:18:40 UTC | indexer: Open mimetype/language get endpoints | 13 October 2016, 12:18:12 UTC |
54efa89 | Antoine R. Dumont (@ardumont) | 12 October 2016, 16:50:36 UTC | indexer: open drop/skip policy update on duplicates (language/mimetype) This adds the optional conflict_update parameter which specifies what to do when conflicts on sha1 occurs. conflict_update by default is false which ignores duplicates. Otherwise, conflich_update to true, overwrite existing data. Related T582 | 13 October 2016, 12:18:12 UTC |
dddbc4c | Antoine R. Dumont (@ardumont) | 13 October 2016, 08:23:16 UTC | Fix: Remove nose test attribute 'one' | 13 October 2016, 12:08:00 UTC |
3fcc628 | Antoine R. Dumont (@ardumont) | 12 October 2016, 00:23:25 UTC | Fix provenance storage init function | 12 October 2016, 00:23:25 UTC |
2fd7f72 | Antoine R. Dumont (@ardumont) | 11 October 2016, 23:33:57 UTC | provenance: Rework configuration setup | 11 October 2016, 23:33:57 UTC |
30f7883 | Antoine R. Dumont (@ardumont) | 07 October 2016, 17:08:34 UTC | Open language_mimetype_{missing,add} endpoints Related T578 | 07 October 2016, 18:30:55 UTC |
859860c | Antoine R. Dumont (@ardumont) | 07 October 2016, 16:53:38 UTC | sql/schema: Add content_language table Related T578 | 07 October 2016, 16:53:38 UTC |
fd717f3 | Antoine R. Dumont (@ardumont) | 07 October 2016, 12:36:56 UTC | Open content_mimetype_add endpoint to add missing mimetypes Related T577 | 07 October 2016, 15:08:05 UTC |
a77c187 | Antoine R. Dumont (@ardumont) | 07 October 2016, 12:36:20 UTC | Open content_mimetype_missing endpoint to list missing mimetypes Related T577 | 07 October 2016, 15:08:05 UTC |
5e9244c | Antoine R. Dumont (@ardumont) | 07 October 2016, 09:38:40 UTC | sql/schema: Add content_mimetype table Towards starting computing information on contents Related T577 | 07 October 2016, 15:08:04 UTC |
7add2cd | Stefano Zacchiroli | 07 October 2016, 14:53:49 UTC | DB schema graph: add new "provenance" cluster it includes the cache_* tables that are currently being populated | 07 October 2016, 14:53:49 UTC |
0f29092 | Stefano Zacchiroli | 07 October 2016, 14:53:14 UTC | DB schema graph: add stray origin_visit table | 07 October 2016, 14:53:14 UTC |
a1aa8be | Antoine R. Dumont (@ardumont) | 29 September 2016, 17:27:34 UTC | Align implementation with docstring's contract | 29 September 2016, 18:31:23 UTC |
6c505cc | Antoine R. Dumont (@ardumont) | 29 September 2016, 16:57:40 UTC | Fix: Missing incremented version 5 for archiver.dbversion | 29 September 2016, 16:57:40 UTC |
1afea82 | Antoine R. Dumont (@ardumont) | 29 September 2016, 14:55:44 UTC | Retrieve information on a content cached | 29 September 2016, 16:45:10 UTC |
a43b962 | Antoine R. Dumont (@ardumont) | 29 September 2016, 14:55:06 UTC | Rename to swh_cache_content_get_all | 29 September 2016, 14:55:06 UTC |
f12d9ef | Antoine R. Dumont (@ardumont) | 28 September 2016, 08:20:27 UTC | Fix copyright range | 29 September 2016, 12:42:57 UTC |
1b4aa4f | Antoine R. Dumont (@ardumont) | 25 September 2016, 09:59:39 UTC | archiver: Remove print statement | 29 September 2016, 12:42:57 UTC |
4b5287e | Nicolas Dandrimont | 23 September 2016, 11:39:43 UTC | upgrades/085: add upgrade script | 23 September 2016, 11:39:43 UTC |
005710e | Nicolas Dandrimont | 23 September 2016, 11:38:11 UTC | sql/swh-func: content cache populates lines in deterministic order This should reduce lock contention when parallelizing the operation | 23 September 2016, 11:38:11 UTC |
4d6d3bd | Antoine R. Dumont (@ardumont) | 23 September 2016, 10:16:25 UTC | archiver: Pass the destination as parameter of the worker to backend | 23 September 2016, 10:28:32 UTC |
394bb4d | Antoine R. Dumont (@ardumont) | 23 September 2016, 10:07:19 UTC | archiver: Add missing property for worker to backend | 23 September 2016, 10:28:32 UTC |
718dda6 | Antoine R. Dumont (@ardumont) | 23 September 2016, 10:06:55 UTC | archiver: Complete docstring's information | 23 September 2016, 10:28:32 UTC |
f29c207 | Antoine R. Dumont (@ardumont) | 23 September 2016, 10:01:41 UTC | archiver: Simplify update on content | 23 September 2016, 10:28:32 UTC |
a67aa26 | Antoine R. Dumont (@ardumont) | 23 September 2016, 09:50:53 UTC | archiver: Improve 'unknown sha1' and 'force copy' policies The 'unknown sha1 path' cannot happen in the default archiver since it reads from the archive db (so the fallback code is not necessary in the worker). To the contrary, since 'archiver to backend' reads from stdin (for now), we could have unregistered sha1s from that source. This commit makes the director deal with that before sending sha1 to workers. It's also the director's job to set the state to 'missing' when the force_copy is true before sending sha1 to worker. | 23 September 2016, 10:28:32 UTC |
9b04941 | Antoine R. Dumont (@ardumont) | 23 September 2016, 09:49:44 UTC | archiver: Fix random.choice input to a list | 23 September 2016, 10:28:31 UTC |
7332c31 | Antoine R. Dumont (@ardumont) | 23 September 2016, 09:47:02 UTC | sql/archiver/schema: Filter unknown sha1s from content_archive endpoint | 23 September 2016, 10:28:31 UTC |
de67eb7 | Nicolas Dandrimont | 22 September 2016, 18:37:57 UTC | provenance: fix typo: we have hex in the message, not hashes | 22 September 2016, 18:37:57 UTC |
ff87ac5 | Nicolas Dandrimont | 22 September 2016, 16:49:20 UTC | swh-func: content-revision cache population now takes a list of revs | 22 September 2016, 16:51:14 UTC |
30f5645 | Nicolas Dandrimont | 22 September 2016, 12:35:50 UTC | swh-func: less churn in the cache_content_revision table | 22 September 2016, 12:42:43 UTC |
4c3623c | Antoine R. Dumont (@ardumont) | 22 September 2016, 11:42:19 UTC | Archiver: Fix to copy only to targeted destination Before that, it could for example pushed copies to other mirrors where the content was missing. | 22 September 2016, 11:43:45 UTC |
cdf11d5 | Antoine R. Dumont (@ardumont) | 22 September 2016, 10:25:06 UTC | d/control: Bump dependency version to latest python3-swh.core | 22 September 2016, 10:37:43 UTC |
57053fe | Antoine R. Dumont (@ardumont) | 22 September 2016, 10:31:46 UTC | Refactor: Align source/destination configuration property names | 22 September 2016, 10:37:43 UTC |
f163c2a | Antoine R. Dumont (@ardumont) | 22 September 2016, 10:24:15 UTC | Handle copies of not registered contents in archiver db Closes T569 | 22 September 2016, 10:37:42 UTC |