654a37e | Antoine R. Dumont (@ardumont) | 09 October 2019, 13:28:51 UTC | tox.ini: Fix py3 environment to use packaged tests Related D2082 | 09 October 2019, 13:28:51 UTC |
03d5a2c | Antoine R. Dumont (@ardumont) | 08 October 2019, 12:13:14 UTC | swh.storage.buffer: Add buffering proxy storage implementation Related T1389 | 08 October 2019, 14:40:51 UTC |
c83f1f9 | Antoine R. Dumont (@ardumont) | 08 October 2019, 12:09:23 UTC | swh.storage.filter: Add filtering storage implementation Also add a sample_data fixture to read default test data from. Related T1389 | 08 October 2019, 14:12:49 UTC |
02b25f0 | Antoine R. Dumont (@ardumont) | 08 October 2019, 12:07:54 UTC | swh.storage: Test get_storage implementation | 08 October 2019, 14:09:56 UTC |
3228b18 | Nicolas Dandrimont | 02 October 2019, 15:21:07 UTC | test_storage: Use the db_transaction context manager instead of raw connections This minimizes the risk of dangling db connections, which in turn reduces the chance of cascading test failures. | 02 October 2019, 15:21:07 UTC |
4c2c311 | Nicolas Dandrimont | 02 October 2019, 15:19:04 UTC | test_storage: introduce a db_transaction context manager This wraps the steps needed to get a (properly wrapped) db object and run a transaction on it so we avoid repeating ourselves | 02 October 2019, 15:20:54 UTC |
9bbd5e2 | Nicolas Dandrimont | 02 October 2019, 15:04:31 UTC | Postgres storage: introduce a db() context manager This helper avoids us repeating the get_db/put_db pattern all the time. | 02 October 2019, 15:13:41 UTC |
e8c01a2 | Nicolas Dandrimont | 02 October 2019, 11:51:05 UTC | Replace all 'inmemory' journal writers with 'memory' | 02 October 2019, 15:13:41 UTC |
24ea71f | Nicolas Dandrimont | 02 October 2019, 15:07:43 UTC | test_storage: Close all pool connections on teardown This avoids having a race condition between the tear down and the database removal, when using more than one connection in the pool. | 02 October 2019, 15:13:41 UTC |
c1ce3a1 | Nicolas Dandrimont | 12 April 2019, 14:15:31 UTC | Add test triggering the race condition in content_add This minimal example triggers T2019 | 01 October 2019, 15:59:05 UTC |
70b15c1 | Antoine Pietri | 01 October 2019, 07:41:04 UTC | api server: missing </body> | 01 October 2019, 07:42:00 UTC |
7c92c33 | Stefano Zacchiroli | 28 September 2019, 11:24:21 UTC | tox: anticipate mypy run to just after flake8 | 01 October 2019, 06:25:21 UTC |
4901259 | Stefano Zacchiroli | 28 September 2019, 11:19:53 UTC | mypy.ini: be less flaky w.r.t. the packages installed in tox in particular, ignore django.* (only indirectly shipped by hypothesis) and allow unused type: ignore (on swh.journal) | 01 October 2019, 06:25:21 UTC |
d8fa30b | Stefano Zacchiroli | 28 September 2019, 10:42:34 UTC | storage.py: ignore typing of optional get_journal_writer import the real type is Optional[Callable[..., Any]], but redefinitions of imported stuff in try..except is currently not supported by mypy, see https://github.com/python/mypy/issues/1153 | 01 October 2019, 06:25:21 UTC |
976aab3 | Stefano Zacchiroli | 27 September 2019, 12:08:48 UTC | mypy: ignore swh.journal to work-around dependency loop | 01 October 2019, 06:25:21 UTC |
ef7d2ef | Stefano Zacchiroli | 27 September 2019, 08:37:04 UTC | init.py: switch to documented way of extending path make mypy 0.730 pass cleanly again | 01 October 2019, 06:25:21 UTC |
395e85d | Stefano Zacchiroli | 21 September 2019, 19:15:02 UTC | typing: minimal changes to make a no-op mypy run pass | 01 October 2019, 06:25:21 UTC |
cbc47c4 | Valentin Lorentz | 24 September 2019, 13:13:04 UTC | Write objects to the journal only if they don't exist yet. * spares some bandwidth/processing time * aligns {content,directory,release,revision}_add on what {snapshot,origin}_add already does | 30 September 2019, 10:05:02 UTC |
e239324 | Valentin Lorentz | 30 September 2019, 09:14:23 UTC | Use origin URLs for skipped_content['origin'] instead of origin ids. This commit uses URLs *instead of* IDs, not in addition to. Supporting IDs should not be needed anymore. | 30 September 2019, 09:43:46 UTC |
0919bdf | Valentin Lorentz | 27 September 2019, 13:20:01 UTC | Properly mock get_journal_writer for the remote-pg-storage tests. | 27 September 2019, 13:28:53 UTC |
9aa6b9d | David Douard | 16 September 2019, 15:17:30 UTC | journal_writer: use journal writer from swh.journal | 25 September 2019, 15:00:12 UTC |
be3a025 | Stefano Zacchiroli | 21 September 2019, 19:16:05 UTC | fix typos in docstrings and sample paths courtesy of codespell | 21 September 2019, 19:23:01 UTC |
2ceacb1 | Antoine R. Dumont (@ardumont) | 20 September 2019, 12:16:49 UTC | storage.origin_visit_add: Remove deprecated 'ts' parameter The parameter `date` becomes mandatory. | 20 September 2019, 12:22:55 UTC |
b2e8aa2 | Stefano Zacchiroli | 16 September 2019, 14:57:24 UTC | click "required" param wants bool, not int courtesy of mypy | 16 September 2019, 14:57:24 UTC |
f91671b | Antoine Lambert | 04 September 2019, 13:44:36 UTC | tests/test_storage: Remove failing assertion after swh-model update | 04 September 2019, 13:44:36 UTC |
ddb8226 | Antoine Lambert | 04 September 2019, 12:25:49 UTC | tests/test_storage: Fix tests execution with psycopg2 < 2.8 | 04 September 2019, 13:11:45 UTC |
bf393fd | Valentin Lorentz | 02 September 2019, 09:28:07 UTC | Add support for origin_url in origin_metadata_*. | 02 September 2019, 09:28:07 UTC |
bc3240d | Valentin Lorentz | 21 August 2019, 15:32:22 UTC | Make origin_add/origin_visit_update validate their input. | 29 August 2019, 11:49:48 UTC |
863443b | Valentin Lorentz | 21 August 2019, 11:49:31 UTC | Make snapshot_add validate its input. | 29 August 2019, 11:36:48 UTC |
add5b01 | Valentin Lorentz | 20 August 2019, 15:25:12 UTC | Make revision_add and release_add validate their input. | 29 August 2019, 09:34:17 UTC |
6946edb | Valentin Lorentz | 20 August 2019, 15:17:15 UTC | Make directory_add validate its input. | 29 August 2019, 09:32:54 UTC |
3a6c5e3 | Valentin Lorentz | 19 August 2019, 12:43:39 UTC | Make content_add validate its input using swh-model. The initial goal of this commit was to make the in-mem content_add raise errors on input that would make postgresql error. As a side-effect, the pgsql content_add now validate its input as well, so clients get a nice/meaningful error instead of postgresql's error. | 29 August 2019, 09:31:37 UTC |
70f63c3 | Valentin Lorentz | 21 August 2019, 11:47:50 UTC | Remove 'next_branch' from test input data. It should only be in the output of 'snapshot_get_*' methods, not in the input of 'snapshot_add'. | 23 August 2019, 08:31:35 UTC |
54b974f | Valentin Lorentz | 21 August 2019, 12:08:28 UTC | Eliminate warnings about %TYPE. Using %TYPE showed this warning when tests fail for unrelated reasons: NOTICE: type reference snapshot.id%TYPE converted to sha1_git | 23 August 2019, 08:26:38 UTC |
b775947 | Valentin Lorentz | 23 August 2019, 08:25:05 UTC | Fix off-by-one error when using origin_visit_upsert on with an unknown visit id. | 23 August 2019, 08:25:05 UTC |
e656d6e | Valentin Lorentz | 22 August 2019, 12:19:00 UTC | Use explicit arguments for origin_visit_add. swh-model v0.0.42 adds more fields to OriginVisit. | 22 August 2019, 12:19:00 UTC |
56a031d | Valentin Lorentz | 20 August 2019, 12:08:47 UTC | Stop leaking person ids. They are not used anymore. | 20 August 2019, 12:59:10 UTC |
57f60cd | Valentin Lorentz | 19 August 2019, 14:47:38 UTC | Remove person_get endpoint. swh-web is about to stop using it, and was the only user of this endpoint. | 20 August 2019, 12:57:24 UTC |
2ef75a2 | Valentin Lorentz | 19 August 2019, 12:41:38 UTC | Remove test_content_missing__marked_missing, it makes no sense. status 'missing' does not exist. | 19 August 2019, 12:41:38 UTC |
babded5 | Valentin Lorentz | 23 July 2019, 13:09:05 UTC | Enforce log level for the werkzeug logger. I don't understand why it's not applied by default (since swh.core applied it to the root logger), but this commit fixes the issue. | 23 July 2019, 13:09:05 UTC |
f71f531 | Ishan Bhanuka | 06 July 2019, 07:20:28 UTC | Add support for skipped content in in-memory storage | 22 July 2019, 15:07:05 UTC |
bc2f03c | David Douard | 15 July 2019, 14:00:03 UTC | api: use RPCServerApp and RPCClient instead of deprecated classes SWHServerAPIApp and SWHRemoteAPI. Depends on swh.core 0.0.65. | 18 July 2019, 11:29:07 UTC |
7053da8 | Valentin Lorentz | 12 July 2019, 13:20:35 UTC | Make origin_get ignore the `type` argument | 18 July 2019, 09:38:15 UTC |
b27fcd1 | Stefano Zacchiroli | 11 July 2019, 14:29:51 UTC | add code of conduct document | 11 July 2019, 14:29:51 UTC |
043c57b | Nicolas Dandrimont | 11 July 2019, 11:50:28 UTC | Drop useless dependency on swh.scheduler | 11 July 2019, 11:50:28 UTC |
9e00284 | Valentin Lorentz | 09 July 2019, 13:00:53 UTC | Add tests for the in-mem storage with origin ids disabled. | 10 July 2019, 14:16:18 UTC |
6b68fc4 | Valentin Lorentz | 08 July 2019, 12:28:51 UTC | Remove the legacy behavior of snapshot_add (which updates a visit). We stopped using it a while ago by now. | 08 July 2019, 12:38:47 UTC |
867b3b3 | Valentin Lorentz | 08 July 2019, 12:28:24 UTC | Fix/silence warnings. | 08 July 2019, 12:28:24 UTC |
4a16a10 | Valentin Lorentz | 04 July 2019, 14:59:55 UTC | Add an env var to the in-mem storage to disable origin ids. If it is true (the default), there is no change from the current behavior, but setting it to False and running another package's tests allows to make sure that package does not rely on origin ids at all. | 08 July 2019, 11:53:18 UTC |
4303fe8 | Stefano Zacchiroli | 04 July 2019, 12:34:22 UTC | README: fix typo | 04 July 2019, 12:34:22 UTC |
617f602 | Stefano Zacchiroli | 04 July 2019, 12:34:03 UTC | CONTRIBUTORS: add Quentin Campos | 04 July 2019, 12:34:03 UTC |
79ceb21 | Valentin Lorentz | 04 July 2019, 08:36:56 UTC | Fix examples in the 'type' column | 04 July 2019, 08:40:26 UTC |
eef8cd7 | Valentin Lorentz | 23 May 2019, 13:44:52 UTC | Specify of extrinsic origin metadata and their storage. | 03 July 2019, 14:53:25 UTC |
af2d253 | Valentin Lorentz | 02 July 2019, 14:53:12 UTC | Make the in-memory storage use URLs instead of ids as its 'primary key' internally. This is a first step toward disabling origin-ids entirely. | 03 July 2019, 08:15:41 UTC |
89fc4a3 | Valentin Lorentz | 02 July 2019, 14:50:55 UTC | Make origin_visit_upsert take an origin dict instead of int. That's the sensible format, and the one we already use in the backfiller. | 03 July 2019, 08:15:16 UTC |
268d122 | Valentin Lorentz | 01 July 2019, 16:22:37 UTC | Fix rebase; reset_storage_tables was renamed to reset_storage. | 01 July 2019, 16:22:37 UTC |
d98f065 | Valentin Lorentz | 28 June 2019, 18:53:23 UTC | Make all endpoints support origin urls instead of ids. | 01 July 2019, 13:52:23 UTC |
f0853ac | Valentin Lorentz | 28 June 2019, 18:42:38 UTC | Merge id/url tests using an arg provided by hypothesis. | 01 July 2019, 13:51:49 UTC |
06fe23b | Valentin Lorentz | 27 June 2019, 16:41:48 UTC | Add an 'origin_visit_find_by_date' endpoint. It reuses the 'swh_visit_find_by_date' function that became dead code when occurences were dropped. So swh-web can use it directly instead of reimplementing its logic in 'get_origin_visit'. | 01 July 2019, 13:51:20 UTC |
37396a5 | Valentin Lorentz | 28 June 2019, 18:33:37 UTC | Add a 'reset' endpoint to the in-mem storage. It will be used for API client/server property tests, because they need to remotely reset the state and cannot create a new storage. | 01 July 2019, 13:50:16 UTC |
168ebf9 | Valentin Lorentz | 05 June 2019, 11:29:59 UTC | Add test for content_get_range's boundaries. | 01 July 2019, 13:49:18 UTC |
e73e886 | Stefano Zacchiroli | 28 June 2019, 07:42:00 UTC | CONTRIBUTORS: add Ishan Bhanuka | 28 June 2019, 07:42:00 UTC |
96a80d2 | Valentin Lorentz | 25 June 2019, 09:58:20 UTC | Add support for origin_url in fetch_history_start. Unbreaks swh-loader-core. | 25 June 2019, 09:58:20 UTC |
4696160 | Valentin Lorentz | 05 June 2019, 11:35:41 UTC | Fix key names in the in-mem impl of metadata_provider_{get,add} | 24 June 2019, 09:18:20 UTC |
2ead4ce | Ishan Bhanuka | 14 June 2019, 06:14:39 UTC | Added comments for all tables and columns | 20 June 2019, 07:18:16 UTC |
b139393 | Valentin Lorentz | 14 June 2019, 11:55:22 UTC | Don't use a join in origin_visit_get_latest. This changes the behavior of snapshot_get_latest, to raise an error in case a visit references an unknown snapshot. This should not happen in practice (unless we're replaying from a journal). snapshot_get_latest will be removed in the next monthes anyway. | 19 June 2019, 12:34:34 UTC |
dc7c1e0 | Valentin Lorentz | 05 June 2019, 11:18:14 UTC | Add test for snapshot/release counters. | 17 June 2019, 09:00:55 UTC |
aec9571 | Valentin Lorentz | 05 June 2019, 09:41:50 UTC | Add origin_visit counter to the in-mem storage. | 17 June 2019, 09:00:55 UTC |
c541224 | Valentin Lorentz | 13 June 2019, 14:27:50 UTC | Actually run the remote-pg-storage tests on a remote-pg. They were run on a local pg instead of via the HTTP API. | 17 June 2019, 08:40:27 UTC |
34c2c34 | Antoine R. Dumont (@ardumont) | 15 June 2019, 06:33:43 UTC | 136: Drop the extra 'text' mention which fails the alter query | 15 June 2019, 06:34:26 UTC |
c891a5b | Antoine R. Dumont (@ardumont) | 15 June 2019, 06:28:40 UTC | 135: Dropping production index after new ones replacing it are built Also, the index creations must be done sequentially. Those new indexes need to be built prior to remove the old index. Otherwise, we break the api (db answering too slow leads to the api breaking). | 15 June 2019, 06:34:26 UTC |
2d610ab | Valentin Lorentz | 13 June 2019, 14:31:07 UTC | Add a new origin_visit_get_latest endpoint. snapshot_get_latest did two things: find the latest origin, and get its snapshot. Now, it only calls origin_visit_get_latest, then snapshot_get. snapshot_get_latest should also be deprecated, because it only fetches some of the branches (for large snapshots). Finally, with the new data model (types are associated to visits, not origins), origin_visit_get_latest is required to know the type of the loader that got the snapshot (needed in swh-indexer). Also, note that origin_visit_get_latest only supports URLs as argument, not origin ids. Since we are migrating away from origin ids, I don't think it makes sense to add support for origin ids in a new endpoint. | 14 June 2019, 11:28:49 UTC |
df461d1 | Valentin Lorentz | 14 June 2019, 11:12:48 UTC | Ignore Hypothesis timeout error for test_add_arbitrary. | 14 June 2019, 11:13:10 UTC |
5e16e60 | Valentin Lorentz | 05 June 2019, 09:35:32 UTC | Add test for date parsing of origin_visit_add. | 13 June 2019, 12:10:03 UTC |
16b6dc5 | Valentin Lorentz | 05 June 2019, 11:23:19 UTC | Add tests for content_missing. | 13 June 2019, 12:08:36 UTC |
9afe736 | Valentin Lorentz | 05 June 2019, 11:40:18 UTC | Test date parsing of origin_metadata_add. | 13 June 2019, 12:04:12 UTC |
34b8c54 | Valentin Lorentz | 05 June 2019, 11:33:13 UTC | Add a test checking that directory_ls does not recurse into subdirs by default. | 12 June 2019, 13:44:28 UTC |
21e26e3 | Valentin Lorentz | 05 June 2019, 11:39:23 UTC | Test that content_find filters out contents that don't match. | 12 June 2019, 13:21:23 UTC |
eed16c2 | Valentin Lorentz | 05 June 2019, 11:16:05 UTC | Add test for ctime in content_add. | 12 June 2019, 13:07:44 UTC |
fb88f61 | Valentin Lorentz | 05 June 2019, 11:26:54 UTC | Add test for content_add when one of the content is already known. | 12 June 2019, 12:29:58 UTC |
cb48f63 | Valentin Lorentz | 05 June 2019, 11:21:30 UTC | Add test for snapshot_get_branches with both filter and pagination. Reviewers: #reviewers, douardda Reviewed By: #reviewers, douardda Differential Revision: https://forge.softwareheritage.org/D1543 | 12 June 2019, 12:26:37 UTC |
aef6dbf | Antoine R. Dumont (@ardumont) | 11 June 2019, 13:32:54 UTC | upgrades/136: Upgrade update instructions | 11 June 2019, 13:33:15 UTC |
e4e5b5c | Valentin Lorentz | 11 June 2019, 13:24:49 UTC | Mark network tests, so they can be disabled. | 11 June 2019, 13:24:49 UTC |
5a9f544 | Valentin Lorentz | 06 June 2019, 15:05:03 UTC | Add support for using URL instead of ID in snapshot_get_latest. | 11 June 2019, 10:01:50 UTC |
0386716 | Valentin Lorentz | 06 June 2019, 15:03:22 UTC | Add support for using URL instead of ID in origin_visit_{add,update}. | 11 June 2019, 10:01:08 UTC |
21b5142 | Valentin Lorentz | 06 June 2019, 13:30:13 UTC | Add a 'type' column to origin_visit. This is a step toward dropping the 'type' column of origins. | 11 June 2019, 10:00:00 UTC |
d8f0a7b | David Douard | 11 June 2019, 08:09:55 UTC | storage: small typo in Storage.content_add_metadata's doc string | 11 June 2019, 08:11:58 UTC |
710e0bf | Valentin Lorentz | 06 June 2019, 12:08:27 UTC | Add support for omitting origin type when calling origin_get. | 06 June 2019, 14:21:07 UTC |
3349788 | Valentin Lorentz | 05 June 2019, 09:34:18 UTC | Make origin_visit_upsert parse date strings. | 05 June 2019, 12:31:11 UTC |
75ac20f | Valentin Lorentz | 05 June 2019, 11:25:05 UTC | Add test for content_get on missing content. | 05 June 2019, 11:45:23 UTC |
09666c0 | Valentin Lorentz | 05 June 2019, 11:21:30 UTC | Add test for snapshot_get_branches with both filter and pagination. | 05 June 2019, 11:21:30 UTC |
909c69b | Valentin Lorentz | 04 June 2019, 13:14:40 UTC | Fix recursive entry names in in_memory.py's directory_entry_get_by_path | 04 June 2019, 13:14:40 UTC |
efda98e | Valentin Lorentz | 29 May 2019, 12:52:46 UTC | Fix origin_search in the in-mem storage to search for sub-strings. | 04 June 2019, 11:29:24 UTC |
3556b74 | Valentin Lorentz | 29 May 2019, 12:54:47 UTC | Unify argument names of the pg and in-mem storage. Argument names matter for the remote API. | 29 May 2019, 16:13:03 UTC |
17d4112 | David Douard | 17 May 2019, 12:43:09 UTC | Update the README file especially for latest CLI modifications. Add a slightly more detailed step-by-step 'quick start' section. | 17 May 2019, 13:08:59 UTC |
289b233 | David Douard | 17 May 2019, 12:42:26 UTC | cli: small improvements of 'rpc-serve --help' message | 17 May 2019, 13:07:24 UTC |
39a1c45 | Valentin Lorentz | 17 May 2019, 12:53:46 UTC | Make content_find return an empty list instead of None. | 17 May 2019, 12:53:46 UTC |
ea3f2d3 | Valentin Lorentz | 17 May 2019, 12:51:17 UTC | Update import of swh.core.db.tests.db_testing. | 17 May 2019, 12:51:17 UTC |
01ed375 | Valentin Lorentz | 17 May 2019, 11:32:55 UTC | Update import of swh.core.api.tests.server_testing. | 17 May 2019, 11:32:59 UTC |
fa02fa4 | David Douard | 15 May 2019, 15:08:04 UTC | cli: add a 'storage' cli group with a 'rpc-serve' subcommand we do add this 'storage' group even if it contains only one subcommand (serve) for UI consistency: swh storage rpc-serve We do keep the main swh-storage console_script entry point as is for bw compat, however. | 16 May 2019, 15:28:29 UTC |