56a031d | Valentin Lorentz | 20 August 2019, 12:08:47 UTC | Stop leaking person ids. They are not used anymore. | 20 August 2019, 12:59:10 UTC |
57f60cd | Valentin Lorentz | 19 August 2019, 14:47:38 UTC | Remove person_get endpoint. swh-web is about to stop using it, and was the only user of this endpoint. | 20 August 2019, 12:57:24 UTC |
2ef75a2 | Valentin Lorentz | 19 August 2019, 12:41:38 UTC | Remove test_content_missing__marked_missing, it makes no sense. status 'missing' does not exist. | 19 August 2019, 12:41:38 UTC |
babded5 | Valentin Lorentz | 23 July 2019, 13:09:05 UTC | Enforce log level for the werkzeug logger. I don't understand why it's not applied by default (since swh.core applied it to the root logger), but this commit fixes the issue. | 23 July 2019, 13:09:05 UTC |
f71f531 | Ishan Bhanuka | 06 July 2019, 07:20:28 UTC | Add support for skipped content in in-memory storage | 22 July 2019, 15:07:05 UTC |
bc2f03c | David Douard | 15 July 2019, 14:00:03 UTC | api: use RPCServerApp and RPCClient instead of deprecated classes SWHServerAPIApp and SWHRemoteAPI. Depends on swh.core 0.0.65. | 18 July 2019, 11:29:07 UTC |
c3c435b | Jenkins for Software Heritage | 18 July 2019, 10:16:15 UTC | New upstream version 0.0.147 | 18 July 2019, 10:16:15 UTC |
7053da8 | Valentin Lorentz | 12 July 2019, 13:20:35 UTC | Make origin_get ignore the `type` argument | 18 July 2019, 09:38:15 UTC |
76e9def | Jenkins for Software Heritage | 18 July 2019, 08:52:09 UTC | New upstream version 0.0.146 | 18 July 2019, 08:52:09 UTC |
b27fcd1 | Stefano Zacchiroli | 11 July 2019, 14:29:51 UTC | add code of conduct document | 11 July 2019, 14:29:51 UTC |
043c57b | Nicolas Dandrimont | 11 July 2019, 11:50:28 UTC | Drop useless dependency on swh.scheduler | 11 July 2019, 11:50:28 UTC |
9e00284 | Valentin Lorentz | 09 July 2019, 13:00:53 UTC | Add tests for the in-mem storage with origin ids disabled. | 10 July 2019, 14:16:18 UTC |
6b68fc4 | Valentin Lorentz | 08 July 2019, 12:28:51 UTC | Remove the legacy behavior of snapshot_add (which updates a visit). We stopped using it a while ago by now. | 08 July 2019, 12:38:47 UTC |
867b3b3 | Valentin Lorentz | 08 July 2019, 12:28:24 UTC | Fix/silence warnings. | 08 July 2019, 12:28:24 UTC |
4a16a10 | Valentin Lorentz | 04 July 2019, 14:59:55 UTC | Add an env var to the in-mem storage to disable origin ids. If it is true (the default), there is no change from the current behavior, but setting it to False and running another package's tests allows to make sure that package does not rely on origin ids at all. | 08 July 2019, 11:53:18 UTC |
4303fe8 | Stefano Zacchiroli | 04 July 2019, 12:34:22 UTC | README: fix typo | 04 July 2019, 12:34:22 UTC |
617f602 | Stefano Zacchiroli | 04 July 2019, 12:34:03 UTC | CONTRIBUTORS: add Quentin Campos | 04 July 2019, 12:34:03 UTC |
79ceb21 | Valentin Lorentz | 04 July 2019, 08:36:56 UTC | Fix examples in the 'type' column | 04 July 2019, 08:40:26 UTC |
eef8cd7 | Valentin Lorentz | 23 May 2019, 13:44:52 UTC | Specify of extrinsic origin metadata and their storage. | 03 July 2019, 14:53:25 UTC |
af2d253 | Valentin Lorentz | 02 July 2019, 14:53:12 UTC | Make the in-memory storage use URLs instead of ids as its 'primary key' internally. This is a first step toward disabling origin-ids entirely. | 03 July 2019, 08:15:41 UTC |
89fc4a3 | Valentin Lorentz | 02 July 2019, 14:50:55 UTC | Make origin_visit_upsert take an origin dict instead of int. That's the sensible format, and the one we already use in the backfiller. | 03 July 2019, 08:15:16 UTC |
5715000 | Jenkins for Software Heritage | 02 July 2019, 10:19:17 UTC | New upstream version 0.0.145 | 02 July 2019, 10:19:17 UTC |
268d122 | Valentin Lorentz | 01 July 2019, 16:22:37 UTC | Fix rebase; reset_storage_tables was renamed to reset_storage. | 01 July 2019, 16:22:37 UTC |
d98f065 | Valentin Lorentz | 28 June 2019, 18:53:23 UTC | Make all endpoints support origin urls instead of ids. | 01 July 2019, 13:52:23 UTC |
f0853ac | Valentin Lorentz | 28 June 2019, 18:42:38 UTC | Merge id/url tests using an arg provided by hypothesis. | 01 July 2019, 13:51:49 UTC |
06fe23b | Valentin Lorentz | 27 June 2019, 16:41:48 UTC | Add an 'origin_visit_find_by_date' endpoint. It reuses the 'swh_visit_find_by_date' function that became dead code when occurences were dropped. So swh-web can use it directly instead of reimplementing its logic in 'get_origin_visit'. | 01 July 2019, 13:51:20 UTC |
37396a5 | Valentin Lorentz | 28 June 2019, 18:33:37 UTC | Add a 'reset' endpoint to the in-mem storage. It will be used for API client/server property tests, because they need to remotely reset the state and cannot create a new storage. | 01 July 2019, 13:50:16 UTC |
168ebf9 | Valentin Lorentz | 05 June 2019, 11:29:59 UTC | Add test for content_get_range's boundaries. | 01 July 2019, 13:49:18 UTC |
3f4c117 | Jenkins for Software Heritage | 01 July 2019, 12:38:38 UTC | New upstream version 0.0.143 | 01 July 2019, 12:38:38 UTC |
e73e886 | Stefano Zacchiroli | 28 June 2019, 07:42:00 UTC | CONTRIBUTORS: add Ishan Bhanuka | 28 June 2019, 07:42:00 UTC |
96a80d2 | Valentin Lorentz | 25 June 2019, 09:58:20 UTC | Add support for origin_url in fetch_history_start. Unbreaks swh-loader-core. | 25 June 2019, 09:58:20 UTC |
4696160 | Valentin Lorentz | 05 June 2019, 11:35:41 UTC | Fix key names in the in-mem impl of metadata_provider_{get,add} | 24 June 2019, 09:18:20 UTC |
2ead4ce | Ishan Bhanuka | 14 June 2019, 06:14:39 UTC | Added comments for all tables and columns | 20 June 2019, 07:18:16 UTC |
b139393 | Valentin Lorentz | 14 June 2019, 11:55:22 UTC | Don't use a join in origin_visit_get_latest. This changes the behavior of snapshot_get_latest, to raise an error in case a visit references an unknown snapshot. This should not happen in practice (unless we're replaying from a journal). snapshot_get_latest will be removed in the next monthes anyway. | 19 June 2019, 12:34:34 UTC |
dc7c1e0 | Valentin Lorentz | 05 June 2019, 11:18:14 UTC | Add test for snapshot/release counters. | 17 June 2019, 09:00:55 UTC |
aec9571 | Valentin Lorentz | 05 June 2019, 09:41:50 UTC | Add origin_visit counter to the in-mem storage. | 17 June 2019, 09:00:55 UTC |
c541224 | Valentin Lorentz | 13 June 2019, 14:27:50 UTC | Actually run the remote-pg-storage tests on a remote-pg. They were run on a local pg instead of via the HTTP API. | 17 June 2019, 08:40:27 UTC |
34c2c34 | Antoine R. Dumont (@ardumont) | 15 June 2019, 06:33:43 UTC | 136: Drop the extra 'text' mention which fails the alter query | 15 June 2019, 06:34:26 UTC |
c891a5b | Antoine R. Dumont (@ardumont) | 15 June 2019, 06:28:40 UTC | 135: Dropping production index after new ones replacing it are built Also, the index creations must be done sequentially. Those new indexes need to be built prior to remove the old index. Otherwise, we break the api (db answering too slow leads to the api breaking). | 15 June 2019, 06:34:26 UTC |
2d610ab | Valentin Lorentz | 13 June 2019, 14:31:07 UTC | Add a new origin_visit_get_latest endpoint. snapshot_get_latest did two things: find the latest origin, and get its snapshot. Now, it only calls origin_visit_get_latest, then snapshot_get. snapshot_get_latest should also be deprecated, because it only fetches some of the branches (for large snapshots). Finally, with the new data model (types are associated to visits, not origins), origin_visit_get_latest is required to know the type of the loader that got the snapshot (needed in swh-indexer). Also, note that origin_visit_get_latest only supports URLs as argument, not origin ids. Since we are migrating away from origin ids, I don't think it makes sense to add support for origin ids in a new endpoint. | 14 June 2019, 11:28:49 UTC |
df461d1 | Valentin Lorentz | 14 June 2019, 11:12:48 UTC | Ignore Hypothesis timeout error for test_add_arbitrary. | 14 June 2019, 11:13:10 UTC |
5e16e60 | Valentin Lorentz | 05 June 2019, 09:35:32 UTC | Add test for date parsing of origin_visit_add. | 13 June 2019, 12:10:03 UTC |
16b6dc5 | Valentin Lorentz | 05 June 2019, 11:23:19 UTC | Add tests for content_missing. | 13 June 2019, 12:08:36 UTC |
9afe736 | Valentin Lorentz | 05 June 2019, 11:40:18 UTC | Test date parsing of origin_metadata_add. | 13 June 2019, 12:04:12 UTC |
34b8c54 | Valentin Lorentz | 05 June 2019, 11:33:13 UTC | Add a test checking that directory_ls does not recurse into subdirs by default. | 12 June 2019, 13:44:28 UTC |
21e26e3 | Valentin Lorentz | 05 June 2019, 11:39:23 UTC | Test that content_find filters out contents that don't match. | 12 June 2019, 13:21:23 UTC |
eed16c2 | Valentin Lorentz | 05 June 2019, 11:16:05 UTC | Add test for ctime in content_add. | 12 June 2019, 13:07:44 UTC |
fb88f61 | Valentin Lorentz | 05 June 2019, 11:26:54 UTC | Add test for content_add when one of the content is already known. | 12 June 2019, 12:29:58 UTC |
cb48f63 | Valentin Lorentz | 05 June 2019, 11:21:30 UTC | Add test for snapshot_get_branches with both filter and pagination. Reviewers: #reviewers, douardda Reviewed By: #reviewers, douardda Differential Revision: https://forge.softwareheritage.org/D1543 | 12 June 2019, 12:26:37 UTC |
be6034f | Jenkins for Software Heritage | 11 June 2019, 13:44:18 UTC | New upstream version 0.0.142 | 11 June 2019, 13:44:18 UTC |
aef6dbf | Antoine R. Dumont (@ardumont) | 11 June 2019, 13:32:54 UTC | upgrades/136: Upgrade update instructions | 11 June 2019, 13:33:15 UTC |
e4e5b5c | Valentin Lorentz | 11 June 2019, 13:24:49 UTC | Mark network tests, so they can be disabled. | 11 June 2019, 13:24:49 UTC |
edc279b | Jenkins for Software Heritage | 11 June 2019, 10:36:31 UTC | New upstream version 0.0.141 | 11 June 2019, 10:36:31 UTC |
5a9f544 | Valentin Lorentz | 06 June 2019, 15:05:03 UTC | Add support for using URL instead of ID in snapshot_get_latest. | 11 June 2019, 10:01:50 UTC |
0386716 | Valentin Lorentz | 06 June 2019, 15:03:22 UTC | Add support for using URL instead of ID in origin_visit_{add,update}. | 11 June 2019, 10:01:08 UTC |
21b5142 | Valentin Lorentz | 06 June 2019, 13:30:13 UTC | Add a 'type' column to origin_visit. This is a step toward dropping the 'type' column of origins. | 11 June 2019, 10:00:00 UTC |
d8f0a7b | David Douard | 11 June 2019, 08:09:55 UTC | storage: small typo in Storage.content_add_metadata's doc string | 11 June 2019, 08:11:58 UTC |
710e0bf | Valentin Lorentz | 06 June 2019, 12:08:27 UTC | Add support for omitting origin type when calling origin_get. | 06 June 2019, 14:21:07 UTC |
3349788 | Valentin Lorentz | 05 June 2019, 09:34:18 UTC | Make origin_visit_upsert parse date strings. | 05 June 2019, 12:31:11 UTC |
75ac20f | Valentin Lorentz | 05 June 2019, 11:25:05 UTC | Add test for content_get on missing content. | 05 June 2019, 11:45:23 UTC |
09666c0 | Valentin Lorentz | 05 June 2019, 11:21:30 UTC | Add test for snapshot_get_branches with both filter and pagination. | 05 June 2019, 11:21:30 UTC |
909c69b | Valentin Lorentz | 04 June 2019, 13:14:40 UTC | Fix recursive entry names in in_memory.py's directory_entry_get_by_path | 04 June 2019, 13:14:40 UTC |
efda98e | Valentin Lorentz | 29 May 2019, 12:52:46 UTC | Fix origin_search in the in-mem storage to search for sub-strings. | 04 June 2019, 11:29:24 UTC |
3556b74 | Valentin Lorentz | 29 May 2019, 12:54:47 UTC | Unify argument names of the pg and in-mem storage. Argument names matter for the remote API. | 29 May 2019, 16:13:03 UTC |
17d4112 | David Douard | 17 May 2019, 12:43:09 UTC | Update the README file especially for latest CLI modifications. Add a slightly more detailed step-by-step 'quick start' section. | 17 May 2019, 13:08:59 UTC |
289b233 | David Douard | 17 May 2019, 12:42:26 UTC | cli: small improvements of 'rpc-serve --help' message | 17 May 2019, 13:07:24 UTC |
39a1c45 | Valentin Lorentz | 17 May 2019, 12:53:46 UTC | Make content_find return an empty list instead of None. | 17 May 2019, 12:53:46 UTC |
ea3f2d3 | Valentin Lorentz | 17 May 2019, 12:51:17 UTC | Update import of swh.core.db.tests.db_testing. | 17 May 2019, 12:51:17 UTC |
01ed375 | Valentin Lorentz | 17 May 2019, 11:32:55 UTC | Update import of swh.core.api.tests.server_testing. | 17 May 2019, 11:32:59 UTC |
fa02fa4 | David Douard | 15 May 2019, 15:08:04 UTC | cli: add a 'storage' cli group with a 'rpc-serve' subcommand we do add this 'storage' group even if it contains only one subcommand (serve) for UI consistency: swh storage rpc-serve We do keep the main swh-storage console_script entry point as is for bw compat, however. | 16 May 2019, 15:28:29 UTC |
41d3930 | Jenkins for Software Heritage | 16 May 2019, 12:09:03 UTC | New upstream version 0.0.140 | 16 May 2019, 12:09:03 UTC |
02134a7 | mihir(faux__) | 24 March 2019, 16:17:31 UTC | Changes the output of content_find method to a list in case of hash collisions and makes the sql query on python side and added test duplicate input, colliding sha256 and colliding blake2s256 | 16 May 2019, 09:06:06 UTC |
85b3da9 | David Douard | 07 May 2019, 15:12:11 UTC | bump dependency to swh.core[db,http] >= 0.0.60 so tests are not broken when next swh.core version is published. | 07 May 2019, 15:12:11 UTC |
2e70516 | Antoine Lambert | 23 April 2019, 14:56:29 UTC | algos.revisions_walker: Add methods to query produced history state Add methods is_history_truncated and missing_revisions to check if the produced history got truncated or not. Closes T1677 | 06 May 2019, 10:02:21 UTC |
e9c9dd7 | Jenkins for Software Heritage | 18 April 2019, 16:08:27 UTC | New upstream version 0.0.139 | 18 April 2019, 16:08:27 UTC |
6510b5e | Antoine Lambert | 18 April 2019, 14:17:16 UTC | algos.revisions_walker: Handle truncated/shallow histories It might exist cases where a revision's parent is referenced but doesn't exist in the archive resulting in a truncated/shallow history when walking on the revisions graph. That commit ensures the revision walkers will not end up in errors when encountering such cases. Closes T1675 | 18 April 2019, 15:30:17 UTC |
685b601 | Valentin Lorentz | 15 April 2019, 10:43:31 UTC | Make revision_add and directory_add call `_missing` using the same transaction. | 16 April 2019, 08:27:07 UTC |
7baf562 | Nicolas Dandrimont | 12 April 2019, 10:03:09 UTC | Update coverage gitignore | 12 April 2019, 10:03:09 UTC |
58b49f9 | Nicolas Dandrimont | 11 April 2019, 09:42:19 UTC | Use the proper order to record content byte addition The current way the value is returned creates a metric whose name is backwards. | 11 April 2019, 12:55:43 UTC |
63f50a5 | Nicolas Dandrimont | 11 April 2019, 10:15:49 UTC | Fix backwards-compatibility for snapshot_add in the server | 11 April 2019, 12:55:43 UTC |
2af864c | Jenkins for Software Heritage | 09 April 2019, 16:50:47 UTC | New upstream version 0.0.138 | 09 April 2019, 16:50:47 UTC |
cf99702 | Valentin Lorentz | 09 April 2019, 14:40:49 UTC | Use the db_transaction decorator on all _add() methods. So they gracefully release the connection on error instead of relying on reference-counting to call the Db's `__del__` (which does not happen in Hypothesis tests) because a ref to it is kept via the traceback object. | 09 April 2019, 16:06:24 UTC |
dab5bf8 | Valentin Lorentz | 09 April 2019, 14:36:31 UTC | Rewrite skipped_content_missing to not use a temporary table. | 09 April 2019, 16:05:46 UTC |
5aacb79 | Valentin Lorentz | 09 April 2019, 14:44:03 UTC | Add a test that adds arbitrary objects to the storage. | 09 April 2019, 15:58:28 UTC |
cd0d309 | Valentin Lorentz | 09 April 2019, 14:24:36 UTC | Fix person name clashes in revision_add and release_add. When two revisions/releases have authors with the same fullname but different names (which should never happen in practice, but since name/email parsing is on a best-effort basis, you never know...). Bug found by Hypothesis. | 09 April 2019, 15:49:38 UTC |
70c43ba | Valentin Lorentz | 09 April 2019, 14:42:19 UTC | Fix order of results of person_get. | 09 April 2019, 15:31:07 UTC |
8672a27 | Valentin Lorentz | 09 April 2019, 14:28:38 UTC | Add test for skipped_content_missing. | 09 April 2019, 15:27:12 UTC |
95c0b27 | Valentin Lorentz | 09 April 2019, 14:12:00 UTC | Fix support of releases with no author or date. Bug found by Hypothesis. | 09 April 2019, 15:24:12 UTC |
ba3a9af | Antoine R. Dumont (@ardumont) | 09 April 2019, 09:48:49 UTC | listener: Remove | 09 April 2019, 15:19:54 UTC |
681070f | Valentin Lorentz | 09 April 2019, 14:09:06 UTC | Make test_origin_get_range run faster (less objects). | 09 April 2019, 14:09:06 UTC |
2aa256b | Valentin Lorentz | 08 April 2019, 20:24:10 UTC | Fix hypothesis tests. Why does test_origin_get_range even work? | 08 April 2019, 20:24:10 UTC |
e106912 | Jenkins for Software Heritage | 08 April 2019, 13:56:15 UTC | New upstream version 0.0.137 | 08 April 2019, 13:56:15 UTC |
00e5f33 | Valentin Lorentz | 08 April 2019, 13:40:24 UTC | Make test_origin_get_range run faster. | 08 April 2019, 13:40:24 UTC |
92a575f | Valentin Lorentz | 05 April 2019, 18:00:42 UTC | Move Hypothesis strategies to swh-model. | 08 April 2019, 13:23:51 UTC |
a574fe5 | Valentin Lorentz | 05 April 2019, 14:19:38 UTC | Add a new endpoint origin_visit_upsert to add an origin_visit with a specific id. Also allows skipping an extra origin_visit_update query to set snapshot/metadata/status. | 05 April 2019, 15:40:28 UTC |
de805da | Jenkins for Software Heritage | 05 April 2019, 14:43:27 UTC | New upstream version 0.0.135 | 05 April 2019, 14:43:27 UTC |
d110b58 | Valentin Lorentz | 04 April 2019, 18:42:32 UTC | Make content_add_metadata require a ctime argument. This makes Python set the ctime instead of pgsql. | 05 April 2019, 09:15:18 UTC |
a62b1ad | Valentin Lorentz | 04 April 2019, 18:44:56 UTC | Prevent the 'snapshots' hypothesis strategy from generating loops. | 04 April 2019, 18:44:56 UTC |
ee62e05 | Valentin Lorentz | 04 April 2019, 13:54:03 UTC | Add a new endpoint content_add_metadata. | 04 April 2019, 15:15:51 UTC |
5913718 | Valentin Lorentz | 04 April 2019, 13:52:07 UTC | Refactor content_add's code into smaller functions. | 04 April 2019, 15:10:50 UTC |