https://github.com/SoftwareHeritage/swh-model

sort by:
Revision Author Date Message Commit Date
448eafa parse_persistent_identifier: Add optional parts parsing This commits adds support for parsing optional parts in persistent identifiers. Optional parts are defined the following way: <swh_id>[;key1=val1][;key2=val2] Related T1040 18 May 2018, 14:57:36 UTC
3f72bca docs: Remove Lorem ipsum paragraph in data-model page 07 March 2018, 12:38:39 UTC
d1b975a docs: add snapshot, branches, and visits to Merkle DAG diagram 09 February 2018, 15:50:24 UTC
6fe1595 docs: generate PDF Merkle DAG diagram by default 09 February 2018, 15:49:56 UTC
96114f6 docs: add preliminary version of the Merkle DAG diagram 09 February 2018, 13:32:03 UTC
d788ef1 swh.model.identifiers: Fix typo in snapshot_identifier docstring 31 January 2018, 09:33:34 UTC
a06122e docs: Fix small issue 23 January 2018, 13:16:49 UTC
ab0fb5a Add swh.model.toposort algorithm 19 January 2018, 13:40:52 UTC
bdf26f5 swh.model.identifiers: persistent_identifier takes object as input Related T335 17 January 2018, 10:01:32 UTC
122326d swh.models.hashutil: Add persistent identifier function Related T335 Related T933 15 January 2018, 18:28:22 UTC
b61c666 docs: document the naming scheme for persistent identifiers Closes: T335 14 January 2018, 21:30:16 UTC
a01d81c docs: shorter fulltitle for the data model document 14 January 2018, 21:30:02 UTC
c79c446 swh-hash-file: make sure that paths are passed on as bytes 14 January 2018, 13:54:58 UTC
73d5ffb bin/swh-hash-file: new binary to compute SWH-style content identifiers reincarnation of the old shw.model bin/ script, which is now gone 14 January 2018, 13:47:43 UTC
a5f7d1e improve hash_file() docstring to specify algorithms type 14 January 2018, 13:39:45 UTC
91d74ef swh.model.hashutil.hash_data: Optionally integrate length in result 20 December 2017, 09:46:03 UTC
eff2692 Merge branch 'wip/snapshots' 13 December 2017, 10:30:21 UTC
1b1cc8d hashutil: add `snapshot` object type for git hashes Summary: Add support for snapshot identifiers Close T566. Related to D268. Test Plan: Unit tests included Reviewers: zack, #reviewers! Maniphest Tasks: T566 Differential Revision: https://forge.softwareheritage.org/D277 13 December 2017, 10:30:10 UTC
46ce819 hashutil: add `snapshot` object type for git hashes 12 December 2017, 19:08:30 UTC
94bd8dd docs: add absolute anchor to documentation index 02 November 2017, 10:09:21 UTC
0b7f217 Cleanup packaging 12 October 2017, 15:16:56 UTC
34228c5 test_from_disk: use os.fsencode to consistently get tmpfile names as bytes 05 October 2017, 18:45:42 UTC
2ab8360 mark tests needing the filesystem as such 05 October 2017, 18:38:07 UTC
6c30346 d/control: add breaks on packages depending on removed APIs 05 October 2017, 18:31:27 UTC
f6a4d7e Remove swh.model.git Close T709 05 October 2017, 18:28:11 UTC
c67f012 from_disk: full test coverage 04 October 2017, 20:33:45 UTC
8900a91 from_disk.Directory: fix some random bugs found when using the API 04 October 2017, 20:33:17 UTC
a8e919a test_hashutil: remove temporary file after test 04 October 2017, 18:08:19 UTC
c790b85 from_disk: add a way to save the path to contents This allows loaders to lazily load data: you can read from disk and transfer only the contents that are really missing. 04 October 2017, 17:12:39 UTC
d54c066 from_disk: convert on-disk data to Software Heritage archive objects Summary: This module is a reimplementation of swh.model.git, with the underlying goal of replacing it and fixing T709 in the process. Reviewers: #reviewers! Differential Revision: https://forge.softwareheritage.org/D248 03 October 2017, 17:47:46 UTC
f44949a Add a Merkle tree data structure 03 October 2017, 17:47:46 UTC
ac3df91 docs/: add sub document data-model.rst Currently just a stub (but with a hereby fixed anchor) to allow linking to it from other part of the development documentation. Will be fillen later on with an abstract description of the Software Heritage data model. 19 September 2017, 15:07:36 UTC
8bafb40 hashutil: improve docstrings 15 September 2017, 17:15:38 UTC
bd43a7f docstring: drop useless heading ":py" domain in crossrefs 06 September 2017, 18:27:43 UTC
1d898f7 sanitize docstrings for sphinx 06 September 2017, 18:22:30 UTC
b53f5d8 docs/: add sphinx apidoc generation skeleton change cherry picked from python module template commit 71b117ba0cf9f1251b1cac26d0994df03a4c787d 30 August 2017, 10:25:57 UTC
8aa5c3a Load pyblake2 dynamically instead of hardcoding the Python version Summary: for those people with legacy openssls Reviewers: anlambert, #reviewers! Differential Revision: https://forge.softwareheritage.org/D224 12 July 2017, 14:12:35 UTC
11de644 requirements: make pyblake2 conditional on Python3.5 26 June 2017, 12:41:11 UTC
d281faf d/control: drop pyblake2 if python >= 3.5 19 June 2017, 17:56:07 UTC
6f89adf git: make GitPerm an IntEnum rather than bytes Fix T685 out of spite. While we wait for a cleaner refactoring of this code, this fixes the immediate clogging of the database with bogus data issue. 07 April 2017, 10:02:30 UTC
4d6d748 d/changelog: Fix sbuild warning 24 March 2017, 14:17:13 UTC
9812285 swh.model.hashutil: Add blake2s256 in default algorithms Related T703 24 March 2017, 14:16:36 UTC
a42c75e swh.model.hashutil: Use pyblake2 dependency on python3 <= 3.4 This resolves the caveat mentioned in prior commit about not being able to use blake2 prior to 3.5 Related T692 Closes D192 21 March 2017, 09:35:45 UTC
24f8dd4 swh.model.hashutil: Adapt according to latest discussion - Add module docstring - Add blake2s256 and blake2b512 in supported algorithms - Spawn a new variable DEFAULT_ALGORITHMS as default computed algorithms for the main functions Related T692 17 March 2017, 08:43:15 UTC
f75be5a swh.model.hashutil: Make unknown variable length algo creation break Remove the limit on the python3 version, this should be transparent. If the hash requested is not available, this will raise with an explanation on the error. Related T692 17 March 2017, 08:42:32 UTC
8776435 swh.model.hashutil: Simplify length hash algorithms instantiation The same caveat applies, will only be supported from python3.6 onward. Related T692 17 March 2017, 08:42:31 UTC
9c25f8f swh.model.hashutil: Open variable length hash algorithm support The caveat is that it will only be supported when we will be using python3 >= 3.5. Related T692 17 March 2017, 08:42:15 UTC
3e325ca Migrate from swh.core.hashutil to swh.model.hashutil Related T700 15 March 2017, 15:00:44 UTC
b0f7f06 Update docstring to clarify the ambiguity around symlinks 24 February 2017, 07:29:38 UTC
c40ab03 Consider special files as empty ones when computing content hashes Closes T255 Ref. D179 23 February 2017, 14:35:02 UTC
e0dbae3 identifiers: properly escape newlines in author specifications Found by investigating T75 15 February 2017, 16:46:22 UTC
58c5a24 git: don't use double underscores for function names 14 February 2017, 17:35:14 UTC
7912710 identifiers: force timestamps as integers everywhere The subversion loader (T680) has shown that throwing floating point values around for timestamps is a mess waiting to happen. We now coerce all clients to send us timestamps as integer numbers of seconds and microseconds, avoiding data losses everywhere. 14 February 2017, 17:35:14 UTC
87444d4 requirements: split internal and external requirements in two separate files 09 February 2017, 13:32:05 UTC
2594832 Fix: echo -n to avoid adding an extra line 23 June 2016, 10:30:35 UTC
5c0be62 Open tools to check rev hash 23 June 2016, 09:33:21 UTC
cec445d d/rules: move to build_dir before tests 14 June 2016, 15:00:27 UTC
db20b20 Remove dead comment 13 June 2016, 14:23:36 UTC
b3c17c7 Remove print statement 13 June 2016, 14:22:35 UTC
5f7c931 Add tests on git.compute_hashes_from_directory - default - ignoring empty folders - ignore folder based on pattern in names 12 June 2016, 14:08:54 UTC
aa06697 Fix hash typos + remove print statement 12 June 2016, 13:33:34 UTC
843d814 Add missing tests on new api 12 June 2016, 09:29:05 UTC
1a2b969 Open children_hashes api function 11 June 2016, 00:20:22 UTC
05ac3c4 Remove dead code 10 June 2016, 23:58:18 UTC
8d2bf5a Rename walk_and_compute_sha1_from_directory_2 to compute_hashes_from_directory 10 June 2016, 23:54:08 UTC
87fcced Add objects_per_type api This permits to reuse the same logic for different clients (loader-dir, loader-tar, loader-svn) (Tests were lost) 08 June 2016, 13:43:52 UTC
17f0493 Open a new walk_and_compute_sha1_from_directory_2 api This actually is supposed to replace walk_and_compute_sha1_from_directory. The data structure used here is better at handling updates. (Code that actually got lost and rewritten - Tests are definitely lost though) 08 June 2016, 13:09:18 UTC
1af7aed Improve internal api regarding directory and tree hash computations Keep the old api (since i don't measure the impacts on other modules yet). + Improve docstring (Code that actually got lost and rewritten) 08 June 2016, 13:08:08 UTC
9b9ec94 Optimize walk for edge cases 26 May 2016, 10:56:11 UTC
a91bf69 Add tests about new use cases Combination of: - validation on files - ignore empty folder 26 May 2016, 09:57:58 UTC
22b9fca Try and detect the next existing parent to lookup from In some corner case, the changed paths can reference a previous ignored folder (thus not existing in the data structure) 25 May 2016, 21:36:54 UTC
1f98c67 Add optional clean up round-trip to remove empty folders 25 May 2016, 21:26:57 UTC
ca235a0 d/control: Ignore filesystem tests 24 May 2016, 15:00:54 UTC
aae146d swh.model.git - update - Deal with edge case about empty folder The empty folder was not previously in the objects structure. So we need to add it as child of its parent for the update. 24 May 2016, 11:48:13 UTC
dca0eaf swh.model.git - update - Secure paths removal 23 May 2016, 13:27:44 UTC
0fbf74e identifiers: support authors with only a Full Name field 08 April 2016, 11:53:31 UTC
16155c4 Fix some edge case on git hash update computation Enforce convention on directory name without trailing /. At the moment, the `git.walk_and_compute_sha1_from_directory` injected the rootdir with a possible trailing / (input from client). 05 April 2016, 15:14:39 UTC
3f63877 Add real use cases for the git computation update tests 05 April 2016, 12:09:50 UTC
d5d5bee Improve docstrings 05 April 2016, 12:08:54 UTC
0fc6af8 Add the length to the data returned Since we compute it anyway, better return it along with the result 02 April 2016, 15:32:06 UTC
97b0d9f Improve git hash update behavior Decrease the number of paths to compute to 1 common ancestor (if any): - Scan only that directory and rehash with new results (data changed) - Update the resulting objects with those new hashes. - Update from that directory to the rootdir the existing hashes computation 01 April 2016, 15:26:22 UTC
cd88163 Detect if we need to recompute all from disk anyway (change at the root level for example) 01 April 2016, 09:56:00 UTC
02e2357 Only compute root_tree_key's directory hash when needed 01 April 2016, 07:32:15 UTC
cdf2b70 Refactor - Improve test git class definition 01 April 2016, 07:30:35 UTC
97be2fd Fix: Delete paths below the path removal deletion 31 March 2016, 18:41:14 UTC
eb99dbf Update git hash computation on changed paths only 31 March 2016, 17:23:12 UTC
3cebfce Clean up after test 31 March 2016, 11:18:52 UTC
4bb18b2 Module import order 31 March 2016, 11:18:39 UTC
604f4f0 Update docstring on swh.model.git module 31 March 2016, 08:27:56 UTC
18086c1 identifier: Don't break on None metadata for revisions 30 March 2016, 15:59:07 UTC
4b13d16 test_identifiers: add test for negative UTC 30 March 2016, 13:04:28 UTC
a09e9b4 identifiers: proper support for negative utc offsets Move timestamp normalization to another function to make it more easily movable. 30 March 2016, 12:04:31 UTC
39c61bb test_identifiers: add tests for empty vs. null messages 29 March 2016, 16:07:04 UTC
e56b58a test_identifiers: this gpg signature is not from Linus 29 March 2016, 15:53:30 UTC
faf0840 identifiers: support None messages in revisions and releases 29 March 2016, 15:51:54 UTC
f62bc76 identifiers: enhance documentation of the revision_identifier function This function wasn't in sync with what's supposed to be our revision schema 29 March 2016, 15:35:47 UTC
dd1c4ba test_identifiers: proper revision w/gpgsig test naming and refactor 29 March 2016, 13:06:29 UTC
f7bc587 identifiers: import symbols from hashutil directly 29 March 2016, 13:02:10 UTC
c3d9439 'metadata' entry is expected to be json serializable so no bytes, and we enforce during the checksum computation function 24 March 2016, 09:37:00 UTC
aca1e40 Use of optional gpgsig in git commit sha1 computation 22 March 2016, 17:58:45 UTC
back to top