A significant amount of source code has already been ingested in the Software Heritage archive. It notably includes the following software origins.

Regular crawling

These software origins get continuously discovered and archived using the listers implemented by Software Heritage.

instance type count search
bitbucket git 2010604
instance type count search
gnu-savannah git 1019
git.gnu.org.ua git 32
git.eclipse.org git 491
gitweb.torproject.org git 1
git.alpinelinux.org git 6
git.openembedded.org git 7
git.yoctoproject.org git 165
git.zx2c4.com git 155
git.kernel.org git 871
fedorapeople.org git 838
git.baserock.org git 1454
code.qt.io git 272
instance type count search
cran cran 18,765
instance type count search
Debian-Security deb 528
Debian deb 35441
instance type count search
codeberg.org git 5515
git.fsfe.org git 350
instance type count search
github git 124669282
instance type count search
gitlab.com git 835831
gitlab.inria.fr git 1970
0xacab.org git 1104
gitlab.freedesktop.org git 6502
gitlab.common-lisp.net git 801
gitlab.ow2.org git 1177
gitlab.gnome.org git 11038
gite.lirmm.fr git 487
gitlab.lip6.fr git 17
framagit.org git 15464
instance type count search
guix nixguix 10923
instance type count search
GNU tar 354
instance type count search
launchpad git 20397
instance type count search
nixos nixguix 30598
instance type count search
npm npm 1721973
instance type count search
pypi pypi 342888
instance type count search
main svn 102042
main hg 27750
main git 182698
Discontinued hosting

Discontinued hosting services. Those origins have been archived by Software Heritage.

instance type search
gitorious git
instance type search
googlecode git
googlecode hg
googlecode svn
On demand archival

These origins are directly pushed into the archive by trusted partners using the deposit service of Software Heritage.

instance type search
elife deposit
instance type search
hal deposit
instance type search
ipol deposit
JavaScript license information