A significant amount of source code has already been ingested in the Software Heritage archive. It notably includes the following software origins.

Regular crawling

These software origins get continuously discovered and archived using the listers implemented by Software Heritage.

instance type count search
bitbucket git 460,118
instance type count search
bower git 1,115
instance type count search
eclipse git 19
git-kernel git 1,061
instance type count search
cran cran 16,956
instance type count search
Debian deb 35,037
Ubuntu-Security deb 2,620
instance type count search
github git 1,689,337
instance type count search
inria git 1
instance type count search
golang golang 12,888
instance type count search
forge.extranet.logilab.fr hg 323
heptapod.host hg 376
instance type count search
launchpad bzr 207,172
launchpad git 915
instance type count search
repository.jboss.org maven 1,745
repository.jboss.org svn 8
repo1.maven.org git 152
repo1.maven.org hg 24
repo1.maven.org maven 356,236
repo1.maven.org svn 96
clojars.org hg 2
instance type count search
npm npm 1,346,738
instance type count search
opam.ocaml.org opam 3,997
coq.inria.fr opam 403
instance type count search
packagist git 42,943
packagist hg 11
packagist svn 50
instance type count search
swh git 160
instance type count search
pubdev pubdev 32,503
instance type count search
pypi pypi 403,770
instance type count search
main bzr 1
main cvs 67,704
main git 12,098
main hg 27,179
main svn 101,306
Discontinued hosting

Discontinued hosting services. Those origins have been archived by Software Heritage.

instance type search
gitorious git
instance type search
googlecode git
googlecode hg
googlecode svn
instance type search
bitbucket hg
On demand archival

These origins are directly pushed into the archive by trusted partners using the deposit service of Software Heritage.

instance type search
elife deposit
instance type search
hal deposit
instance type search
ipol deposit
JavaScript license information