A | B | C | D | E | F | G | H | I | J | K | L | M | N | O | P | Q | R | S | T | U | V | W | X | Y | Z | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1 | Image Count | URL | Remark | golan's notes | ||||||||||||||||||||||
2 | 79,302,017 | http://www.historicnewengland.org/collections-archives-exhibitions/collections-access/highlights/wallpaper | 1 | |||||||||||||||||||||||
3 | 67,000,000 | http://www.kleegestaltungslehre.zpk.org/ee/ZPK/BF/2012/01/01/001/ | no bulk download | 0 | ||||||||||||||||||||||
4 | 48,000,000 | http://www.vision.caltech.edu/visipedia/CUB-200.html | 200 species of birds, categorized | 0 | ||||||||||||||||||||||
5 | 14,000,000 | http://www1.cs.columbia.edu/CAVE/software/softlib/coil-100.php | 128x128px images of 100 objects rotated in 5-degree increments | 1 | ||||||||||||||||||||||
6 | 10,298,323 | http://aaronsfiles.com/SheepProcessing/SheepView_Processing.zip | Processing sketch with vectors for handdrawn sheep | |||||||||||||||||||||||
7 | 10,000,000 | http://www.robots.ox.ac.uk/~vgg/data/flowers/ | 1 | |||||||||||||||||||||||
8 | 7,000,000 | https://web.archive.org/web/20150703060412/http://137.189.35.203/WebUI/CatDatabase/catData.html | Original URL on google search is down; go Internet Archive | 1 | ||||||||||||||||||||||
9 | 5,000,000 | http://www.vision.caltech.edu/Image_Datasets/Caltech101/ | This is older and smaller than Caltech 256 | http://research.microsoft.com/pubs/80582/ECCV_CAT_PROC.pdf | 1 | |||||||||||||||||||||
10 | 4,763,691 | https://github.com/KanjiVG/kanjivg/releases | 1 | |||||||||||||||||||||||
11 | 3,000,000 | http://pfid.rit.albany.edu/ | 0 | |||||||||||||||||||||||
12 | 2,500,000 | http://places.csail.mit.edu/ | ||||||||||||||||||||||||
13 | 1,200,000 | http://vis-www.cs.umass.edu/lfw/ | 1 | |||||||||||||||||||||||
14 | 1,100,000 | https://www.flickr.com/photos/projectapolloarchive/albums/with/72157656702724284 | 1 | |||||||||||||||||||||||
15 | 1,000,000 | http://www.robots.ox.ac.uk/~vgg/data/oxbuildings/ | 1 | |||||||||||||||||||||||
16 | 1,000,000 | http://images.ikea.com/assetbank-ikea/action/viewHome | 1 | |||||||||||||||||||||||
17 | 1,000,000 | http://cybertron.cg.tu-berlin.de/eitz/projects/classifysketch/ | 1 | |||||||||||||||||||||||
18 | 1,000,000 | http://www1.cs.columbia.edu/CAVE/databases/facetracer/ | Links to online images of faces, w/metadata, but some links dead. | |||||||||||||||||||||||
19 | 671,628 | http://biometrics.idealtest.org/dbDetailForUser.do?id=7 | 1 | |||||||||||||||||||||||
20 | 600,000 | http://human-pose.mpi-inf.mpg.de/ | 0 | |||||||||||||||||||||||
21 | 561,628 | http://www.ifnenit.com/download.htm | 1 | |||||||||||||||||||||||
22 | 250,000 | http://ballads.bodleian.ox.ac.uk | See also imagematch.bodleian.ox.ac.uk for CBIR on a subset (docs at http://balladsblog.bodleian.ox.ac.uk/blog/570 and http://balladsblog.bodleian.ox.ac.uk/blog/1069) | |||||||||||||||||||||||
23 | 250,000 | https://github.com/WaltersArtMuseum/walters-api | See image API docs: https://github.com/WaltersArtMuseum/walters-api/blob/master/images.md | |||||||||||||||||||||||
24 | 223,128 | http://www.vision.caltech.edu/Image_Datasets/Caltech256/ | ||||||||||||||||||||||||
25 | 202,599 | http://digital.library.pitt.edu/images/pittsburgh/ | 1 | |||||||||||||||||||||||
26 | 201,544 | http://openplaques.org/about/data | commemorative plaques. a mix of close up and context photos | |||||||||||||||||||||||
27 | 200,000 | http://www.bottlecapclub.org/index.php | bottle caps. would need to scrape. | 1 | ||||||||||||||||||||||
28 | 180,000 | https://images.nga.gov/en/page/show_home_page.html | 0 | |||||||||||||||||||||||
29 | 112,039 | http://www.libcrowds.com/data/ | Various languages. Example subset: https://www.flickr.com/photos/132066275@N04/sets/72157657517602031 | |||||||||||||||||||||||
30 | 102,212 | https://github.com/dimatura/getpubfig | ||||||||||||||||||||||||
31 | 93,000 | http://memorability.csail.mit.edu/explore.html | Large-scale Image Memorability. images and memorability metadata | 1 | ||||||||||||||||||||||
32 | 80,000 | http://www.davidrumsey.com/ | no bulk download | |||||||||||||||||||||||
33 | 80,000 | http://www.cs.toronto.edu/~kriz/cifar.html | 0.5 | |||||||||||||||||||||||
34 | 78,000 | https://github.com/tategallery/collection | Museum collection, some works are under copyright protection. | 0 | ||||||||||||||||||||||
35 | 70,000 | http://lipitk.sourceforge.net/datasets/tamilchardata.htm | 1 | |||||||||||||||||||||||
36 | 70,000 | http://yann.lecun.com/exdb/mnist/ | meta data | 0.5 | ||||||||||||||||||||||
37 | 67,000 | https://github.com/cmoa/collection | In particular, check out the Teenie Harris collection—the image resolution is better for those. | |||||||||||||||||||||||
38 | 60,000 | https://staff.fnwi.uva.nl/t.e.j.mensink/rijks/ | Also see https://www.rijksmuseum.nl/en/api | |||||||||||||||||||||||
39 | 60,000 | https://www.flickr.com/photos/biodivlibrary/ | For more info see http://www.biodiversitylibrary.org/ | 0 | ||||||||||||||||||||||
40 | 60,000 | http://www.progettosnaps.net/ | arcade machine game screenshot 'snaps' and marquees | 0 | ||||||||||||||||||||||
41 | 50,000 | drawings | German Signs | 1 | ||||||||||||||||||||||
42 | 50,000 | http://www.nypl.org/research/collections/digital-collections/public-domain | Public Domain part of NYPL labs | 1 | ||||||||||||||||||||||
43 | 45,000 | https://news.artnet.com/art-world/art-uk-public-accessible-online-434489 | no bulk download | 0 | ||||||||||||||||||||||
44 | 37,882 | https://collection.cooperhewitt.org/api/ | ||||||||||||||||||||||||
45 | 36,000 | http://mmlab.ie.cuhk.edu.hk/projects/CelebA.html | large-scale face attributes dataset with more than 200K celebrity images, each with 40 attribute annotations | |||||||||||||||||||||||
46 | 32,750 | http://ukiyo-e.org/ | Created by John Resig. Check out his work using this database | |||||||||||||||||||||||
47 | 30,607 | http://www.metmuseum.org/collection/the-collection-online | ||||||||||||||||||||||||
48 | 30,000 | http://collections.vam.ac.uk/ | ||||||||||||||||||||||||
49 | 30,000 | http://herbarium.bgbm.org/object | Caution: URL is zip file of catalogue | |||||||||||||||||||||||
50 | ~30,000 | http://leafsnap.com/dataset/ | LeafSnap is a dataset built for leaf recognition | |||||||||||||||||||||||
51 | 26,459 | http://ufldl.stanford.edu/housenumbers/ | 1 | |||||||||||||||||||||||
52 | 25,000 | http://digitalcollections.nypl.org/ | Not sure if there is a way to batch download :( Also note that these images may be (will most likely be) copy righted. | 1 | ||||||||||||||||||||||
53 | 20,000 | https://www.flickr.com/photos/britishlibrary/ | http://www.bl.uk/collection-guides/datasets-for-image-analysis | 1 | ||||||||||||||||||||||
54 | 20,000 | https://archive.org/details/audio-covers | 1 | |||||||||||||||||||||||
55 | 20,000 | http://vision.cs.stonybrook.edu/~vicente/sbucaptions/ | script to download from Flickr | 1 | ||||||||||||||||||||||
56 | 15,000 | http://landsat.gsfc.nasa.gov/?page_id=2 | Bulk download: http://landsat.gsfc.nasa.gov/?p=1275 | 0 | ||||||||||||||||||||||
57 | 14,000 | http://www.iapr-tc11.org/mediawiki/index.php/Harbin_Institute_of_Technology_Opening_Recognition_Corpus_for_Chinese_Characters_(HIT-OR3C) | 1 | |||||||||||||||||||||||
58 | 13,000 | https://archive.org/details/geographarchive | aims to collect geographically representative photographs and information for every square kilometre of Great Britain and Ireland | 1 | ||||||||||||||||||||||
59 | 11,076 | https://sites.google.com/view/11khands | 11,076 hand images (1600 x 1200 pixels) of 190 subjects, of varying ages between 18 - 75 years old | |||||||||||||||||||||||
60 | 11,000 | https://www.flickr.com/photos/internetarchivebookimages | Also see https://blog.archive.org/2015/10/23/zoom-in-to-9-3-million-internet-archive-books-and-images-through-iiif/ for API access notes | |||||||||||||||||||||||
61 | 10,000 | http://image-net.org/explore | 1 | |||||||||||||||||||||||
62 | 10,000 | http://chroniclingamerica.loc.gov/about/api/ | Historic newspaper pages from Library of Congress. | |||||||||||||||||||||||
63 | 10,000 | http://lsun.cs.princeton.edu/ | 0.5 | |||||||||||||||||||||||
64 | 10,000 | http://www.nlpr.ia.ac.cn/databases/handwriting/Home.html | Requires permission via email to get ftp link. They gave it to me for dubious purposes, so don't think this is hard. The download is very slow though. | |||||||||||||||||||||||
65 | 8,121 | http://pillbox.nlm.nih.gov/developer.html | 0 | |||||||||||||||||||||||
66 | 7,200 | http://www.europeana.eu/portal/ | Not sure if there is a good way for bulk download | |||||||||||||||||||||||
67 | 6,033 | http://horatio.cs.nyu.edu/mit/tiny/data/index.html | http://groups.csail.mit.edu/vision/TinyImages/ | 1 | ||||||||||||||||||||||
68 | 6,000 | http://webscope.sandbox.yahoo.com/catalog.php?datatype=i&did=67 | 1 | |||||||||||||||||||||||
69 | 5,640 | http://www.robots.ox.ac.uk/~vgg/data/dtd/ | 1 | |||||||||||||||||||||||
70 | 5,600 | http://amos.cse.wustl.edu/dataset | Archive of webcam feeds | 1 | ||||||||||||||||||||||
71 | 5,364 | http://press.liacs.nl/mirflickr/#sec_download | Link page to several face sets | 1 | ||||||||||||||||||||||
72 | 5,353 | http://erikbern.com/2016/01/21/analyzing-50k-fonts-using-deep-neural-networks/ | 1 | |||||||||||||||||||||||
73 | 5,062 | 50k bitmapped font sets, most complete with 62 characters (upper and lower case letters and numbers). download link to HDF5 at the bottom of the page | ||||||||||||||||||||||||
74 | 5,000 | https://www.flickr.com/commons/institutions/ | Starting point for multiple public domain collections | 0 | ||||||||||||||||||||||
75 | 3,915 | https://www.flickr.com/photos/121003427@N03/ | 0 | |||||||||||||||||||||||
76 | 3,900 | https://nik.bot.nu/ | 0 | |||||||||||||||||||||||
77 | 3,500 | http://deeplearning.net/datasets/ | this is a list of datasets, contains some of the above, and loads more | 1 | ||||||||||||||||||||||
78 | 1,814 | http://webscope.sandbox.yahoo.com/catalog.php?datatype=i | 1 | |||||||||||||||||||||||
79 | 1,620 | https://uwdc.library.wisc.edu/collections/WI/BrittinghamImgs/ | not VERY large but different - intimate collection of Wisconsin family | 1 | ||||||||||||||||||||||
80 | 1,620 | https://github.com/rev3rend/instadownload | ||||||||||||||||||||||||
81 | 1,000 | http://openglam.org/open-collections/ | List of open digital collections from GLAM institutions, many European | |||||||||||||||||||||||
82 | http://ddd.unil.ch/ | drawings of "God"/deities by children -- see also http://arxiv.org/pdf/1511.03466v1.pdf | 1 | |||||||||||||||||||||||
83 | https://sites.google.com/site/pornographydatabase/ | Signed Agreement required for access | 0 | |||||||||||||||||||||||
84 | http://www.vision.caltech.edu/html-files/archive.html | 1 | ||||||||||||||||||||||||
85 | http://www.algaterra.org/default.htm | have not figured put how to scrape, but its on the to do list | ||||||||||||||||||||||||
86 | http://eol.jsc.nasa.gov/Tools/ | Batch Download Tools | ||||||||||||||||||||||||
87 | https://aws.amazon.com/nasa/nex/ | 1 | ||||||||||||||||||||||||
88 | http://cs.nyu.edu/~silberman/datasets/nyu_depth_v2.html | RGBD | ||||||||||||||||||||||||
89 | http://www.rrpicturearchives.net/rsRRList.aspx?id=19 | |||||||||||||||||||||||||
90 | https://www.cooldatasets.com/ | |||||||||||||||||||||||||
91 | ||||||||||||||||||||||||||
92 | ||||||||||||||||||||||||||
93 | ||||||||||||||||||||||||||
94 | ||||||||||||||||||||||||||
95 | ||||||||||||||||||||||||||
96 | ||||||||||||||||||||||||||
97 | ||||||||||||||||||||||||||
98 | ||||||||||||||||||||||||||
99 | ||||||||||||||||||||||||||
100 |