English»Data Sets and State-of-the-art (SOTA)»Image

Blog

Directory

100 million Flickr images

3D people data set

50+ million QuickDraw drawings

Amsterdam Library of Object Images (ALOI) - about 100K images

Baidu Apollo - a massive self-driving data set.

Bulkr - A tool to download Flickr images

Caltech101

CIFAR-10 and CIFAR-100 - 60K color images 32x32 (test & training data)

Classification accuracy for various famous image/video datasets

CMU Face Databases

CoPhIR (Content-based Photo Image Retrieval) Test-Collection - >100 million processed images.

Corel data set features

CVonline - A collated list of image and video databases that people have found useful for computer vision research and algorithm evaluation (a great collection of image data sets).

Drawing of 10 Famous Logos From Memory by various people

Face in the wild

Google Landmarks - more than 2 million images depicting 30 thousand unique landmarks from across the world.

Image Annotations collected Using Amazon's Mechanical Turk

ImageNet - ImageNet is an image database organized according to the WordNet hierarchy (currently only the nouns), in which each node of the hierarchy is depicted by hundreds and thousands of images.

iNaturalist competition data

INRIA Holiday's dataset

INRIA Person Dataset

Label Me (MIT)

Matterport3D - 10,800 aligned 3D panoramic views (RGB + depth per pixel) from 194,400 RGB + depth images of 90 building-scale scenes.

MediaEval datasets - A rich set of humanly-annotated multimedia datasets

Microsoft COCO - an image recognition and segmentation dataset.

Microsoft Sequential Image Narrative Dataset (SIND)

MIR Flickr - Flickr images with annotations

MNIST database of handwritten digits

MS-Celeb-1M - Challenge of Recognizing One Million Celebrities in the Real World

One million free pictures from the British Library

Open Images Dataset V4 + Extensions - A large object detection data set.

Open Images V4 - 15.4M bounding-boxes for 600 categories on 1.9M images.

Profiset - a collection of 20M high-quality images with rich and systematic annotations, which were obtained from Profimedia

semantic3d - a large set of point clouds with over one billion of labelled points.

TEXMEX datasets for approximate nearest neighbor search

The PASCAL Object Recognition Database Collection

The Stanford 3D Scanning Repository

The Street View House Numbers (SVHN) Dataset - a real-world image dataset where the task is to recognize house numbers.

UQ_IMH dataset

Visual Dictionary - More than 80 millions tiny images collected by MIT

Visual Genome - an ongoing effort to connect structured image concepts to language

Visual Storytelling Dataset (VIST)

WLFDB : Weakly Labeled Face Databases