Datasets:

Modalities:
Image
Formats:
parquet
Languages:
English
Libraries:
Datasets
pandas
License:
Search is not available for this dataset
img
imagewidth (px)
32
32
label
class label
10 classes
0airplane
6frog
0airplane
2bird
7horse
2bird
1automobile
2bird
4deer
1automobile
5dog
6frog
6frog
3cat
1automobile
3cat
5dog
5dog
8ship
1automobile
4deer
2bird
3cat
2bird
1automobile
2bird
8ship
9truck
5dog
0airplane
7horse
6frog
7horse
6frog
8ship
8ship
7horse
4deer
9truck
1automobile
2bird
6frog
5dog
9truck
4deer
2bird
5dog
1automobile
0airplane
1automobile
1automobile
9truck
0airplane
7horse
5dog
3cat
9truck
6frog
3cat
3cat
3cat
4deer
1automobile
5dog
9truck
7horse
7horse
2bird
9truck
0airplane
2bird
8ship
5dog
9truck
6frog
7horse
8ship
4deer
0airplane
4deer
9truck
2bird
7horse
1automobile
0airplane
5dog
1automobile
8ship
1automobile
6frog
5dog
9truck
7horse
0airplane
4deer
5dog
2bird
3cat
5dog
5dog

Dataset Card for CIFAR-10

Dataset Summary

The CIFAR-10 dataset consists of 60000 32x32 colour images in 10 classes, with 6000 images per class. There are 50000 training images and 10000 test images. The dataset is divided into five training batches and one test batch, each with 10000 images. The test batch contains exactly 1000 randomly-selected images from each class. The training batches contain the remaining images in random order, but some training batches may contain more images from one class than another. Between them, the training batches contain exactly 5000 images from each class.

Supported Tasks and Leaderboards

  • image-classification: The goal of this task is to classify a given image into one of 10 classes. The leaderboard is available here.

Languages

English

Dataset Structure

Data Instances

A sample from the training set is provided below:

{
  'img': <PIL.PngImagePlugin.PngImageFile image mode=RGB size=32x32 at 0x201FA6EE748>,
  'label': 0
}

Data Fields

  • img: A PIL.Image.Image object containing the 32x32 image. Note that when accessing the image column: dataset[0]["image"] the image file is automatically decoded. Decoding of a large number of image files might take a significant amount of time. Thus it is important to first query the sample index before the "image" column, i.e. dataset[0]["image"] should always be preferred over dataset["image"][0]
  • label: 0-9 with the following correspondence 0 airplane 1 automobile 2 bird 3 cat 4 deer 5 dog 6 frog 7 horse 8 ship 9 truck

Data Splits

Train and Test

Dataset Creation

Curation Rationale

[More Information Needed]

Source Data

Initial Data Collection and Normalization

[More Information Needed]

Who are the source language producers?

[More Information Needed]

Annotations

Annotation process

[More Information Needed]

Who are the annotators?

[More Information Needed]

Personal and Sensitive Information

[More Information Needed]

Considerations for Using the Data

Social Impact of Dataset

[More Information Needed]

Discussion of Biases

[More Information Needed]

Other Known Limitations

[More Information Needed]

Additional Information

Dataset Curators

[More Information Needed]

Licensing Information

[More Information Needed]

Citation Information

@TECHREPORT{Krizhevsky09learningmultiple,
    author = {Alex Krizhevsky},
    title = {Learning multiple layers of features from tiny images},
    institution = {},
    year = {2009}
}

Contributions

Thanks to @czabo for adding this dataset.

Downloads last month
42,810

Models trained or fine-tuned on uoft-cs/cifar10

Spaces using uoft-cs/cifar10 24