The Yahoo Flickr Creative Commons 100 Million dataset (YFCC100M) is one of the largest public multimedia databases which contains 99.2 million images and 0.8 million videos. Although its videos take only a small part of the dataset in comparison to its images, the video part is still one of the largest publicly available video databases. However, some essential video metadata and features are missing in the original database. Therefore, we obtained several metadata and features from the videos and release the database here for the research community.


Metadata included

This database includes several essential spatial, temporal, and content features of videos in the YFCC100M dataset. Following features and metadata are included in this database.

For more information, please refer to our paper which is mentioned below.


The database is in Tab-Separated Values (TSV) format with LF (Unix) line endings. Each line contains following tab-separated fields of each video:

License note

The database is available under Creative Commons Attribution 3.0 license. Please cite following paper when you use this database:



Back to top