annotations_creators | language | language_creators | license | multilinguality | pretty_name | size_categories | source_datasets | tags | task_categories | task_ids | |||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
MSCOCO |
|
|
|
|
- Dataset Card Creation Guide
- Homepage: https://cocodataset.org/#home
- Repository: https://github.com/shunk031/huggingface-datasets_MSCOCO
- Paper (Preprint): https://arxiv.org/abs/1405.0312
- Paper (ECCV2014): https://link.springer.com/chapter/10.1007/978-3-319-10602-1_48
- Leaderboard (Detection): https://cocodataset.org/#detection-leaderboard
- Leaderboard (Keypoint): https://cocodataset.org/#keypoints-leaderboard
- Leaderboard (Stuff): https://cocodataset.org/#stuff-leaderboard
- Leaderboard (Panoptic): https://cocodataset.org/#panoptic-leaderboard
- Leaderboard (Captioning): https://cocodataset.org/#captions-leaderboard
- Point of Contact: [email protected]
COCO is a large-scale object detection, segmentation, and captioning dataset. COCO has several features:
- Object segmentation
- Recognition in context
- Superpixel stuff segmentation
- 330K images (>200K labeled)
- 1.5 million object instances
- 80 object categories
- 91 stuff categories
- 5 captions per image
- 250,000 people with keypoints
[More Information Needed]
[More Information Needed]
- captioning dataset
import datasets as ds
dataset = ds.load_dataset(
"shunk031/MSCOCO",
year=2014,
coco_task="captions",
)
- instances dataset
import datasets as ds
dataset = ds.load_dataset(
"shunk031/MSCOCO",
year=2014,
coco_task="instances",
decode_rle=True, # True if Run-length Encoding (RLE) is to be decoded and converted to binary mask.
)
- person keypoints dataset
import datasets as ds
dataset = ds.load_dataset(
"shunk031/MSCOCO",
year=2014,
coco_task="person_keypoints",
decode_rle=True, # True if Run-length Encoding (RLE) is to be decoded and converted to binary mask.
)
- captioning dataset
import datasets as ds
dataset = ds.load_dataset(
"shunk031/MSCOCO",
year=2017,
coco_task="captions",
)
- instances dataset
import datasets as ds
dataset = ds.load_dataset(
"shunk031/MSCOCO",
year=2017,
coco_task="instances",
decode_rle=True, # True if Run-length Encoding (RLE) is to be decoded and converted to binary mask.
)
- person keypoints dataset
import datasets as ds
dataset = ds.load_dataset(
"shunk031/MSCOCO",
year=2017,
coco_task="person_keypoints",
decode_rle=True, # True if Run-length Encoding (RLE) is to be decoded and converted to binary mask.
)
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
The annotations in this dataset along with this website belong to the COCO Consortium and are licensed under a Creative Commons Attribution 4.0 License.
The COCO Consortium does not own the copyright of the images. Use of the images must abide by the Flickr Terms of Use. The users of the images accept full responsibility for the use of the dataset, including but not limited to the use of any copies of copyrighted images that they may create from the dataset.
Copyright (c) 2015, COCO Consortium. All rights reserved. Redistribution and use software in source and binary form, with or without modification, are permitted provided that the following conditions are met:
- Redistributions of source code must retain the above copyright notice, this list of conditions and the following disclaimer.
- Redistributions in binary form must reproduce the above copyright notice, this list of conditions and the following disclaimer in the documentation and/or other materials provided with the distribution.
- Neither the name of the COCO Consortium nor the names of its contributors may be used to endorse or promote products derived from this software without specific prior written permission.
THIS SOFTWARE AND ANNOTATIONS ARE PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS "AS IS" AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT OWNER OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
@inproceedings{lin2014microsoft,
title={Microsoft coco: Common objects in context},
author={Lin, Tsung-Yi and Maire, Michael and Belongie, Serge and Hays, James and Perona, Pietro and Ramanan, Deva and Doll{\'a}r, Piotr and Zitnick, C Lawrence},
booktitle={Computer Vision--ECCV 2014: 13th European Conference, Zurich, Switzerland, September 6-12, 2014, Proceedings, Part V 13},
pages={740--755},
year={2014},
organization={Springer}
}
Thanks to COCO Consortium for creating this dataset.