Polysemous Visual-Semantic Embedding for Cross-Modal Retrieval (CVPR 2019)
-
Updated
Mar 15, 2024 - Python
Polysemous Visual-Semantic Embedding for Cross-Modal Retrieval (CVPR 2019)
Summary about Video-to-Text datasets. This repository is part of the review paper *Bridging Vision and Language from the Video-to-Text Perspective: A Comprehensive Review*
Python implementation of extraction of several visual features representations from videos
Add a description, image, and links to the tgif-dataset topic page so that developers can more easily learn about it.
To associate your repository with the tgif-dataset topic, visit your repo's landing page and select "manage topics."