Key points are not available for this paper at this time.
The rise of multi-million-item dataset initiatives has enabled data-hungry machine learning algorithms to reach near-human semantic classification performance at tasks such as visual object and scene recognition. Here we describe the Places Database, a repository of 10 million scene photographs, labeled with scene semantic categories, comprising a large and diverse list of the types of environments encountered in the world. Using the state-of-the-art Convolutional Neural Networks (CNNs), we provide scene classification CNNs (Places-CNNs) as baselines, that significantly outperform the previous approaches. Visualization of the CNNs trained on Places shows that object detectors emerge as an intermediate representation of scene classification. With its high-coverage and high-diversity of exemplars, the Places Database along with the Places-CNNs offer a novel resource to guide future progress on scene recognition problems.
Building similarity graph...
Analyzing shared references across papers
Loading...
Bolei Zhou
University of California, Los Angeles
Àgata Lapedriza
Northeastern University
Aditya Khosla
University of North Carolina at Chapel Hill
IEEE Transactions on Pattern Analysis and Machine Intelligence
Massachusetts Institute of Technology
Universitat Oberta de Catalunya
Building similarity graph...
Analyzing shared references across papers
Loading...
Zhou et al. (Tue,) studied this question.
synapsesocial.com/papers/68ffcc959c622404abed979c — DOI: https://doi.org/10.1109/tpami.2017.2723009