Key points are not available for this paper at this time.
Learning from a few examples and generalizing to markedly different situations are capabilities of human visual intelligence that are yet to be matched by leading machine learning models. By drawing inspiration from systems neuroscience, we introduce a probabilistic generative model for vision in which message-passing-based inference handles recognition, segmentation, and reasoning in a unified way. The model demonstrates excellent generalization and occlusion-reasoning capabilities and outperforms deep neural networks on a challenging scene text recognition benchmark while being 300-fold more data efficient. In addition, the model fundamentally breaks the defense of modern text-based CAPTCHAs (Completely Automated Public Turing test to tell Computers and Humans Apart) by generatively segmenting characters without CAPTCHA-specific heuristics. Our model emphasizes aspects such as data efficiency and compositionality that may be important in the path toward general artificial intelligence.
Building similarity graph...
Analyzing shared references across papers
Loading...
Dileep George
DeepMind (United Kingdom)
Wolfgang Lehrach
Union University
Ken Kansky
Union University
Science
Union University
Clarion University
Building similarity graph...
Analyzing shared references across papers
Loading...
George et al. (Thu,) studied this question.
synapsesocial.com/papers/69d97f580d540cafc5835e3f — DOI: https://doi.org/10.1126/science.aag2612
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: