Learning Object-Centric Representations of Visually Complex Scenes

Dec 1, 2022 · 1 min read

Learning distributed vector representations has been a driving force behind the remarkable success of deep neural networks in perceptual tasks on ImageNet-like photos with solitary objects (cars, trunks, etc.) in the center. One emerging paradigm for capturing the compositional nature of complex scenes is to learn slot-based object-centric representations. Nevertheless, existing methods to date are still largely limited to simple synthetic settings with low visual complexity.

Last updated on Dec 1, 2022

Computer Vision

Authors

Linyan Yang

Machine Learning Engineer

← Efficient Unbiased Training of Large-scale Distributed WGAN Mar 1, 2023