Abstract: To mitigate the heavy reliance on semantic information and the unreliability of manual feature extraction in dynamic simultaneous localization and mapping (SLAM) and object tracking systems, ...
Abstract: Programming based approaches to reasoning tasks have substantially expanded the types of questions models can answer about visual scenes. Yet on benchmark visual reasoning data, when models ...