Of all the visual tasks we might ask a computer to perform, analyzing a scene and recognizing all of the constituent objects remains the most challenging. While computers excel at accurately reconstructing the 3D shape of a scene from images taken from different views, they cannot name all the objects and animals present in a picture, even at the level of a twoyear- old child. There is not even any consensus among researchers on when this level of performance might be achieved.
CITATION STYLE
Szeliski, R. (2011). Structure from motion (pp. 303–334). https://doi.org/10.1007/978-1-84882-935-0_7
Mendeley helps you to discover research relevant for your work.