Mental rotation and orientation-dependence in shape recognition

  • Tarr M
  • Pinker S
  • 222

    Readers

    Mendeley users who have this article in their library.
  • 494

    Citations

    Citations of this article.

Abstract

How do we recognize objects despite differences in their retinal projections when they are seen at different orientations? Marr and Nishihara (1978) proposed that shapes are represented in memory as structural descriptions in object-centered coordinate systems, so that an object is represented identically regardless of its orientation. An alternative hypothesis is that an object is represented in memory in a single representation corresponding to a canonical orientation, and a mental rotation operation transforms an input shape into that orientation before input and memory are compared. A third possibility is that shapes are stored in a set of representations, each corresponding to a different orientation. In four experiments, subjects studied several objects each at a single orientation, and were given extensive practice at naming them quickly, or at classifying them as normal or mirror-reversed, at several orientations. At first, response times increased with departure from the study orientation, with a slope similar to those obtained in classic mental rotation experiments. This suggests that subjects made both judgments by mentally transforming the orientation of the input shape to the one they had initially studied. With practice, subjects recognized the objects almost equally quickly at all the familiar orientations. At that point they were probed with the same objects appearing at novel orientations. Response times for these probes increased with increasing disparity from the previously trained orientations. This indicates that subjects had stored representations of the shapes at each of the practice orientations and recognized shapes at the new orientations by rotating them to one of the stored orientations. The results are consistent with a hybrid of the second (mental transformation) and third (multiple view) hypotheses of shape recognition: input shapes are transformed to a stored view, either the one at the nearest orientation or one at a canonical orientation. Interestingly, when mirrorimages of trained shapes were presented for naming, subjects took the same time at all orientations. This suggests that mental transformations of orientation can take the shortest path of rotation that will align an input shape and its memorized counterpart, in this case a rotation in depth about an axis in the picture plane. © 1989.

Get free article suggestions today

Mendeley saves you time finding and organizing research

Sign up here
Already have an account ?Sign in

Find this document

Authors

Cite this document

Choose a citation style from the tabs below

Save time finding and organizing research with Mendeley

Sign up for free