Optimality Theory: Constraint Interaction in Generative Grammar
Studies in Second Language Acquisition (1993)
- ISSN: 02722631
- ISBN: 9780470759400
- DOI: 10.1017/S0272263106220060
- PubMed: 10599628
Available from www.journals.cambridge.org
or
Abstract
When referring to papers, the Archive interface only requires the sequence number (the first three digits). Citation by sequence number (ex. ROA-) is therefore sufficient for bibliographic purposes as well. The citation should also include reference to the Optimality Archive and its URL. For example: Green, Thomas and Michael Kenstowicz. 1995. The Lapse Constraint. Ms. MIT. ROA-101, Rutgers Optimality Archive, http://roa.rutgers.edu/
Available from www.journals.cambridge.org
Page 1
Optimality Theory: Constraint Int...
~ ROA Version, 8/2002. Essentially identical to the Tech Report, with new pagination (but the same footnote and example numbering) correction of typos, oversights & outright errors improved typography and occasional small-scale clarificatory rewordings. Citation should include reference to this version. OPTIMALITY THEORY Constraint Interaction in Generative Grammar First circulated: April, 1993 RuCCS-TR-2 CU-CS-696-93: July, 1993 Minor Corrections: December, 1993 ROA Version: August, 2002 Alan Prince Paul Smolensky Department of Linguistics Department of Cognitive Science Rutgers Cognitive Science Center The Johns Hopkins University Rutgers University [1993: University of Colorado at Boulder] prince@ruccs.rutgers.edu smolensky@cogsci.jhu.edu Everything is possible but not everything is permitted Richard Howard, The Victor Vanquished It is demonstrated, he said, that things cannot be otherwise: for, since everything was made for a purpose, everything is necessarily made for the best purpose. Candide ou loptimisme. Ch. I. Remark. The authors names are arranged in lexicographic order.
Page 2
Acknowledgments Special thanks to John McCarthy for detailed discussion of virtually every issue raised here and for a fine-grained skepsis of the entire first draft of the ms., which resulted in innumerable improvements and would have resulted in innumerably more, were this a better world. We are particularly grateful for his comments and suggestions in r��� Chs. 7 and 9. We also wish to thank Robert Kirchner, Armin Mester, and Junko It for remarks that have had significant impact on the development of this work, as well as David Perlmutter, Vieri Samek-Lodovici, Cheryl Zoll, Henrietta Hung, Mark Hewitt, Jane Grimshaw, Ad Neeleman, Diana Archangeli, Henry Churchyard, Doug Pulleyblank, Moira Yip, Tom Bever, Larry Hyman, Andy Black, Mike Jordan, Lauri Karttunen, Ren�� Kager, Paul Kiparsky, Mike Kenstowicz, Ellis Visch, AndrAEs Kornai, Akin Akinlabi, G��raldine Legendre, Clayton Lewis, Merrill Garrett, Jim Martin, Clara Levelt, Mike Mozer, Maria Bittner, Alison Prince, Dave Rumelhart, Mark Liberman, Jacques Mehler, Steve Pinker, Daniel B��ring, Katharina Hartmann, Joshua Legendre Smolensky, Ray Jackendoff, Bruce Hayes, Geoff Pullum, Gyanam Mahajan, Harry van der Hulst, William Labov, Brian McHugh, Gene Buckley, Will Leben, Jaye Padgett and Loren Billings. None of these individuals can be sensibly charged with responsibility for any errors that may have crept into this work. To Merrill Garrett (Cognitive Science, University of Arizona, Tucson) and to the organizers of the Arizona Phonology Conference we are grateful for providing in April 1991 the first public forums for the presentation of the theory, which proved a significant stimulus to the cohering thereof. We would also like to thank audiences at our 1991 LSA Summer Institute course and at the Feature Workshop there, at WCCFL 1992, at the OTS (Utrecht), University of California at Berkeley (Phonology Laboratory), the University of Colorado at Boulder and the Boulder Connectionist Research Group, Rutgers University (New Brunswick and Piscataway), Brandeis University, the University of Pennsylvania (the Linguistics Department and the Institute for Research in Cognitive Science), Princeton University Cognitive Science Center, Stanford University (Phonology Workshop and Parallel Distributed Processing Seminar), the University of Rochester Cognitive Science Program, and the International Computer Science Institute of Berkeley CA. Financial support was provided by a University of Colorado Faculty Fellowship, by research funds from Rutgers University and from the Rutgers Center for Cognitive Science, and, most crucially, by NSF SGER BNS-90 16806 without which the rigors of long-distance collaboration would have proved daunting indeed. We remember Robert Jeffers with special appreciation for constructing the Rutgers environment that so greatly facilitated the progress of this work.
Page 3
iii Table of Contents 1. Preliminaries . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 1.1 Background and Overview . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 1.2 Optimality . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 1.3 Overall Structure of the Argument . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7 Part I Optimality and Constraint Interaction Overview of Part I . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10 2. Optimality in Grammar: Core Syllabification in Imdlawn Tashlhiyt Berber . . . . . . . . . . . . . . . . . . . 11 2.1 The Heart of Dell & Elmedlaoui . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11 2.2 Optimality Theory . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17 2.3 Summary of discussion to date . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22 3. Generalization-Forms in Domination Hierarchies I Blocking and Triggering: Profuseness and Economy . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23 3.1 Epenthetic Structure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24 3.2 Do Something Only When: The Failure of Bottom-up Constructionism . . . . . . . . . . . . . . . . . . . . . . . . . . . 28 4. Generalization-Forms in Domination Hierarchies II Do Something Except When: Blocking, or The Theory of Profuseness . . . . . . . . . . . . . . . . . . . . 33 4.1 Edge-Oriented Infixation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33 4.2 Interaction of Weight Effects with Extrametricality . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38 4.2.1 Background: Prominence-Driven Stress Systems . . . . . . . . . . . . . . . . . . . . . . . . . 38 4.2.2 The Interaction of Weight and Extrametricality: Kelkars Hindi . . . . . . . . . . . . . 41 4.3 Nonfinality and Nonexhaustiveness . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44 4.3.1 Nonfinality and the Laws of Foot Form: Raw Minimality . . . . . . . . . . . . . . . . . . 49 4.3.2 Nonfinality and the Laws of Foot Form: Extended Minimality Effects . . . . . . . . . . . . . . . . . . . . . . . . . 54 4.4 Summary of Discussion of the Except When Effect . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 59 4.5 Except meets Only: Triggering and Blocking in a Single Grammar . . . . . . . . . . . . . . . . . . 59 5. The Construction of Grammar in Optimality Theory . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 73 5.1 Construction of Harmonic Orderings from Phonetic and Structural Scales . . . . . . . . . . . . . . . . . . . . . . . . . . 73 5.2 The Theory of Constraint Interaction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 74 5.2.1 Comparison of Entire Candidates by a Single Constraint . . . . . . . . . . . . . . . . . . 74 5.2.1.1 ONS: Binary constraints . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 75 5.2.1.2 HNUC: Non-binary constraints . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 78 5.2.2 Comparison of Entire Candidates by an Entire Constraint Hierarchy . . . . . . . . . 79 5.2.3 Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 83 5.2.3.1 Non-locality of interaction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 83 5.2.3.2 Strictness of domination . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 85 5.2.3.3 Serial vs. Parallel Harmony Evaluation and Gen . . . . . . . . . . . . . . . . . 86 5.2.3.4 Binary vs. Non-binary constraints . . . . . . . . . . . . . . . . . . . . . . . . . . . . 88 5.3 P~��inis Theorem on Constraint Ranking . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 88
Page 4
iv Part II Syllable Theory Overview of Part II . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 92 6. Syllable Structure Typology I: the CV Theory . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 93 6.1 The Jakobson Typology . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 93 6.2 The Faithfulness Interactions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 95 6.2.1 Groundwork . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 95 6.2.2 Basic CV Syllable Theory . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 98 6.2.2.1 Onsets . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99 6.2.2.2 Codas . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 102 6.2.3 The Theory of Epenthesis Sites . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 104 7. Constraint Interaction in Lardil Phonology . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 107 7.1 The Constraints . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 107 7.2 The Ranking . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 117 7.2.1 Some Ranking Logic . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 117 7.2.2 Ranking the Constraints . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 120 7.3 Verification of Forms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 127 7.3.1 Consonant-Final Stems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 128 7.3.2 Vowel Final Stems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 132 7.4 Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 135 8. Universal Syllable Theory II: Ordinal Construction of C/V and Onset/Coda Licensing Asymmetry . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 139 8.1 Associational Harmony . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 144 8.1.1 Deconstructing HNUC: Berber, Take 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 144 8.1.2 Restricting to Binary Marks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 147 8.2 Reconstructing the C and V Classes: Emergent Parameter Setting via Constraint Ranking . . . . . . . . . . . . 152 8.2.1 Harmonic Completeness of Possible Onsets and Peaks . . . . . . . . . . . . . . . . . . . 152 8.2.2 Peak- and Margin-Affinity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 154 8.2.3 Interactions with PARSE . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 156 8.2.4 Restricting Deletion and Epenthesis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 157 8.2.5 Further Necessary Conditions on Possible Onsets and Nuclei . . . . . . . . . . . . . . 158 8.2.6 Sufficient Conditions on Possible Onsets and Nuclei . . . . . . . . . . . . . . . . . . . . 160 8.3 The Typology of Onset, Nucleus, and Coda Inventories . . . . . . . . . . . . . . . . . . . . . . . . . . 165 8.3.1 The Typology of Onset and Nucleus Inventories . . . . . . . . . . . . . . . . . . . . . . . . 165 8.3.2 Onset/Coda Licensing Asymmetries . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 171 8.3.3 An Example: Berber, Take 2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 178 8.4 Simplifying the Theory by Encapsulating Constraint Packages . . . . . . . . . . . . . . . . . . . . . 183 8.4.1 Encapsulating the Association Hierarchies . . . . . . . . . . . . . . . . . . . . . . . . . . . . 183 8.4.2 An Example: Berber, Take 3 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 185 8.4.3 Sufficiency and Richness of the Encapsulated Theory . . . . . . . . . . . . . . . . . . . 185
Page 5
v Part III Issues and Answers in Optimality Theory 9. Inventory Theory and the Lexicon . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 191 9.1 Language-Particular Inventories . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 191 9.1.1 Harmonic Bounding and Nucleus, Syllable, and Word Inventories . . . . . . . . . 193 9.1.2 Segmental Inventories . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 195 9.2 Universal Inventories . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 202 9.2.1 Segmental Inventories . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 202 9.2.2 Syllabic Inventories . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 208 9.3 Optimality in the Lexicon . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 209 10. Foundational Issues and Theory-Comparisons . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 215 10.1 Thinking about Optimality . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 215 10.1.1 Fear of Optimization . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 215 10.1.2 The Reassurance . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 215 10.2 The Connectionism Connection, and other Computation-based Comparisons . . . . . . . . 217 10.2.1 Why Optimality Theory has nothing to do with connectionism . . . . . . . . . . . . 217 10.2.2 Why Optimality Theory is deeply connected to connectionism . . . . . . . . . . . . 218 10.2.3 Harmony Maximization and Symbolic Cognition . . . . . . . . . . . . . . . . . . . . . . 219 10.3 Analysis of Phonotactics+Repair Theories . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 221 10.3.1 CV Syllable Structure and Repair . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 224 10.3.2 General Structure of the Comparisons: Repair Analysis . . . . . . . . . . . . . . . . . 226 10.3.3 Persistent Rule Theory . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 228 10.3.3.1 English Closed Syllable Shortening . . . . . . . . . . . . . . . . . . . . . . . . . 229 10.3.3.2 Shona Tone Spreading . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 231 10.3.3.3 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 233 10.3.4 The Theory of Constraints and Repair Strategies . . . . . . . . . . . . . . . . . . . . . . . 233 Appendix +incomplete, . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 241 A.1 The Cancellation and Cancellation/Domination Lemmas . . . . . . . . . . . . . . . . . . . . . . . . 241 A.2 CV Syllable Structure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 241 A.3 P~��ini���s Theorem on Constraint-ranking . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 241 References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 243
Page 6
Plain text is unavailable for this page.
Page 7
1. Preliminaries 1.1 Background and Overview As originally conceived, the RULE of grammar was to be built from a Structural Description delimiting a class of inputs and a Structural Change specifying the operations that altered the input (e.g. Chomsky 1962). The central thrust of linguistic investigation would therefore be to explicate the system of predicates used to analyze inputs the possible Structural Descriptions of rules and to define the operations available for transforming inputs the possible Structural Changes of rules. This conception has been jolted repeatedly by the discovery that the significant regularities were to be found not in input configurations, nor in the formal details of structure-deforming operations, but rather in the character of the output structures, which ought by rights to be nothing more than epiphenomenal. We can trace a path by which conditions on well-formedness start out as peripheral annotations guiding the interpretation of rewrite rules, and, metamorphosing by stages into constraints on output structure, end up as the central object of linguistic study. As the theory of representations in syntax has ramified, the theory of operations has dwindled in content, even to triviality and, for some, nonexistence. The parallel development in phonology and morphology has been underway for a number of years, but the outcome is perhaps less clear both in the sense that one view has failed to predominate, and in the sense that much work is itself imperfectly articulate on crucial points. What is clear is that any serious theory of phonology must rely heavily on well-formedness constraints where by serious we mean committed to Universal Grammar. What remains in dispute, or in subformal obscurity, is the character of the interaction among the posited well-formedness constraints, as well as the relation between such constraints and whatever derivational rules they are meant to influence. Given the pervasiveness of this unclarity, and the extent to which it impedes understanding even the most basic functioning of the grammar, it is not excessively dramatic to speak of the issues surrounding the role of well-formedness constraints as involving a kind of conceptual crisis at the center of phonological thought. Our goal is to develop and explore a theory of the way that representational well-formedness determines the assignment of grammatical structure. We aim therefore to ratify and to extend the results of the large body of contemporary research on the role of constraints in phonological grammar. This body of work is so large and various as to defy concise citation, but we would like to point to such important pieces as Kisseberth 1972, Haiman 1972, Pyle 1972, Hale 1973, Sommerstein 1974, where the basic issues are recognized and addressed to Wheeler 1981, 1988, Bach and Wheeler 1981, Broselow 1982, Dressler 1985 Singh 1987, Paradis 1988ab, Paradis & Prunet 1991, Hulst 1984, Kaye & Lowenstamm 1984, Kaye, Lowenstamm & Vergnaud 1985, Calabrese 1988, Myers 1991, Goldsmith 1990, 1991, Bird 1990, Coleman 1991, Scobbie 1991, which all represent important strands in recent work as well as to Vennemann 1972, Bybee 1972, 1985, Liberman 1975, Goldsmith 1976, Liberman & Prince 1977, McCarthy 1979, McCarthy & Prince 1986, Selkirk 1980ab, 1981, Kiparsky 1980, 1982, Kaye & Lowenstamm 1981, McCarthy 1981, 1986, Lapointe & Feinstein 1982, Cairns & Feinstein 1982, Steriade 1982, Prince 1983, 1990, Kager & Visch 1983, Hayes 1984, Hyman 1985, Dressler 1985, Wurzel 1985, Borowsky 1986ab, It 1986, 1989, Mester 1986, 1992, Halle & Vergnaud 1987, Lakoff 1988, in press, Yip 1988, Cairns 1988, Kager 1989, Visch 1989, Clements 1990, Legendre, Miyata, & Smolensky 1990ab , Mohanan 1991, in press, Archangeli & Pulleyblank 1992, Burzio 1992ab, It, Kitagawa & Mester 1992, It & Mester 1992 a sample of work which offers an array of perspectives on the kinds of problems
Page 8
Chapter 1 Prince & Smolensky 2 we will be concerned with some close to, others more distant from our own, and some contributory of fundamental representational notions that will put in appearances throughout this work (for which, see the local references in the text below). Illuminating discussion of fundamental issues and an interesting conception of the historical development is found in Goldsmith 1990 Scobbie 1992 reviews some recent work. The work of Stampe 1973/79, though framed in a very different way, shares central abstract commitments with our own perhaps more distantly related are Chapter 9 of Chomsky & Halle 1968 and Kean 1975. The work of Wertheimer 1923, Lerdahl & Jackendoff 1983 (chs. 3 and 12), Jackendoff 1983 (chs. 7 and 8), 1987, 1991, though not concerned with phonology at all, provides significant conceptual antecedents similarly, the proposals of Chomsky 1986, and especially 1989, 1992, though very different in implementation, have fundamental similarities with our own. Rizzi 1990, Bittner 1993, and Legendre, Raymond, & Smolensky 1993, and Grimshaw in prep., are among recent works in syntax and semantics that resonate with our particular concerns. The basic idea we will explore is that Universal Grammar consists largely of a set of constraints on representational well-formedness, out of which individual grammars are constructed. The representational system we employ, using ideas introduced into generative phonology in the 1970s and 1980s, will be rich enough to support two fundamental classes of constraints: those that assess output configurations per se and those responsible for maintaining the faithful preservation of underlying structures in the output. Departing from the usual view, we do not assume that the constraints in a grammar are mutually consistent, each true of the observable surface or of some level of representation. On the contrary: we assert that the constraints operating in a particular language are highly conflicting and make sharply contrary claims about the well-formedness of most representations. The grammar consists of the constraints together with a general means of resolving their conflicts. We argue further that this conception is an essential prerequisite for a substantive theory of UG. It follows that many of the conditions which define a particular grammar are, of necessity, frequently violated in the actual forms of the language. The licit analyses are those which satisfy the conflicting constraint set as well as possible they constitute the optimal analyses of underlying forms. This, then, is a theory of optimality with respect to a grammatical system rather than of well- formedness with respect to isolated individual constraints. The heart of the proposal is a means for precisely determining which analysis of an input best satisfies (or least violates) a set of conflicting conditions. For most inputs, it will be the case that every possible analysis violates many constraints. The grammar rates all these analyses according to how well they satisfy the whole constraint set and produces the analysis at the top of this list as the output. This is the optimal analysis of the given input, and the one assigned to that input by the grammar. The grammatically well-formed structures are those that are optimal in this sense. How does a grammar determine which analysis of a given input best satisfies a set of inconsistent well-formedness conditions? Optimality Theory relies on a conceptually simple but surprisingly rich notion of constraint interaction whereby the satisfaction of one constraint can be designated to take absolute priority over the satisfaction of another. The means that a grammar uses to resolve conflicts is to rank constraints in a strict dominance hierarchy. Each constraint has absolute priority over all the constraints lower in the hierarchy.
Page 9
Optimality Theory Chapter 1 3 1 One work that uses ranking as a systematic part of the analysis is Cole 1992 thanks to Robert Kirchner for bringing this to our attention. Such prioritizing is in fact found with surprising frequency in the literature, typically as a subsidiary remark in the presentation of complex constraints.1 We will show that once the notion of constraint-precedence is brought in from the periphery and foregrounded, it reveals itself to be of remarkably wide generality, the formal engine driving many grammatical interactions. It will follow that much that has been attributed to narrowly specific constructional rules or to highly particularized conditions is actually the responsibility of very general well-formedness constraints. In addition, a diversity of effects, previously understood in terms of the triggering or blocking of rules by constraints (or merely by special conditions), will be seen to emerge from constraint interaction. Although we do not draw on the formal tools of connectionism in constructing Optimality Theory, we will establish a high-level conceptual rapport between the mode of functioning of grammars and that of certain kinds of connectionist networks: what Smolensky (1983, 1986) has called Harmony maximization, the passage to an output state with the maximal attainable consistency between constraints bearing on a given input, where the level of consistency is determined exactly by a measure derived from statistical physics. The degree to which a possible analysis of an input satisfies a set of conflicting well-formedness constraints will be referred to as the Harmony of that analysis. We thereby respect the absoluteness of the term well-formed, avoiding terminological confusion and at the same time emphasizing the abstract relation between Optimality Theory and Harmony- theoretic network analysis. In these terms, a grammar is precisely a means of determining which of a pair of structural descriptions is more harmonic. Via pair-wise comparison of alternative analyses, the grammar imposes a harmonic order on the entire set of possible analyses of a given underlying form. The actual output is the most harmonic analysis of all, the optimal one. A structural description is well-formed if and only if the grammar determines it to be the optimal analysis of the corresponding underlying form. With an improved understanding of constraint interaction, a far more ambitious goal becomes accessible: to build individual phonologies directly from universal principles of well-formedness. (This is clearly impossible if we imagine that constraints must be surface- or at least level-true.) The goal is to attain a significant increase in the predictiveness and explanatory force of grammatical theory. The conception we pursue can be stated, in its purest form, as follows: Universal Grammar provides a set of highly general constraints. These often conflicting constraints are all operative in individual languages. Languages differ primarily in how they resolve the conflicts: in the way they rank these universal constraints in strict domination hierarchies that determine the circumstances under which constraints are violated. A language-particular grammar is a means of resolving the conflicts among universal constraints. On this view, Universal Grammar provides not only the formal mechanisms for constructing particular grammars, it also provides the very substance that grammars are built from. Although we shall be entirely concerned in this work with phonology and morphology, we note the implications for syntax and semantics.
Page 10
Chapter 1 Prince & Smolensky 4 2 This kind of reasoning is familiar at the level of grammar selection in the form of the Evaluation Metric (Chomsky 1951, 1965). On this view, the resources of UG define many grammars that generate the same language the members of that set are evaluated, and the optimal grammar is the real one. 1.2 Optimality The standard phonological rule aims to encode grammatical generalizations in this format: (1) A �� B / CD The rule scans potential inputs for structures CAD and performs the change on them that is explicitly spelled out in the rule: the unit denoted by A takes on property B. For this format to be worth pursuing, there must be an interesting theory which defines the class of possible predicates CAD (Structural Descriptions) and another theory which defines the class of possible operations A �� B (Structural Changes). If these theories are loose and uninformative, as indeed they have proved to be in reality, we must entertain one of two conclusions: (i) phonology itself simply doesnt have much content, is mostly periphery rather than core, is just a technique for data-compression, with aspirations to depth subverted by the inevitable idiosyncrasies of history and lexicon or (ii) the locus of explanatory action is elsewhere. We suspect the latter. The explanatory burden can of course be distributed quite differently than in the re-write rule theory. Suppose that the input-output relation is governed by conditions on the well-formedness of the output, markedness constraints, and by conditions asking for the exact preservation of the input in the output along various dimensions, faithfulness constraints. In this case, the inputs falling under the influence of a constraint need share no input-specifiable structure (CAD), nor need there be a single determinate transformation (A��B) that affects them. Rather, we generate (or admit) a set of candidate outputs, perhaps by very general conditions indeed, and then we assess the candidates, seeking the one that best satisfies the relevant constraints. Many possibilities are open to contemplation, but some well-defined measure of value excludes all but the best.2 The process can be schematically represented like this: (2) Structure of Optimality-theoretic grammar a. Gen (Ink) �� {Out1, Out2, } b. H-eval( Outi, 1#i#4 ) �� Outreal The grammar must define a pairing of underlying and surface forms, (inputi, outputj). Each input is associated with a candidate set of possible analyses by the function Gen (short for generator), a fixed part of Universal Grammar. In the rich representational system employed below, an output form retains its input as a subrepresentation, so that departures from faithfulness may be detected
Page 11
Optimality Theory Chapter 1 5 by scrutiny of output forms alone. A candidate is an input-output pair, here formally encoded in what is called Outi in (2). Gen contains information about the representational primitives and their universally irrevocable relations: for example, that the node F may dominate a node Onset or a node : (implementing some theory of syllable structure), but never vice versa. Gen will also determine such matters as whether every segment must be syllabified we assume not, below, following McCarthy 1979 et seq. and whether every node of syllable structure must dominate segmental material again, we will assume not, following It 1986, 1989. The function H-eval determines the relative Harmony of the candidates, imposing an order on the entire set. An optimal output is at the top of the harmonic order on the candidate set by definition, it best satisfies the constraint system. Though Gen has a role to play, the burden of explanation falls principally on the function H-eval, a construction built from well-formedness constraints, and the account of interlinguistic differences is entirely tied to the different ways the constraint-system H-eval can be put together, given UG. H-eval must be constructible in a general way if the theory is to be worth pursuing. There are really two notions of generality involved here: general with respect to UG, and therefore cross- linguistically and general with respect to the language at hand, and therefore across constructions, categories, descriptive generalizations, etc. These are logically independent, and success along either dimension of generality would count as an argument in favor of the optimality approach. But the strongest argument, the one that is most consonant with the work in the area, and the one that will be pursued here, broaches the distinction, seeking a formulation of H-eval that is built from maximally universal constraints which apply with maximal breadth over an entire language. Optimality Theory, in common with much recent work, shifts the burden from the theory of operations (Gen) to the theory of well-formedness (H-eval). To the degree that the theory of well- formedness can be put generally, the theory will fulfill the basic goals of generative grammar. To the extent that operation-based theories cannot be so put, they must be rejected. Among possible developments of the optimality idea, we need to distinguish some basic architectural variants. Perhaps nearest to the familiar derivational conceptions of grammar is what we might call harmonic serialism, by which Gen provides a set of candidate analyses for an input, which are harmonically evaluated the optimal form is then fed back into Gen, which produces another set of analyses, which are then evaluated and so on until no further improvement in representational Harmony is possible. Here Gen might mean: do any one thing: advance all candidates which differ in one respect from the input. The Gen W H-eval loop would iterate until there was nothing left to be done or, better, until nothing that could be done would result in increased Harmony. A significant proposal of roughly this character is the Theory of Constraints and Repair Strategies of Paradis 1988ab, with a couple of caveats: the constraints involved are a set of parochial level-true phonotactic statements, rather than being universal and violable, as we insist and the repair strategies are quite narrowly defined in terms of structural description and structural change rather than being of the do-unto-" variety. A key aspect of Paradiss work is that it confronts the problem of well-definition of the notion repair: what to do when applying a repair strategy to satisfy one constraint results in violation of another constraint (at an intermediate level of derivation). Paradis refers to such situations as constraint conflicts and although these are not conflicts in our sense of the term they cannot be, since all of her constraints are surface- or level- true and therefore never disagree among themselves in the assessment of output well-formedness her work is of unique importance in addressing and shedding light on fundamental complexities in
Page 12
Chapter 1 Prince & Smolensky 6 3 An interesting variant is what we might call anharmonic serialism, in which Gen produces the candidate set by a nondeterministic sequence of constrained procedures (do one thing do another one) which are themselves not subject to harmonic evaluation. The candidate set is derived by running through every possible sequence of such actions harmonic evaluation looks at this candidate set. To a large extent, classical Move-" theories (Chomsky, 1981) work like this. the idea of wellformedness-driven rule-application. The persisent rule theory of Myers 1991 can similarly be related to the notion of Harmony-governed serialism. The program for Harmonic Phonology in Goldsmith 1990, 1991 is even more strongly of this character within its lexical levels, all rules are constrained to apply harmonically. Here again, however, the rules are conceived of as being pretty much of the familiar sort, triggered if they increase Harmony, and Harmony itself is to be defined in specifically phonotactic terms. A subtheory which is very much in the mold of harmonic serialism, using a general procedure to produce candidates, is the Move-x theory of rhythmic adjustment (Prince 1983, Hayes 1991/1995).3 A contrasting view would hold that the Input �� Output map has no internal structure: all possible variants are produced by Gen in one step and evaluated in parallel. In the course of this work, we will see instances of both kinds of analysis, though we will focus predominantly on developing the parallel idea, finding strong support for it, as do McCarthy & Prince 1993. Definitive adjudication between parallel and serial conceptions, not to mention hybrids of various kinds, is a challenge of considerable subtlety, as indeed the debate over the necessity of serial Move-" illustrates plentifully (e.g. Aoun 1986, Browning 1991, Chomsky 1981), and the matter can be sensibly addressed only after much well-founded analytical work and theoretical exploration. Optimality Theory abandons two key presuppositions of earlier work. First, that it is possible for a grammar to narrowly and parochially specify the Structural Description and Structural Change of rules. In place of this is Gen, which generates for any given input a large space of candidate analyses by freely exercising the basic structural resources of the representational theory. The idea is that the desired output lies somewhere in this space, and the constraint system of the grammar is strong enough to single it out. Second, Optimality Theory abandons the widely held view that constraints are language-particular statements of phonotactic truth. In its place is the assertion that constraints are essentially universal and of very general formulation, with great potential for disagreement over the well-formedness of analyses an individual grammar consists of a ranking of these constraints, which resolves any conflict in favor of the higher-ranked constraint. The constraints provided by Universal Grammar are simple and general interlinguistic differences arise from the permutations of constraint-ranking typology is the study of the range of systems that re- ranking permits. Because they are ranked, constraints are regularly violated in the grammatical forms of a language. Violability has significant consequences not only for the mechanics of description, but also for the process of theory construction: a new class of predicates becomes usable in the formal theory, with a concomitant shift in what we can think the actual generalizations are. We cannot expect the world to stay the same when we change our way of describing it.
Page 13
Optimality Theory Chapter 1 7 1.3 Overall Structure of the Argument This work falls into three parts. Part I develops the basic groundwork, theoretical and empirical, and illustrates the characteristic kinds of analytical results that can be gotten from the theory. Part II propounds a theory of universal syllable typology at two levels of idealization, drawing on and then advancing beyond various constraints introduced in Part I. The syllable structure typology provides the basis for a full-scale analysis of the rich system of prosodically-conditioned alternations in the Lardil nominal paradigm. Part III begins with an investigation of the way that inventories are delimited both in UG and in particular grammars. A variety of issues are then explored which have to do with the conceptual structure of the theory and with its relation to other work along the same general lines. We conclude with an Appendix containing proofs of some theorems stated in the text proper and other material of interest. The argument ranges over a variety of issues, problems, generalizations, and theoretical constructions. Some are treated rapidly, with the aim of extracting a general point, others are pursued in detail sometimes the treatment is informal, at other times it is necessary to formalize carefully so that nonobvious results can be established by explicit proof. We have tried to segregate and modularize as much as possible, but the reader should feel free on first reading to tunnel through bits that do not appeal: the formalist can surely find another formal patch up ahead, the connoisseur of generalizations another generalization. We have tried to sign-post the way in the text. If the readers interest is piqued by the present contents, the following works, which make use of Optimality Theory in various ways, may be of interest: Archangeli, Diana and Douglas Pulleyblank. 1992. Grounded phonology. Ms. University of Arizona and University of British Columbia. Black, H. Andrew. 1993. Constraint-ranked derivation: truncation and stem binarity in Southeastern Tepehuan. Ms. UC Santa Cruz. Churchyard, Henry. 1991. Biblical Hebrew prosodic structure as the result of preference-ranked constraints. Goodman, Beverley. 1993. The integration of hierarchical features into a phonological system. Doctoral dissertation, Cornell University. Hung, Henrietta. 1992. Relativized suffixation in Choctaw: a constraint-based analysis of the verb grade system. Ms. Brandeis University. Hung, Henrietta. in preparation. The rhythmic and prosodic organization of edge constituents. Doctoral dissertation. Brandeis University. It, Junko. Yoshihisa Kitagawa. and R. Armin Mester. 1992. prosodic type preservation in japanese: evidence from zuuja-go. SRC-92-05. Syntax Research Center. UC Santa Cruz. It, Junko and R. Armin Mester. 1992. Weak layering and word binarity. Ms. University of California. Santa Cruz. It, Junko and R. Armin Mester. to appear. Licensed segments and safe paths. In Constraints, violations, and repairs in phonology. Special issue of the Canadian Journal of Linguistics. Kirchner, Robert. 1992a. Harmonic Phonology within One Language: An Analysis of Yidiny. MA thesis. University of Maryland. College Park. Kirchner, Robert. 1992b. Yidiny prosody in Harmony Theoretic Phonology. Ms. UCLA.
Page 14
Chapter 1 Prince & Smolensky 8 Legendre, G��raldine, William Raymond, and Paul Smolensky. Analytic typology of case marking and grammatical voice. McCarthy, John. to appear. A case of surface constraint violation. Canadian Journal of Linguistics. special issued edited by Carole Paradis. Darlene LaCharit��. and Emmanuel Nikiema. McCarthy, John and Alan Prince. 1993. Prosodic Morphology I: constraint interaction and satisfaction. Mester, R. Armin. to appear. The quantitative trochee in Latin. Natural Language & Linguistic Theory. Prince, Alan. 1990/92. Quantitative consequences of rhythmic organization. Rosenthall, Sam. in preparation. The phonology of vowels and glides. Doctoral dissertation. University of Massachusetts, Amherst. Samek-Lodovici, Vieri. 1992. Universal constraints and morphological gemination: a crosslinguistic study. Ms. Brandeis University. Samek-Lodovici, Vieri. 1993. A unified analysis of crosslinguistic morphological gemination. In Proceedings of CONSOL-1. Selkirk. Elisabeth. 1993. The prosodic structure of functional elements: affixes, clitics, and words. handout of talk presented at Signal to Syntax Conference. Brown University. Sherer, Tim. in preparation. Prosodic Phonotactics. Doctoral dissertation. University of Massachusetts. Amherst. Yip, Moira. 1992. Cantonese loan word phonology and Optimality Theory. To appear in Journal of East Asian Linguistics. Yip, Moira. 1993. Phonological constraints, optimality, and phonetic realization in Cantonese. UPenn Colloquium. Zec, Draga. in preparation. Coda constraints and conditions on syllable weight. Cornell University. Zoll, Cheryl. 1992. When syllables collide: a theory of alternating quantity. Ms. Brandeis University. Zoll, Cheryl. 1993. Ghost consonants and optimality. WCCFL, Santa Cruz.
Page 15
PART I Optimality and Constraint Interaction
Readership Statistics
471 Readers on Mendeley
by Discipline
90% Linguistics
1% Psychology
by Academic Status
25% Ph.D. Student
24% Student (Master)
11% Student (Bachelor)
by Country
21% United States
10% United Kingdom
5% Brazil
Sign up today - FREE
Mendeley saves you time finding and organizing research. Learn more
- All your research in one place
- Add and import papers easily
- Access it anywhere, anytime


