Are there really interactive processes in speech perception?

James M. McQueen, Anne Cutler, Dennis Norris

    Research output: Contribution to journalArticle

    38 Citations (Scopus)

    Abstract

    On both empirical and theoretical grounds, we argue that the affirmative answer of McClelland et al. [1] is premature. Contrary to the predictions of the TRACE model, which postulates interactive processing in speech perception, there is no lexically mediated compensation for coarticulation when there is a lexical bias in interpretation of the preceding fricative [2]. This cannot be dismissed with criticism of the fricative-final lexical contexts and their distance from the stops (Box 3 in Ref. [1]). The contexts in Ref. [2] were effective – there was a lexical bias on the fricatives; and the timing was appropriate – there was a compensation effect with the same fricative-stop gap in the nonword-context conditions. Furthermore, we suggest that the evidence on compensation that purports to show interaction is unconvincing. Some apparent lexical effects 3 and 4 A.G. Samuel and M.A. Pitt, Lexical activation (and other factors) can mediate compensation for coarticulation, J. Mem. Lang. 48 (2003), pp. 416–434. Abstract | PDF (256 K) | View Record in Scopus | Cited By in Scopus (22)4 are possibly higher-order transitional-probability effects [2]. Others perhaps reflect learning about experiment-internal biases; indeed, the effect in Ref. [5] did not seem to increase over time, but this null result is not definitive because learning could have occurred in the practice block. Perceptual retuning can explain the selective-adaptation data (e.g. Ref. [6]). Retuning of phonetic perception can arise after only ten critical trials [7]; selective adaptation effects depend on many more exposures (e.g. 768 in Ref. [6]). Thus, selective adaptation can follow from perceptual retuning. After listeners learn that the ambiguous phoneme is /s/, for example, it acts as an adaptor, reducing the number of /s/ responses to the test stimuli. Further analysis of the data in Ref. [6] reveals exactly this: early blocks of trials show retuning, whereas later blocks show adaptation (Figure 3 in Ref. [8]). The evidence on whether there is on-line lexical influence on prelexical processes is thus inconclusive. However, consensus has been reached on the existence of lexical feedback for learning 7 and 9 and, as just shown, this can also explain apparent evidence of on-line interaction. Why might there be feedback that affects learning but not processing? Feedback for learning is helpful because it enables the listener to adjust to speaker-related variability [9], but feedback in on-line processing is not beneficial and could even be harmful [10]. These views are based on rational analysis [11]. Analyzing the nature of the perceptual task generated a hypothesis (that was confirmed in Ref. [9]) about how speech perception should operate. According to the rational-analysis perspective, and McClelland et al. [1], the goal of the speech-recognition device is optimal interpretation. Bayesian methods provide the optimal way to combine independent sources of information for perceptual decisions. However, if interaction were permitted between information sources, those sources would no longer be independent and the decision would be suboptimal [10]. Interaction, therefore, makes optimal interpretation harder. If an interactive algorithm could be made to compute the correct Bayesian decision function, the interaction would be a property only of that algorithm, not of the underlying computation. What would be computed is exactly what a non-interactive system would compute. Hence, commitment to the computational principle of optimality requires no commitment to the algorithmic principle of interactive processing. No data require direct influences of the lexicon on prelexical mechanisms, and evidence and computational principles argue against interactive processing. Further evidence of lexical mediation of prelexical processes might yet appear. Indeed, if lexical retuning were implemented in the TRACE model using Hebbian learning 9 and 12, there could well be on-line processing effects. Such demonstrations would be further evidence of feedback in learning and not evidence of interactive processing, which, other than potentially as part of a learning mechanism, serves no useful function.
    Original languageEnglish
    JournalTrends in Cognitive Science
    Publication statusPublished - 2006

    Keywords

    • psycholinguistics
    • speech perception

    Fingerprint

    Dive into the research topics of 'Are there really interactive processes in speech perception?'. Together they form a unique fingerprint.

    Cite this