Discriminative learning and the lexicon: NDL and LDL
2021, Oxford Research Encyclopedia of Linguistics
https://doi.org/10.1093/ACREFORE/9780199384655.013.375Abstract
Chuang, Y.-Y., and R. H. Baayen Naive discriminative learning (NDL) and linear discriminative learning (LDL) are simple computational algorithms for lexical learning and lexical processing. Both NDL and LDL assume that learning is discriminative, driven by prediction error, and that it is this error that calibrates the association strength between input and output representations. Both words’ forms and their meanings are represented by numeric vectors, and mappings between forms and meanings are set up. For comprehension, form vectors predict meaning vectors. For production, meaning vectors map onto form vectors. These mappings can be learned incrementally, approximating how children learn the words of their language. Alternatively, optimal mappings representing the end state of learning can be estimated. The NDL and LDL algorithms are incorporated in a computational theory of the mental lexicon, the ‘discriminative lexicon’. The model shows good performance both with respect to production and comprehension accuracy, and for predicting aspects of lexical processing, including morphological processing, across a wide range of experiments. Since, mathematically, NDL and LDL implement multivariate multiple regression, the ‘discriminative lexicon’ provides a cognitively motivated statistical modeling approach to lexical processing.
References (52)
- Arnold, D., Tomaschek, F., Lopez, F., Sering, T., and Baayen, R. H. (2017). Words from spontaneous conversational speech can be recognized with human-like accuracy by an error-driven learning algorithm that discriminates between meanings straight from smart acoustic features, bypassing the phoneme as recognition unit. PLOS ONE, 12(4):e0174623.
- Baayen, R. H. (2012). Learning from the bible: computational modelling of the costs of letter transpositions and letter exchanges in reading classical hebrew and modern english. Lingue e linguaggio, 11(2):123-146.
- Baayen, R. H., Chuang, Y.-Y., and Blevins, J. P. (2018). Inflectional morphology with linear mappings. The Mental Lexicon, 13(2):232-270.
- Baayen, R. H., Chuang, Y.-Y., Shafaei-Bajestan, E., and Blevins, J. (2019). The discriminative lexicon: A unified computational model for the lexicon and lexical processing in comprehension and production grounded not in (de)composition but in linear discriminative learning. Complexity.
- Baayen, R. H., Milin, P., Filipović Durdević, D., Hendrix, P., and Marelli, M. (2011). An amorphous model for morphological processing in visual comprehension based on naive discriminative learning. Psychological Review, 118:438-482.
- Baayen, R. H., Milin, P., and Ramscar, M. (2016a). Frequency in lexical processing. Aphasiology, 30(11):1174-1220.
- Baayen, R. H. and Moscoso del Prado Martín, F. (2005). Semantic density and past-tense formation in three Germanic languages. Language, 81:666-698.
- Baayen, R. H., Shaoul, C., Willits, J., and Ramscar, M. (2016b). Comprehension without segmentation: A proof of concept with naive discriminative learning. Language, Cognition, and Neuroscience, 31(1):106-128.
- Baayen, R. H. and Smolka, E. (2020). Modelling morphological priming in German with naive discriminative learning. Frontiers in Communication, section Language Sciences. preprint on PsyArXiv, doi:10.31234/osf.io/nj39v.
- Balota, D., Cortese, M., Sergent-Marshall, S., Spieler, D., and Yap, M. (2004). Visual word recognition for single-syllable words. Journal of Experimental Psychology: General, 133:283- 316.
- Blevins, J. P. (2016). Word and paradigm morphology. Oxford University Press.
- Bozic, M., Tyler, L. K., Ives, D. T., Randall, B., and Marslen-Wilson, W. D. (2010). Bihemispheric foundations for human speech comprehension. Proceedings of the National Academy of Sciences, 107(40):17439-17444.
- Breiman, L. (2001). Random forests. Machine Learning, 45:5-32.
- Burnard, L. (1995). Users guide for the British National Corpus. British National Corpus consortium, Oxford university computing service.
- Chuang, Y.-Y., Bell, M. J., Banke, I., and Baayen, R. H. (2020a). Bilingual and multilingual mental lexicon: a modeling study with linear discriminative learning. Language Learning, pages 1-73.
- Chuang, Y.-Y., Loo, K., Blevins, J. P., and Baayen, R. H. (2020b). Estonian case inflection made simple. A case study in Word and Paradigm morphology with Linear Discriminative Learning. In Körtvélyessy, L. and Štekauer, P., editors, Advances in Morphology, pages 119-141. Cambridge University Press.
- Chuang, Y.-Y., Vollmer, M.-L., Shafaei-Bajestan, E., Gahl, S., Hendrix, P., and Baayen, R. H. (2020c). The processing of nonword form and meaning in production and comprehension: A computational modeling approach using linear discriminative learning. Behavior Research Methods, pages 1-51.
- Cibelli, E. S., Leonard, M. K., Johnson, K., and Chang, E. F. (2015). The influence of lexical statistics on temporal lobe cortical dynamics during spoken word listening. Brain and language, 147:66-75.
- Dalal, N. and Triggs, B. (2005). Histograms of oriented gradients for human detection. In 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), volume 1, pages 886-893.
- Danks, D. (2003). Equilibria of the Rescorla-Wagner model. Journal of Mathematical Psychology, 47(2):109-121.
- Divjak, D., Milin, P., Ez-zizi, A., Józefowski, J., and Adam, C. (2020). What is learned from exposure: an error-driven approach to productivity in language. Language, Cognition and Neuroscience, pages 1-24.
- Ellis, N. C. (2006). Language acquisition as rational contingency learning. Applied Linguistics, 27(1):1-24.
- Ellis, N. C. (2013). Second language acquisition. The Routledge Handbook of Second Language Acquisition, page 193.
- Frost, R. (2012). Towards a universal model of reading. Behavioral and Brain Sciences, page in press.
- Grainger, J., Dufau, S., Montant, M., Ziegler, J. C., and Fagot, J. (2012). Orthographic processing in baboons (papio papio). Science, 336(6078):245-248.
- Hannagan, T., Ziegler, J. C., Dufau, S., Fagot, J., and Grainger, J. (2014). Deep learning of orthographic representations in baboons. PLOS-one, 9:e84843.
- Hannun, A., Case, C., Casper, J., Catanzaro, B., Diamos, G., Elsen, E., Prenger, R., Satheesh, S., Sengupta, S., Coates, A., et al. (2014). Deep speech: Scaling up end-to-end speech recognition. arXiv preprint arXiv:1412.5567.
- Heitmeier, M. and Baayen, R. H. (2021). Simulating phonological and semantic impairment of English tense inflection with Linear Discriminative Learning. The Mental Lexicon, in press. PsyArXiv.
- Joanisse, M. F. and Seidenberg, M. S. (1999). Impairments in verb morphology after brain injury: a connectionist model. Proceedings of the National Academy of Sciences, 96:7592-7597.
- Kapatsinski, V. (2018). Changing minds changing tools: From learning theory to language acquisition to language change. MIT Press.
- Keuleers, E., Lacey, P., Rastle, K., and Brysbaert, M. (2012). The british lexicon project: Lexical decision data for 28,730 monosyllabic and disyllabic english words. Behavior Research Methods, 44(1):287-304.
- Landauer, T. and Dumais, S. (1997). A solution to Plato's problem: The latent semantic analysis theory of acquisition, induction and representation of knowledge. Psychological Review, 104(2):211-240.
- Liberman, A. and Mattingly, I. (1985). The motor theory of speech perception revised. Cognition, 21:1-36.
- Linke, M., Broeker, F., Ramscar, M., and Baayen, R. H. (2017). Are baboons learning "orthographic" representations? probably not. PLOS-ONE, 12(8):e0183876.
- Lund, K. and Burgess, C. (1996). Producing high-dimensional semantic spaces from lexical co- occurrence. Behaviour Research Methods, Instruments, and Computers, 28(2):203-208.
- Luo, X., Chuang, Y. Y., and Baayen, R. H. (2021). Judiling: an implementation in Julia of Linear Discriminative Learning algorithms for language modeling.
- Matthews, P. H. (1974). Morphology. An Introduction to the Theory of Word Structure. Cambridge University Press, Cambridge.
- Mikolov, T., Chen, K., Corrado, G., and Dean, J. (2013). Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781.
- Milin, P., Divjak, D., and Baayen, R. H. (2017a). A learning perspective on individual differences in skilled reading: Exploring and exploiting orthographic and semantic discrimination cues. Journal of Experimental Psychology: Learning, Memory, and Cognition.
- Milin, P., Feldman, L. B., Ramscar, M., Hendrix, P., and Baayen, R. H. (2017b). Discrimination in lexical decision. PLOS-one, 12(2):e0171935.
- Milin, P., Madabushi, H. T., Croucher, M., and Divjak, D. (2020). Keeping it simple: Implementation and performance of the proto-principle of adaptation and learning in the language sciences. PsyArXiv.
- Nixon, J. S. (2020). Of mice and men: Speech sound acquisition as discriminative learning from prediction error, not just statistical tracking. Cognition, 197:104081.
- Perrone-Bertolotti, M., Kujala, J., Vidal, J. R., Hamame, C. M., Ossandon, T., Bertrand, O., Minotti, L., Kahane, P., Jerbi, K., and Lachaux, J.-P. (2012). How silent is silent reading? intracerebral evidence for top-down activation of temporal voice areas during reading. Journal of Neuroscience, 32(49):17554-17562.
- Plag, I., Homann, J., and Kunter, G. (2017). Homophony and morphology: The acoustics of word- final S in English. Journal of Linguistics, 53(1):181-216.
- Ramscar, M., Dye, M., and Klein, J. (2013a). Children value informativity over logic in word learning. Psychological Science, 24(6):1017-1023.
- Ramscar, M., Dye, M., and McCauley, S. M. (2013b). Error and expectation in language learning: The curious absence of mouses in adult speech. Language, 89(4):760-793.
- Ramscar, M., Hendrix, P., Shaoul, C., Milin, P., and Baayen, R. H. (2014). Nonlinear dynamics of lifelong learning: the myth of cognitive decline. Topics in Cognitive Science, 6:5-42.
- Ramscar, M. and Yarlett, D. (2007). Linguistic self-correction in the absence of feedback: A new approach to the logical problem of language acquisition. Cognitive Science, 31(6):927-960.
- Ramscar, M., Yarlett, D., Dye, M., Denny, K., and Thorpe, K. (2010). The effects of feature-label- order and their implications for symbolic learning. Cognitive Science, 34(6):909-957.
- Rastle, K., Davis, M. H., and New, B. (2004). The broth in my brother's brothel: Morpho- orthographic segmentation in visual word recognition. Psychonomic Bulletin & Review, 11:1090- 1098.
- Rescorla, R. A. (1988). Pavlovian conditioning. It's not what you think it is. American Psychologist, 43(3):151-160.
- Rescorla, R. A. and Wagner, A. R. (1972). A theory of Pavlovian conditioning: Variations in the effectiveness of reinforcement and nonreinforcement. In Black, A. H. and Prokasy, W. F., editors, Classical conditioning II: Current research and theory, pages 64-99. Appleton Century Crofts, New York.