Cross‐Situational Statistics Present in an Early Language Learning Context: Evidence From Naturalistic Parent–Child Interactions

Abstract
According to the cross-situational learning account, infants aggregate statistical information from multiple parent naming events to resolve ambiguous word-referent mappings within individual naming events. While previous experimental studies have shown that infant and adult learners can build correct mappings based on statistical regularities encoded in multiple learning situations in an experiment, other studies that use more naturalistic stimuli (e.g., real-world video) reveal poor performance in adults’ ability to infer the correct referent. Based on those results derived from more naturalistic stimuli, the cross-situational learning solution cannot be useful to solve the mapping problem in the real world because cross-situational statistics from the real world are much more ambiguous than those created in experimental studies. To examine the feasibility of cross-situational learning in everyday contexts, the present study aims to quantify visual-audio statistics from one of everyday activities—parent–child toy play. We analyze parent naming events in a video corpus of infant-perspective scenes during parent–child toy play in a naturalistic lab setting, where we found three distinct properties that characterize statistical regularities perceived by young learners: (1) there are a limited number of visual scene compositions perceived by young learners at the moments when they hear object names; (2) the frequencies of parent naming events are distributed in a skewed, Zipfian fashion; and (3) cross-situational statistics in naturalistic toy play are comparable to those used in laboratory experiments. Our results underscore the importance of quantifying the statistical regularities in the input from the learner’s perspective in order to shed light on the mechanisms supporting early word learning.

Read Full Article (External Site)