Delving into the Power of Visual Heuristics for Verb Production

Published on July 31, 2023

Imagine trying to capture the essence of a story by looking at the cover of a book. In a similar way, speakers rely on visual cues to identify features of verbs, such as tense and aspect morphology. To explore this phenomenon, researchers conducted experiments with Japanese speakers who were asked to describe computer-generated animations depicting simple actions. The results showed that participants were able to use visual information, specifically goal-related visuals, to select the appropriate verb forms. Additionally, they produced a wide variety of verb forms. To explain these findings, a deep-learning model was created that mimicked human-like verb production using visual input. This model successfully used visual cues to determine the correct tense/aspect morphology. Interestingly, the model also predicted that video duration would impact verb complexity and that past tense production would increase when the endpoint of an action was shown. These predictions were confirmed in a separate study with Japanese adults. Overall, this research suggests that verb production is closely linked to visual heuristics that aid in understanding events.

Abstract
Tense/aspect morphology on verbs is often thought to depend on event features like telicity, but it is not known how speakers identify these features in visual scenes. To examine this question, we asked Japanese speakers to describe computer-generated animations of simple actions with variation in visual features related to telicity. Experiments with adults and children found that they could use goal information in the animations to select appropriate past and progressive verb forms. They also produced a large number of different verb forms. To explain these findings, a deep-learning model of verb production from visual input was created that could produce a human-like distribution of verb forms. It was able to use visual cues to select appropriate tense/aspect morphology. The model predicted that video duration would be related to verb complexity, and past tense production would increase when it received the endpoint as input. These predictions were confirmed in a third study with Japanese adults. This work suggests that verb production could be tightly linked to visual heuristics that support the understanding of events.

Read Full Article (External Site)

Leave a Reply

Your email address will not be published. Required fields are marked *

You may use these HTML tags and attributes:

<a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <s> <strike> <strong>