
Customized conversation technology
25-11-18 15:00
AI even learns language habits? A personalized conversation technology has emerged to help dementia patients communicate.
The Secret to AI's Human-Like Speech: The Evolution of 'Lexical Alignment' Technology
Conversational AI is becoming increasingly natural, but there's still a challenge left: the ability to select words that match the other person's language style, or "lexical alignment." A research team at Radboud University in the Netherlands recently presented a method for learning an individual's language patterns using just 10 minutes of conversational data. This approach holds great promise, particularly for supporting communication with dementia patients. [arxiv.org]
What is lexical alignment?
Lexical alignment is the natural phenomenon in which two speakers gradually use similar words during a conversation. For example, if one person uses the word "chair," the other person, even if they normally use "seat," will change it to "chair" to suit the context. This kind of coordination is crucial for smooth communication, especially when communicating with people who have language difficulties. [arxiv.org]
Why is it important in conversations with dementia patients?
Dementia patients often have difficulty finding words or use unintended words. Previous research has shown that the language of dementia patients exhibits characteristics such as reduced use of nouns, increased use of pronouns and adjectives, and increased use of conjunctions. Therefore, if AI can reflect these patterns in conversation, communication can be more effective. [arxiv.org]
Why 10 Minutes of Data Is Enough
The research team analyzed conversation data of various lengths and confirmed that profiles created from utterance recordings longer than 10 minutes maintained stable performance. While 5 minutes of data was insufficient, 10 minutes was sufficient to capture an individual's typical language style. [arxiv.org]
Optimal Profile Composition
We also experimented with how many words should be included for each part of speech. The results were as follows:
Adjectives and Conjunctions: 5 each
Nouns, Pronouns, Verbs, and Adverbs: 10 each
This combination best balanced data efficiency and performance. [arxiv.org]
Operates without Real-Time Input
The strength of this approach lies in its lack of reliance on real-time user input. Existing systems require continuous analysis during conversations, which limits their ability to consistently input data, such as from dementia patients. In contrast, leveraging pre-built profiles allows for reliable vocabulary alignment in new conversations. [arxiv.org]
Future Prospects
This study demonstrated the concept using conversation data from elderly individuals, rather than actual dementia patient data. If a method is developed to update profiles based on changes in the patient's cognitive status, effective AI care is expected to become possible in the long term.