石墨烯知识资源中心

专利标题: Computer-implemented method for adapting automatic speech recognition engine for processing dynamic domain voice queries, involves receiving voice query that includes action and requested media content at user device.
专利号: EP4064278-A2, US2022310076-A1, EP4064278-A3
发明人: KUMAR A, BRATT E O, HEO M, RAJSHREE N, MANGALATH P C, BRATT E
专利权人: ROKU INC, ROKU INC
国际专利分类: G10L015/065, G10L015/187, G10L015/22, G06F040/205, G06F040/295, G10L015/18, G10L025/33, G10L015/26
专利详细信息: EP4064278-A2 28 Sep 2022 G10L-015/187 202283 Pages: 25 English
申请详细信息: EP4064278-A2 EP164531 25 Mar 2022
优先权号: US214462

▎ 摘　　要

NOVELTY - The method involves receiving a voice query that includes an action and requested media content. A transcription of the voice query is generated. The transcription is parsed to identify an entity corresponding to the textual representation of the media content. A phonetic representation of the entity is generated. The phonetic representation includes graphene of the entity, a phoneme of the entity, and an N-gram of the entity. A fuzzy candidate list comprising fuzzy candidates representing potential matches to the requested media content is generated based on the phonetic representation. The fuzzy candidate list is ranked to form a ranked fuzzy candidate list including a highest ranked fuzzy candidate corresponding to a best potential match for the requested media content. The action associated with the highest ranked fuzzy candidate is performed. USE - Method for adapting automatic speech recognition engine. ADVANTAGE - The domain adapted audio command processing module generates domain-specific fuzzy candidates that potentially match with the content being requested by the voice query, thus effectively processing dynamic domain voice queries in an automatic speech recognition (ASR) system, and hence improving the performance of the ASR system. DESCRIPTION OF DRAWING(S) - The drawing shows a block diagram illustrating the multimedia environment. 102User 104Media system 106Media device 120Content server 122Content