TY - GEN
T1 - Automatic learning of phonetic mappings for cross-language phonetic-search in keyword spotting
AU - Bar-Yosef, Yossi
AU - Aloni-Lavi, Ruth
AU - Opher, Irit
AU - Lotner, Noam
AU - Tetariy, Ella
AU - Silber-Varod, Vered
AU - Aharonson, Vered
AU - Moyal, Ami
PY - 2012
Y1 - 2012
N2 - Phonetic-search (PS) is an extremely fast technique used for spoken keyword spotting over large amounts of audio data. PS is based on matching a desired phonetic pattern over existing phonetic lattices, avoiding heavy computations of acoustic probabilities during the search. Since PS requires substantial acoustic and language resources (LR) for training acoustic models, there is a need for reducing model training costs to support new target languages. Particular cases of under-resourced languages pose even a greater challenge for PS as the available LR are not sufficient for acoustic model training. This study examines methods for keyword search in a new target language, using existing models of another source language in the lattice generation phase. We explore methodologies for learning cross-language phonetic mappings depending on the availability of data in the target language. We describe three approaches for creating phonetic-mappings: linguistic, acoustic, and statistic, introducing an efficient way for learning a robust statistical cross-language mapping. Our cross-language PS experiments showed that learning a good cross-language mapping can alleviate acoustic mismatches between languages, to significantly improve cross-language phonetic-search.
AB - Phonetic-search (PS) is an extremely fast technique used for spoken keyword spotting over large amounts of audio data. PS is based on matching a desired phonetic pattern over existing phonetic lattices, avoiding heavy computations of acoustic probabilities during the search. Since PS requires substantial acoustic and language resources (LR) for training acoustic models, there is a need for reducing model training costs to support new target languages. Particular cases of under-resourced languages pose even a greater challenge for PS as the available LR are not sufficient for acoustic model training. This study examines methods for keyword search in a new target language, using existing models of another source language in the lattice generation phase. We explore methodologies for learning cross-language phonetic mappings depending on the availability of data in the target language. We describe three approaches for creating phonetic-mappings: linguistic, acoustic, and statistic, introducing an efficient way for learning a robust statistical cross-language mapping. Our cross-language PS experiments showed that learning a good cross-language mapping can alleviate acoustic mismatches between languages, to significantly improve cross-language phonetic-search.
KW - phonetic-mapping
KW - phonetic-search
KW - spotting
KW - under-resourced languages
UR - http://www.scopus.com/inward/record.url?scp=84871960357&partnerID=8YFLogxK
U2 - 10.1109/EEEI.2012.6376955
DO - 10.1109/EEEI.2012.6376955
M3 - ???researchoutput.researchoutputtypes.contributiontobookanthology.conference???
AN - SCOPUS:84871960357
SN - 9781467346801
T3 - 2012 IEEE 27th Convention of Electrical and Electronics Engineers in Israel, IEEEI 2012
BT - 2012 IEEE 27th Convention of Electrical and Electronics Engineers in Israel, IEEEI 2012
T2 - 2012 IEEE 27th Convention of Electrical and Electronics Engineers in Israel, IEEEI 2012
Y2 - 14 November 2012 through 17 November 2012
ER -