myG2P

所属分类:模式识别(视觉/语音等)
开发工具:Perl
文件大小:6398KB
下载次数:0
上传日期:2021-04-16 22:30:27
上 传 者sh-1993
说明:  no intro
(Myanmar (Burmese) Language Grapheme to Phoneme (myG2P) Conversion Dictionary for speech recognition (ASR) and speech synthesis (TTS).)

文件列表:
reference (0, 2021-04-17)
reference\G2P4Myanmar_WSSANLP_COLING2016.pdf (1231786, 2021-04-17)
reference\g2p-revised-ICCA2015.pdf (1158036, 2021-04-17)
reference\myg2p-PACLING2015.pdf (1186813, 2021-04-17)
tutorial (0, 2021-04-17)
tutorial\ch2col2.pl (657, 2021-04-17)
tutorial\ch2line.pl (635, 2021-04-17)
tutorial\g2p-tutorial.pdf (1392356, 2021-04-17)
tutorial\gradepos.pl (1370, 2021-04-17)
tutorial\mk-pair.pl (967, 2021-04-17)
tutorial\mk-wordtag.pl (3915, 2021-04-17)
ver1.1 (0, 2021-04-17)
ver1.1\myg2p.ver1.1.txt (1889863, 2021-04-17)
ver1 (0, 2021-04-17)
ver1\myg2p.ver1.txt (1889864, 2021-04-17)
ver2 (0, 2021-04-17)
ver2\myg2p.ver2.0.txt (2329808, 2021-04-17)

# myG2P Myanmar (Burmese) Language Grapheme to Phoneme (myG2P) Conversion Dictionary for speech recognition (ASR) and speech synthesis (TTS). --> [README in Myanmar Language](https://github.com/ye-kyaw-thu/myG2P/blob/master/README-Myanmar.md) ## Lincense Creative Commons Attribution-NonCommercial-Share Alike 4.0 International (CC BY-NC-SA 4.0) License [Details Info of License](https://creativecommons.org/licenses/by-nc-sa/4.0/) Contact email: wasedakuma[at]gmail.com ## Introduction We developed this myG2P (Myanmar Grapheme-to-Phoneme) dictionary for [VoiceTra](http://voicetra.nict.go.jp/en/index.html) (Multilingual Speech Translation Application) Myanmar language project of [NICT](http://www.nict.go.jp/en/), Japan (during 2014-2015). We mainly used MLC (Myanmar Language Commission) dictionary words. Please cite the [ICCA 2015 paper](https://github.com/ye-kyaw-thu/myG2P/blob/master/reference/g2p-revised-ICCA2015.pdf) and/or [COLING 2016 paper](https://github.com/ye-kyaw-thu/myG2P/blob/master/reference/G2P4Myanmar_WSSANLP_COLING2016.pdf), if you use myG2P dictionary. Please cite [PACLING 2015 paper](https://github.com/ye-kyaw-thu/myG2P/blob/master/reference/G2P4Myanmar_WSSANLP_COLING2016.pdf), if you are talking about sentence level grapheme-to-phoneme conversion of Myanmar language. ## Grapheme to Phoneme Mapping The Myanmar Language Commission (MLC) Pronunciation Dictionary can be used as a basis for pronunciation mapping. We found it necessary to extend the dictionary with foreign pronunciations. In the proposed mapping table there are 23 phonetic symbols for 33 consonants (some consonants share the same pronunciation, for example, “”, “”, “” and “” in Table1), 87 vowels combinations and 20 special symbols for foreign word pronunciations. Characters are grouped according to their pronunciation; the groups are unaspirated, aspirated, voiced and nasal and are shown in Table 1. Many Myanmar syllables containing unaspirated and aspirated consonants are pronounced as voiced consonants depending on the neighboring context. Some foreign pronunciations have to be expressed by special vowel combinations because Myanmar pronunciations do not include some pronunciations. See Table 3. MLC dictionary was extended by defining 26 more symbols to include phoneme mappings for foreign words for example, the Myanmar phonetic representation of the foreign name “Alex” “()” is e:le’S (here, S is for ()) and “Swift” “()()” is hswi’HPHT (here, HP is for () and HT is for ()).

Table 1: Groups of Myanmar consonants and their pronunciations

### Contextually Independent Pronunciation This section explains how the pronunciation of Myanmar syllables is normally derived from orthographic structure. Myanmar syllables are generally composed of consonants and (zero or more) vowel combinations starting with a consonant. Here, vowel combinations can be a single vowel, sequences of vowels starting with a consonant that modifies the pronunciation of the first vowel. The pronunciations of consonants when they are combined with vowels are shown in Table 2.

Table 2: Examples of vowel combinations and their pronunciations

### Contextually Dependent Pronunciations Some Myanmar syllables do not conform to these standard rules of pronunciation. The pronunciation of the syllables can depend on the context of syllables. Differences between standard pronunciations and correct pronunciations of some words are shown in Table 3 as examples.

Tagle 3: Examples of contextually dependent pronunciations of some Myanmar words

## Dictionary Format The dictionary format is distributed as a plain text file with one entry to a line in the format as follow: Word-ID\Word\Syllable-Breaked-Word\Pronunciation\IPA Example: ``` 19663 thu. ta. θu ta 196*** thu. ta. sa pei θu ta sa pe 19665 thu. ti. θu ti 19666 thu. tei tha- na. θu te θ na 19667 thu. tei thi θu te θi 19668 thu. da- ma za- ja' θu d ma z ja 19669 thou' da bou' θo da bo 19670 thu. na pa- ran ta. tain: θu na p a ta ta 19671 thu. ba. ja za θu ba ja za 19672 thu. min ga- la. θu m ɡ la ``` ## Versions [Version.1.0](https://github.com/ye-kyaw-thu/myG2P/tree/master/ver1), Released Date: May 30, 2017 [Version.1.1](https://github.com/ye-kyaw-thu/myG2P/tree/master/ver1.1), Released Date: Feb 25, 2019 [Version.2.0](https://github.com/ye-kyaw-thu/myG2P/tree/master/ver2), Released Date: Feb 15, 2021 ## Development and Support Contributors for developing myG2P dictionary are as follows: ### for myG2P (Version 1.0) [Win Pa Pa](https://sites.google.com/site/winpapaucsy/) [Ye Kyaw Thu](https://sites.google.com/site/yekyawthunlp/) ### for myG2P (version 2.0) - Honey Htun (Ph.D. Candidate, Yangon Technological University, Myanmar) - Ni Htwe Aung (Ph.D. Candidate,Yangon Technological University, Myanmar) - Shwe Sin Moe (a Master's student, Yangon Technological University, Myanmar) - Wint Theingi (a Master's student, Yangon Technological University, Myanmar) - [Ye Kyaw Thu](https://sites.google.com/site/yekyawthunlp/) (National Electronics and Computer Technology Center, Thailand) ## Acknowledgement We would like to express our gratitude to Ms. Aye Mya Hlaing and Ms. Hay Mar Soe Naing for checking G2P mappings. We also would like to thanks our NICT colleagues especially to Dr. Jinfu Ni and Dr. Yoshinori Shiga for their valuable suggestions on myG2P development. ## To Do -to add new Myanmar words from various domain ## Publication Ye Kyaw Thu, Win Pa Pa, Andrew Finch, Aye Mya Hlaing, Hay Mar Soe Naing, Eiichiro Sumita and Chiori Hori, "Syllable Pronunciation Features for Myanmar Grapheme to Phoneme Conversion", In Proceedings of the 13th International Conference on Computer Applications (ICCA 2015), February 5~6, 2015, Yangon, Myanmar, pp. 161-167. [Paper](https://github.com/ye-kyaw-thu/myG2P/blob/master/reference/g2p-revised-ICCA2015.pdf) [Best Paper Award] Ye Kyaw Thu, Win Pa Pa, Andrew Finch, Jinfu Ni, Eiichiro Sumita and Chiori Hori, 2015, "The Application of Phrase Based Statistical Machine Translation Techniques to Myanmar Grapheme to Phoneme Conversion", In Proceedings of the Pacific Association for Computational Linguistics Conference (PACLING 2015), May 19~21, 2015, Legian, Bali, Indonesia, pp. 170-176. [Paper](https://github.com/ye-kyaw-thu/myG2P/blob/master/reference/myg2p-PACLING2015.pdf) (revised paper has been published in Springer Communication in Computer and Information Science (CCIS), ISSN:1865-0929, pp. 238-250) _We used myG2P dictionary + extracted 5,276 sentences of BTEC corpus for this PACLING 2015 conference paper_ Ye Kyaw Thu, Win Pa Pa, Yoshinori Sagisaka, Naoto Iwahashi, "Comparison of Grapheme–to–Phoneme Conversion Methods on a Myanmar Pronunciation Dictionary", In Proceedings of the 6th Workshop on South and Southeast Asian Natural Language Processing (WSSANLP), COLING 2016, December 11-17, 2016, Osaka, Japan, pp. 11–22. [Paper](https://github.com/ye-kyaw-thu/myG2P/blob/master/reference/G2P4Myanmar_WSSANLP_COLING2016.pdf) ## Workshop Presentation Title: Grapheme-to-IPA Phoneme Conversion for Burmese (myG2P Version 2.0) Workshop: [the 2nd joint Workshop on NLP/AI R&D](https://isai-nlp-aiot2020.aiat.or.th/the-2nd-joint-myanmar-thai-nlp-ai-rd-workshop/), [iSAI-NLP 2020](https://isai-nlp-aiot2020.aiat.or.th/), Bangkok, Thailand. Authors: Honey Htun (YTU, Myanmar), Ni Htwe Aung (YTU, Myanmar), Shwe Sin Moe (YTU, Myanmar), Wint Theingi (YTU, Myanmar), Nyein Nyein Oo (YTU, Myanmar), Thepchai Supnithi (NECTEC, Thailand) and Ye Kyaw Thu (NECTEC, Thailand) ## Journal Paper to appear ## Reference 1. Myanmar-English Dictionary (1993), Department of the Myanmar Language Commission, Ministry of Education, Union of Myanmar. 2. https://en.wikipedia.org/wiki/International_Phonetic_Alphabet

近期下载者

相关文件


收藏者