wordlists

所属分类:collect
开发工具:Java
文件大小:0KB
下载次数:0
上传日期:2018-03-08 14:03:11
上 传 者sh-1993
说明:  no intro
(Wordlists dictionary for Burmese (Myanmar),)

文件列表:
LICENSE (474, 2018-03-08)
nrclist.list (4918, 2018-03-08)
postal code/ (0, 2018-03-08)
postal code/MyanmarPostalCode.AyeYarWaddy.csv (3741, 2018-03-08)
postal code/MyanmarPostalCode.Bago.csv (4650, 2018-03-08)
postal code/MyanmarPostalCode.Chin.csv (1309, 2018-03-08)
postal code/MyanmarPostalCode.Kayah.csv (568, 2018-03-08)
postal code/MyanmarPostalCode.Kayin.csv (1097, 2018-03-08)
postal code/MyanmarPostalCode.Khchin.csv (3055, 2018-03-08)
postal code/MyanmarPostalCode.Magway.csv (5195, 2018-03-08)
postal code/MyanmarPostalCode.Mandalay.csv (5218, 2018-03-08)
postal code/MyanmarPostalCode.Mon.csv (2644, 2018-03-08)
postal code/MyanmarPostalCode.Naypyitaw.csv (700, 2018-03-08)
postal code/MyanmarPostalCode.Rakhine.csv (3144, 2018-03-08)
postal code/MyanmarPostalCode.Sagaing.csv (6418, 2018-03-08)
postal code/MyanmarPostalCode.Shan.csv (5107, 2018-03-08)
postal code/MyanmarPostalCode.Tanintharyi.csv (1900, 2018-03-08)
postal code/MyanmarPostalCode.Yangon.csv (1369, 2018-03-08)
select.php (2173, 2018-03-08)
wikitionary/ (0, 2018-03-08)
wikitionary/WordListExtractor/ (0, 2018-03-08)
wikitionary/WordListExtractor/.classpath (301, 2018-03-08)
wikitionary/WordListExtractor/.project (393, 2018-03-08)
wikitionary/WordListExtractor/.settings/ (0, 2018-03-08)
wikitionary/WordListExtractor/.settings/org.eclipse.jdt.core.prefs (598, 2018-03-08)
wikitionary/WordListExtractor/bin/ (0, 2018-03-08)
wikitionary/WordListExtractor/bin/com/ (0, 2018-03-08)
wikitionary/WordListExtractor/bin/com/minthanthtoo/ (0, 2018-03-08)
wikitionary/WordListExtractor/bin/com/minthanthtoo/wordlists/ (0, 2018-03-08)
wikitionary/WordListExtractor/bin/com/minthanthtoo/wordlists/Main.class (489, 2018-03-08)
wikitionary/WordListExtractor/bin/com/minthanthtoo/wordlists/WikitionaryExtractor.class (4142, 2018-03-08)
wikitionary/WordListExtractor/data/ (0, 2018-03-08)
wikitionary/WordListExtractor/data/mywiktionary20150901pagesarticlesmultistream.xml (81, 2018-03-08)
wikitionary/WordListExtractor/data/mywiktionary20150901pagesarticlesmultistream.xml.out (1741258, 2018-03-08)
wikitionary/WordListExtractor/src/ (0, 2018-03-08)
wikitionary/WordListExtractor/src/com/ (0, 2018-03-08)
wikitionary/WordListExtractor/src/com/minthanthtoo/ (0, 2018-03-08)
wikitionary/WordListExtractor/src/com/minthanthtoo/wordlists/ (0, 2018-03-08)
wikitionary/WordListExtractor/src/com/minthanthtoo/wordlists/Main.java (419, 2018-03-08)
... ...

Kanaung-Wordlists ================= Wordlists dictionary for Burmese (Myanmar) Under construction We have built Burmese wordlists from Myanmar Letter Ka (U+1000) "" to Myanmar Letter A (U+1021) "". Currently some words are not in order and duplicate words occur. We will fix these errors after completing "Burmese Sorting". Don't hesitate if you want to help with it. Sources ======= Currently,all these words were taken from "Burmese Spelling Book", officially published in 2003 by Myanmar Department of Education Ministry. "( - )" . We got a PDF file and detected it was encoded in standardized Unicode 5.1 or later. Modifications ============= 1) As usual, PDF extraction cannot correctly detect text alignments, so some words are not in order, and ending-letters, such as Asat (U+103A) "", Lower Vowel (U+1030) "" are missing and we had to add manually these letters. 2) We consider the final lists to be clean and simple for other programming and research uses. This is why we removed all annotations explaining the correct usage of the words Purposes ======== 1) For dictionary writers, these wordlists will be a useful source. 2) For NLP(Natural Language Processing) researchers, it may be essential in several NLP works utilizing dictionary-lookup approach, such as POS-tagging, building N-grams, Myanmar-English bilingual corpora, applications in Myanmar OCR, etc. Future ========= 1) We'll update the lists with new words 2) Burmese sorting and related tools will be developed for several platforms.

近期下载者

相关文件


收藏者