omegat-tokenizer-voikko
所属分类:特征抽取
开发工具:Java
文件大小:78KB
下载次数:0
上传日期:2020-05-05 08:18:59
上 传 者:
sh-1993
说明: omegat标记器voikko,,
(omegat-tokenizer-voikko,,)
文件列表:
CHANGELOG.md (144, 2020-05-05)
COPYING (35147, 2020-05-05)
build.gradle (2239, 2020-05-05)
config (0, 2020-05-05)
config\checkstyle (0, 2020-05-05)
config\checkstyle\checkstyle.xml (7137, 2020-05-05)
config\checkstyle\header.txt (1138, 2020-05-05)
gradle.properties (272, 2020-05-05)
gradle (0, 2020-05-05)
gradle\wrapper (0, 2020-05-05)
gradle\wrapper\gradle-wrapper.jar (52818, 2020-05-05)
gradle\wrapper\gradle-wrapper.properties (237, 2020-05-05)
gradlew (5242, 2020-05-05)
gradlew.bat (2260, 2020-05-05)
settings.gradle (45, 2020-05-05)
src (0, 2020-05-05)
src\main (0, 2020-05-05)
src\main\java (0, 2020-05-05)
src\main\java\net (0, 2020-05-05)
src\main\java\net\briac (0, 2020-05-05)
src\main\java\net\briac\omegat (0, 2020-05-05)
src\main\java\net\briac\omegat\plugin (0, 2020-05-05)
src\main\java\net\briac\omegat\plugin\voikko (0, 2020-05-05)
src\main\java\net\briac\omegat\plugin\voikko\VoikkoGrammarCheck.java (3763, 2020-05-05)
src\main\java\net\briac\omegat\plugin\voikko\VoikkoInstance.java (1560, 2020-05-05)
src\main\java\net\briac\omegat\plugin\voikko\VoikkoIssue.java (4060, 2020-05-05)
src\main\java\net\briac\omegat\plugin\voikko\VoikkoSpellCheck.java (2191, 2020-05-05)
src\main\java\net\briac\omegat\plugin\voikko\VoikkoTokenizer.java (8659, 2020-05-05)
# Voikko Finnish Tokenizer for Omegat
## Description
This plugins adds a new `VoikkoTokenizer` in OmegaT. Because it relies on the [Voikko](https://voikko.puimula.org/) library it should gives more accurate tokenization than the default OmegaT tokenizer for language like Finnish.
It also includes Grammar verification and spellchecking.
## Installation
* You can download the plugin jar file from the [release page](../../releases). The OmegaT plugin jar should be placed in `$HOME/.omegat/plugins` (Linux), `~/Library/Preferences/OmegaT/plugins/` (macOS), or `%APPDATA%\OmegaT\plugins` (Windows) depending on your operating system.
* The Voikko library should also be installed on your system (see [this page](https://voikko.puimula.org/java.html) for detailed instructions).
* The Voikko dictionnaries (which can be [downloaded there](https://www.puimula.org/htp/testing/voikko-snapshot-v5/)) must be placed in the `$HOME/.omegat/voikko/dicts` directory. Files should be unzipped and directory structure kept (i.e. standard dictionary should be in `$HOME/.omegat/voikko/dicts/5/mor-standard`). Dictionaries in v.3 are supported as well,they need to be placed in `$HOME/.omegat/voikko/dicts/5` directory.
Once everything is installed, (re)start OmegaT, open your project, select Project → Properties → Source Language Tokenizer
### Windows library install
* Download the dll corresponding to your to you system (win32 or win***) from this page: https://www.puimula.org/htp/testing/voikko-sdk/win-crossbuild/libvoikko-4.1.1+win1/
* Copy the `libvoikko-1.dll` file in the folder `%APPDATA%\OmegaT\voikko`.
### macOS library install
* On macOS libvoikko is available in HomeBrew. It can be installed with `brew install libvoikko`. If the plugin fails to find the library, it might be necessary to copy its files to `~/Library/Preferences/OmegaT/voikko`: (`cp -r /usr/local/Cellar/libvoikko/4.3_2/lib/* ~/Library/Preferences/OmegaT/voikko/`).
## License
This project is distributed under the GNU general public license version 3 or later.
近期下载者:
相关文件:
收藏者: