AramorpherCpp.0.9.0_2
所属分类:Windows编程
开发工具:C/C++
文件大小:1084KB
下载次数:3
上传日期:2013-01-08 18:56:12
上 传 者:
mhidy
说明: arabic morphology processing
文件列表:
AramorpherDictionary.h (9982, 2009-09-27)
AramorpherUtils.cpp (1437, 2009-09-25)
AramorpherUtils.h (2442, 2009-09-26)
CompatTable.cpp (2659, 2009-09-26)
CompatTable.h (2797, 2009-09-26)
Doxyfile (55203, 2009-09-28)
getopt.c (3903, 2009-02-21)
getopt.h (609, 2009-02-18)
gpl.txt (18349, 2009-02-04)
Makefile (2143, 2009-09-26)
utf8.h (1555, 2009-04-15)
docs\html (0, 2009-09-28)
docs\html\annotated.html (4297, 2009-09-28)
docs\html\ArabicEncodings_8cpp.html (1713, 2009-09-12)
docs\html\ArabicEncodings_8h-source.html (13390, 2009-09-28)
docs\html\ArabicEncodings_8h.html (5590, 2009-09-28)
docs\html\arabic__8859__6_8cpp.html (2003, 2009-09-28)
docs\html\arabic__8859__6_8h-source.html (22343, 2009-09-28)
docs\html\arabic__8859__6_8h.html (4047, 2009-09-28)
docs\html\arabic__buckwalter_8cpp.html (1873, 2009-09-28)
docs\html\arabic__buckwalter_8h-source.html (16580, 2009-09-28)
docs\html\arabic__buckwalter_8h.html (3084, 2009-09-28)
docs\html\arabic__cp1256_8cpp.html (1994, 2009-09-28)
docs\html\arabic__cp1256_8h-source.html (23619, 2009-09-28)
docs\html\arabic__cp1256_8h.html (3984, 2009-09-28)
docs\html\arabic__utf8_8cpp.html (1874, 2009-09-28)
docs\html\arabic__utf8_8h-source.html (25104, 2009-09-28)
docs\html\arabic__utf8_8h.html (4535, 2009-09-28)
docs\html\AramorpherApp_8cpp.html (10508, 2009-09-28)
docs\html\AramorpherCppUnitTestDefs_8h-source.html (3740, 2009-09-12)
docs\html\AramorpherCppUnitTestDefs_8h.html (3580, 2009-09-12)
docs\html\AramorpherDefs_8h-source.html (18258, 2009-09-28)
docs\html\AramorpherDefs_8h.html (11050, 2009-09-28)
docs\html\AramorpherDictionary_8cpp.html (2179, 2009-09-28)
docs\html\AramorpherDictionary_8h-source.html (25572, 2009-09-28)
docs\html\AramorpherDictionary_8h.html (7423, 2009-09-28)
docs\html\AramorpherUtilsTest_8cpp.html (1530, 2009-09-12)
docs\html\AramorpherUtilsTest_8h-source.html (5110, 2009-09-12)
... ...
# README FOR ARAMORPHERCPP
# Note that this file is also used to generate the doxygen documentation main page, and the information below will be
# displayed there.
/**
* \file README
* \brief This file is used by doxygen to generate the main page, and contains an overview of AramorpherCpp.
*
* \mainpage AramorphCpp
*
* \section what_is What is AramorpherCpp?
*
* AramorpherCpp is a C++ implementation of the Buckwalter Arabic Morphological Analyzer (Version 1.0) by Tim Buckwalter. It is released under the GNU GPL (included).
*
* AramorpherCpp provides two types of functionality. The first is as a stand-alone application. Compile the project using the included Makefile,
* and you can use the resulting binary the same way you use the original aramorph.pl Perl script (BAMA Version 1.0). It has some additional features available, though;
* use the "-h" option to see them. The advantage of using the application rather than the original Perl script (besides the extra features) is, of course, that being
* compiled in C++, it should run much faster.
*
* The second use of AramorpherCpp is as a library to include in your own C++ application. By using the classes contained here, you can create an Arabic morphological analyzer that can be
* loaded into memory and queried from your own code, much more efficiently than relying on a Perl script that must reload all the dictionaries and compatibility tables for each
* file to be analyzed.
*
* \section usage Usage
*
* If you just want a morphological analyzer that duplicates the aramorpher.pl Perl script, run the executable. (The Makefile is set up to generate this by default.)
*
* If you want to write your own C++ application and use Aramorpher methods, then all you have to do is include and compile these files in your project. The API is documented using doxygen.
*
* You can also compile the Aramorpher library as a static or shared library and then link to it. (I have tested this on Windows and Linux; the Makefile isn't set up to do it by default,
* but the procedure should be straightfoward.)
*
* \section reqs Requirements and Dependencies
*
* The public distribution of Aramorpher comes with precompiled binaries for various platforms. In order to compile and use Aramorpher
* yourself, or develop your own applications using this library, a few additional resources will be required.
*
* \subsection bama BAMA Data files
*
* Aramorpher requires the BAMA Version 1.0 data files (dictionaries and compatibility tables). These are not included; they are available for free from the Linguistic Data Consortium:
*
* http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC2002L49
*
* By default, Aramorpher assumes the BAMA data files are located in the current directory, but a different directory can be set
* (with the -d option for the executable, or by specifying it in the constructor when creating an Aramorpher object).
*
* \subsection utf8_cpp utf8 cpp library
*
* Aramorpher makes use of the
utf8 cpp library, by Nemanja Trifunovic. This is included.
*
* \subsection boost Boost
*
* Aramorpher makes use of the
Boost string and regex libraries. Thus, you must have Boost installed on your system (and you will have to set
* the locations appropriately in the Makefile) in order to compile it. The stand-alone executable only requires the Boost DLL/Shared Object file if compiled to use dynamic libraries;
* if compiled to use the Boost static libraries, you only need Boost to compile, but can then distribute the application even to environments without Boost installed.
*
* \section how_does_it_work How does AramorpherCpp Work?
*
* The Arabic morphological analysis is exactly the same as in the original BAMA script, the details of which can be found in that release's documentation.
*
* \section versions Version 1.0 vs. Version 2.0
*
* The Buckwalter Arabic Morphological Analyzer comes in two versions: Version 1.0 and Version 2.0. Version 2.0 has improved analysis capabilities and dictionaries. Both are available
* from the LDC, but unfortunately, only Version 1.0 is free and released under the GNU GPL. Version 2.0 requires an LDC membership, and derivative code is not freely distributable.
*
* Therefore, I am unable to distribute the C++ implementation of BAMA Version 2.0. However, if you have access to BAMA Version 2.0 yourself, you should find that modifying AramorpherCpp to work
* with it is not terribly difficult.
*
* \section documentation Documentation
*
* AramorpherCpp comes with a doxygen configuration file named "Doxyfile." If you run the doxygen tool (available at www.doxygen.org) on this file,
* it will generate a complete API, class diagrams, etc. This README file will become the main page of the documentation.
*
* \section aramorpherpl Aramorpher.pl
*
* As part of this Sourceforge project, I have also created aramorpher.pl, which is just an improved version of the original aramorph.pl Perl script
* (BAMA Version 1.0). It has additional features (also implemented in AramorpherCpp) for those who want to stick with the Perl version.
*
* Questions? Comments? Bugfixes? David Edelstein: amadan@users.sourceforge.net
*
* \author David Edelstein
*/
近期下载者:
相关文件:
收藏者: