Edinburgh

所属分类:语音合成
开发工具:C++
文件大小:1937KB
下载次数:415
上传日期:2005-10-28 15:07:52
上 传 者名利不如闲
说明:  英国爱丁堡大学的语音处理系统,可以找到大部分语音处理/识别/合成所需要的算法
(University of Edinburgh voice processing system that can find most of voice processing/recognition/synthesis algorithms needed)

文件列表:
speech_tools\.time-stamp (50, 1999-12-02)
speech_tools\audio\audioP.h (3629, 1999-10-15)
speech_tools\audio\esd.cpp (5314, 1999-10-04)
speech_tools\audio\gen_audio.cpp (11606, 1999-10-15)
speech_tools\audio\irixaudio.cpp (2523, 1998-07-31)
speech_tools\audio\Makefile (3419, 1999-10-15)
speech_tools\audio\mplayer.cpp (4180, 1999-05-30)
speech_tools\audio\nas.cpp (6531, 1999-10-15)
speech_tools\audio\os2audio.cpp (7042, 1999-06-01)
speech_tools\audio\sun16audio.cpp (10408, 1999-09-29)
speech_tools\audio\voxware.cpp (10541, 1999-09-29)
speech_tools\audio\win32audio.cpp (5183, 1999-05-30)
speech_tools\audio (0, 2005-07-13)
speech_tools\base_class\EST_Complex.cpp (4391, 1999-05-30)
speech_tools\base_class\EST_DMatrix.cpp (21087, 1999-05-30)
speech_tools\base_class\EST_Featured.cpp (4303, 1999-09-07)
speech_tools\base_class\EST_FeatureData.cpp (14295, 1999-05-30)
speech_tools\base_class\EST_Features.cpp (7316, 1999-09-07)
speech_tools\base_class\EST_features_aux.cpp (4060, 1999-06-10)
speech_tools\base_class\EST_features_io.cpp (10038, 1999-05-30)
speech_tools\base_class\EST_FMatrix.cpp (22886, 1999-06-10)
speech_tools\base_class\EST_get_function_template.h (4064, 1999-05-30)
speech_tools\base_class\EST_ilist_aux.cpp (4129, 1999-05-30)
speech_tools\base_class\EST_IMatrix.cpp (3444, 1999-05-30)
speech_tools\base_class\EST_matrix_support.cpp (5423, 1999-05-30)
speech_tools\base_class\EST_matrix_support.h (3175, 1999-05-30)
speech_tools\base_class\EST_Option.cpp (6638, 1999-05-30)
speech_tools\base_class\EST_Pathname_unix.cpp (6651, 1999-06-01)
speech_tools\base_class\EST_Pathname_win32.cpp (6800, 1999-06-14)
speech_tools\base_class\EST_slist_aux.cpp (6917, 1999-05-30)
speech_tools\base_class\EST_SMatrix.cpp (4287, 1999-05-30)
speech_tools\base_class\EST_StringTrie.cpp (5996, 1999-05-30)
speech_tools\base_class\EST_svec_aux.cpp (4206, 1999-05-30)
speech_tools\base_class\EST_TBuffer.cpp (5452, 1999-05-30)
speech_tools\base_class\EST_TDeque.cpp (6765, 1999-06-02)
speech_tools\base_class\EST_THash.cpp (7673, 1999-11-04)
speech_tools\base_class\EST_TKVL.cpp (9120, 1999-06-02)
speech_tools\base_class\EST_TList.cpp (5270, 1999-05-30)
speech_tools\base_class\EST_TMatrix.cpp (15064, 1999-05-30)
speech_tools\base_class\EST_TNamedEnum.cpp (10617, 1999-06-03)
... ...

EDINBURGH SPEECH TOOLS LIBRARY VERSION 1.2.1 November 1999 Principal authors: Paul Taylor, Alan W Black, Richard Caley and Simon King Centre for Speech Technology University of Edinburgh 80 South Bridge Edinburgh EH1 1HN, UK http://www.cstr.ed.ac.uk/projects/speech_tools.html Although this is a considered stable release, there are still a number of known parts are missing or incomplete. ------------------------------------------------------------------------ The Edinburgh speech tools system is a library of C++ classes, functions and utility programs that are frequently used in speech software. The system compiles to a single unix library .a file which can be linked with software. At present, C++ classes for several useful speech and language classes have been written, along with audio software and some basic signal processing software. ======================================================================== Edinburgh Speech Tools Library version 1.2.1 Centre for Speech Technology Research University of Edinburgh, UK Copyright (c) 1994-1999 All Rights Reserved. Permission is hereby granted, free of charge, to use and distribute this software and its documentation without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of this work, and to permit persons to whom this work is furnished to do so, subject to the following conditions: 1. The code must retain the above copyright notice, this list of conditions and the following disclaimer. 2. Any modifications must be clearly marked as such. 3. Original authors' names are not deleted. 4. The authors' names are not used to endorse or promote products derived from this software without specific prior written permission. THE UNIVERSITY OF EDINBURGH AND THE CONTRIBUTORS TO THIS WORK DISCLAIM ALL WARRANTIES WITH REGARD TO THIS SOFTWARE, INCLUDING ALL IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS, IN NO EVENT SHALL THE UNIVERSITY OF EDINBURGH NOR THE CONTRIBUTORS BE LIABLE FOR ANY SPECIAL, INDIRECT OR CONSEQUENTIAL DAMAGES OR ANY DAMAGES WHATSOEVER RESULTING FROM LOSS OF USE, DATA OR PROFITS, WHETHER IN AN ACTION OF CONTRACT, NEGLIGENCE OR OTHER TORTIOUS ACTION, ARISING OUT OF OR IN CONNECTION WITH THE USE OR PERFORMANCE OF THIS SOFTWARE. ======================================================================== The following c++ programs are available: na_play: generic playback program for use with net_audio and CSTR ao. ch_wave: Waveform file conversion program. ch_lab: label file conversion program. ch_track: Track file conversion program. wagon: a CART tree build and test program And others The following C++ sub-libraries are available audio: C++ audio functions for Network Audio system, Suns Linux and FreeBSD speech_class: C++ speech classes, including waveform and track. ling_class: C++ linguistic classes. sigpr: Signal processing utils: Various utilities. COPYING ------------------------------------------------------------------------ Some files in the distribution do not fall under the above copyright. However none have any commercial restrictions, and no files fall under the GPL. Specifically files that have different licences are as follows: siod/ include/siod*.h lib/siod/siod.scm Apart from some new functionality under our licence the SIOD files fall under the licence given in siod/slib.cc copyright Paradigm Associates. This code has been substantially modified by us though our changes are under that licence. editline.c editline.h el_complete.c el_sysunix.c el_unix.h are copyright 1992 Simmule Turner and Rich Salz under the "BSD like" licence in siod/editline.c Again substantial changes have been made by us which continue to be under that licence rxp/ include/rxp/ Is Copyright Richard Tobin. We have a specific version of RXP which allows free (including non-commercial) use. base_class/raveconv.cc Is copyright 1992, 1995 by Markus Mummert base_class/string/reg*.{c,cc} Copyright (c) 1***6 by University of Toronto. base_class/string/EST_strcasecmp.c Copyright (c) 1***7, 1993 The Regents of the University of California. All rights reserved. The standard BSD licence ACKNOWLEDGEMENTS ------------------------------------------------------------------------ Most of the software was written by Paul Taylor, Alan W Black, Simon King and Richard Caley additional contributions have been made by the following people. Slightly different licence agreements accompany these pieces of code, so it is important that the various agreements be properly understood. Tony Robinson: some signal processing code. Markus Mummert: sample rate converter (rateconv.cc). Craig Reese and Joe Campbell: ulaw conversion code. Paul Bagshaw: pitch tracker. Theo Veenker: IRIX audio support. Stan Chen: AIX support Paradigm Assoc. and George Carrett: For Scheme In One Defun Simmule Turner and Rich Salz: for editline Henry Spencer: for regex code The Regents of the University of California: some string comparison code Richard Tobin: RXP, XML parser. INSTALLATION ************ see INSTALL file for details FILE/DIRECTORY STRUCTURE ------------------------------------------------------------------------ The main directory contains minimal wrap-around main() functions which compile to form executables which use the algorithms in the speech tools. All these main() functions really do is to read command line options and read and write files, the algorithms themselves are completely contained within the libraries. The directory structure is as follows: config: shared make rules and configuration make rules speech_class: general speech class files, including waveforms, contours etc. ling_class: general linguistic class files, including a stream-based architecture. sigpr: signal processing code. doc: basic documentation, generates html, info and postscript include: class and general header files. lib: where the .a files are kept main: Directory containing main files which compile to executables. DOCUMENTATION ------------------------------------------------------------------------ There are two sources of documentation. Man pages exist for the more important stand-alone programs. These are found in the main directory. The library documentation has substantiall improved in this version using an integrated DOC++, JADE and docbook combination. Classes are described at the overview, theorectical, practical and code level. The build process for the documentation is still dependent on some more obscure packages so it is recommended that you take the pre-built documentation. Or access it on-line through http://www.cstr.ed.ac.uk/projects/speech_tools.html The library documentation is in texinfo format which can be converted into GNU info HTML and postscript, see Makefile in doc/ Note the documentation is not up to date. BUGS/COMMENTS ETC ------------------------------------------------------------------------ Substantial parts of this release are considered mature and should be relatively free of bugs, however there are newer parts which have not been rigourously tested yet. If you find any bugs, please send a bug report to speechtools@cstr.ed.ac.uk. Any useful comments will also be appreciated. See also the bugs file in the doc directory. If you have some software that you think could be integrated into the speechtools system, please contact us at speechtools@cstr.ed.ac.uk. The whole point of writing and releasing code such as this is to prevent duplication of effort, and therefore we would appreciate greatly any code which will add to the usefulness of the system. Signal processing routines are particularly welcome. ====================================================================== ADDITIONAL SOFTWARE ====================================================================== CSTR supplies a number of different speech technology systems including speech synthesis systems and (in the future) a recognition system. To find out more information checkout http://www.cstr.ed.ac.uk/

近期下载者

相关文件


收藏者