OCR

所属分类:模式识别(视觉/语音等)
开发工具:Visual C++
文件大小:120KB
下载次数:475
上传日期:2008-04-15 21:03:56
上 传 者duoduoz
说明:  一个有名的开源OCR,值得下载研究和学习
(A well-known open source OCR, worthy of study and learning download)

文件列表:
OCR\arg_parser.cc (5640, 2007-06-29)
OCR\arg_parser.h (3469, 2007-06-29)
OCR\AUTHORS (111, 2004-12-28)
OCR\bitmap.cc (12836, 2007-06-29)
OCR\bitmap.h (2898, 2007-06-29)
OCR\blob.cc (7020, 2007-06-29)
OCR\blob.h (1930, 2007-06-29)
OCR\ChangeLog (4521, 2007-06-29)
OCR\character.cc (10024, 2007-06-29)
OCR\character.h (3006, 2007-06-29)
OCR\character_r11.cc (15499, 2007-06-29)
OCR\character_r12.cc (9159, 2007-06-29)
OCR\character_r13.cc (2294, 2007-06-29)
OCR\common.cc (4659, 2007-06-29)
OCR\common.h (2433, 2007-06-29)
OCR\configure (6027, 2007-06-29)
OCR\COPYING (35068, 2007-06-29)
OCR\doc\ocrad.1 (2073, 2007-06-29)
OCR\doc\ocrad.info (10578, 2007-06-29)
OCR\doc\ocrad.texinfo (10029, 2007-06-29)
OCR\examples\test.pbm (55451, 2005-03-12)
OCR\examples\test.txt (169, 2005-03-11)
OCR\feats.cc (9230, 2007-06-29)
OCR\feats.h (2370, 2007-06-29)
OCR\feats_test0.cc (30953, 2007-06-29)
OCR\feats_test1.cc (8642, 2007-06-29)
OCR\INSTALL (1492, 2006-10-31)
OCR\main.cc (14313, 2007-06-29)
OCR\Makefile.in (4124, 2006-12-30)
OCR\NEWS (330, 2007-06-29)
OCR\ocrad.png (219, 2006-01-18)
OCR\page_image.cc (8198, 2007-06-29)
OCR\page_image.h (3849, 2007-06-29)
OCR\page_image_io.cc (14962, 2007-06-29)
OCR\page_image_layout.cc (10489, 2007-06-29)
OCR\profile.cc (18287, 2007-06-29)
OCR\profile.h (2351, 2007-06-29)
OCR\rational.cc (5843, 2007-06-29)
OCR\rational.h (5490, 2007-06-29)
... ...

GNU Ocrad is an OCR (Optical Character Recognition) program based on a feature extraction method. It reads images in pbm (bitmap), pgm (greyscale) or ppm (color) formats and produces text in byte (8-bit) or UTF-8 formats. The pbm, pgm and ppm formats are collectively known as pnm. Ocrad includes a layout analyser able to separate the columns or blocks of text normally found on printed pages. See the file INSTALL for compilation and installation instructions. Try "ocrad --help" for usage instructions. Caveats. For better results the characters should be at least 20 pixels high. Merged characters are always a problem. Try to avoid them. Very bold or very light (broken) characters are also a problem. Always see with your own eyes the pnm file before blaming ocrad for the results. Remember the saying, "garbage in, garbage out". Ideas, comments, patches, donations (hardware, money, etc), etc, are welcome. --------------------------- Debug levels ( option -D ) 100 - Show raw block list. 99 - Show recursive block list. *** - Show main block list. 96..97 - reserved. 95 - Show all blocks from every character before recognition. 94 - Show main black blocks from every character before recognition. 90..93 - reserved. 89 - Show all blocks from every character. 88 - Show main black blocks from every character. 87 - Show guess list for every character. 86 - Show best guess for every character. 80..85 - reserved. 78..79 - reserved. 7X - X = 0 Show page as bitmap. X = 1 Show page as bitmap with marked zones. X = 2 Show page as bitmap with marked lines. X = 4 Show page as bitmap with marked characters. --------------------------- OCR Results File (ORF) Calling ocrad with option -x produces an orf file, that is, a parsable file containing the ocr results. The format is as follows: - All lines starting with '#' are ignored. - The first valid line has the form 'source file filename'. Where 'filename' is the name of the file being processed ('-' for stdin). This is the only line guaranteed to exist for every input file read without errors. If the file, or any block or line, has no text, the corresponding part in the orf file will be missing. - The second valid line has the form 'total text blocks n'. Where 'n' is the total number of text blocks in the source image. For each text block in the source image, the following data follows: - A line in the form 'text block i x y w h'. Where 'i' is the block number and 'x y w h' are the block position and size as described below for character boxes. - A line in the form 'lines n'. Where 'n' is the number of lines in this block. For each line in every text block, the following data follows: - A line in the form 'line i chars n height h'. Where 'i' is the line number, 'n' is the number of characters in this line, and 'h' is the mean height of the characters in this line (in pixels). - N lines (one for every character) in the form "x y w h b; g[, 'c'v]...". 'x' = the left border (x-coordinate) of the char bounding box in the source image (in pixels). 'y' = the top border (y-coordinate). 'w' = the width of the bounding box. 'h' = the height of the bounding box. 'b' = the percent of black pixels in the bounding box. 'g' = the number of different recognition guesses for this character. The result characters follow after the number of guesses in the form of a comma-separated list of pairs. Every pair is formed by the actual recognised char enclosed in single quotes, followed by the confidence value, without space between them. The higher the value of confidence, the more confident is the result. Running './ocrad -x test.orf examples/test.pbm' in the source directory will give you an example orf file.

近期下载者

相关文件


收藏者