tesseract-ocr4.0 安装部署及训练验证码识别

1.  下载最新版本的leptonica,  leptonica-1.74.1.tar.gz  

2.  编译安装

tar -zxvf leptonica-1.74.1.tar.gz
cd leptonica-1.74.1
./configure
make
sudo make install

3. 安装相关依赖库

sudo apt-get install autoconf automake libtool
sudo apt-get install autoconf-archive
sudo apt-get install pkg-config
sudo apt-get install libpng12-dev
sudo apt-get install libjpeg8-dev
sudo apt-get install libtiff5-dev
sudo apt-get install zlib1g-dev

#if you plan to install the training tools, you also need the following libraries:

sudo apt-get install libicu-dev
sudo apt-get install libpango1.0-dev
sudo apt-get install libcairo2-dev

4. 下载编译安装最新版本 tesseract-4.0, 

git clone --depth 1 https://github.com/tesseract-ocr/tesseract.git
cd tesseract
./autogen.sh
./configure --enable-debug
LDFLAGS="-L/usr/local/lib" CFLAGS="-I/usr/local/include" make
sudo make install
sudo ldconfig

5. 使用

# 查看版本号
tesseract -v

# 查看tesseract 支持语言
tesseract --list-langs

# 识别 test.jpg 图片文字
tesseract test.jpg out -l eng
more out.txt
每天一小步,人生一大步!Good luck~
原文地址:https://www.cnblogs.com/jkmiao/p/6417167.html