View on GitHub

Olena

Support for Indic Scripts in Olena (Scribo) Project

Download this project as a .zip file Download this project as a tar.gz file

Indic-OCR Logo

Olena

Indic OCR uses Scribo module of Olena for layout analysis.

Olena has dependencies on a number of packages.

Following is the list of DEB packages that we installed on our Ubuntu system to compile Olena.

sudo apt-get install build-essential
sudo apt-get install autotools-dev 
sudo apt-get install libtiff5-dev 
sudo apt-get install libpango1.0-dev
sudo apt-get install libleptonica-dev 
sudo apt-get install bibtex2html 
sudo apt-get install autoconf libtool doxygen hevea
sudo apt-get install swig
sudo apt-get install python-dev
sudo apt-get install texlive-latex-base
sudo apt-get install docutils docutils-common 
sudo apt-get install  docutils-common 
sudo apt-get install graphicsmagick-libmagick-dev-compat 
sudo apt-get install libcfitsio-dev
sudo apt-get install libfftw3-dev
sudo apt-get install libcfitsio3-dev  
sudo apt-get install git
sudo apt-get install libleptonica-dev 
sudo apt-get install tesseract-ocr
sudo apt-get install wget
sudo apt-get install libjpeg8-dev 
sudo apt-get install libjpeg-dev
sudo apt-get install libgif-dev
sudo apt-get install libicu-dev 
sudo apt-get install graphicsmagick-imagemagick-compat
sudo apt-get install libboost-dev
sudo apt-get install fop
sudo apt-get install xsltproc 
sudo apt-get install latex2html
sudo apt-get install qt4-default

Chackout the code from

git clone https://github.com/indic-ocr/olena.git

And compile it using following commands

./bootstrap
./configure --enable-scribo
make

For Indic Script support please install models from https://indic-ocr.github.io/tessdata/

Authors and Contributors

@rkvsraman