Olena
Indic OCR uses Scribo module of Olena for layout analysis.
Olena has dependencies on a number of packages.
Following is the list of DEB packages that we installed on our Ubuntu system to compile Olena.
sudo apt-get install build-essential
sudo apt-get install autotools-dev
sudo apt-get install libtiff5-dev
sudo apt-get install libpango1.0-dev
sudo apt-get install libleptonica-dev
sudo apt-get install bibtex2html
sudo apt-get install autoconf libtool doxygen hevea
sudo apt-get install swig
sudo apt-get install python-dev
sudo apt-get install texlive-latex-base
sudo apt-get install docutils docutils-common
sudo apt-get install docutils-common
sudo apt-get install graphicsmagick-libmagick-dev-compat
sudo apt-get install libcfitsio-dev
sudo apt-get install libfftw3-dev
sudo apt-get install libcfitsio3-dev
sudo apt-get install git
sudo apt-get install libleptonica-dev
sudo apt-get install tesseract-ocr
sudo apt-get install wget
sudo apt-get install libjpeg8-dev
sudo apt-get install libjpeg-dev
sudo apt-get install libgif-dev
sudo apt-get install libicu-dev
sudo apt-get install graphicsmagick-imagemagick-compat
sudo apt-get install libboost-dev
sudo apt-get install fop
sudo apt-get install xsltproc
sudo apt-get install latex2html
sudo apt-get install qt4-default
Chackout the code from
git clone https://github.com/indic-ocr/olena.git
And compile it using following commands
./bootstrap
./configure --enable-scribo
make
For Indic Script support please install models from https://indic-ocr.github.io/tessdata/