CentOS 5.5 PDF轉TXT
安裝djvulibre-3.5.24
下載地址:http://djvu.sourceforge.net/
下載壓縮包解壓好
# ./configure
# make && make install && make clean
安裝poppler-0.18.3.tar.gz
下載地址:http://poppler.freedesktop.org/
# ./configure --enable-xpdf-headers --disable-poppler-qt4 --disable-cms --disable-poppler-cpp
# make && make install && make clean
安裝poppler過程出錯處理:
Package requirements (fontconfig >= 2.0.0) were not met:
解決辦法:通過yum 安裝 fontconfig-devel,再configure通過。
配置時必需添加 --enable-xpdf-headers 參數,否則安裝pdf2djvu-0.7.12時會提示 fatal error: GfxState.h: No such file or directory
安裝pdf2djvu-0.7.12
下載地址:http://code.google.com/p/pdf2djvu/
# ./configure --prefix=/usr/local/pdf2djvu-0.7.12 --without-graphicsmagick --without-xslt
# make && make install && make clean
可以指定使用的編譯器
#./configure CFLAGS="-O3" CC="gcc462" CXXFLAGS="-O3" CXX="g++462" --prefix=/usr/local/pdf2djvu-0.7.12 --without-graphicsmagick --without-xslt
設置庫的頭文件查找路徑,以下方式緊當前shell有用
export PKG_CONFIG_PATH=/usr/local/lib/pkgconfig:$PKG_CONFIG_PATH
export PKG_CONFIG_PATH=/usr/lib/pkgconfig:$PKG_CONFIG_PATH
錯誤處理:
checking for DJVULIBRE... no
configure: error: Package requirements (ddjvuapi >= 3.5.17) were not met:
No package 'ddjvuapi' found
設置庫的頭文件查找路徑
export PKG_CONFIG_PATH=/usr/local/lib/pkgconfig:$PKG_CONFIG_PATH
提示POPPLER版本過低
checking for POPPLER... no
configure: error: Package requirements (poppler-splash >= 0.7.3) were not met:
下載新版安裝
提示找不到:GfxState.h
g++ -fopenmp -g -O2 -Wall -Werror=overloaded-virtual -pthread -I/usr/local/include -I/usr/local/include/poppler -c -o config.o config.cc
cc1plus: error: unrecognized command line option "-Werror=overloaded-virtual"
重新安裝poppler-0.18.3.tar.gz 添加 --enable-xpdf-headers 參數
gcc編譯器版本太低
安裝最新版gcc方法參看“CentOS 5.5 手動安裝 GCC4.6.2” http://www.linuxidc.com/Linux/2014-01/95388.htm
使用方法:
# ./pdf2djvu -o /data/sphinx_doc_zhcn_0.9.djvu /data/sphinx_doc_zhcn_0.9.pdf
# djvutxt /data/sphinx_doc_zhcn_0.9.djvu /data/sphinx_doc_zhcn_0.9.txt
djvutxt默認在/usr/bin目錄下
更多CentOS相關信息見CentOS 專題頁面 http://www.linuxidc.com/topicnews.aspx?tid=14