MS Word (doc)
Requirements#
- The ldc-doc library.
-
The
antiword
binary available onPATH
-
Debian/Ubuntu:
sudo apt install antiword
- Windows: Softpedia
Plugins#
Extracting text from MS Word (.doc) documents#
ldc-convert \
-l INFO \
from-doc-pt \
-l INFO \
--input "./input/*.doc" \
to-txt-pt \
-l INFO \
--output "./output/"