Skip to content

MS Word (doc)

Requirements#

  • The ldc-doc library.
  • The antiword binary available on PATH

  • Debian/Ubuntu: sudo apt install antiword

  • Windows: Softpedia

Plugins#

Extracting text from MS Word (.doc) documents#

ldc-convert \
  -l INFO \
  from-doc-pt \
    -l INFO \
    --input "./input/*.doc" \
  to-txt-pt \
    -l INFO \
    --output "./output/"