Windows batch script to extract text from PDF

480 pts.
Tags:
Command line
DOS
Microsoft Windows
PDF
Programming Languages
Windows Batch Files
Windows Server 2008
I need to automate the conversion of a PDF file to text format, or extract the text from the PDF file. Is there a Windows batch script that is available to do this, or a good freeware that I can download.  One that is compatible to run on Windows XP, Vista or Windows server 2008. A freeware that supports command line operations as I will need to do the conversion from the command line only and NOT from the GUI. Any help will be greatly appreciated.

Answer Wiki

Thanks. We'll let you know when a new response is added.

Never used any of these but you could go through and see if they have what you are looking for.

Thanks Ramsey for your response.

I did find this site during my search, however I figure that they would not be useful because they seen to only support GUI interface and did not make any reference to supporting command line operations which is what I need.

I need to automate the conversation process so I can incorporate the command line operations into a script and run it that way.

I did find a limited amount that support this but needed to get some recommendation or input from someone who is already using a pdf conversation software.

Discuss This Question: 8  Replies

 
There was an error processing your information. Please try again later.
Thanks. We'll let you know when a new response is added.
Send me notifications when members answer or reply to this question.

REGISTER or login:

Forgot Password?
By submitting you agree to receive email from TechTarget and its partners. If you reside outside of the United States, you consent to having your personal data transferred to and processed in the United States. Privacy
  • TomLiotta
    Are you comfortable that the PDFs that you will process will always have text in them? Not all PDFs have text. Have you opened many PDFs with Notepad (or other basic text editor) to see what they look like inside? In particular, view a couple of the PDF examples that you would expect to process in the future to be sure of what your desired utility needs to handle. Tom
    125,585 pointsBadges:
    report
  • Kccrosser
    Have you checked out http://www.a-pdf.com/text/? This site claims to do PDF to text and claims to have a command line version.
    3,830 pointsBadges:
    report
  • jutpro
    Yes I did check this site, http://www.a-pdf.com/text/, and found that only the GUI verion is free.
    480 pointsBadges:
    report
  • RamseyB
    what about this one. Looks all command based. You will need to download their free software but that is just so you can point to in in cmd converter
    report
  • RamseyB
    Or just move to linux lol
    2,115 pointsBadges:
    report
  • RamseyB
    Damn cost money too- another one. The top one listed says Command Lines freeware- http://www.adultpdf.com/products/txttopdf/index.html
    2,115 pointsBadges:
    report
  • jutpro
    Thanks for the site RamseyB. I will check out the freeware version and see how it works. Sorry, moving to Linux is not an option, but certainly not a bad suggestion.
    480 pointsBadges:
    report
  • Genderhayes
    Upload a document file and click on "Convert file". After a short time you will be able to download your converted text document. If you have a PDF file with scans or images with text, select the OCR functionality to enable character recognition.
    7,950 pointsBadges:
    report

Forgot Password

No problem! Submit your e-mail address below. We'll send you an e-mail containing your password.

Your password has been sent to:

To follow this tag...

There was an error processing your information. Please try again later.

REGISTER or login:

Forgot Password?
By submitting you agree to receive email from TechTarget and its partners. If you reside outside of the United States, you consent to having your personal data transferred to and processed in the United States. Privacy

Thanks! We'll email you when relevant content is added and updated.

Following