Extract fields from PDF in bulk, using ABAP

5 pts.
Tags:
Microsoft Excel
PDF
SAP ABAP

Hi all,

We have a requirement from a client. They receive purchase orders in PDF format. Data like PO number / date. Ship to party etc has to be pulled out from the PDF and saved as an Excel file. This data will be used to post sale orders in SAP (using BAPI)

My questions:

  1. Can we write an ABAP program to read the PDF file and extract the field data?
  2. If the PDF files are not in a standard format, how difficult does the above task become?
  3. The POs can be in different languages as well?

Has anyone done the above for any of their projects? If so, kindly help me out.

Thanks and regards,

Madhu

Answer Wiki

Thanks. We'll let you know when a new response is added.
You may have to do it in steps. If you have Adobe Acrobat here are the directions from their site.

How to convert a PDF file to Excel:

  1. Open a file in Acrobat.

  2. Click on the Export PDF tool in the right pane.
  3. Choose spreadsheet as your export format, and then select Microsoft Excel Workbook.
  4. Click Export. If your PDF contains scanned text, Acrobat will run text recognition automatically.

  5. Name the Excel file and save it in a desired location.

Discuss This Question:  

 
There was an error processing your information. Please try again later.
Thanks. We'll let you know when a new response is added.
Send me notifications when members answer or reply to this question.

Forgot Password

No problem! Submit your e-mail address below. We'll send you an e-mail containing your password.

Your password has been sent to:

To follow this tag...

There was an error processing your information. Please try again later.

Thanks! We'll email you when relevant content is added and updated.

Following

Share this item with your network: