MSPSUG February 2017 Virtual Meeting – Parsing PDF Files with PowerShell

Our February and March meetings will be postponed by one week from our normal meeting date due to scheduling conflicts. They will each be held on the third Tuesday of the month (February 21st and March 21st).

For the February 2017 Mississippi PowerShell User Group virtual meeting,  Rohn Edwards will be presenting “Parsing PDF Files with PowerShell” on Tuesday, February 21st  at 8:30pm central time.

Parsing text from any source can be a hassle. Parsing text from a PDF file, though, can sometimes seem impossible. In this session, we’ll cover using iTextSharp, an open source library for working with PDFs, to handle simple to intermediate text parsing from PowerShell. We’ll start with simple documents that have very little formatting, and move up to extracting columns and tables from pages. While there’s no single generic tool for extracting readable text from all PDFs with formatting, hopefully this session will get you familiar enough with the process to allow you to create your own tools for the job.

Rohn Edwards is the winner of the advanced category in the 2012 Scripting Games and co-founder of the Mississippi PowerShell User Group.

The Mississippi PowerShell User Group Meetings are held online (via Skype for Business) on the second Tuesday of each month at 8:30pm Central Time and are free to attend. The system requirements to attend these online meetings can be found on the MSPSUG website under the “Attendee Info” section.

Register via EventBrite to receive the URL for this meeting.


Leave a Reply

This site uses Akismet to reduce spam. Learn how your comment data is processed.