Serving the Quantitative Finance Community

 
ilscot
Topic Author
Posts: 3
Joined: August 22nd, 2020, 10:49 pm

need to extract pages from a pdf based on content

August 22nd, 2020, 10:55 pm

Good morning,
  I need to extract pages from a (many 1000s) page pdf based on the content of the page
  some pages have transaction detail
  some pages have open position detail
  some pages have accounting detail (margins, etc)

  Once extracted, I would like to save the extracted pages as a single (new) pdf.

  It appears the "Action Wizard" might have the abiity to do this = IF I knew javascript.

Anyone??  THANK YOU in advance.
 
User avatar
Alan
Posts: 10654
Joined: December 19th, 2001, 4:01 am
Location: California
Contact:

Re: need to extract pages from a pdf based on content

August 23rd, 2020, 2:26 pm

I've used Adobe Acrobat over the years, but never automated it this way. But I see there are a number of Action downloads. If it's important, either bite the bullet and learn enough JavaScript or hire somebody. There's also an Adobe community forum for questions and  tutorials
 
ilscot
Topic Author
Posts: 3
Joined: August 22nd, 2020, 10:49 pm

Re: need to extract pages from a pdf based on content

December 1st, 2020, 5:37 pm

answering my own question (months later)

per many trial/error:  Economical and importantly - effective - commercial apps are available (vs the Adobe 'solution').  If interested, contact me privately to avoid a commercial endorsement issue.
In addition, newer versions of Excel can (with some creative VBA) be made to cycle through and convert a series of pdf files.  not as robust as the commercial apps, but effective for most uses