I have input that consists of invoices of two different types (type 1 or 2. They are to many different account numbers. There may be any number (including 0) of either type of invoice to a particular account number.
The goal is that the invoices should be printed, sorted by account number, subsorted by invoice type: For any given account number the type1(s) should be come first, the type2 next. However, if there are type2 invoices that do not have at least one matching type1, those should not be printed. (Actually the output is a PDF, but you get the idea.)
I have all the invoices formatted in a single PDF, sorted by account number, subsorted by invoice type, and have the account number and invoice type printed in the margin in the same place on every page.
So how to finish up by filtering out invoice type 2s that have no matching type 1?
My thought was to read the PDF into a datamapper, separate into records based on change of accountnumber. That (I think) gives one record per account number?
I should then be able to delete any record that is not invoicetype1 on the first page. But do I do this with some sort of filter in a preset? Or is this something best done in the workflow?
It seems simple, but I’m not sure how to proceed. Advice, please?
First, thanks for the advice on how to do this with scripts in workflow. That seems a good workaround to the issue. I also have been told that the filter is not the optimum way to do this: the best way is to have a mapper that opens the sorted PDF, breaks it into “records” on change of accountnumber (without extracting, just look for a change in a region of the PDF); then a true/false branch if a region of the PDF contains “doctype1”; If true, extract fields. If not true, action step “skip record”.
This yields metadata where all the records (which might actually be several page consisting of several invoices all with the same account number) begin with doctype 1, and any records that begin with any other doctype have no metadata extracted.
Then when I build a new PDF from this, I should get the results I want.
However, every record is printed, paying no attention to whether there is metadata for the record or not.
I can’t see why it would print. Any ideas?