Upland OL User community

Workflow, separation, and arrival of files

Good morning,
Here is the process I use to generate the production files.
The separation is created via a task creation, it is parameterized by 1000 sheets.
In the case where we have a file of 50,000 sheets, all the production files arrive at the same time at the end of the process.
Est-il possible de les faire apparaitre au fur et à mesure ?

Hello @JulienBat,

To achieve this you’d need to first split your data into chunk.

If I understand correctly, should I split the csv files upstream?
Wouldn’t there be a setting to add to the process?

You would need to use a data splitter, if this is what you mean by “setting”.
How is the data working? Is it 1 CSV line per record? Or is there grouping done on a field change?

It’s a flat file in csv, one line per record, there is no grouping, I send the entire file. “By parameter”, I meant to add an action to the workflow process, is that necessary so that split PDFS arrive gradually?

@JulienBat: note that if you create 10 000 separate jobs (because your CSV has 10 000 lines), the entire production will be much less efficient than processing a single job and splitting it through Job Creation. So you will be slowing down the production in order to see each file appear one by one, instead of all at once.

Perhaps you could split your CSV file into larger chunks (for instance, 50 lines per chunk) so that each chunk would generate 50 documents at once? This would have less of an impact on performance.

If you could explain a bit more why you need those files to be generated one by one, it would help us determine if there are better methods for processing the jobs.

1 Like

Hello Phil,
I work in sending advertising mailings, we send mailings for printing.
We mainly work in imposition from flat csv files, and for production constraints we have to separate the PDFs into batches of 1000 sheets.
Currently we are waiting for the end of the process to recover all the print files.
I wanted to know if it was possible for them to arrive gradually, so that production could start in parallel.
I hope my explanations will be clear?

If your template creates a single sheet per record, then you can split your data file into chunks that contain 1000 lines each. This will create PDFs sequentially, and you can start processing the first ones before the last ones have been produced.

But if your template creates a variable number of sheets for each record, then you have to process the entire file at once because the imposition depth (1000 sheets, in your case) might fall in the middle of a record.

Thank you for your explanations, it seems clear to me