Example:1file_a.csv:2item, quantity3apple, 34orange, 95banana, 2Copied!1file_b.csv:2item, quantity3pear, 94apple, 35banana, 2Copied!After processingfile_a.csv
, if we addfile_b.csv
to the same directory and run a job, only the row containingpear, 9
will be processed, asapple, 3
andbanana, 2
were already seen whenfile_a.csv
was processed. The same applies within the same file - if we'd addedpear, 9
tofile_a.csv
instead of creatingfile_b.csv
, the net result would be the same:pear, 9
would be the only new row.
Example:1file_a.csv:2item, quantity3apple, 34orange, 95banana, 2Copied!1file_b.csv:2item, quantity3pear, 94apple, 35banana, 2Copied!After processingfile_a.csv
, if we addfile_b.csv
to the same directory and run a job, all three records infile_b.csv
will be processed. If instead we'd added those records tofile_a.csv
, the duplicated records (apple, 3
,banana, 2
) would be skipped, and the new recordpear, 9
would be the only new record processed.