Split

A Split node should be used when there is a need to split csv data processing. This might occur when csv data processing becomes excessively time consuming. Such time-consuming behavior might occur when you send big csv data to the flow input terminal or it might become memory intense when complex data validation is being performed. The Split node can be used with these, and with other data processing tasks that can be done in parallel.

The Split splits data records into batches. Each batch of csv data is processed in parallel and passed for further processing by the downstream nodes. Using the Split node can improve the performance and scalability of your flow design.