Split Node

A Split Node should be used when there is a need to split CSV data processing. This might occur when CSV data processing becomes excessively time consuming.

Such time-consuming behavior might occur when you send big CSV data to the flow input terminal or it might become memory intense when complex data validation is being performed. The Split Node can be used with these, and with other data processing tasks that can be done in parallel.

The Split Node splits data records into batches. Each batch of CSV data is processed in parallel and passed for further processing by the downstream nodes. Using the Split node can improve the performance and scalability of your flow design.

The Split node now supports the splitting JSON Arrays. If a JSON document contains a large JSON arrays, it is possible to speed up the JSON document processing by using the Split node, user can specify path to the JSON array inside the JSON document that should be split by the Split node and processed in parallel by downstream nodes.