Solved: Approach on loading multiple data sources sequenti...

Former Member · ‎02-03-2016

Hi there,

I'm working on a job which need to load four datasource by order and any one of them failed, the job should still continually go forward to the end.

In order to do that, I use a series of workflows which connected sequentially. Within each of workflow, there is one dataflow to load one data source to the target table. The target table is composed by those four datasource, so, for each of dataflow, I have to use Table_Comparison to make sure the fields are filled based on the primary key. For this approach, it satisfies on those requirements, however, for each of workflow, they are processing for a while. The whole chain takes too much time. Does anyone have idea on how to improve on the performance or any other strategy to handle this case? Thanks a lot!

The screenshot for each of the layer as below

WF chain:

Within each of WF,

Within each of DF

former_member187605 · ‎02-10-2016

Check for a discussion on optimal settings for Table_Comparison.

Former Member · ‎02-16-2016

Did you tried with Auto Correct Load option?

former_member199543 · ‎02-10-2016

Hi

1. make 2 separate flows, the first flow is Source table >Query> target table, and the second is with TC. By doing so the first flow will be pushed down to DB, thus no Job server is involved in calculations.

2. How do you to optimize TC transform? This can not be pushed down to the DB, thus local resources are used. It depends on Source and target table, if both are properly indexed according to your Comparison options, then performance should be fine. The best bet here would be to avoid TC transform and simpy use script instead, which compares your 2 tables, seeing that there is no SCD2 needed, then this is very easy to maintain.

Custom scripts are always faster and better than predefined transformations.

former_member308526 · ‎02-04-2016

You can also try to use sorted input in TC (remember to sort the data before the TC )

former_member198401 · ‎02-04-2016

Can you let me know what is the count of the Target table for all the data sources. If Amount of data is not too huge then try using 'Cached Comparison Table ' in Table Comparison.

Performance is okay as long as the amount of data is not too huge.

Regards

Arun Sasi

Former Member · ‎02-04-2016

Hi Cindy,

Please refer the below document to optimize BODS job

Thanks,

Surya B.

Approach on loading multiple data sources sequentially within one job

Accepted Solutions (1)

Accepted Solutions (1)

Answers (5)

Answers (5)

Re: How can assign in Identity Authentication Serv...

Re: Are there plans to update the Spring framework...

Vendor Invoice Screen 'Payment' tab screen field '...

I have data like Date, Net, Gross. I want to deriv...

Re: Re Generate Co files and data files