bulk insert behavior - impact on the package's performance


i'm seeing an increase in run times between client development world , production.  on client, using production volumes, package runs in hour.  on production runs 80-90 minutes.  both environments using remote file servers data sources.  production has luxury of not having cross server joins development requires 1 , still runs faster.  input data source contains 2.5 million records can produce many times number of errors on error table , call duplicates unchanged records on duplicates table.

i've checked see if indexes didnt make prod, no such luck.

so i'm starting @ opportunities efficiency , explanations differences.

i've got stats 1 or both environments specific things , focused on bulk inserts @ moment.  bulk inserts being requested in package via "table or view - fast load option" no lock, no check constraint, no keep identity, no keep nulls, no rows/batch , no max insert commit size.

when watch graphics of client pkg running,  appears records being inserted 3 bulk inserts (import, error, work) tables data moving thru buffer.  i'm guessing isnt started until of data ready insert.  true?

my dba ran profiler , saw 44 minutes elapsed  (they overlap)  in 3 pkg bulk inserts.   error table shared amongst number of apps (i make own staging one) import , work owned app. 

might better performance using different option inserts import , work tables if records allowed "trickle" these tables  as buffer moving?  maybe with lock turned on?   assuming error table is dealt somehow so doesnt become critical path completion of pkg?      

 

validating data bulk inserted prior insertion doesn't seem efficient.  feasible disable check constraints, bulk insert data, validate bulk inserted data using t-sql?  realize may involve alot of "heavy lifting" database engine.  however, imagine faster attempting validate data using client-side code.

hth

duane douglas, mcad | http://www.ssisbi.com | please mark post(s) answered question.


SQL Server  >  SQL Server Integration Services



Comments

Popular posts from this blog

Azure DocumentDB Owner resource does not exist

RFC_ERROR_SYSTEM_FAILURE with SAP ECC 6 Unicode

C# System.Data.Common DbCommand and getting Datasets from Oracle