Commit Frequency

Hi Gurus,
Anybody help me how to work with commit frequency in ODI.I have 5 Million Records to load for every 10000 records it has to commit

You can refer my blog. I believe you can still optimize the KM. This is just the beginning.
http://dwteam.in/commit-frequency-in-odi/

Similar Messages

Bulk Insert - Commit Frequency

Hi,
I am working on OWB 9.2.
I have a mapping that would be inserting > 90000 records in target table.
When i execute the SQL query generated by OWB in Database it return result in 6 minutes.
But when i executed this OWB mapping in 'Set based fail over row based' with Bulk-size = 50, commit frequency = 1000 and max error = 50 it is taking almost 40 minutes. Now while this mapping was running i checked the target table records count and it was increasing by 50 at a time and it is taking long time to finish.
What changes I need to do to make this inserting of records Fastest.

If it is inserting 50 a time, it means that setbased is failing and then it is switching to rowbased.
If it is always failing in set based, it will be better to run in row based so as not spending time in set based.
In row based mode, you can increase bulk size (like 5000 or 10000 or more ) for efficient execution. Keep commit freq same as bulk size.

Implementing COMMIT frequency

I've been asked to create a procedure which should delete from a table records which are older than a specified period(input parameter) with a User specified Commit Frequency(This will be another Input parameter)
I have a column called created_time in mytable which i could use in WHERE clause of the DELETE statement.
create or replace procedure delete_mytable
       p_commitfreq in number
       p_no_of_days_old in number
as
begin
DELETE FROM mytable where created_time< sysdate-p_no_of_days_old;
-- code to implement COMMIT frequency
end;
How can i implement COMMIT frequency in this proc?. Client is going to COMMIT after every 35000 rows. But they don't want this to be hard coded. I have seen other threads in OTN where Gurus are saying that including COMMIT frequency is a bad thing. This just happens to be in SPECS which I don't get to design(we are actually converting a Purge script written in C++ to PL/SQL)

I have seen other threads in OTN where Gurus are saying that including COMMIT
frequency is a bad thing. We consider the use of COMMIT in the middle of a business transaction to be bad practice for a number of reasons:
(1) Multiple commits are generally slower.
(2) It is frequently associated with an implementation which consists of record-by-record processing inside a PL/SQL curosr loop when set-based processing is far more efficient.
(3) It can be harder to recover from a failed business transaction if the data is in an inconsistent state.
(4) Such implementations are prone to ORA-01002 and ORA-01555 errors.
Now the question which occurs to me in your case is, "Why do you users care?" In fact, would they even notice if you implemented the procedure as you show in your post and just ignore the p_commitfreq?
However, if your conscience won't let you do that, the best solution would be something like this:
<< commit_loop >>.
loop
       delete from mytable
       where created_time< sysdate-p_no_of_days_old
       and rownum <= p_commitfreq;
       exit when sql%rowcount = 0;
      commit;
end loop commit_loop;
commit;This solution will be slower than a single commit approach but avoids the other issues.
Cheers, APC
Blog : http://radiofreetooting.blogspot.com/

Can we control the Materializd view commit frequency

Hi
I am creating a huge MV that requires lot of undo space while creating initially or if I want to do a complete refresh. Can we control the commit interval like if I want the MV during initialization to commit records after every 1 million records.
Any suggestions will be highly appreciated.

No, you cannot do incremental commits when refreshing a materialized view. At least not if you want Oracle to refresh it for you. Even if you could, you almost certainly wouldn't want to. You would generate more undo in total trying to refresh 1 million rows at a time. And your data would be inconsistent. Plus, you'd have to build additional structures to track which rows had already been processed/ updated so that Oracle could recover in the event that there is a failure in the middle of refreshing a materialized view.
If you're not concerned by the inconsistent data, depending on the Oracle version and how you've configured the environment, along with whether you're doing incremental or complete refreshes, you could set atomic_refresh to FALSE so that Oracle truncates and reloads the data each time, which generates less UNDO but causes the materialized view to be empty during the refresh. Or you could set the materialized view to refresh incrementally, which may be substantially faster than doing a complete refresh.
Justin

COMMIT FREQUENCY /BULK SIZE ignored ?

OWB Client 9.2.0.2.8
OWB Repository: 9.2.0.2.0
I have a simple SRC to TRG toy map which does an update (row based)
I was hoping to achieve the following: if any error is encountered , then 1) abort and 2) rollback any changes
I could achieve 1) by setting MAX_ERRORS=0
For 2) I tried COMMIT_FREQUENCY=BULKSIZE= some_very_high_number.
But the map always insists on updating at least some rows before encountering the first error (and then aborting)
I tried changing the bulk prcoessing to true, and got the same results. I changed UPDATE to INSERT=>similar behavior
why is the COMMIT_FREQUENCY=BULK SIZE setting being ignored? Or is this just a bug with my version ?

Hi,
Good question... it should be the case as you describe (I think). I'd need to look at the code for this, but you may want to contact support on this one so they can reproduce this...
Jean-Pierre

Commit after every 1000 records

Hi dears ,
i have to update or insert arround 1 lakhs records every day incremental basis,
while doing it , after completing all the records commit happens, In case some problem in between all my processed records are getting rollbacked,
I need to commit it after every frequency of records say 1000 records.
Any one know how to do it??
Thanks in advance
Regards
Raja

Raja,
There is an option in the configuration of a mapping in which you can set the Commit Frequency. The Commit Frequency only applies to non-bulk mode mappings. Bulk mode mappings commit according the bulk size (which is also an configuration setting of the mapping).
When you set the Default Operating Mode to row based and Bulk Processing Code to false, Warehouse Builder uses the Commit Frequency parameter when executing the package. Warehouse Builder commits data to the database after processing the number of rows specified in this parameter.
If you set Bulk Processing Code to true, set the Commit Frequency equal to the Bulk Size. If the two values are different, Bulk Size overrides the commit frequency and Warehouse Builder implicitly performs a commit for every bulk size.
Regards,
Ilona

Corelated commit in owb

Hi,
can any one explian how corelated commit works in owb.suppose if i enable corelated commit in mapping and when i am running same mapping and if i
pass runtime parameters like commit frequency=10000.
whether it will commit every 10000 records in target because of commit frequency or it will commit at the end of loading all records in target because of corelated commit.please comment on this issue.
Regards
naren

Commit frequency and Correlated Commit are 2 independent configuration parameter. Setting Commit frequency to 10000 will make sure that the commit happens after every 10000 records irrespective of whether Correlated Commit is set or not.
Commit frequency is applicable only for rowbased and rowbased (target) mode.
Correlated commit is applicable when you have a single source populating multiple target tables in a single mapping. OWB User guide chapter 11 (page 11-16) has very good explanation of how correlated commit works.
Hope this helps.

OWB and COMMIT

OWB version: 9.0.2.
I realized that OWB puts several COMMIT statement in the generated PL/SQL packages code.
I need to execute the COMMIT statement only after the end of the PL/SQL procedure, or when I decided,or make a ROLLBACK if some error occurs, and so my question is:
Is it possible to configure an OWB mapping in order to completely remove the COMMIT statement from the generated code?
I remove COMMIT sentences manually, but it seems that some part of the generated code makes a COMMIT, because when the transaction finishes, I make a rollback, but all inserts remains in tables.
TIA

Javier,
OWB indeed always includes commit statements in its code. However, even if you would remove all commits (assuming you run 904 or 92 version of OWB) you will end up with commits after every execution, because every execution runs in a separate session. When a session is exited cleanly the database will implicitly commit.
In OWB92 we introduced a feature correlated commit (configuration parameter on a mapping) that helps you control commits in a multi-target mapping. With that feature you can control commits to only happen at the end of the mapping in set-based mode. In row-based mode or row-based bulk mode, the behavior is that rows from a PL/SQL cursor are being applied to all targets rather than just at the time.
The commit behavior is following:
- in set-based mode, there is one commit per statement and with correlated commit set to true, it will be once at the end of the mapping.
- in row-based bulk mode, the commit frequency is the bulk size. To force row-based mode, set the bulk size to 1.
- in row-based mode, the commit frequency can be configured by the commit frequency configuration parameter.
So, if possible, run in set-based mode (this is fastest anyway) and set correlated commit to true. This will commit the mapping at the end. In case of any error, the entire mapping will rollback.
Hope this helps,
Mark.

OWB 10gR2: Commit everything or nothing

Hello.
We have a mapping which must run in row-based mode because it deletes records. There is one source and multiple targets. One flow directs into a logtable (LOG_LT). The other flow directs into 4 targets (MOD_LT for all targets) depending on the mode given in the recors (Update/Insert/Merge/Delete). The Logtable gets a part of the primary keys of the incoming rows wich are used for a replication application.
It is important that there is nothing in the 4 targets which is not stored (part of pk as described) in the logtable. Otherwise information will be lost for the replication.Therfore we set target to order to use the LOG_LT first.
a) We started with a high commit high freqency larger than the number of rows in source and autocommit. If the data flow had a error (pk e.g.) only this and not the log-flow was rollbacked. (Table based rollback)
b) We used autocommit correlated. The logflow and the dataflow are ok, but the mapping was commiting to early (not the value in commit freqency).
Why?
c) The next idea was to set the mapping to manual commit and commit in a postmapping procedure if everything was ok. The mapping can be deployed but when we start the mapping we get the warning:
RTC-5380: Manual commit control configured on maping xy is ignored.
Why?
Has anybody a good practice for a row-based-mapping (with multiple targets and one source) to commit everything or nothing?
Thanks for your help
Stephan

Hello Stephan,
In row based mappings, and one having single source and multiple targets then the following are the different ways of loading the targets.
1. Correlated Commit = true and Row Based - Suppose there are 50 rows in the source and there are 4 target tables. When the mapping is run in this mode ansd suppose the mapping encounters error while loading row 10 in target table say TGT_3 then this row will be rolled back from all the four targets and the processing will continue, thus finally loading 49 records in all the 4 tables.
When you run in row based mode, then please check the bulk processing mode. If it is set to true then the bulk size overrides the commit frequency set by you. It is thus advisable to make th bulk size and the commit frequency equal or set the bulk porcessing mode to false and then set the commit frequency.
HTH
-AP

Optimizing a looping procedure - when to commit

Does anyone have a link to an article that explains when a commit should be timed to be optimal (ie: after X amount of statements or after insertion / updating of Y Mb)?
I have a procedure that loops through and inserts (ugly, but inherited and I have no time to re-code) and I'm not sure how to go about optimizing the commit point other than trial and error. Right now I'm commiting every 1000 interations of a loop, which is faster than 1 but slower than no commits, but how can I figure out what is the best?
Connected to:
Oracle8i Enterprise Edition Release 8.1.7.4.0 - Production
With the Partitioning option
JServer Release 8.1.7.4.0 - Production
Lemme know if more info is needed.
Thanks,
Pete

Yes this is obvious each commit has a small amount of overhead, the more you do the more the overhead adds up.
So the most optimal approach available to you is commit once at the end.
If you have 27 million inserts this may not be possible due to resource issues. Do the tables have a lot of indexes? Try for a commit every 2 million, 1000 is rediculously tiny.
Try the query in this post to see how much undo you use before commiting after 1000, and then multiply up to see how much you can get away with.
Re: Suggestions for improving this update in batches of 100 records
But again, I have run tests before that show LOOP vs SQL is hundreds of times slower, while commiting after every row in the LOOP versus commiting once after the LOOP makes a difference of a few percent, so I do not think you are going to squeeze much of a runtime improvement just be reducing the commit frequency.

PL/SQL procedure for deleting rows

We have to delete rows from a table by initiating parallel processes depending on no of connections, and also variable commit frequency. The procedure has to start by itself in case of failure of 1 or more parallel processes, by identifying the position where it stopped. Please some one help me what would be th elogic needed to write the [rocedure.
Thanks in Advance
Edited by: 864979 on Jun 9, 2011 10:02 PM

Be careful of how this is designed and coded. It is very easy to do it horribly wrong, causing contention and problems.
Have a look at DBMS_PARALLE_EXECUTE.
If the package is not available on your Oracle version, then look at {message:id=1534900} for a manual approach.

Dynamic Lookup in OWB 10.1g

Can we execute dynamic lookup in OWB 10.1g?
I want update the columns of the target table, based on the previous values of the columns.
Suppose there is a record in the target table with previous status and current status columns.
The source table consist of 10 records which need to be processed one at a time in a single batch. Now we need to compare the status of record with the current status of target table. If the source contains next higher status then the current status of target record need to go to previous status and the new status coming from source need to overwrite the current status of target record.
We have tried using row based option as well as setting commit frequency equal to 1 but we are not able to get the required result.
how can we implement this in OWB10.1g?

OK, now what I would do in an odd case like this is to look at the desired FINAL result of a run rather than worry so much about the intermediate steps.
Based on your statement of the status incrementing upward, and only upward, your logic can actually be distilled down to the following:
At the end of the load, the current status for a given primary key is the maximum status, and the previous status will be the second highest status. All the intermediate status values are transitional status values that have no real bearing on the desired final result.
So, let's try a simple prototype:
--drop table mb_tmp_src; /* SOURCE TABLE */
--drop table mb_tmp_tgt; /*TARGET TABLE */
create table mb_tmp_src (pk number, val number);
insert into mb_tmp_src (pk, val) values (1,1);
insert into mb_tmp_src (pk, val) values (1,2);
insert into mb_tmp_src (pk, val) values (1,3);
insert into mb_tmp_src (pk, val) values (2,2);
insert into mb_tmp_src (pk, val) values (2,3);
insert into mb_tmp_src (pk, val) values (3,1);
insert into mb_tmp_src (pk, val) values (4,1);
insert into mb_tmp_src (pk, val) values (4,3);
insert into mb_tmp_src (pk, val) values (4,4);
insert into mb_tmp_src (pk, val) values (4,5);
insert into mb_tmp_src (pk, val) values (4,6);
insert into mb_tmp_src (pk, val) values (5,5);
commit;
create table mb_tmp_tgt (pk number, val number, prv_val number);
insert into mb_tmp_tgt (pk, val, prv_val) values (2,1,null);
insert into mb_tmp_tgt (pk, val, prv_val) values (5,4,2);
commit;
-- for PK=1 we will want a current status of 3, prev =2
-- for PK=2 we will want a current status of 3, prev =2
-- for PK=3 we will want a current status of 1, prev = null
-- for PK=4 we will want a current status of 6, prev = 5
-- for PK=5 we will want a current status of 5, prev = 4
Now, lets's create a pure SQL query that gives us this result:
select pk, val, lastval
from
select pk,
val,
max(val) over (partition by pk) maxval,
lag(val) over (partition by pk order by val ) lastval
from (
select pk, val
from mb_tmp_src mts
union
select pk, val
from mb_tmp_tgt mtt
where val = maxval
(NOTE: UNION, not UNION ALL to avoid multiples where tgt = src, and would want a distinct in the union if multiple instances of same value can occur in source table too)
OK, now I'm not at my work right now, but you can see how unioning (SET operator) the target with the source, passing the union through an expression to get the analytics, and then through a filter to get the final rows before updating the target table will get you what you want. And the bonus is that you don't have to commit per row. If you can get OWB to generate this sort of statement, then it can go set-based.
EDIT: And if you can't figure out how to get OWB to generate thisentirely within the mapping editor, then use it to create a view from the main subquery with the analytics, and then use that as the source in your mapping.
If your problem was time-based where the code values could go up or down, then you would do pretty much the same thing except you want to grab the last change and have that become the current value in your dimension. The only time you would care about the intermediate values is if you were coding for a type 2 SCD, in which case you would need to track all the changes.
Hope this helps.
Mike
Edited by: zeppo on Oct 25, 2008 10:46 AM

OWB Repository Browser

Hi,
I use the Repository Browser to see the execution details when a mapping is in execution.
But the repository browser does not show the number of records inserted or updated, until the mapping
finishes its execution. This is happening in 10gR2.
Is there way , by which I can get the real time information of how many records got updated till now while the mapping execution is in progres.
Thanks
MD

You have to take into fact that you have a commit frequency, and max number of errors, if an error occurs it will probably rollback, you have operating mode, row based, set based, etc. you will probably not see your result until the execution is done.

Fetching records in a block(db) for mass processing

My user interface inputs commit frequency for intermittent commits. We reference this to # of records processed & then issue commit. [whilst processing against millions of records]
Now, we want to process all records belonging to set of data blocks to avoid contention issues (gc/ other locking issues) & then issue commit.
So the question is, how do I query records belonging to a particular block OR identify those blocks that hold my data.
Any pointer is greatly appreciated.
thanks
Kiran.

kkulkarn wrote:
This is needed for a DW instance wherein we select recordset from a OLTP instance running 4 node RAC. The user then inputs a condition based on which the records are selected.
We are commiting based on parameters defined by DBA's - the commit frequency. This increases contention especially if the records are stored in contiguous location. Add to this the RAC, the GC traffic is huge.
So what we want to do is to grab all the records in a particular block & read the whole block & then process to DW. I was looking at a query that can get me the block# involved in this process.
Could you clarify:
Is it the OLTP that is on RAC, or the DW, or both ?
Which type of contention - can you show us the critical wait information - and is this on the OLTP or RAC system, and is it the OLTP or RAC system where the GC traffic is huge ?
You seem to be asking how you can specify a query in an OLTP system that is going to visit many table blocks - but "pre-sort" the data in some way so that it visits each table block only once, and visits the table blocks in order so that you don't revisit the same table block and run the risk of extra interconnect traffic on the OLTP system. Is this a correct interpretation of your requirement ?
Does your transfer process do something like:
counter := 0;
for r in (select from OLTP system) loop
    insert into DW system values (r.thing....);
    counter := counter + 1;
    if counter = {limit} then
        commit;
        counter := 0;
    end if;
end loop;
commit;Regards
Jonathan Lewis
http://jonathanlewis.wordpress.com
http://www.jlcomp.demon.co.uk
To post code, statspack/AWR report, execution plans or trace files, start and end the section with the tag {noformat}{noformat} (lowercase, curly brackets, no spaces) so that the text appears in fixed format.
"Science is more than a body of knowledge; it is a way of thinking"
Carl Sagan

Rollback all steps in a mapping if an error occurs

We have a mapping with some different steps (pre-mapping procedure, some operators and post mapping procedure) and we want to commit or to rollback the entire mapping transaction. So we have that all is OK or all is not refreshed in this mapping. Which parameters we have to configure (Maximum number of errors: 1, Commit frequency: 0, ...)?
Thanks.
Alessandro

You should probably make sure that the pre-mapping does not have commits (that should be easy). Also don't do DDL in there!
Then set the commit frequency to large, or even better, make sure the mapping runs in set based mode. That means any error rolls back the stuff (be sure to set this to SET BASED ONLY, otherwise we will do row-by-row and commit stuff).
Make sure that the post mapping does NEVER run (e.g. configure this to only run on success of the main mapping body).
That should do the trick.
Jean-Pierre

Commit Frequency

Similar Messages

Maybe you are looking for