DI Job Execution Hangs

It took a while to build our DI Linux Job Server, so we initially installed a local DI Job Server on each of the laptops in Windows.
We're still having to work out some firewall rules as well-
Here is the DI issue. We design a job in the Designer, and can execute it on the local job server (on the laptop) and it runs fine and to completion.
Becuase of the firewall rules, we can't execute a job from within the Designer, but rather, we execute it from the DI Web Admin Console.
When we execute the same job on the Linux Job Server, it appears to be running, but the trace log reads this:
(11.7) 01-07-09 10:51:55 (21168:3086001856) JOB: The initial environment locale <eng_us.utf-8> has been coerced to <Unicode (UTF-16)> ().
(11.7) 01-07-09 10:51:55 (21168:3086001856) ORA_LIB: Loaded Oracle Client library <libclntsh.so> is in the path: </u01/app/oracle/product/10.2.0/client_1/lib>.
(11.7) 01-07-09 10:51:55 (21168:3086001856) JOB: Reading job <89857676_72d8_4bf9_818f_24dc45457e2c> from the repository; Server version is <11.7.3.5>; Repository version is
<11.7.0.0000>.
(11.7) 01-07-09 10:51:55 (21168:3086001856) JOB: Current directory of job <89857676_72d8_4bf9_818f_24dc45457e2c> is </opt/DataIntegrator/NDDI/bin>.
(11.7) 01-07-09 10:51:55 (21168:3086001856) JOB: Starting job on job server host <DataIntegrator-v1.dc.nd.edu>, port <3500>.
(11.7) 01-07-09 10:51:55 (21168:3086001856) JOB: Job <JOB_ADV> of runid <20090107105155211683086001856> is initiated by user <boadmin>.
(11.7) 01-07-09 10:51:55 (21168:3086001856) JOB: Processing job <JOB_ADV>.
And never completes.
This is also the log in the error_JobService.log file:
01-07-09 11:55:02 (20754:-1208867136) JSERVICE: Sleeping for 30sec
01-07-09 11:55:32 (20754:-1208867136) JSERVICE: Polling the JobServer(s) ..
01-07-09 11:55:32 (20754:-1208867136) JSERVICE: Polling the Data Integrator Web Server..
01-07-09 11:55:32 (20754:-1208867136) JSERVICE: Polling the AccessServer(s) ..
01-07-09 11:55:32 (20754:-1208867136) JSERVICE: Sleeping for 30sec
01-07-09 11:56:02 (20754:-1208867136) JSERVICE: Polling the JobServer(s) ..
01-07-09 11:56:02 (20754:-1208867136) JSERVICE: Polling the Data Integrator Web Server..
01-07-09 11:56:02 (20754:-1208867136) JSERVICE: Polling the AccessServer(s) ..
01-07-09 11:56:02 (20754:-1208867136) JSERVICE: Sleeping for 30sec
01-07-09 11:56:32 (20754:-1208867136) JSERVICE: Polling the JobServer(s) ..
01-07-09 11:56:32 (20754:-1208867136) JSERVICE: Polling the Data Integrator Web Server..
01-07-09 11:56:32 (20754:-1208867136) JSERVICE: Polling the AccessServer(s) ..
01-07-09 11:56:32 (20754:-1208867136) JSERVICE: Sleeping for 30sec
Additionally, for the PID noted above, 21168, there is an entry in the stack_trace.txt log in the log directory that reads this:
boadmin@DataIntegrator-v1 log$ cat stack_trace.txt
=== Start Stack Trace dump. PID: 14417===
=== End Stack Trace dump. PID: 14417 ===
=== Start Stack Trace dump. PID: 17290===
=== End Stack Trace dump. PID: 17290 ===
=== Start Stack Trace dump. PID: 21168===
=== End Stack Trace dump. PID: 21168 ===
boadmin@DataIntegrator-v1 log$
Why is the job just hanging like that when we execute it on the Linux job server? Is there something else that needs to be done?
Thanks!

is the job hanging or it has crashed ?
do you see this pid in the output of following command
ps -aef
looks like its crashing, check if there are any core dump file in the $LINK_DIR/bin folder, check the ulimit -a to find if core file size is not set to 0
what is the exact version of DI 11.7.0.0 or 11.7.2.x, or 11.7.3.x
try running the job by enabling trace all option, this can give some information

Similar Messages

Test Execution hangs while executing a test from Command prompt option

I have done the following
1) Created a script Test driver
2) Added different scripts as Script->Properties->Assets to the driver script
3) Call each of this asset script from TestDriver using command getScript("<<ScrptName>>".run(1, true, true, true);
4) Now calling this TestDriver script from command prompt with command
e:\\OracleATS\\agent
runScript.bat E:\\INTEGRATION_SERVICES\\Identity_management\\Automation\\Scripts\\MasterScripts\\TestDriver
TestDriver.jwg
The script execution hangs at "Initializing VU 1 for Script TestDriver. After few mintues of seeing this message i get error "Execution Timed Out".
Do I need to keep openscript UI open for script execution from command line ?
Is there any way to increase this Execution Time out ? May be it is taking time to load the internal script assets. AS other scripts are running fine on the same machine from command prompt itself.
Complete error is as follows
C:\Documents and Settings\oracle>e:\\OracleATS\\agent\\runScript.bat E:\\INTEGRATION_SERVICES\\Identity_management\\Automation\\Scripts\\MasterScripts\\TestDriv
er\\TestDriver.jwg
Running "TestDriver" ...
Agent started. Available commands:
stop - Stop the virtual user after it finishes the current iteration.
abort - Abort the virtual user cleanly, before it finishes the current itera
tion.
exit - Terminate the process immediately
You may type the above commands at any time.
12:17:53,418 INFO [1] Initialized script service "oracle.oats.scripting.modules.utilities.api.UtilitiesService"
12:17:53,418 INFO [1] Initialized script service "oracle.oats.scripting.modules.browser.api.BrowserService"
12:17:53,418 INFO [1] Initialized script service "oracle.oats.scripting.modules.functionalTest.api.FunctionalTestService"
12:17:53,465 INFO [1] Initialized script service "oracle.oats.scripting.modules.webdom.api.WebDomService"
12:17:53,856 INFO [1] Initialized script service "oracle.oats.scripting.modules.datatable.api.DataTableService"
12:17:53,856 INFO [1] Initializing VU 1 for script TestDriver
Execution Timed Out.

Hi e_raja_sekar,
You wrote:
public static void main(String args){should be
public static void main(String args[]){Granted, the below code was tested on Windows XP using J2SE SDK 1.4.1_02 (and not 1.2.2 like Sridhar is using), but when I compiled and executed this code:
public class BadMain {
public static void main(String args) {
System.out.println("Hello World");
}The output I got was:
Exception in thread "main" java.lang.NoSuchMethodError: mainSo I don't think this is Sridhar's problem (but I could be wrong) since he says that he gets no error message.
Cheers,
Avi.

Schedule Job Execution problem

Hi All,
The schedule jobs are taking more time.... to complete
We have schdule jobs which will run for every hour,soo to execute the job sometimes it is taking 1hr and sometime 30mins
please help me to have the Job execution total time to be constant to near 30mins.

preethi wrote:
Hi All,
The schedule jobs are taking more time.... to complete
We have schdule jobs which will run for every hour,soo to execute the job sometimes it is taking 1hr and sometime 30mins
please help me to have the Job execution total time to be constant to near 30mins.Thread: HOW TO: Post a SQL statement tuning request - template posting
HOW TO: Post a SQL statement tuning request - template posting

Sql Server Agent Job Execution

Hi All,
I want to run a sql agent job (one job is execution one package) in such a way that as soon as job1 completes , it should be run again (without any time loss)
I am folowing Below approach:
Job 1 has 2 steps , in both the steps I am calling same package to execute . After that on Success/Failure action. I am selecting the same step.it is running in an infinite loop and continue fetching data.
is there any dis-advantages on the above approach?
Is there any other approach to achive the Same?
Thanks in Advance....

Hello,
The two-step recursive (on success) technique should be fine. I would just recommend the introduction of short-circuit”techniques that stops a runaway job in the event of logical issues (the “on failure” action will still catch the general error issues).
Other patterns that are sometimes used for continual-execution requirements:
Service Broker (for example, continually watch the queue and act on new messages)
T-SQL home-grown job that runs in a loop and then executes the job based on some condition – and would require you to check job execution state
For a continuously running package watching for new files in a folder, one example is detailed in the following blog post:
Continuously watching files with WMI Event Watcher Task
Best Regards, Joe Sack, MVP, SQL MCM | SackHQ.com

Enable server group in BPC job execution

Dear Experts
[Note 1476057 - Enable server group in BPC job execution|https://websmp130.sap-ag.de/sap/bc/bsp/spn/sapnotes/index2.htm?numm=0001476057&nlang=E]
After upgrade, the server group logic will be :
1, Create a correct server group with server assigned,
    job will be executed in specified server group;
2, Create a server group without server assigned,
     job will be successfully submitted to BW,
     but will not be executed until server is assigned to the server group;
3, Incorrectly specify a server group in web admin,
     client will get error message when run package.
As I understand the group server is defined in BW Server. Where should I specify this group server in BPC. I could not find the parameter in BPC Web Admin - Appset Parameters or BPC Server (.Net Server) - Server Options.
Appreciate if somebody can help ?
Thanks
Ashish
Edited by: Ashish Narway on Oct 14, 2010 2:55 PM

HI John,
The group settings in WGM SL and Leopard are different.
Yes, they are. I booted to my eval copy of SL Server and ran through the settings in WGM, Server Admin and Server Preferences; as you pointed out, all the mail list admin tools I could find are in the Mail service in Server Admin.
The only thing that may be of interest that I could find is that for the group mail list, the email address for the group is group_shortname@ServerDNSname, not groupshortname@yourdomain (this is in the Mail Services Admin Guide for 10.6). This might explain why your email to the group may have been rejected?
-Doug

Got error in job execution, but the standalone procedure runs fine.

I got error in job execution. But it runs fine as standalone procedure. Where could it be wrong?
_>exec dbms_job.run(145373);
BEGIN dbms_job.run(145373); END;
ERROR at line 1:
ORA-12011: execution of 1 jobs failed
ORA-06512: at "SYS.DBMS_IJOB", line 481
ORA-06512: at "SYS.DBMS_JOB", line 275
ORA-06512: at line 1
I checked the alert log file
ORA-06550: line 1, column 101:
PLS-00103: Encountered the symbol "RPTIMPORT" when expecting one of the following:
:= . ( @ % ;
Thu May 2 12:59:34 2013
Errors in file /oracle/admin/PDAMLPR1/udump/pdamlpr11_ora_15574.trc:
ORA-12012: error on auto execute of job 146925
ORA-06550: line 1, column 133:
PLS-00103: Encountered the symbol "" when expecting one of the following:
:= . ( @ % ;
The symbol ";" was substituted for "" to continue.
The trace file:
*** ACTION NAME:() 2013-05-02 12:43:48.261
*** MODULE NAME:(SQL*Plus) 2013-05-02 12:43:48.261
*** SERVICE NAME:(SYS$USERS) 2013-05-02 12:43:48.261
*** SESSION ID:(1811.10644) 2013-05-02 12:43:48.261
*** 2013-05-02 12:43:48.261
ORA-12012: error on auto execute of job 145373
ORA-06550: line 1, column 133:
PLS-00103: Encountered the symbol "" when expecting one of the following:
:= . ( @ % ;
The symbol ";" was substituted for "" to continue.
*** ACTION NAME:(********* Schedule Job: Report C) 2013-05-02 12:47:58.178
*** MODULE NAME:(PATIENT_REMEDIATE_JOB) 2013-05-02 12:47:58.178
*** 2013-05-02 12:47:58.178
ORA-12012: error on auto execute of job 145373
ORA-06550: line 1, column 133:
PLS-00103: Encountered the symbol "" when expecting one of the following:
:= . ( @ % ;
The symbol ";" was substituted for "" to continue.
*** 2013-05-02 12:52:43.409
ORA-12012: error on auto execute of job 146924
ORA-06550: line 1, column 101:
PLS-00103: Encountered the symbol "RPTIMPORT" when expecting one of the following:
:= . ( @ % ;
*** 2013-05-02 12:59:34.651
ORA-12012: error on auto execute of job 146925
ORA-06550: line 1, column 133:
PLS-00103: Encountered the symbol "" when expecting one of the following:
:= . ( @ % ;
The symbol ";" was substituted for "" to continue.
~
~
The job was created as the following with 30 min interval:
The job number: 145373
DECLARE
X NUMBER;
BEGIN
SYS.DBMS_JOB.SUBMIT
( job => X
,what => 'RPTIMPORT.pr_exec_pop_ddl21_data_irt'
,next_date => to_date(sysdate)
,interval => '/*30:Mins*/ sysdate + 30/(60*24)'
,no_parse => TRUE
SYS.DBMS_OUTPUT.PUT_LINE('Job Number is: ' || to_char(x));
END;
Thanks for help!

the issue is resolved. I missed to put ";" at end of what statement. Thank all.

JOB execution fails due to Access failure to procedures and views using certificates (encryption)

Job execution fails using SQLServerAgent account but runs successfully under another windows personal account.
Both have same security access on server and DB.
The Windows account is mapped to dbo on the database .
The package uses procedures and views which are using encryption by certificates.
Please advice on how to provide security so that job runs from the agent

Hi Prady26,
If the SQL Server Agent Service account and the proxy account generated based on a Windows user account have the same permissions on both the SQL Server database and the File System of the server, it may be related to the protection level of the package.
If the package is set to encrypt with UserKey, the job will fail unless it runs under the security context of the Windows user that designs the package in BIDS/SSDT. You can check the
ProtectionLevel of the package in the BIDS/SSDT to see if it is EncryptSensitiveWithUserKey or EncryptAllWithUserKey, if it is, re-deploy/reinstall the package and set the package protection
level to “Relay on server storage for encryption”.
If the issue persists, post the error message in the job history for further analysis.
Regards,
Mike Yin
If you have any feedback on our support, please click
here
Mike Yin
TechNet Community Support

Find aging of concurrent jobs execution for last one year in R12

DB:11.2.0.3.0
EBS: 12.1.3
O/S: Red Hat Linux
Hi all,
Could anyone please share the resolution for the below question in R12 ?
1) Considering only 3 weeks of data is maintained in FND tables, which means we cannot use fnd_concurrent_requests to get list of jobs submitted in last one year. Is there any other way what we can get to find out aging of jobs execution for last one year?
Thanks for your time!
Regards,

Locking this thread - Pl continue the discussion in your original post - Identifying customizations in R12
Unless you have backups of the deleted data, this is not possible

Error on 11g 2 during external job execution using credential

Hi,
I need a help from you.
I am getting below error on 11g 2 during external job execution using credentials -
EXTERNAL_LOG_ID="job_805812_412409",
ORA-27369: job of type EXECUTABLE failed with exit code: Arg list too long
STANDARD_ERROR="Launching external job failed: Invalid username or password"
Find credentials creation & other required details -
EXECUTE DBMS_SCHEDULER.create_credential(credential_name =>'SYS.SCRIPTS',username =>'scripts',password => 'u5342222');
BEGIN
DBMS_SCHEDULER.CREATE_JOB(
job_name => 'AUTOLOAD_JOB',
job_type => 'EXECUTABLE',
job_action => '/qpshell/dev/scripts/autoload.sh',
repeat_interval => 'freq=MINUTELY;interval=5;byhour=1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22;',
enabled => true,
auto_drop => false,
credential_name => 'SYS.SCRIPTS');
end;
-rw-r--r-- 1 root oinstall 1571 Nov 11 14:42 /u01/app/oracle/product/11.2.0/dbhome_1/rdbms/admin/externaljob.ora
content of externaljob.ora -
run_user = nobody
run_group = nobody
run_user=scripts
run_group=qvcscript
-rwsr-x--- 1 root oinstall 3068424 Nov 11 13:13 extjob
-rwxr-xr-x 1 oracle oinstall 3068424 Nov 11 13:13 extjobo
root@knudwdbs2:/u01/app/oracle/product/11.2.0/dbhome_1/bin> ls -ltr extjob*
-rwx------ 1 oracle oinstall 3068424 Nov 11 12:45 extjoboO
-rwsr-x--- 1 root oinstall 3068424 Nov 11 12:45 extjobO
-rwxr-xr-x 1 oracle oinstall 3068424 Nov 11 13:13 extjobo
-rwsr-x--- 1 root oinstall 3068424 Nov 11 13:13 extjob
Kindly suggest how it can be fixed. Early response much appreciated.
Thanks
Ajay
--

Hi,
When you are logged in to the OS, can you do an "su" to the "scripts" user using this exact password (passwords are case-sensitive) ?
Also make sure that you have libpam.so located somewhere in $LD_LIBRARY_PATH and that it points to a real file.
external jobs with credentials use jssu, but I don't think permissions on jssu are the problem here (it should be setuid to root).
Hope this helps,
Ravi.

Job executions lost in EM Grid Control

Hello,
We have lost all executions from the job execution pane except the backup jobs. We have lost also all scheduled jobs. Any idea about ?
Thanks,
Manuel.

Did you create the email Notification Schedule?

ODI Interface execution hanging

Hi,
suddenly some interfaces execution hanging, what is the reason for this issue, and how i can overcome this issue?
thanks

The interface is loading from SQL table to oracle table, My IKM are : IKM Oracle Incremental Update and LKM SQL to Oracle,
How can I know it's using memory engine or not ?
what do you mean by ODI Agent is hosed? I am using (Local Agent)
thanks

Background Job execution enabled via pro-active monitoring

Hi community,
We're interested in CPS Redwood to help us in proactive monitoring of background jobs (Ad Hoc and Periodic) and impacts in system performance.
The idea would be to have somehow a tool that monitor the number of available working process and system performance. When the system workload reaches a certain level, ex: 95%, this tool would intercept new jobs execution. After this, when the system workload goes down and the resources are free, the system would release all jobs to normal run again.
Is it possible via CPS Redwood?
Is there any other way to have something automatic like this?
Thanks,
Jose Antonio.

Hi Jose,
This is possible indeed using CPS. CPS can monitor various load metrics of your system, then adapt the scheduling logic if thresholds are exceeded.
In your scenario, you would basically always intercept all jobs, but normally not enforce any limits so effectively that means all jobs still run immediately. If the load is too high, CPS would enforce a limit, so then jobs will be delayed until the load is lower again.
Regards,
Anton.

Jobs are hanging in running status

Hey ,
I have some overnight jobs that refresh materialized views over a dblink from a remote db. I came this morning and noticed that some of the jobs where hanging in RUNNING state with "more data from dblink" event but there was no corresponding sessions in the remote db. It seems like the refresh process was already done (by the size of the mviews). Tracing the session is useless (i've tried) since no actual job is being done. The alert log gave no clue.
any ideas on what the problem is or what to do in order to find it?
could it be a locking issue ?
Thanks

Hi,
I think you may need to use normal database hang diagnostic methods to try to figure out what is going on.
The first thing to do is look in the dba_scheduler_running_jobs view, there should be sessions associates with the running view (you can also check that there are real j0 slave processes running the jobs). Once you have checked that there is a real session that is still running the job maybe you could check the v$session view and any other v$ views that should be able to tell you what SQL statement the job is hanging on. You may also be able to check locks held by those sessions.
You may also be able to use Oracle's ADDM reporting tools to get some diagnostic info.
Another possible thing to try is running the job action by hand manually in pl/sql and seeing whether that hangs and possibly figure out where.
Where do you see the "more data from dblink" message ?
-Ravi

How to suspend (all) job execution ?

Can I define jobs but suspend their execution (until I enable job execution again) ?
If yes, how do I suspend them ?

Hi,
You can visit here:
http://download-uk.oracle.com/docs/cd/B19306_01/server.102/b14231/scheduse.htm#i1009724
and also visit here:
http://download-uk.oracle.com/docs/cd/B19306_01/server.102/b14231/scheduse.htm#i1009725
Adith

Is there a way to generate job execution scripts from the command line?

The customer is planning to change DI production server configuraiton and needs to regenerate job execution scripts for hundreds of jobs. Their question is - instead of going through web admin and doing it one by one, is there a way to run a batch command and generate all jobs with the same job server information from one repository? I looked at the "al_engine" command but didn't find the options to do so. Did I miss something? This is for a DI 11.5.3 on HPUX.
Thanks,
Larry

I also suggested that but the customer was so afraid of messing things up that they wanted to find out if there is another way. Thanks a lot for the confirmation.
Larry

DI Job Execution Hangs

Similar Messages

Maybe you are looking for