Jobs are hanging in running status

Hey ,
I have some overnight jobs that refresh materialized views over a dblink from a remote db. I came this morning and noticed that some of the jobs where hanging in RUNNING state with "more data from dblink" event but there was no corresponding sessions in the remote db. It seems like the refresh process was already done (by the size of the mviews). Tracing the session is useless (i've tried) since no actual job is being done. The alert log gave no clue.
any ideas on what the problem is or what to do in order to find it?
could it be a locking issue ?
Thanks

Hi,
I think you may need to use normal database hang diagnostic methods to try to figure out what is going on.
The first thing to do is look in the dba_scheduler_running_jobs view, there should be sessions associates with the running view (you can also check that there are real j0 slave processes running the jobs). Once you have checked that there is a real session that is still running the job maybe you could check the v$session view and any other v$ views that should be able to tell you what SQL statement the job is hanging on. You may also be able to check locks held by those sessions.
You may also be able to use Oracle's ADDM reporting tools to get some diagnostic info.
Another possible thing to try is running the job action by hand manually in pl/sql and seeing whether that hangs and possibly figure out where.
Where do you see the "more data from dblink" message ?
-Ravi

Similar Messages

  • Repeating Job Can Hang In "Started" Status On Multi-Cpu Windows Machine

    Dear All,
    I am struck with a know Bug 3092358 "Repeating Job Can Hang In "Started" Status On Multi-Cpu Windows Machine". The Work around is mentioned in Note:307448.1 in metalinks.
    In the above doc for the Fix-3 Another option that worked is to try:
    Upgraded to EM 9.2.0.6 and applied the Apr05 CPU.
    I searched in metalinks to get patch to upgrade EM from 9.2.0.1 to 9.2.0.6. But not able to find can some tell me where can i get this patch.
    OS: windows 2003 server
    DB:9.2.0.1
    OWB :9.2
    Pls,do let me know if there any other work arounds,
    Thanks for your help.
    Rgds,
    Satya.

    While I dunno the specifics of NT threads, one way to spread 1 thread across multiple processors is by simply round robining the thread between the processors over time. i.e. Thread A spends 0.5seconds executing on cpu 1, for the next 0.5 s the thread is moved over to cpu2 and executed there, then it is moved back to cpu1 and so on...
    I think the Convex SPP used this model, the motivation perhaps being to spread load evenly among all cpus. Made sense in a multi-user multi-cpu system with a real fast bus.
    cheers
    -Ragu
    You know you've been spending too much time on the computer when your friend misdates a check, and you suggest adding a "++" to fix it.

  • Messages are hanged in "Delivering" status

    Hi everyone,
    I have an IDoc -> PI -> File scenario. Everything works fine until the message reaches the file adapter. In the Communication Channel monitoring there is no activity for that message, and the channel is correctly configured and started. However, in the Message Monitoring, when I filter for my receiver interfacce, I see that there are 4 messages that have been in "Delivering" status for 2 days. Then, there are a lot more messages in "To Be Delivered" which I guess are waiting for the first ones to finish.
    I stoped & started the channel, I tried to resend the messages and nothing happened. Any suggestions? Thank you very much!
    Regards,
    Pablo

    Hi Pablo,
    "However, in the Message Monitoring, when I filter for my receiver interfacce, "
    --- First Remove all the filters in the RWB,
    now set the status filter as "System Error" and select the date range for more than six months or 1 yr ...
    now if you see any message then delete them,
    Again set the status to "To Be Delivered" and delete all of them.
    now set the status to "Delvering" and restart them,
    this is what i do when ever i face this problem.
    Regards,
    Karna...

  • Reports Stuck in Running Status

    We are having a problem where reports are stuck in running status. When looking on the server, the reports stuck in running status taking up a "jobserverchild" process, and does not complete.
    Has anyone experienced this issue, and how we can resolve it?

    Ok - here we go - we have been experiencing this issue for the longest time - it looks like we have found the culprit:
    1) Check the default printer of the report developer who placed this report into Enterprise.
    2) Ensure that exact same driver is on the Business Objects Server.
    3) Report should schedule fine.
    Another option is to set "No Printer" within the report design itself by clicking on File > Page Setup. This way there is no dependency on printer drivers.
    Edited by: Troy Underwood on Nov 25, 2009 11:29 PM

  • Jobs with running status in OEM while it was already finished

    Hi
    We have rebooted our server and all the jobs which are suppose to run some scripts in the server are showing running state for more than 16hours now! checking the scripts logs it was successfully executed and finished. When trying to delete the running jobs and schedule a new one it refuses to stop (although its not even running in reality). What can we do to clear this running status and let oem to schedule new ones? or how can we resolve this situation?
    We are running Enterprise manager 10g
    Swaid

    Hi Jozsef,
    I've tried to follow the instructions in that Metalink note and the job seems to be deleted from mgmt_job table as well as the executions for that job in mgmt_job_execution; but still everything is hanging and when i checked in the EM page it still shows the job in running state! how can I clean it? and why is it hanging in the first place?
    I've looked into the agent trace and its not getting updated since the problem has started and here is a bit of the last messages we have in the trace:
    2009-04-17 12:05:09 Thread-2413 ERROR http: secondary header = Host, value = snmsmaster:3938
    2009-04-17 12:05:09 Thread-2413 ERROR http: secondary header = Connection, value = Keep-Alive, TE
    2009-04-17 12:05:09 Thread-2413 ERROR http: secondary header = TE, value = trailers, deflate, gzip, compress
    2009-04-17 12:05:09 Thread-2413 ERROR http: secondary header = User-Agent, value = RPT-HTTPClient/0.3-3
    2009-04-17 12:05:09 Thread-2413 ERROR http: secondary header = Accept-Encoding, value = gzip, x-gzip, compress, x-compress
    2009-04-17 12:05:09 Thread-2413 ERROR http: secondary header = Content-type, value = application/octet-stream
    2009-04-17 12:05:09 Thread-2413 ERROR http: secondary header = Content-length, value = 593
    2009-04-17 12:05:09 Thread-2413 ERROR http: --- Error context dump end for incoming request ---
    The emoms trace is getting updated with some repeated error messages that looks like below:
    2009-04-18 15:56:57,114 [HealthMonitor] ERROR em.jobs pingPastDue.604 - Job step continuing
    2009-04-18 15:57:13,362 [HealthMonitor] ERROR emd.main run.291 - HealthMonitor : Found errant task : TaskRegn:ID3440,Callback:class oracle.sysman.emdrep.jobs.JobWorker,Iterative:true,Duration:900,DueTime:1240059433359
    2009-04-18 15:57:13,363 [HealthMonitor] ERROR em.jobs pingPastDue.602 - Entry - 15 Minute timeout error for jobstep:Stepname: Command
    Commandname: remoteOp
    Commandtype: Short-Running
    StepId: 97914
    jobIdStr: 679733b18b7a5698e04400144fa10220
    executionIdStr: 67bd0ff86c9b4268e04400144fa10220
    iterateParam: null
    iterateParamIndex: -1
    Swaid

  • How to find the particular record in 1000's of workflow jobs are running

    Hi,
    In a data manager -> in workflow tab>IF a record is in CHECK-OUT MODE there are 1000's of jobs are running in that workflow tab. Can anyone tell me how to find that particular record in that workflow jobs.
    Can anyone show me the difference in getting a record in 5.5 and 7.1

    Hello COTI
    Unfortunatly, SAP MDM doesn't have good ability for  WF search.
    All WF clarify by it's status (unlaunched, avialable, Received, complited, error  etc.)
    For each WF SAP MDM assing unique Job ID and this id will be shown in Job ID field in Data Manager WF Tab.
    You can change WF list order by all WF fields like as Job ID, Step, User, Start etc. and try to find your's WF.
    You can use Java API - this is one of the best solution for WF management and WF mass upload  (for example)
    Regards
    Kanstantsin Chernichenka

  • Scheduled jobs are not running DPM 2012 R2

    Hi,
    Recently upgraded my dpm 2012 sp1 to 2012 R2 and upgrade went well but i got 'Connection to the DPM service has been lost.(event id:917 and other event ids in the eventlog errors ike '999,997)'. Few dpm backups are success and most of the dpm backups consistenancy
    checks are failed.
    After investigating the log files and found two SQL server services running in the dpm 2012 r2 server those are 'sql server 2010 & sql server 2012 'service. Then i stopped sql 2010 server service and started only sql server 2012 service using (.\MICROSOFT$DPM$Acct).
    Now 'dpm console issue has gone (event id:917) but new issue ocurred 'all the scheduled job are not running' but manully i can able to run all backup without any issues. i am getting below mentioned event log errors 
    Log Name:      Application
    Source:        SQLAgent$MSDPM2012
    Date:          7/20/2014 4:00:01 AM
    Event ID:      208
    Task Category: Job Engine
    Level:         Warning
    Keywords:      Classic
    User:          N/A
    Computer:      
    Description:
    SQL Server Scheduled Job '7531f5a5-96a9-4f75-97fe-4008ad3c70a8' (0xD873C2CCAF984A4BB6C18484169007A6) - Status: Failed - Invoked on: 2014-07-20 04:00:00 - Message: The job failed.  The Job was invoked by Schedule 443 (Schedule 1).  The last step to
    run was step 1 (Default JobStep).
     Description:
    Fault bucket , type 0
    Event Name: DPMException
    Response: Not available
    Cab Id: 0
    Problem signature:
    P1: TriggerJob
    P2: 4.2.1205.0
    P3: TriggerJob.exe
    P4: 4.2.1205.0
    P5: System.UnauthorizedAccessException
    P6: System.Runtime.InteropServices.Marshal.ThrowExceptionForHRInternal
    P7: 33431035
    P8: 
    P9: 
    P10: 
    Log Name:      Application
    Source:        MSDPM
    Date:          7/20/2014 4:00:01 AM
    Event ID:      976
    Task Category: None
    Level:         Error
    Keywords:      Classic
    User:          N/A
    Computer:      
    Description:
    The description for Event ID 976 from source MSDPM cannot be found. Either the component that raises this event is not installed on your local computer or the installation is corrupted. You can install or repair the component on the local computer.
    If the event originated on another computer, the display information had to be saved with the event.
    The following information was included with the event: 
    The DPM job failed because it could not contact the DPM engine.
    Problem Details:
    <JobTriggerFailed><__System><ID>9</ID><Seq>0</Seq><TimeCreated>7/20/2014 8:00:01 AM</TimeCreated><Source>TriggerJob.cs</Source><Line>76</Line><HasError>True</HasError></__System><Tags><JobSchedule
    /></Tags></JobTriggerFailed>
    the message resource is present but the message is not found in the string/message table
    plz help me to resolve this error.
    jacob

    Hi,
    i would try to reinstall DPM
    Backup DB
    uninstall DPM
    Install DPM same Version like before
    restore DPM DB
    run dpmsync.exe -sync
    finished
    Seidl Michael | http://www.techguy.at |
    twitter.com/techguyat | facebook.com/techguyat

  • What SQL Server Agent jobs are currently running and for how long

    Is there a way to tell what SQL Server Agent jobs are currently running and for how long?  View history isn't working in my case because it's only showing completed jobs.  John Schroeder

     You can use the "Job Activity Monitor" for doing this.  Do the following steps. It will open up the Job Activity Monitor.
    1.    In Object Explorer, connect to an instance of the SQL Server Database Engine, and then expand that instance.
    2.    Expand SQL Server Agent.
    3.    Right-click Job Activity Monitor and click View Job Activity.
    4.    In the Job Activity Monitor, you can view details about each job that is defined for this server.
    In Job Activity Monitor page, we have column called Status. It will indicate whether job is executing or idle. Last Run column will give you the last invocation time of this job.
    Use the following link for further reference.
    http://msdn.microsoft.com/en-us/library/ms187449.aspx
    Thanks,
    Sateesh.
    Mark Post as helpful if it provides any help.Otherwise,leave it as it is.

  • SQL MP Job Last Run Status Monitor doesn't alert

    Hi Guys,
    Last run Status Monitor in SQL MP that checks the last run status of agent jobs and generates alerts is not generating any alerts in SCOM. Although we had overrided the monitor to trigger the alerts.
    Jobs are discovered under SQL agent Job state view and state changes to warning as well but no alerts at all.
    Please guide.
    Regards,
    Daya Ram

    Hi!
    Enable "Generates Alert" by override.
    Set "Alert On State" to "The monitor is in a warning health state" by override.
    Optionally change the "Alert severity" so that it matches the health state.
    Set "Auto-Resolve Alert" to true by override.
    Cheers,
    Patrick
    Please remember to click “Mark as Answer” on the post that helped you.
    Patrick Seidl (System Center and Private Cloud)
    Website: http://www.syliance.com
    Blog: http://www.systemcenterrocks.com

  • Scheduled Brio/IR Jobs are not running as expected in EPM 11.1.2.2

    Hi All,
    We are observing some scheduled jobs are not running as expected,they are not running on the scheduled time.
    Given below are more details.
    We have 2 Jobs running at 6:00 AM daily based on single Recurring Time Event and it is observed that this jobs are not running on time(ex: 6:06 AM EST).
    I tried verifying Consolidate Job Status in workspace admin options and the v8_RecurtimeEvent table for any miss in the 'Next Run Time',
    but they were having the scheduled date/time of run.No clue from EPM Logs.
    Interesting observation is the same jobs are running after updating the schedule time a little bit further(ex: 6:06 AM to 6:07 AM),but again after some days
    the updated job(6:07 AM ) will stop running.
    Thanks,

    Put the IR job logs in TRACE:32 mode which will give more information in the logs . Refer this document link to put in TRACE mode :http://docs.oracle.com/cd/E17236_01/epm.1112/epm_install_troubleshooting_1112200.pdf
    Thanks,
    KK

  • T code to check the job run status

    Hi Gurus,
    I am filling the setup tables for inventory, but i didn't run in background. I clicked on execute button, but suddenly i got disconnected from net and my server also.
    Now i have to check the job run status whether it is running or not.
    Request you to provide the suggessions.
    Thanks & Regards,
    Saketh

    Hi,
    if you ran job in background you can see at SM37.use proper time ,date and use id to get your job.
    if you ran on frond end then you won't see job.if your job was completed then you can see data at SE11 by using setup table name or you can see at t code NPRT by using name of the run.
    if you won't find your job then just delete your setup tables again and fill it.
    No issues.
    Thanks

  • Jobs Scheduled But Never Run or EM jobs remain in 'running' status

    Hi All,
    Please I need help on this: I have job scheduled through Oracle Enterprise Manager (OEM) but each time our Windows server is shutdown for patches update the job I scheduled failed and in the status column it shows that the job is still running ( Note: I do know how to stop/delete this job) BUT I will like to know if there's anything I need to do so that each time our server is shutdown the scheduled job will not fail to run.
    Thanks
    Wale

    Jobs Scheduled But Never Run or EM jobs remain in 'running' status what is the output of user_jobs and user_jobs_running views when this happens? especially last_date, last_sec, failures, broken columns.

  • Jobs are in "Cancelling" Status

    Hi Team,
    Some of the Jobs are in "Cancelling" from long time . More than a month. Could you please help me in deleting those jobs.
    Is there any way we can do job status check (like in SM37)
    Thanks,
    Preethish

    Hi Preetish,
    You can use the System_ForceJobStatusUnknown JobDefinition to set a particular job to Unknown, after that you can delete the Unknown job.
    Regards,
    Anton.

  • Prime Infrastructure 2.1 Can I make a report on the run status of jobs?

    To be more specific. I want a report mailed telling the Last Run Status of a Configuration Archive job

    Hi Edwin,
    unfortunately , I don't see this option available in PI at the moment.
    Thanks-
    Afroz

  • Jobs are in ready state

    Hi,
    Kernel Release-701
    OS-Linux
    Patch level-117
    dev_disp
    ERROR => DpWPCheck: W55 (pid 7299) died (severity=0, status=11) [dpxxdisp.c   15884]
    child (pid=7299) killed with signal 11
    ERROR => DpWPCheck: W56 (pid 7523) died (severity=0, status=11) [dpxxdisp.c   15884]
    child (pid=7523) killed with signal 11
    ERROR => DpWPCheck: W57 (pid 23473) died (severity=0, status=11) [dpxxdisp.c   15884]
    child (pid=23473) killed with signal 11
    ERROR => DpWPCheck: W58 (pid 7526) died (severity=0, status=11) [dpxxdisp.c   15884]
    child (pid=7526) killed with signal 11
    ERROR => DpWPCheck: W60 (pid 7528) died (severity=0, status=11) [dpxxdisp.c   15884]
    child (pid=7528) killed with signal 11
    SM21 logs
    08:00:05 BTC  055 000 SAPSYS                  EC  F Failed to create log for job /
    08:00:05 BTC  055 000 SAPSYS                  EA  Y Failed to read status entry for job
    08:00:05 BTC  055 000 SAPSYS                  EB  C > Job
    08:00:05 BTC  055 000 SAPSYS                  F2  0 Calling program reports invalid handle for TemSe object (magic==X'NULL-ptr')
    08:00:05 BTC  057 000 SAPSYS                  EC  F Failed to create log for job /
    08:00:05 BTC  057 000 SAPSYS                  EA  Y Failed to read status entry for job
    08:00:05 BTC  057 000 SAPSYS                  EB  C > Job
    08:00:05 BTC  057 000 SAPSYS                  F2  0 Calling program reports invalid handle for TemSe object (magic==X'NULL-ptr')
    08:00:06 BTC  060 000 SAPSYS                  EC  F Failed to create log for job /
    08:00:06 BTC  060 000 SAPSYS                  EA  Y Failed to read status entry for job
    08:00:06 BTC  060 000 SAPSYS                  EB  C > Job
    08:00:06 BTC  060 000 SAPSYS                  F2  0 Calling program reports invalid handle for TemSe object (magic==X'NULL-ptr')
    08:00:06 BTC  056 000 SAPSYS                  EC  F Failed to create log for job /
    08:00:06 BTC  056 000 SAPSYS                  EA  Y Failed to read status entry for job
    08:00:06 BTC  056 000 SAPSYS                  EB  C > Job
    08:00:06 BTC  056 000 SAPSYS                  F2  0 Calling program reports invalid handle for TemSe object (magic==X'NULL-ptr')
    08:00:06 BTC  058 000 SAPSYS                  EC  F Failed to create log for job /
    08:00:06 BTC  058 000 SAPSYS                  EA  Y Failed to read status entry for job
    Some jobs are going in ready state
    If i check them then job execute successfully.

    Hi,
    As per my personnel experience, there are lot of errors in kernel support pack 117 of 701 release. Try to use patch level 111 which is good.
    Also, run TEMSE consistency check to check any inconsistency.
    Thanks
    Sunny

Maybe you are looking for