Log sequence error - possible causes?

We're using C++, DBXML 2.1.7, with underlying Berkeley 4.3.28 - core 5 linux (2.6.16.28). We use transacted write's, with no nesting of transactions. We've been running with this version of DBXML for some time and this is the first time we've seen any sort of data corruption with the database.
In this case, the database server was shutdown, and the system restarted - on restart, the database server core dumped. Repeated attempts to restart the database gave the same failure. We enabled error output for the Berkeley DB and we get the following errors:
Finding last valid log LSN: file: 1 offset 8234100
Recovery starting from [1][7965842]
Log sequence error: page LSN 1 1664073; previous LSN 1 5236280
Recovery function for LSN 1 8228918 failed on forward pass
PANIC: Invalid argument
PANIC: fatal region error detected; run recovery (repeated several times)
followed by a segfault in libdb_cxx-4.3.so.
So I have 2 questions, the first (and most important) being - how can the log file get corrupted? Is this an OS/file system problem? Or could we have a problem in our database server? It's relatively simple - there is a single thread for read's/write's, and a separate "checkpoint" thread that periodically calls the txn_checkpoint function. Something was just changed on the system that has to do with mirroring, specifically on the partition that holds our database, but I don't know the details (I can get the info, though).
The second question - why is Berkeley choking on the error path, instead of causing a database panic? Granted, in this situation it would appear that we're hosed either way, but a panic is at least a little more user-friendy than a core dump. Looking at the core file, it appears that we've entered the error handling portion of dbenv_open, and the mp_handle of the environment object is NULL - we fail in the call to __dbenv_refresh because of that. (If you're interested, we rebuilt Berkeley with debug symbols - I can give you a stack trace with details for the segfault.)
Oh, the startup flags for the database server are: DB_CREATE|DB_INIT_LOCK|DB_INIT_LOG|DB_INIT_MPOOL|DB_INIT_TXN|DB_RECOVER|DB_THREAD
Thanks!
Wendy

Thanks Michael -
Here's the stack trace from one of the core files, generated with the debug version of the library:
#0 0xb7d97c3a in __dbenv_refresh (dbenv=0x80b5430, orig_flags=1024,
rep_check=0) at ../dist/../env/env_open.c:722
#1 0xb7d9993d in __dbenv_open (dbenv=0x80b5430,
db_home=0x80b5334 "/pivot3/repository/xml", flags=188513, mode=432)
at ../dist/../env/env_open.c:415
#2 0xb7d1fe78 in DbEnv::open (this=0xbfa94488,
db_home=0x80b5334 "/pivot3/repository/xml", flags=188513, mode=0)
at ../dist/../cxx/cxx_env.cpp:442
#3 0x0804e0e8 in main (argc=134537448, argv=0xbfa94488) at dbserver.cxx:179
As to the "why's": the database files are never moved (in normal operation - I did copy them to a similar system for debug, but the failure is the same on both systems), we always run recovery when we restart the server, we never physically access the environment outside of the server, and there is only one database server on the system, so no "cross contamination" from another server (basically, we're using DBXML as an embedded database to store configuration information - we have it running on multiple, identical systems, and this is the first time in over 2 years that we've seen any sort of database corruption).
So based on what you've said, the only other real possibility is if something happened to the partition holding the log file (all of the database files are on the same partition). Would there be any smoking guns we could look for on the physical system that might indicate what happened? The system that the failure originally occurred on is still in the 'last booted' state, in case there was anything that we could look for. The partition mirroring changes were just made this past week, which is why we thought it might be something at a lower level.
I just ran db_printlog and the entry for 1664073 looks like this:
[1][1664073]__bam_repl: rec: 58 txnid 80000047 prevlsn [0][0]
fileid: 21
pgno: 3
lsn: [1][8228918]
indx: 3
isdeleted: 0
orig: 0xc
repl: 0x11
prefix: 8
suffix: 23
The other record (5236280) does not exist in the log file, based on the output from db_printlog.
If there's anything else from the log print that you need, just let me know.
Thanks!

Similar Messages

  • Sequence stopped on user interface when a sequence error happens on a step. Is possible to change this?

    I'd like to know if when developing an user interface over TS using MFC, if a sequence error occurs, I get an error on the user interface and the sequence stops without giving any other option. Is possible to continue the sequence, and show as failed the sequence that caused the error?
    Dani

    hello Daniel -
    If you are talking about the behavior of the simple operator interface example for MFC, then the reason why the error immediately causes a 'skip to clean-up' is because it is set explicitly in the code for the error case. To fully handle the error (as the sequence editor would), you would want to modify the case where the error is detected and perhaps tap into the TestStand API for Engine.DisplayRunTimeErrorDialog( ). If you are able to view the C code written for the full version of the CVI operator interface, you may want to use that as an example of how to completely handle every error.
    An alternative is always to handle errors yourself. If you are interested in knowing strategies for such customizations I recommend you look into the shipping examp
    les and downloadable programs for sequences leveraging the PostStepRunTimeError callback.
    Regards,
    Elaine R.
    National Instruments
    http://www.ni.com/ask

  • Error -32810 occurred at The FieldPoint server initialization failed. Possible causes: 1. Missing or corrupt configuration file; 2. Failed to create callbacks. : FP Open.vi

    I have two cFP-2120 RT controller. They both have NI-Serial RT2.5.6, DataSocket for LV RT4.2, NI-VISA 3.6, FP Drivers 5.1.0, FP VI Manager 3.1.0 LV RT 7.1.1 installed. But one same vi can run on one controller but not another.
    the error message got when it runs on another controller is "Error -32810 occurred at  The FieldPoint server initialization failed. Possible causes: 1. Missing or corrupt configuration file; 2. Failed to create callbacks. : FP Open.vi"
    I wonder by what it is caused?
    Thanks

    Hi Matthew,
    As mentioned previously, this KnowledgeBase article may be of some help. Also, if your code is being run on an embedded controller and then a network controller (not embedded), you will need to run the program differently. On the embedded controller, you should deploy your VI in a Project Explorer or programmatically (as described here.) However, you do not need to deploy the program to a network controller, but simply run the VI.
    I would also suggest creating a new .IAK file. Do you still see the same error? Also, double-check that the software installed on the controller matches the software installed on the host PC. If you have any file paths associated with one FieldPoint controller, ensure that you are changing them to match the second controller.
    Please keep us posted on your troubleshooting steps. Have a great day!
    Amanda Howard
    Americas Services and Support Recruiting Manager
    National Instruments

  • Generic service 'Analysis Services (TEST1)' could not be brought online (with error '1060') during an attempt to open the service. Possible causes include: the service is either not installed or the specified service name is invalid.

    Hi,
    We have a cluster with 2 nodes. Everything works fine in Node1. When I try to failover TEST1 database to Node-2 it fails with this message.
    Generic service 'Analysis Services (TEST1)' could not be brought online (with error '1060') during an attempt to open the service.  Possible causes include: the service is either not installed or the specified service name is invalid.
    Any help is much appreciated.
    Thanks

    Hello,
    The error message is pretty straight forward, it's saying either the service isn't installed or it's not installed as the same service on that node. Did you install analysis services on the second node (from the error it seems like it isn't)?
    Sean Gallardy | Blog |
    Twitter

  • Recently, any attempt to log on to any site with my valid userid and password fails with FireFox but passes with Internet Explorer. What could possibly cause this?

    I used FireFox a long time for email and other scure sites. Out of the blue my logins started rejecting. In frustration, I went back to Internet Explorer and found I could log in using the same UID and PWD.
    I cain't imagine what setting could be causing this to happen. Have you ever heard of such a thing? And is there a solution?

    In Firefox 3.6.4 and later the default connection settings have been changed to "Use the system proxy settings".
    See "Firefox connection settings" in [[Server not found]]
    You can find the connection setting here: Tools > Options > Advanced : Network : Connection
    If you do not need to use a proxy to connect to internet then select No Proxy
    Another possible cause is security software (firewall) that blocks or restricts Firefox without informing you about that,.
    Remove all rules for Firefox from the permissions list in the firewall and let your firewall ask again for permission to get full unrestricted access to internet for Firefox.
    See [[Server not found]] and [[Firewalls]] and http://kb.mozillazine.org/Firewalls

  • It shows the log sequence not found error while all the log seqences exist?

    Why the database shows the log sequence not found error while all the log seqences exist?
    SQL> select*from v$log_history;
    RECID STAMP THREAD# SEQUENCE# FIRST_CHANGE# FIRST_TIM NEXT_CHANGE#
    RESETLOGS_CHANGE# RESETLOGS
    1 657328345 1 1 519368 13-JUN-08 543434
    519368 13-JUN-08
    2 657370849 1 2 543434 13-JUN-08 567705
    519368 13-JUN-08
    3 657373012 1 3 567705 14-JUN-08 570408
    519368 13-JUN-08
    RECID STAMP THREAD# SEQUENCE# FIRST_CHANGE# FIRST_TIM NEXT_CHANGE#
    RESETLOGS_CHANGE# RESETLOGS
    4 657373039 1 4 570408 14-JUN-08 570464
    519368 13-JUN-08
    5 657374623 1 5 570464 14-JUN-08 573505
    519368 13-JUN-08
    6 657375015 1 6 573505 14-JUN-08 573856
    519368 13-JUN-08
    RECID STAMP THREAD# SEQUENCE# FIRST_CHANGE# FIRST_TIM NEXT_CHANGE#
    RESETLOGS_CHANGE# RESETLOGS
    7 657376537 1 1 570465 14-JUN-08 573543
    570465 14-JUN-08
    8 657377084 1 2 573543 14-JUN-08 575109
    570465 14-JUN-08
    9 657377094 1 3 575109 14-JUN-08 575114
    570465 14-JUN-08
    RECID STAMP THREAD# SEQUENCE# FIRST_CHANGE# FIRST_TIM NEXT_CHANGE#
    RESETLOGS_CHANGE# RESETLOGS
    10 657377103 1 4 575114 14-JUN-08 575121
    570465 14-JUN-08
    11 657377106 1 5 575121 14-JUN-08 575123
    570465 14-JUN-08
    11 rows selected.
    RMAN> flashback database to sequence=10 thread=1;
    Starting flashback at 14-JUN-08
    using target database control file instead of recovery catalog
    allocated channel: ORA_DISK_1
    channel ORA_DISK_1: sid=155 devtype=DISK
    RMAN-00571: ===========================================================
    RMAN-00569: =============== ERROR MESSAGE STACK FOLLOWS ===============
    RMAN-00571: ===========================================================
    RMAN-03002: failure of flashback command at 06/14/2008 13:03:15
    RMAN-20206: log sequence not found in the repository

    Apart from the output being hardly readable (please refer to the FAQ in the right upper hand corner how to post output), I don't see any sequence 10.
    You seem to mistake the column recid for the sequence column.
    The sequences are listed in -not surprisingly- the sequence# column.
    Sybrand Bakker
    Senior Oracle DBA

  • CFG0008: Error occured while processing. Possible Cause: URN_NOT_FOUND : urn "JRM" : Not

    Dears , Kindly advise
    I have a server running Ciscoworks LMS 4.2.3 version. Server has been working without any issues.
    Now when I try to go into Netconfig Jobs, I get the following popup:
    CFG0008: Error occured while processing.
    Possible Cause: URN_NOT_FOUND : urn "JRM" : Not found !!
    I've been looking at various threads similar to my situation and tried several, to no avail.

    It may be that the related services are down. Share the output of NMSROOT/bin/pdshow -brief from the server to share service status.
    Many times, it happens when the port required by service is being used by other. Try the following procedure and see it works :
    1.Stop daemon manager: net stop crmdmgtd
    2.Go the location: CSCOpx\MDC\tomcat\webapps\rme\WEB-INF\lib take a copy of the following
    file and them delete them: ctmregistery and ctmregistery.backup
    3.Restart the daemon manager: net start crmdmgtd (wait for 10 minutes)
    Check if everything works fine.
    -Thanks
    Vinod

  • The database alert log has errors. Please help me

    Hi all
    I have an Oracle 8i Database that seams to be functioning well but when I
    Checked the Database alert Log I realized that something is not fine
    Part of the alert log content is here under. What can be problem and how can I
    Resolve it. Please I need you assistance...
    Thu Feb 07 08:50:37 2008
    Thread 1 advanced to log sequence 266
    Thu Feb 07 08:50:37 2008
    Current log# 11 seq# 266 mem# 0: E:\ORACLE\ORADATA\MUKREC\LOGS\REDO11.LOG
    Thu Feb 07 08:50:38 2008
    ARC0: Beginning to archive log# 10 seq# 265
    Thu Feb 07 08:51:07 2008
    ARC0: Completed archiving log# 10 seq# 265
    Thu Feb 07 09:23:22 2008
    Errors in file E:\Oracle\admin\MUKREC\udump\ORA03976.TRC:
    Thu Feb 07 09:23:23 2008
    Errors in file E:\Oracle\admin\MUKREC\udump\ORA04092.TRC:
    ................................................................................................................

    I'll just throw a shameless copy-paste from Metalink note : 164839.1 , hope you can use it to get further ? :
    Subject: FATAL ERROR IN TWO-TASK SERVER: error = 12571 Found in Alert File
    Doc ID: Note:164839.1 Type: PROBLEM
    Last Revision Date: 17-DEC-2007 Status: PUBLISHED
    fact: Oracle Server - Enterprise Edition
    symptom: Errors appear in alert file
    symptom: FATAL ERROR IN TWO-TASK SERVER:
    symptom: ERROR = 12571
    symptom: trace file generated
    symptom: Database operations continue successfully
    cause: The most common cause for the above error is an ungraceful
    disconnection of a session from the oracle db while the db is currently running
    a dml statement issued by that session. The error is recorded when oracle
    attempts to reply back to the session with the results of the dml and cannot
    access the session. Overall database operations are usually not affected.
    An ungraceful disconnection could cause by but is not limited to any of the
    following:
    - the client machine crashed
    - the network connection crashed
    - the user exited the session improperly (not issuing the 'exit' command)
    - the user application allows the user to exit the application without properly
    terminating the session.
    The above can cause problems with corrupted rollback segments if occurring on a
    regular basis and is not addressed. This would require db recovery and
    possibly a db rebuild (not a light matter)
    PMON will usually rollback most transactions in the rollback segments for a
    session if it finds that the session has been ungracefully disconnected, but
    there is always a chance that it cannot and this will lead to rollback segment
    corruption.
    fix:
    The dml and the user that issued the dml can be determined from the trace file.
    The current dml is in the tracefile header section. The user can be found in
    the process state dump of the trace. The process state shows the machine,
    o/suser, and user for the session.
    The DBA can use this information to determine what the user was doing at the
    time and if there was an ungraceful exit from the session the user was
    utilizing.
    The DBA should then address the cause of the ungraceful exit to reduce the
    possibility of recurrence.

  • Planning Sequence error - You do not have sufficient authorization

    Hello Experts,
    I am encountering the following error during execution of my planning sequence:
    You do not have sufficient authorization
    Message no. EYE007
    Diagnosis
    You do not have sufficient authorization for the requested data records.
    Procedure
    Either select other data or get the required authorizations from your administrator.
    Can anyone please suggest me what could be the possible cause of this error and how can I overcome it?
    Thanks in advance,
    Nitish.
    Edited by: Nitu_0105 on Jan 12, 2012 11:54 AM

    Check the planning level on which planning sequence is based  for values restriction.Looks like values for which it is been restricted you are not authorized or Roles assigned to you are not restricted with values in planning level.
    Also check if level is been restricted with any variable on which authorization is been restricted.Some are user specific example specific business area are assigned to only some users ...so you need to add your user ID to those variable.

  • DRM Error 3321 Causes & Resolution

    [ Problem Background ]
    Adobe Access DRM needs to download a device-specific key before protected content can be consumed. This step is called individualization. If user is getting error code 3321 it means the individualization step failed.
    [ Root Cause ]
    The client needs to make network connection to http://individualization.adobe.com in order to individualize. The failure most likely was caused by a temporary network connection failure. To verify: use the browser and connect to http://individualization.adobe.com/flashaccess/i15n/v5 . View the page source. If everything is fine you will see something like:
    <html>
    <head>
    </head>
    <body>
    <!-- Version xxxxxxxxxx-->
    </body>
    </html>
    [ Resolution ]
    Make sure the network connection is fine.  Pay special attention to any proxies and firewalls which may be in place. 
    Also, you may add additional logic to your video application to determine if it is able to query the URL endpoing for individualization in the event handler for 3321.  If the connection isn't possible, display a dialog to the end user to check for network connectivity issues (to http://individualization.adobe.com) - perhaps they are behind a firewall or in a network that restricts access to the individualization server.
    [ Problem Not Resolved? ]
    This article details what Adobe believes will resolve 3321 errors.  If this hasn't resolved your issue, we would love to hear more from you!  Please reply below with the following information:
    - If you don't see the <html> code that is shown above, what are you seeing?  Please cut & paste.
    - Date & time that you noticed this outage
    - OS & OS version of the affected computer
    - Browser & browser version of the affected computer

    I have recently begun having this problem with:
    Firefox (32.0.3)
    flash (15.0.0.152)
    windows 7 pro 64 bit (Version    6.1.7601 Service Pack 1 Build 7601)
    (works fine in Chrome (37.0.2062.124) btw)
    I have tried uninstalling/reinstalling both flash and firefox a number of times.
    I've deleted my firefox profile, and recreated.
    I've deleted %appdata%\adobe\flash player\
    I've deleted %appdata%\macromedia\flash player\
    I deauthorized the computer
    Here is my log from the test defined here: Re: DRM Error 3321 Causes & Resolution
    http://drmtest2.adobe.com:8080/AccessPlayer/player.html
    12:07:41.363 : DRMError event received
    Error Code = 3313 [WriteMicrosafeFailed]
    Sub Error Code = 1000055
    Error Details =
    drmUpdateNeeded = false
    systemUpdateNeeded = false
    12:07:39.504 : Acquiring License
    12:07:35.467 : Metadata received
    12:07:35.418 : loading metadata from metadata file...
    12:07:31.758 : Player mode changed to  TVP
    12:07:28.186 : DRM Error: 3313 [WriteMicrosafeFailed]
    12:07:24.277 : Load http://drmtest2.adobe.com:8080/Content/anonymous.f4v
    12:07:07.056 : This swf version is 20
    12:07:07.056 : Player Initialized!
    12:07:07.049 : Player mode changed to  SVP
    12:07:06.938 : Can't access local storage: Error #2134
    When I change the amount of "local storage" that the flash player can have, it won't "stick".  I go back and it is still at 0.
    I've added drmtest2.adobe.com to my global list of allowed sites, and that doesn't help.
    A screen shot of the permissions for %appdata%\adobe\flash player\
    A screen shot of the permissions for %appdata%\macromedia\flash player\
    The global storage settings:
    Tried adding drmtest2.adobe.com specifically (still didn't help)
    I change this to something > 0, click close, re-open it and it is still 0.
    I use AVG AnitVirus Free Edition 2014 (2014.0.4765), I've tried disabling it, to no avail.
    This is the error I get:
    DRM AUTHORIZATION ERROR
    Write to the file system failed.
    (Error Code: 3313. Suberror Code: 1000055)

  • Thread 1 cannot allocate new log, sequence in Alert log

    Could someone help, I am getting the below message in my alert log
    System: AIX6.1 - Oracle 11r2
    GROUP#     THREAD#     MEMBER                                  ARCHIVED     STATUS             MB
    1     1     +DATA01/mydb/redolog_group1_member1     NO         INACTIVE             1536
    1     1     +DATA01/mydb/redolog_group1_member2     NO         INACTIVE             1536
    2     1     +DATA01/mydb/redolog_group2_member2     NO         CURRENT          1536
    2     1     +DATA01/mydb/redolog_group2_member1     NO         CURRENT          1536
    3     1     +DATA01/mydb/redolog_group3_member2     NO         INACTIVE             1536
    3     1     +DATA01/mydb/redolog_group3_member1     NO         INACTIVE             1536
    Tue Jul 10 18:37:48 2012
    Thread 1 advanced to log sequence 28831 (LGWR switch)
      Current log# 1 seq# 28831 mem# 0: +DATA01/mydb/redolog_group1_member1
      Current log# 1 seq# 28831 mem# 1: +DATA01/mydb/redolog_group1_member2
    Tue Jul 10 19:12:01 2012
    Thread 1 advanced to log sequence 28832 (LGWR switch)
      Current log# 2 seq# 28832 mem# 0: +DATA01/mydb/redolog_group2_member1
      Current log# 2 seq# 28832 mem# 1: +DATA01/mydb/redolog_group2_member2
    Tue Jul 10 19:39:00 2012
    Thread 1 cannot allocate new log, sequence 28833
    Private strand flush not complete
    Tue Jul 10 19:39:18 2012
      Current log# 2 seq# 28832 mem# 0: +DATA01/mydb/redolog_group2_member1
      Current log# 2 seq# 28832 mem# 1: +DATA01/mydb/redolog_group2_member2
    Tue Jul 10 19:41:21 2012
    Thread 1 advanced to log sequence 28833 (LGWR switch)
      Current log# 3 seq# 28833 mem# 0: +DATA01/mydb/redolog_group3_member1
      Current log# 3 seq# 28833 mem# 1: +DATA01/mydb/redolog_group3_member2
    Tue Jul 10 20:15:28 2012
    Thread 1 cannot allocate new log, sequence 28834
    Private strand flush not complete
      Current log# 3 seq# 28833 mem# 0: +DATA01/mydb/redolog_group3_member1
      Current log# 3 seq# 28833 mem# 1: +DATA01/mydb/redolog_group3_member2
    Tue Jul 10 20:16:21 2012Could an increase on the redo log file sizes solve this problem? Unfortunately I do not have a test platform, where I could have tested this.

    mseberg wrote:
    It just a warning. I would read these two notes:
    I suspect the OP is more worried about the apparent two minute gap between the "cannot switch" and the "advanced" at     19:39:18 and 19:41:12 respectively. Did the switch really have to wait for that two minutes, or is this a spurious threat caused by an anomaly in the writing of the alert log.
    As a cross check, I would examine v$event_histogram (or an AWR/Statspack for the interval) for time spent in the event "log file switch (private strand flush incomplete)". In prinicple I wouldn't expect to see waits longer than a few (less than 10) milliseconds; if the histogram shows long waits consistent with the alert log reports than I'd contact Oracle because something odd is happening.
    (Footnote - it's possible that the event times out after a limited interval, log file sync used to time out after 1 second and has recently changed to something shorter and adjustable, so 2 seconds may (for example) appear as 20 x 0.1 seconds.)
    Regards
    Jonathan Lewis

  • Log sequence

    In production database 8.1.6
    I got a message at a particular time
    cannot allocate log sequence seq#
    Thread 1 advanced log sequence seq#
    After this, I increased the log buffer value
    and number of log files also.
    But I am getting the same error. Is there
    any suggestions please?

    Check the status of the log files by using following query
    select * from v$log;
    On log file should have status='CURRENT' and should have the highest sequence number;
    Other's should have status = 'INACTIVE' and ARC=Y.
    Its possible that the redo logs are waiting to be archived because of space problem.
    <BLOCKQUOTE><font size="1" face="Verdana, Arial">quote:</font><HR>Originally posted by maran:
    In production database 8.1.6
    I got a message at a particular time
    cannot allocate log sequence seq#
    Thread 1 advanced log sequence seq#
    After this, I increased the log buffer value
    and number of log files also.
    But I am getting the same error. Is there
    any suggestions please?<HR></BLOCKQUOTE>
    null

  • Cannot allocate new log, sequence Checkpoint not complete

    Hi,
    Im having very frequent log switches and Im getting error as
    " cannot allocate new log, sequence Checkpoint not complete"
    I was having 3 redo log groups with 50 MB each. After I found this error in the alert log; I increased teh number of redo log groups to 6 with 50MB each. Still the issue is not getting resolved.
    Please suggest what will be the best solution for this.
    Following is a snippet from alertlog.
    ==================================================
    Sun Apr 19 09:14:08 2009
    Thread 1 advanced to log sequence 5811
    Current log# 2 seq# 5811 mem# 0: /u01/app/oracle/oradata/mview/redo02.log
    Thread 1 cannot allocate new log, sequence 5812
    Checkpoint not complete
    Current log# 2 seq# 5811 mem# 0: /u01/app/oracle/oradata/mview/redo02.log
    Sun Apr 19 09:14:18 2009
    Thread 1 advanced to log sequence 5812
    Current log# 3 seq# 5812 mem# 0: /u01/app/oracle/oradata/mview/redo03.log
    Thread 1 cannot allocate new log, sequence 5813
    Checkpoint not complete
    Current log# 3 seq# 5812 mem# 0: /u01/app/oracle/oradata/mview/redo03.log
    Thread 1 advanced to log sequence 5813
    Current log# 1 seq# 5813 mem# 0: /u01/app/oracle/oradata/mview/redo01.log
    Thread 1 cannot allocate new log, sequence 5814
    Checkpoint not complete
    Current log# 1 seq# 5813 mem# 0: /u01/app/oracle/oradata/mview/redo01.log
    Sun Apr 19 09:14:32 2009
    Thread 1 advanced to log sequence 5814
    Current log# 2 seq# 5814 mem# 0: /u01/app/oracle/oradata/mview/redo02.log
    Thread 1 cannot allocate new log, sequence 5815
    Checkpoint not complete
    Current log# 2 seq# 5814 mem# 0: /u01/app/oracle/oradata/mview/redo02.log
    Thread 1 advanced to log sequence 5815
    Current log# 3 seq# 5815 mem# 0: /u01/app/oracle/oradata/mview/redo03.log
    Thread 1 cannot allocate new log, sequence 5816
    Checkpoint not complete
    Current log# 3 seq# 5815 mem# 0: /u01/app/oracle/oradata/mview/redo03.log
    Sun Apr 19 09:14:44 2009
    =========================================================
    Regards
    Pratheej

    Anand... wrote:
    Hi Sir,
    Although i too had suggested increasing the redo logfile size, but after going through [http://asktom.oracle.com/pls/asktom/f?p=100:11:0::::P11_QUESTION_ID:69012348056] i was little confused as Tom has mentioned
    Another way is to make the log files smaller, hence increasing the frequency with which we checkpoint ---- Can you explain why so.
    I was a little surprised when I read that posting - but noticed it was dated May 2000 - when databases were still quite small and less busy. (And Tom suggested 25MB as being "modest" rather than "tiny" - which is probably what many people would call 25MB these days). And May 2000 probably means 8.0 or 8.1 - and the whole log buffer, redo generation, checkpointing technology has changed a lot since then.
    Basically, if you hit "checkpoint not complete", you need more online log space so that it's possible during the busiest times to keep generating redo log information while the checkpoint queues are being cleared far enough to allow older log files to be recycled.
    You can do this by adding more log files, or by increasing the sizes of the log files you use. Tom's point, I think, was that if you chose the option to add more files and kept them small (or even made them smaller) then the volume of dirty data blocks that you could create while filling a log file would be small, so the database writer wouldn't have to do much work to make each log file available for re-use. (I'm not sure I'd agree with the approach, though - even for 8i - because it could easily lead to an increase in the volume of datablocks written, even if it did bypass the checkpoint issue).
    Regards
    Jonathan Lewis
    http://jonathanlewis.wordpress.com
    http://www.jlcomp.demon.co.uk
    "For every expert there is an equal and opposite expert."
    Arthur C. Clarke

  • Thread 1 cannot allocate new log, sequence 28703

    Hello,
    I have a problem in my production database. I have an Oracle database 10.2.0.2 64 bits, and Checking the alertlog I see this message some days ago:
    Thread 1 cannot allocate new log, sequence 19943
    Private strand flush not complete
    Current log# 1 seq# 19942 mem# 0: E:\ORADATA\INCARLOP\REDO01A.LOG
    Current log# 1 seq# 19942 mem# 1: F:\ORADATA\INCARLOP\REDO01B.LOG
    My archivelogs size is 90 MB and I have enough free space in my filesystem
    what can be the problem?
    Thanks!!!

    fjfranken wrote:
    The problem is caused by the fact that Oracle wants to start using a redo-log file which is still in use by a running checkpoint.
    Most of the time this problem is solved by adding more redo-log-groups.No, what you are describing would be more the "+Check point not complete+" error, which could be solved by adding more logfile group or increase the current size.
    The message of the OP is similar, but not the same. It would rather means that log switch cannot occur because the redo has not been written, this could be a disk contention to write down the redo. If this is eratic behaviour it could safely be ignore.
    Nicolas.

  • Thread 1 cannot allocate new log, sequence 1558 Checkpoint not complete

    hi,
    i m working on oracle 10g rac database on aix machine . i m getting this error on peck time
    Thread 1 cannot allocate new log, sequence 1558 Checkpoint not complete
    i read lots of documents and they asked to increase size of redo file or add more redo files.
    can u plz describe me y m i getting this error ? & how adding redo file can help in this error.
    thxs

    when yours current redo log filled and then started to switch another log then checkpoint occurs ,this checkpoint started to write dirty buffer from buffer cache to datafile , you cannot reuse this logfile unless checkpoint process writes alls dirty buffer from buffer cache to disk which contained this redo log file.If you attempt to reuse the same log file which cause to checkpoint upon log switch then you will get this error.
    Typically this error comes where yours number of redo log switches occuring too frequently or you have less number of redo logs.
    lets say if you have 2 redolog file A and B,yours A redolog filled and then oracle switch from redo log A to B,checkpoint occurs,DBWRn started to write dirty buffer to disk meanwhile yours redo log B also get filled antoher log switch occurs to be attempt to reuse redo log file A ,but redo log A will not be entertain unless the previous checpoint completed to write alls dirty block from buffer cache to hard disk which is contained thats redo log A.
    Adding redo log will be helpful in this case that redo log will switch to another new added redo log say C and A log file will get more time to be completed checkpoint which he/she contains contents.
    Khurram

Maybe you are looking for