Document size performance considerations

We are trying to determine the performance implications of different approaches to document storage with BDB. Most of the XML we need to store will contain anywhere between 5000 to 20000 nodes at about 6MB per XML block. Best practices or document size analysis/data breakdown isn't a topic that is well explained on the docs websites. We are working primarily in a single-threaded environment.
1. Does performance degrade considerably when BDB containers hold documents upwards of 500MB? 2GB? or should documents generally be small? (Assuming Node type storage)
2. Does anyone know of or have any best practices for data breakdown and storage within BDB?
And completely unrelated:
3. Are Environments completely portable when fully moved to different systems?
Many thanks.

Hi Taka,
1. Does performance degrade considerably when BDB
containers hold documents upwards of 500MB? 2GB? or
should documents generally be small? (Assuming Node
type storage)Documents being written need to be parsed (larger docs take longer), inserted into the database and the appropriate indices updated (the more indices, the longer it takes). I think that the best thing to do is to build a prototype of your application, populate a database and benchmark the performance.
3. Are Environments completely portable when fully moved to different systems?There are two issues with copying or moving databases: database page log sequence numbers (LSNs), and database file identification strings.
Because database pages contain references to the database environment log records (LSNs), databases cannot be copied or moved from one transactional database environment to another without first clearing the LSNs. Note that this is not a concern for non-transactional database environments and applications, and can be ignored if the database is not being used transactionally. Specifically, databases created and written non-transactionally (for example, as part of a bulk load procedure), can be copied or moved into a transactional database environment without resetting the LSNs. The database's LSNs may be reset in one of three ways: the application can call the DB_ENV->lsn_reset method to reset the LSNs in place, or a system administrator can reset the LSNs in place using the -r option to the db_load utility, or by dumping and reloading the database (using the db_dump and db_load utilities).
Because system file identification information (for example, filenames, device and inode numbers, volume and file IDs, and so on) are not necessarily unique or maintained across system reboots, each Berkeley DB database file contains a unique 20-byte file identification bytestring. When multiple processes or threads open the same database file in Berkeley DB, it is this bytestring that is used to ensure the same underlying pages are updated in the database environment cache, no matter which Berkeley DB handle is used for the operation.
The database file identification string is not a concern when moving databases, and databases may be moved or renamed without resetting the identification string. However, when copying a database, you must ensure there are never two databases with the same file identification bytestring in the same cache at the same time. Copying databases is further complicated because Berkeley DB caches do not discard cached database pages when database handles are closed. Cached pages are only discarded when the database is removed by calling the DB_ENV->remove or DB->remove methods.
Bogdan Coman

Similar Messages

  • Document size changes when creating pdf from jpeg

    When I create a pdf from a jpeg file, the document size increases considerably. The native jpeg was saved in the "save to web" feature in Photoshop. I cannot find settings in ImageReady or Acrobat to make my document the correct size. Any suggestions?

    Why not save the PDF from Photoshop directly? This gives full control
    over compression settings. It will not, however, downsample so you
    should do that first.
    Aandi Inston

  • XML Embedded in Stored Function - Performance Considerations

    A developer in my company approached us with a question about performance considerations while writing stored procedures or functions that embed XML.
    The primary use for this function would be to provide a quick decision given a set of parameters. The function will use the input parameters along with some simple calculations and DB lookups to come up with an answer. These parameters will be stored in the database. Potentially even more parameters that are currently represented in the xml will be available in the DB and therefore could be looked up by the function.
    My biggest question is if this way of using XML as an input parameter introduces any performance considerations or concerns for storage/bandwidth etc.
    Thank you
    Edited by: user8699561 on May 19, 2010 9:24 AM

    user8699561 wrote:
    A developer in my company approached us with a question about performance considerations while writing stored procedures or functions that embed XML.
    The primary use for this function would be to provide a quick decision given a set of parameters. The function will use the input parameters along with some simple calculations and DB lookups to come up with an answer. These parameters will be stored in the database. Potentially even more parameters that are currently represented in the xml will be available in the DB and therefore could be looked up by the function.
    My biggest question is if this way of using XML as an input parameter introduces any performance considerations or concerns for storage/bandwidth etc.
    Thank you
    Edited by: user8699561 on May 19, 2010 9:24 AMStorage/bandwith will be determined regarding the size of the XML doc, but there are ways to minimize those to the minimum (binary XML support in JDBC eg.). Performance overhead in general...eh..."it depends" (how you set it up)...

  • Reducing pixel dimensions increases document size.

    In photoshop cs6. Working on an image of 15000x18662 pixels, with about 100+ layers.  doc size 800mb/6.43GB.  To optimize speed, I reduced the file size to the absolute minimum I need which is 13200x16423.  However now the file size grew bigger  620.2M/6.45GB.   Some layers extend beyond the visible image area, so I cropped the image "deleting cropped pixels" , but this increased the image even further to 6.50GB.  What's going on here? 
    Why is the file size increasing when I decrease pixel size?
    Why would deleting cropped pixels increase file size?

    This is a user to user Forum.
    If no one who has bothered to read the original post considers it of enough interest to investigate or advise that is up to them.
    Can you provide the original image?
    Depending on the interpolation settings on the content downsampling might result in a lessened effect of the applied compression.
    You seem to give the approximated values from "Document Sizes" in the status bar – what is the actual file’s size?
    Quote from the Reference:
    Document Sizes Displays information on the amount of data in the image. The number on the left represents the printing size of the image
    —approximately the size of the saved, flattened file in Adobe Photoshop format. The number on the right indicates the file’s approximate
    size including layers and channels.

  • Document size and final print size

    I am beginning a new aerial drawing in Illustrator CS4 on an 2009 imac, to be printed at 40x40". If the document is sized at 40x40, the computer slows down as detail accumulates. Since Illustrator is a vector-based application, can the document size be set smaller than the final print size, say at 20x20" or 10x10", to keep file size down and speed faster, without losing print quality? 
    Thanks for any help!

    function(){return A.apply(null,[this].concat($A(arguments)))}
    without losing print quality?
    Depends on the specific content of the file and what exactly you mean by 'setting the document size smaller.'
    Better would be to try to discern what is causing the performance issue. Again, that comes down to what the content is.
    JET

  • Page & cache size performance tuneup

    Hi
    I am doing performance evaluation on BDB. Please help me in find answer to below queries.
    1. page size: Do I need to give the page size based on my XML document size. Is there any relation(formula) between page size & XML document size to get optimum memory usage?
    2. cache size: Is cache size needs to be equal/more than the doc size to minimize the query response time? Could you please suggests a optimum cache size for 1MB XML document?
    3. I have stared with BDB version 2.3.10, but i read in this forum there is some performance improvement in BDB version 2.3.10. What version i should use for my evaluation? Is the latest(4.6.21) is best(stable)?
    4. Is any other parameters ( other than page & cache size) I need to tuneup to get optimum memory usage & minimal CPU utilization?
    Is there any reference document I can get more details on BDB performace?
    Thanks,
    Santhosh

    Hi Santhosh,
    It’s hard to give solid suggestions without knowing more about your application, what you are measuring and what your performance requirements are. What language are you implementing in?
    Is query response time most important, or document insertion or updates?
    I am going to request that you respond to this Performance Questionaire and answer as many questions as you can at this time. Send the questionaire to me at Ron dot Cohen at Oracle.
    http://forums.oracle.com/forums/ann.jspa?annID=426
    In addition to the information requested, you can see from the questionaire that the utility
    Db_stat –m is useful to look at a number of things including the effectiveness of the cache size you have.
    Have you taken any measurements yet? I would suggest going with the default pagesize but using a cachesize larger than the default. I don’t know how much real memory you have but for a first measurement you could try a cachesize of 100MB-500MB (or larger) depending on your workload and how much memory you have available. I am not recommending that as a final cache size, just giving you a number to start with.
    http://tinyurl.com/2mfn6f
    You will likely find a lot of improvements in performance can be obtained by your indexing strategy. This may be where you get the best results. You may want to spend some time reviewing that and the documentation on indexes:
    http://tinyurl.com/2522sc
    Also, take a look in the same document at the indexing sections.
    Berkeley DB XML 2.3 (Berkeley DB 4.5.20) should be fine to start (though you may have read on this forum about the speed improvements in Berkeley DB XML 2.4 which is currently in test mode).
    Please do respond to the survey, send it to me and we will try to help you further.
    Ron

  • Document size in quick look

    Hi Community,
    I installed a new Mac running Snow Leopard and now I have a small document size in quick look. I can change the size by drag and drop the lower right corner but a) the next document is in the same small size and b) when I reopen a document in quick look I adjusted the size before, it will be displayed also in the small size again. In other words the quick look doesn't remember the custom size setting.
    Is there anywhere in the system a way to change the settings of quick look?
    Thx & Bye Tom

    Yes, I have the same issue, text is tiny on my large screen when previewing word or other text documents. The implementation of Quicklook for text could be improved. For images it obviously works great but for text, I would define a "quick look" as a quick scan of the contents, not just the layout, because its text. When you are editing HTML or looking at CVs a lot, you end up needing to read a little to see what the item actually is, as a lot of layouts are the same for these type of documents.

  • How to change Default document size for CC Library Graphics (InDesign)

    Good morning --
    I have just discovered the amazing uses of the CC Library setup in InDesign and would love to be able to use it as a tool to collaborate template galleries amongst coworkers.
    I work in an industry that offers many different templates for our Marketing pieces available to the sales team. What we have in place right now is about 50+ different templates in their own packaged files. Now, this isn't all that difficult to maneuver around, but the CC Library has made it so I can "drag and drop" templates into place. This drag & drop feature is easy enough for me to grasp, and with the proper training it should be with the rest of the team (should being the keyword). However, in some cases if you double-click on Library graphic it opens the file in it's own editable file with a custom document size of 8.5x11 Portrait (which I imagine to be the internal InDesign preset).
    My questions therefore are when I double click on the library "Graphics" can the document size be anything other than 8.5x11? Can each Graphic have it's own document preset?
    - If I open a file that is set for landscape, all of the "Graphic" falls off of the spread.
    - Some "eblasts" are much longer than 8.5x11 (sometimes as much as 8.5x20), thus knocking some of the "Graphics" into that weird space area.
    Please let me know if I am not being a clear enough.
    Thank you!
    Steve

    @Steve – you are totally clear what you are doing and what you like to do.
    But I fear an automatic setting that is creating the page size according to an asset's size does not exist right now.
    I can change the size of a new document beforehand, if no document is open and create a new size in the document's properties dialog with the keyboard shortcut alt + cmd + p (in my German InDesign on Mac OSX 10.7.5). But you have to know in advance what size that should be.
    Example: If I chose A3 instead of A4 (my default) when no document is open and double click an asset in a CC Library the asset will be placed in a one page document sized A3.
    1. Changing the size (no document is open) in my German UI of InDesign CC-2014.2:
    2. Double click and the result is according to the settings I made:
    Uwe

  • Using Scripts to detect document size in inches?

    Is it possible that a script detects the document size in inches not in pixels?
    Thanks for the help!

    The size an Image will print is a simple calulation the numbet of Pixel Wide Devided by the imageg current DPI resolution. And numbet of Pixel High Devided by the imageg current DPI resolution. You can always change the DPI while maintaining the same number of pizels using Image Size with resample NOT checked to change the image's print size. All the information can be retrived in a script and the script can change the image's print size with or without resampling the image.

  • How to find out the Pixel Dimensions and Document Size of placed images?

    Is it possible to find out the placed image details like Pixel Dimensions(Width,Height) Document Size(Width,Height) in illustrator through scripting. Kindly advice me with some examples.

    http://www.adobeforums.com/webx/.3bbaa316.59b84fd8
    I hope the snippet therein will get you started.
    But after all, when you place an image via pi=layer.placedItems.add() it should pop in with its default dimensions, so that you have pi.height and pi.width ready.
    Is this what you are aiming at?

  • Change default document size in PDF

    The default document page setting in my PDF is set as 36"x24". I am not able to print the same in A4 even after making it ti fit to A4. ANy solution to change the default document size to A4

    CreatePDF, perhaps?  Try the forum at http://forums.adobe.com/community/createpdf

  • In MAC, I want to change document size from 8.5X11 to 18X24 to create a poster to print through Staples. I created the doc originally in WORD, changed the size in WORD, converted to PDF doc. But PDF doc is still in 8.5X11. Read ADOBE support help info. Te

    In MAC, I want to change document size from 8.5X11 to 18X24 to create a poster to print through Staples. I created the doc originally in WORD, changed the size in WORD, converted to PDF doc. But PDF doc is still in 8.5X11. Read ADOBE support help info. Terls me to change size in application rather than printer. BUT ACROBAT Pro does not give me a page set up option in FILE. I can only find one in the printer dialog box. Help!

    from the FAQs on Staples website:
    I have a file that I know is a PDF, but the website claims it is not in a PDF format. What should I do?
    Check to see that the file has the .PDF extension. Also, check that the filename does not have any special characters such as an ampersand (“&”).
    Regarding your measurements set to centimeters rather than inches; is it just in MS Word?
    Or does it occur in all other applications.
    Check your Work preferences first:
    If it is happening in all your applications, check your Mac OS System Preferences.

  • Copy & Paste From Illustrator to Photoshop not matching document size

    I just recently reformated my computer and I've been having an issue with copy and paste from Illustrator to Photoshop. Before I could copy a 2"x2" vector box from Illustrator into photoshop making a new document and the size and resolution would match. That is the new document would say 2"x2" at 300 ppi in photoshop and the preset would read "clipboard". I would then hit ok and paste and it would match perfectly. Now since I've reformated, photoshop always creates a new document with the wrong size regardless of the settings I enter in Illustrator or photoshop. I have document raster and effects set at 300 ppi in Illustrator and I have the new document set at 300 in photoshop, but it always transfers the 2"x2" box as some other size no matter what resolution I change it too. Again this used to work before so I'm assuming there is some setting I forgot about. Does anyone know? The main issue is the clipboard preset is not matching the document size, its almost as if Illustartor is telling photoshop it's a different resolution when it's not. Copying to 72 ppi seems to have no issue, which makes me think Illustrator just isn't sending it out at 300 ppi, but I don't know why as I have the document set at 300 ppi.

    Have you found an answer that wasn't posted here yet? I have the same issue.

  • Changing ppi changes pixel dimensions but not document size?

    I have an image that is W=4.375 H=6.124 at 355 ppi the pixel dimensions area under Img > Image Size indicates it is W=1553 px H=2174
    When I change the resolution to 72 px it is only changing the pixel dimensions and not the document size. I'm confuesed.
    1. What is the difference between pixel dimensions and document size?
    2. Is the document the same physical size regardless if I make it 200 ppi, 150 ppi or 72?
    Thanks.

    Let me see if I understand all of this:
    - If I change the resolution the print dimension will stay the same, though what is it actually doing to the image if I increase the resolution, though it still is the same print dimension? I am guessing you can not just add pixels to a given size image and expect it to print better.
    - If I change the dimensions, the resolution will change because there will be either more pixels for a smaller area or fewer pixels for a larger area, depending on how I size it.
    Thanks.

  • Changing Resolution and Document size

    Hello - I have a jpeg from a digital camera that I Saved As photo.psd with an original Document Size of 44" x 66" @ 72 ppi resolution.
    I am using CS3 and I want to prepare the photo for printing while maintaining the highest quality possible. The first thing I did was uncheck Resample Image. Then I changed the resolution to 300. This gave me a Document Size of 10.56 x 15.84.
    I want to enlarge the Document Size to 16" x 24" (the same proportions). So, it seems to me that the next step is to now check Resample Image and put in my new Document Size of 16" x 24". I would then apply some Unsharp Mask as desired.
    So, should Ifollow the steps above or do you recommend a different workflow?
    I've read that Photoshop will upres very well up to 200% and all I need is 166%, so it should work ok. Also, do you recommend standard Bicubic or Bicubic Smoother?
    TIA

    What you're suggesting will work, though you don't really need to do it in multiple steps.  One step, with resampling checked, once you set all the fields as you want, will do the trick.
    And for all the hype and mystery surrounding the two alternate choices for resampling, in all my work for minor resampling I always find I prefer Photoshop's original Bicubic method.  For extreme upsampling I prefer, personally, a plug-in called Genuine Fractals.
    Also, modern print drivers are getting very good at size matching...  If the image quality in your file is to your liking, you COULD just set the print size in inches (without resampling) and be done with it.  This would yield an image at a resolution lower than 300 pixels/inch, but would save you some work, and it may very well be that you will not be able to see any difference by letting the driver do the resampling as needed.
    -Noel

Maybe you are looking for

  • OBJEC_OBJEC REF NOT FOUND,UNCAUGHT EXCEPTION ERRORS IN ABAP WEBDYNPRO

    Hi Folks, We ve been facing with a weird issue that while we were trying to access the (node in Objective setting and appraisals) configure performance management process ,getting an dump as Runtime Errors         UNCAUGHT_EXCEPTION Exception        

  • DNS, Certificates, and Active Directory - School Setup Issues

    Our school has been piloting a small iPad depolyment.  I have been struggling with getting Profile Manager to work correctly since August of last year. Here's the setup: 1. Active Directory DNS/DHCP server (set as "school.local"--yes, I know .local i

  • Standard report for TDS

    HI SAP GURUS please tell me what is the standard report for TDS IN SAP ?

  • Content compression via the mod_plsql Gateway

    I've written a compression filter for the mod_plsql Gateway. Check it out at; http://www.katmandu.org/

  • Software all out of whack.

    Here is where I am at - Im not good with this stuff so treat me like I am 6 and I should be able to figure it out. I updated my iphone to Ios 7.0.4 via the phones update option under settings. Once downloaded, my phone keeps saying I have to connect