Is it possible to add or change OCR text in a batch process?

Hello,
This is my first submission.  Really hope you can help as if there's a solution to this it will significantly help our business.
Is it possible to 'batch process' the adding or changing of OCR text in a PDF?
This might sound like a strange process but let me explain what we do:
1. We scan old, handwritten books and registers.  Typically these registers contain lists of information. Each page will be scanned to a filename like pge01.jpg, pge02.jpg, pge03.jpg etc.
2. We transcribe the content of each page.  Each page will contain mulitple records (i.e. 20 records)  typical fields might be:
Unique ID
Surname
Forename
Year
Address
Filename (i.e. pge02.jpg)
3. We provide this content back within database driven software so that when a user performs a search on say 'Surname=Jones' & 'Year=1945' then all of the scanned pages that match that contains handwritten text that matches that search criteria is displayed in a list.  The user can then click on a search result and see the scanned page containing that record.
Rather than provide database driven software, we'd simply like to produce a standard PDF file.  Each page within the PDF will show each of the scanned pages (pge01.jpg, pge02.jpg, pge03.jpg etc.).  But where you would normally store the 'ocr recognised text' behind each image, we would like to show our transcribed content.
If this is possible, then I realise that it's likely that you can't do field searches (i.e. 'Surname=Jones' & 'Year=1945') but at the very least I'd be able to type 'Jones' in to the search box and it would find all pages that contained the transcribed word 'jones'.
If it's possible to add the transcribed data as 'ocrd text' then is it possible to do it in some sort of batch process?  We scan lots of big books and capture millions of records - so doing it manually is not an option.
Any help that anyone can provide will be hugely appreciated.
Thanks,
Paul

I don't think it's possible to manually add OCRd text, but you can add form
fields with the text in them. And yes, it is possible to search the content
of form fields, using a script.

Similar Messages

  • Is it possible to add a gradient to text in adobe edge animate?

    i'm wondering if it's possible to add a gradient to text in adobe edge. I come from the flash world and new to edge. I'm trying to eliminate importing a series of text blocks as art. Can anyone add some insight please?
    thanks in advance!

    Check this out :http://css-tricks.com/snippets/css/gradient-text/
    Sent from Mailbox
    On Mon, Jun 16, 2014 at 4:52 PM, Marie Goodwyn <[email protected]>

  • PI Sheet : Is it possible to Add Phase or Operation Text?

    Hi Experts,
    Is it possible to add phase or operation text along with phase number?
    We are using browser base PI sheet and in the begining of the frame PI sheet shows phase number automatically like " Phase0101". Is ther any way to add short text of the phase instead of phase number or both?
    This will really  make my life easy.
    Thanks in anticipation.
    Kind Regards
    RajaRam

    In 013c defining process message category for process characteristic include standards characteristics of PP_PI Operation, PP-PI Operation text, PP-PI Phase, PP-PI Phase text also as characteristics,
    After this when defing PI category 012c these characteristic appear along with process message charectestic.there define these as output characteristics. This will display phase no, phase text, operation no, operation text each time respective Process characteristic is recorded.

  • How to Change Column text of Iview HCM Process Overview

    ECC Ehp4 sps7
    Portal 7.01 sps7
    Please Help,
    I am trying to globally personalize the Copy of the Iview HCM Process Overview . The column texts have titles such as Object Type and Object Key. I want to rename them Person ID etc.
    I was able to "Hide" some columns in the Iview but when I change the text in the "Settings for Current Configuration" it does not Appy or change the Display.
    I am in the Portal Content Administrator, with Admin priveledges. I am opening the Iview objecyt then using Preview.
    I then Right cntl click to bring up the Settings for Current Configuration. I am looking at the View UI Element Tree and opening the Grouping of UI Elements. I see the text for the column "Object  Key" I change the text but when I save it does not change the Iview text. When I open the "Settings for Current Configuration again the changed text is still there but the Iview has not changed
    Any help on this matter would be appreciated.
    thank you,
    Sarah

    Yes,
    I have opened the Role object, edited the Work Overview Page inside the role, selected / clicked on the Iview HCM Process Overview, opened the IView  HCM Process Overview, previewed it, cntl right click for Settings for current configuration,
    Selected the UI Element "Object Type" TEXT and made my changes
    Sarah

  • Is it possible to add and delete some text in a file?

    Hi everybody!!!
    I would like to add something and delete it later on into and from the text file. Is it possible? do anybody have any idea to help me out?
    ciao...

    Read the text file in sequentially, write out the modified version to a new text file sequentially.

  • Is there a script to change a setting incrementally when batch processing in Camera Raw?

    I am editing a batch of about 1000 photos in Camera Raw. There are certain settings like contrast or clarity that will be the same for all of them. But with the Exposure, I would like to have it decrease incrementally from the first picture (set to +1.25) to the last picture (set to -2.00).
    So for certain settings that will be the same on all images I have done a simple "select all" and "synchronize." But I am wondering if there is a script that will have Camera Raw (or any other Adobe program) automatically split the difference between an exposure of +1.25 to -2.00 and apply incremental values to the images?
    Any thoughts?

    Copy and paste the script into ExtendScript Toolkit
    This gets installed with Photoshop and can be found:-
    PC: C:\Program Files\Adobe\Adobe Utilities
    MAC: <hard drive>/Applications/Utilities/Adobe Utilities
    Start Bridge
    PC: Edit - Preferences - Startup Scripts
    Mac: Adobe Bridge menu - Preferences - Startup Scripts
    At the bottom click the "Reveal Button" this will open the folder where the script should be saved.
    Close and restart Bridge.
    Accept the new script.
    To use:-
    Navigate to the folder with your Raw files.
    Tools - Set Average Exposures
    Make sure you do a test first on a temp folder of the raw files!
    #target bridge  
       if( BridgeTalk.appName == "bridge" ) { 
    setExposure = MenuElement.create("command", "Set Average Exposures", "at the end of Tools","exp");
    setExposure.onSelect = function () {
    var d=app.document;
    d.deselectAll();
    var sels = d.getSelection("cr2,crw,nef,mos,orf");
    var noOfFiles =sels.length;
    var diff = (3.25 / (noOfFiles-1));
    var exp = 1.25;
    var exp2=exp;
    for(var a = 0;a<noOfFiles;a++){
        exp2 = exp - (diff*(a)).toFixed(3);
        var file = new File(sels[a].spec.toString().replace(/\....$/,'.xmp'));
        setExposure( file, exp2 )
    app.document.chooseMenuItem('PurgeCache');
    function setExposure( file, expose ){
         file.open('r');
         file.encoding = "UTF8";
         file.lineFeed = "unix";
         file.open("r", "TEXT", "????");
         var xmpStr = file.read();
         file.close();
         if (ExternalObject.AdobeXMPScript == undefined) ExternalObject.AdobeXMPScript = new ExternalObject("lib:AdobeXMPScript");
         var xmp = new XMPMeta( xmpStr );
         xmp.setProperty( XMPConst.NS_CAMERA_RAW, "Exposure", Number(expose) );
         file.open('w');
         file.encoding = "UTF8";
         file.lineFeed = "unix";
         file.write( xmp.serialize() );
         file.close();

  • Batch processing with JTA not possible?

    hi ng,
    i'm hoping that someone can tell me if it is actually possible to use JTA transactions in non-app server spawned threads and, if so, how to do it.
    The problem we have is that we need a demon thread executing updates over a cached cluster (hence the need for JTA), but sun 7 uses a thread local to store the context and transaction info, if we try to perform JTA transactions in our own thread it bombs out from trying to lookup the missing data.
    We've tried to re-create what is expected in the thread local and bind it in but this is a large hack involving changing accessibility levels etc and we can't even get it to work anyway.
    The other option we've considered is to do the processing in a servlet but that would make the process widely accessible.
    Any help would be most appreciated.
    Thanks,
    Cam

    Hi Marje,
    It's possible to include both resizing and image overlay in a custom Fireworks command that can be used in batch processing.  To get started, you could check out this tutorial that deals with the first step.
    That article describes how to perform image resize and overlay (in that case, a watermark), and then how to record the steps and turn them into a custom command that can be later used in batch processing.
    Once you saved the custom command, click File >> Batch Process, and follow the steps below:
    In the first window, select the images you want to process.
    On the next screen, open the Commands dropdown menu and select the custom command you created (it'll probably be on the bottom of the list), and click the Add button to include it in the batch process list.
    Finally, on the next screen select the location of the processed files, and optionally save the batch script for later use.
    Good luck!

  • Batch process to add Javascript to PDF files

    Hi All,
    I have written a small piece of Javascript for my PDF files. The idea is to add a date stamp to each page of the document before printing. To do this, I have added the following code to the "Document Will Print" action:
    for (var pageNumber = 0; pageNumber < this.numPages; pageNumber++)
    var dateStamp = this.addField("Date","text",pageNumber,[700,10,500,40]);
    dateStamp.textSize=8;
    dateStamp.value = "Date Printed: " + util.printd("dd/mmm/yyyy",new Date());
    My question is this: Does anyone know of a way to batch process a whole directory (of around 600 PDF's) to insert my Javascript into the "Document Will Print" action of each file?
    Many thanks for any information you may have.
    Kind regards,
    Aaron

    > Can I just confirm a few things please? Firstly, should I be going into "Batch Sequences" -> "New Sequence" and selecting "Execute JavaScript" as my sequence type?
    Yes, you are creating new batch sequence that will use JavaScript.
    > My second question is, how can I insert my body of script into the variable "cScript"? I have quotation marks and other symbols that I imagine I will have to escape if I wish to do this?
    You ca either use different quotation marks or us the JavaScript escape character '\' to insert quotation marks
    Your will print code will only work for a full version of Acrobat and not Reader, because Reader will not allow the addition of fields. Also each time you print you will be creating duplicate copies of the field. So it might be better to add the form field only in the batch process and then just add the script to populate the date field in the WillPrint action.
    // add form field to each page of the PDF
    for (var pageNumber = 0; pageNumber < this.numPages; pageNumber++)
    var dateStamp = this.addField("Date","text",pageNumber,[700,10,500,40]);
    dateStamp.textSize=8;
    this.setAction("WillPrint", "dateStamp.value = \"Date Printed: \" + util.printd(\"dd/mmm/yyyy\",\new Date());");

  • When creating a custom SearchPlugin, is it possible to add more code such as uppercase conversion of the SearchText and IF statements that change the URL depending on what is typed?

    When creating a custom SearchPlugin, is it possible to add more code such as uppercase conversion of the searchTerms and IF statements that change the URL depending on the searchTerms? Every time I try to add something firefox doesn't want to add it as a search plugin. I need to create a more powerful search tool for personal use.

    I've found some external software applications that will do it, so that leads me to believe its not possible within ID CC.

  • Is it possible to add a word at the end of a line?

    Hi, i'm searching a way to automatically add a word at the end of a certain paragraph styles?
    is it something possible doing by using a script?
    i don't script myself but if someone could find me a way to do this it would be really appreciated.
    Here's an example to what i need to do:
    i have paragraph style X in a current  book document, and i need to edit the book and add the copyright source at the end of every line that has the style X to it
    so i would need something that asks me
    the targeted paragraph style
    which word i want to append to it, and then apply it torough the document.
    Thanks in advance

    Yes, the screen shot shows where you got it wrong:
    function(){return A.apply(null,[this].concat($A(arguments)))}
    Find: (.)$
    and set your paragraph style in the find format area
    Change: $1XYZ
    where XYZ is the text you want to insert.
    .. you forgot the '1' right after the '$'. It's an important omission: "$1" is a GREP code, meaning "insert the found text, parenthesized group #1" -- and there is also a "$2", "$3", etc. up to "$9". It's great if you have to switch two, or even more, items around! But without the digit, the dollar sign indicates ... just a dollar sign.
    Change the Change to text to
    $1~S(CEMEQ)
    and you'll be fine.

  • Is it possible to add more input values for a payroll element?

    Hi All
    I have created an element with 1 default input value 'pay value' and another input value '1xx1'. I have linked this to payroll and attached to many employees. Now due to business requirment, I need to add one more input value '2yy2' and also I need to change the formula. Eventhough its a minor change in formula, without a new input value it will not work. I tried to add a new one by datetracking to the element creation date without sucess.
    Is it possible to add input values after saving an element?
    Regards
    Rahman

    You can Change the Required Input Value to Not-required
    Alter the Sequence of the Input Value to be displayed in the Element Entry Screen
    You can Change the Input Value Validation Rule minimum, Maximum, Lookup and Formula
    Change your specification of which input values create Database Items. Note, however, you cannot remove Database Items if they are used in any formulas or QuickPaint reports.
    You cannot remove existing input values or add new one if you have created any entries for the element
    For more details you can refer http://ramesh-oraclehrms.blogspot.com/2007/08/maintaining-element.html

  • Is it possible to add high resolution photos to an ipad2 without synchronising it?

    Hi,
    I'm looking in to offering potential clients an ipad with all their wedding photos pre loaded, but is it possible to add photos without synchronising each ipad to my mac? When I first got my ipad2, I couldn't use it for anything until I had connnected and registered it with Apple........I have got the camera connection kit, but haven't found a workaround for adding photos without synchronising yet?
    I haven't been able to find much info on the subject so far? There must be a workaround?
    Cheers,
    MacSlates

    The first thing that currently needs to be done with a new iPad is to activate it by connecting it to a computer that is running iTunes - it can't currently be used 'properly' until after that is done. After iOS 5 is released and is pre-installed on iPads then I think that should change and you will no longer need to first connect to a computer.
    The camera connection kit can be used to add high-resolution photos to the iPad, unlike syncing via iTunes where the photos get 'optimised' by the transfer process. If adding the photos to the SD card yourself then you need to create a DCIM directory off the root of the card with the photos underneath it, and the photo filenames need to be exactly 8 characters long (no spaces) plus the file extension i.e. in a similar format as if a camera had created/written them
    Otherwise there are third-party apps such as Simple Transfer which can copy photos to/from the iPad from your computer via your wifi network. But photos transferred this way will only go into the Saved Photos/Camera Roll album in the Photos app, not into a 'proper' album - and until iOS 5 is released and installed you won't be able to move the photos between albums on the iPad : from http://www.apple.com/ios/ios5/features.html#photos :
    Even organize your photos in albums - right on your device
    Also a problem with syncing photos to the iPad via iTunes is that they then need to be on the other person's computer when they then connect to it - otherwise the photos are likely to be removed from the device. Photos copied via the connection kit or copied via third-party apps shouldn't have that problem.

  • Is it possible to add a firewall Filter or Rule Set to the Extreme Router (802.11n)

    Is it possible to add a firewall Filter or Rule Set to the setting for the Extreme Router (802.11n) like the following:
    "ALLOW TCP/UDP IN/OUT to 208.67.222.222 or 208.67.220.220 on Port 53"  and
    "BLOCK TCP/UDP IN/OUT all IP addresses on Port 53"
    The goal of this is to create a firewall rule to only allow DNS (TCP/UDP) to OpenDNS' servers and restrict all other DNS traffic to any other IPs.
    Or, alternatively is there a way to configure same applied to the Network preferences on IMAC OS X?
    Thanks and much appreciation to anyone who has any clue about this.

    Sorry, I think you've got it backwards.
    The concern is NOT that the child can make changes to our hardware/AEBS, or even our network software on my IMAC - nothing's been changed.
    BUT, he changed the dns settings on his OWN device (ie chromebook) to google public server, accessed the AE using our home wifi network BUT bypassed our dns settings. Capeesh?
    See: http://www.pocketables.com/2013/03/how-to-use-change-the-dns-settings-on-your-ch romebook-and-use-googles.html

  • Is it possible to add a custom menu item to built-in apps like addressbook, safari etc in iPhone

    Is it possible to add a custom menu item to built-in apps like addressbook, safari etc in iPhone

    Apple expects the OS to remain constant for all users - helps to maintain a quality experience. Devs are not allowed to make changes to built-in apps.

  • Is it possible to add new columns with format "Text" once a table is linked to a form

    Is it possible to add new columns with format "Text" once a table is linked to a form in Numbers for iPhone or is it impossible and thus a serious bug?(Rating stars and numeric vales seem to work.)
    Those bugs happen both for new speadsheets as well as existing onces, like the demo. When you are in the form only the numeric keyboard shows up.
    Pat from the Apple Store Rosenstrasse/Germany approved that it looks like a Bug during the Numbers Workshop I was in: It is not possible to add new columns with format "Text". I reported the error for Version 1.4 but there is no update nor do I get statement of understanding the issue.

    Hi Knochenhort,
    I see what you are talking about now. Without knowing how the program actually works, I think this is what's going on:
    When you add a new column to an already existing table (with already existing formats), the new cells come already formatted like the previous column. So when you add a column to the end of the demo table, the cells are already formatted like stars, and when you add a column to the beginning, they're already formatted like number.
    I think this is why it's different when if you add columns to a table with blank (unformatted) columns. In that case, the new cells aren't already "tainted" with a set format, so you can change to text format without issue.
    It seems like the problem is that you can't format cells that are already formatted as "number" as "text" format (even if it doesn't look like they are, because they are blank). IMO, this is a bug! This is why you don't see this issue when adding columns to a new table, because the new cells don't already come with a format.
    To workaround, you can highlight the body cells after adding the new column, and delete the cells. This will "clear" the formatting, so you can then go in the inspector, format them as text, and the correct keyboard will pop up.
    Hope that helps!

Maybe you are looking for