Need help in text search

We have a many folders which contain a lot of pdf documents.  We need to write a script and search for a particular keyword through all the pdf documents and print the path and filename which contains this keyword. 
Thats it.  I am new to Adobe and I do not know whether adobe has any API to do this and looking for some sample programs.
Thanks in Advance
Chari

You can use a batch process (http://livedocs.adobe.com/acrobat_sdk/9.1/Acrobat9_1_HTMLHelp/BatchSeq_BatchSequences.96.1 .html) to scan through a collection of PDFs. You can also seach for words in a PDF using the getPageNthWord JavaScript method: http://livedocs.adobe.com/acrobat_sdk/9.1/Acrobat9_1_HTMLHelp/JS_API_AcroJS.88.486.html
So the batch process would be a JavaScript that loops through the pages of a document, loops through the words on each page, and if a match is found, it could write the file path (http://livedocs.adobe.com/acrobat_sdk/9.1/Acrobat9_1_HTMLHelp/JS_API_AcroJS.88.411.html) to the JavaScript console.

Similar Messages

  • Need help doing text search of Blob

    I presently have a simple single table candidate tracking application that create records of candidate information and you attach their resume in a word format into a blob column named RESUME within the table.
    I need to be able to do a full text boolean search of the attached documents, since we're going to use this to allow us to search an internal database for people with particular skill sets.
    I'm not a DBA or a developer, I just started working with Oracle Application Express and need a simple way of creating this search feature.
    I also need to add the search feature into a region for users to input search terms.
    Any help would be greatly appreciated.
    Edited by: user10608055 on Nov 25, 2008 11:57 AM

    try
    http://tahiti.oracle.com/
    in the list of books, look at 'TEX' for Oracle text.
    Interesting you should bring this up. I'm slighly involved with US Transition team.
    Many resumes are coming in.
    Tim

  • Need help in building search query

    Guys ..
    Problem Description:
    I have a huge table that is indexed using CONTEXT.
    I want to write a search query that considers the following:
    1. number of keywords match
    2. takes care of spelling mistakes, synonyms and acronyms
    3. proximity - the keywords should not be too far of each other.
    e.g. I have this phrase: "Horizontal Stabilizer Trim Brake"
    I was thinking of writing a query like:
    SELECT SCORE(1) SCORE,
    TEXT text
    FROM MY_TABLE
    WHERE CONTAINS(TEXT, '(Horz | Horizontal) ACCUM (Stab | Stabilier) ACCUM Trim ACCUM (Brk | Break)', 1) >= 0
    ORDER BY SCORE DESC
    The results doesnt look satisfactory. I have not used "near" operator as i dont know how to use it.
    Please help me as I am very much new to Oracle Text.
    -G

    Well, I'm not going to write the function for you, but we can at least talk through a general strategy.
    A lot depends on how you help your users on the front end -- for example, if they're searching a technical document, you may want to return results that aren't perfect matches but you do want to make sure the user picks 'mandatory' and 'useful' keywords in a way that lets you figure out which ones are really important. On the other hand, if you're google and have to handle queries like 'horizontal stabilizer trim brake' and 'were Pete and Jenny in the break room' then you run the risk of spending too much time looking for interesting words, almost doing a full-text search on the query trying to derive meaning.
    So I'm going to presume that you have some control over what/how the users generate their searches so that finding keywords isn't the issue.
    The plan will be to parse the query a bit to find the interesting words, clean them up, and weigh their importance, then use transformed data to build the query template to score various combinations.
    So here's some pseudocode for the function:
    function parse_query(pQueryWords in clob) returns clob as
    begin
        generate_token_list (); -- split the query into a set of individual tokens/words
        for each token in token_list
            if it's a mandatory word then accumtokenlist := accumtokenlist || ' ' || token ||'*10' -- weigh the presence of the token strongly
            if it's a useful word then accumtokenlist := accumtokenlist || ' ' || token ||'*5' -- domain-specific words are also important
            if it's a stopword or reserved word, then do not add it to the list
            if it's not on my lists, then accumtokenlist := accumtokenlist || ' ' || token
                                         and normaltokenlist := normaltokenlist ||' ' || token
        end;
        --so now, we have two lists, one for NEAR and one for ACCUM
        now build the guts of the template
            querytemplate := querytemplate || '<seq> || normaltokenlist || '</seq>';
            querytemplate := querytemplate || '<seq> || replace (accumtokenlist, ' ',' ACCUM ') || '</seq>';
            querytemplate := querytemplate || '<seq>$' || replace(normaltokenlist,' ','$') || '</seq>';
            querytemplate := querytemplate || '<seq>? || replace(replace(accumtokenlist,' ',' ?'),' ', ' accum ') || </seq>';  -- first fuzzy the words, then accum
            querytemplate := querytemplate || '<seq>? || replace(replace(normaltokenlist,' ',' ?'),' ', ' near ') || </seq>';  -- first fuzzy the words, then near
        return querytemplate
    end;So, with a 'cooked' query text that is template-friendly, all we need to do is apply a template that is aware of your inputs:
    query_Template_string := '
    <query>
       <textquery lang="ENGLISH" grammar="CONTEXT"> horizontal stabilizer*5 trim brake*10
         <progression> '
    || parse_query('horizontal stabilizer trim brake')  ||
    '     </progression>
       </textquery>
      <score datatype="INTEGER" algorithm="COUNT"/>'
    </query>So that's an example of one approach.

  • HELP! Need help generating TEXT-ONLY portal page...

    Text Only Portal Question:
    PLATFORM:
    =================================================================
    Sun Solaris (5.2 if memory serves) for db and mid-tier, running
    8.1.7 DB and 3.0.9 (1.0.2.2) portal.
    THE NEED:
    =================================================================
    I need to display text only portal pages. Some of the more
    detailed concerns at this point are below. Also, I've had an open
    tar on Metalink for about two weeks, and after research from
    their end has resulted in no help.
    THE ISSUES (so far):
    =================================================================
    IMAGES:
    If an anchor [A HREF=...] tag uses an image as it's "text", I
    need to strip out the ALT= text to show inside the anchor. If no
    ALT text is available, then I would like to show the image name
    as a default.
    For example:
    <img src=home.gif
    alt=Home>
    should display as:
    Home
    FORMS:
    How do I get the resulting page from a form (which include the
    login inputs and submit button, search box, advanced search page,
    etc.) to be displayed by the text only page?
    For example:
    When a form is called, the <FORM> elements are as follows:
    METHOD=GET or POST
    ACTION=url (relative or absolute) to the script.
    In this case, the action value is:
    ACTION=/servlet/page?
    pageid=6&dad=portal30&_schema=PORTAL30.
    This calls the advanced search API.
    I would expect that to redirect the browser back to some
    text-only version, the ACTION= element would have to be changed
    to be something like:
    ACTION=[pathscraper]?/servlet/page?
    pageid=6&dad=portal30&_schema=PORTAL30
    REDIRECTION:
    What happens when portal pages redirect internally? How do you
    get back to the text-only page?
    For example:
    The login link on the standard Oracle Portal home page flips
    from url to url to get to the actual login page. Our
    implementation of Oracle portal goes from
    [DOMAIN]/pls/portal30_sso/portal30_sso.wwsso_app_admin.ls_login
    to [domain]/pls/portal30_sso/portal30_sso.login_page.
    Since this is standard Oracle redirection, how can it be
    intercepted so the portal30_sso.login_page can be presented as
    text only?
    TRIED SO FAR:
    =================================================================
    I've written a socket/text scraper in Perl, running it from a web
    server. The problems mentioned above are really causing problems,
    plus the whole cookie thing. Since Oracle Portal tries to push a
    cookie to the client, when the client is another UNIX server,
    the cookie thing doesn't work.
    POSSIBLE OTHER SOLUTIONS:
    =================================================================
    Something...anything. I've tried to think of some method to
    create some sort of PL/SQL procedure to catch the content then
    strip out the HTML calls.
    An Applet to do the same thing, but on the client side, but
    since time is an issue, coding a complete Java applet isn't
    really an option.
    THE CONCLUSION:
    =================================================================
    HELP! I need some help. This is for a client that is government
    funded, and to meet Section 508 (part of the Americans with
    Disabilities Act that states web sites and applications must be
    made accessible. A text-only page is one of the requirements for
    an accessible page.
    Thanks,
    Ryan Stefani
    ps: feel free to contact me via [email protected] or
    [email protected]

    Use Find/Change and the GREP tab.
    Search for .+ and set the Find formatting to find the charcteristics you want.
    What will you do with this text once found? You'll need something to "change" to, either new text or Change Formatting options...

  • Need Help regarding text Output

    Dear gurus.
    I need help regarding formatting of a text.
    I want to format a employee sub group text.
    im getting a text workers (7) from a table t503t having field ptext.
    i want to show only (7) in the output not the whole text how can i do this ?
    Please help
    regards
    Saad.Nisar

    DATA: BEGIN OF itab_odoe OCCURS 0,
      department_text LIKE t527x-orgtx,"Holds the short text for department
      department_no LIKE pernr-orgeh,
      pernr LIKE pernr-pernr,
      ename LIKE pernr-ename,
      grade like t503t-ptext,   "THIS AREA GET ME TEXT OF EMPLOYEE SUBGROUP"
    *  department_text LIKE t527x-orgtx,"Holds the short text for department
      current_year LIKE sy-datum,
      wt0001 LIKE q0008-betrg,"Basic Pay
      wt1101 LIKE q0008-betrg," COLA
      wt3002 LIKE p0015-betrg,"Overtime
      per_basic type p DECIMALS 2,"Overtime percentage on basic
      per_basic_sum type p decimals 2,"Overtime Sum Division
      overtime_sum LIKE p0015-betrg,"holds sum of overtime
      basic_sum like q0008-betrg,"holds sum of basic
    END OF itab_odoe.
    Im using the select statement to get the employee subgroup from the table
    select single ptext
        from t503t
        into itab_odoe-grade
        where persk eq pernr-persk
        AND SPRSL eq 'EN'.
    now in itab_odoe-grade the values comes is Workers (7) , Snr Mgt (M3)
    i want to show only the text in Brackets.

  • Need help in text field with 2D array

    text field with 2D array
    Hi
    I need help to represent (i) in from field and (j) in to field
    I and j are 2D an array indices.
    This code are not complated
    import java.applet.*;
    import java.awt.*;
    import java.awt.event.*;
    //declaring class
    public class test3 extends Applet implements ActionListener
    { //declaring the TextField
    private TextField fromField ,toField;
    //declaring an array
    int weight[][];
    int m = 99; // m is infinity
    int N; // Set of Nodes
    int d; // distance
    int i; // source Node
    int j; // destition Node
    //declaring values of text field
    private int from = i; // start Node
    private int to = j; // end node
    public void init()
    setBackground(Color.white);
    setForeground(Color.red);
    //giving labels
    Label TITLE2,TITLE1;
    TITLE1 = new Label("from:");
    add(TITLE1);
    fromField = new TextField(5);
    add(fromField);
    // register listener using void add actionListener
    fromField.addActionListener(this);
    TITLE2 = new Label("to");
    add(TITLE2);
    toField = new TextField(5);
    add(toField);
    // register listener using void add actionListener
    toField.addActionListener(this);
    // event handler methods
    public void actionPerformed(ActionEvent event) {
    //declaring textfield
    from=Integer.parseInt(fromField.getText());
    to=Integer.parseInt(toField.getText());
    weight =new int[7][7];
    weight[1][1] = 0; weight[2][1]= 2;
    weight[1][2]= 2; weight[2][2]= 0;
    weight[1][3]= 5; weight[2][3]= 3;
    weight[1][4]= 1; weight[2][4]= 2;
    weight[1][5]= 99; weight[2][5]= 99;
    weight[1][6]= 99; weight[2][6]= 99;
    weight[3][1]= 5;
    weight[3][2]= 3;
    weight[3][3]= 0;
    weight[3][4]= 3;
    weight[3][5]= 1;
    weight[3][6]= 5;
    for (int i=1; i<7; ++i) {
    for (int j=1; j<7; ++j)

    all your base are belong to us

  • Clarifications needed for full text search

    Hi,
    I need some clarification regarding full text search.
    1) Is japanese part of the standard Oracle full text search?
    2) if it is not, how to install the japanese lexer?
    3) how oracle is sorting international characters. If a column contains both english, japanese and french, how will be the output?
    Thanks
    Muneer

    Follwoing is the sql statement and the result i got
    select language, description,lengthb(description) bytes, length(description) length, vsize(description) vsize from t2;
    LANGUAGE      DESCRIPTION      BYTES      LENGTH      VSIZE
    English      abcdefghij      10      10      10
    English      zyxwvutsrq      10      10      10
    French      désignéess      16      12      16
    French      réconcilia      13      11      13
    German      Einfuhrzöl      13      11      13
    German      müßtämpfer      19      13      19
    Greek      Î´Î·Î¼Î¿ÏƒÎ¹Î¿Î³ÏÎ±      40      20      40
    Greek      Î±Ï€Î¿ÎºÎ»ÎµÎ¹ÏƒÏ„ι      42      20      42
    Russian      Ð¿Ñ€ÐµÑÑ‚упник      42      20      42
    Russian      Ð¿Ñ€Ð¸Ð²ÐµÐ´ÐµÐ½Ð¸Ñ      41      20      41
    Japanese      å…¥é–“市の会社の銀行口      65      30      65
    Japanese      ç¥žæˆ¸å¸‚ä¸å¤®åŒºã®æ±éŠåœ’      62      30      62
    Korean      ì¶œë°œì ë¶€í„°ì¶œë°œì ë¶€í„°      64      30      64
    Korean      ë³´ì¢Œê´€ì„출발점부터경      64      30      64
    Hindi      à¤•à¤¤à¤œà¤¨à¤¤à¤®à¤¨à¤œà¤¤à¤¶à¥à¤°      73      36      73
    Hindi      à¤¨à¥€à¤šà¥‡à¤¨à¥€à¤šà¥‡à¤¨à¥€à¤šà¥‡à¤¨à¥€à¤šà¥‡à¤¨à¥€à¤šà¥‡      130      60      130 I think it explains a lot. I am facing another problem in searching blob columns when it contains japanese or korean characters. I tried with multi lexer (adding japanese as sub lexer and making english as default lexer). But it is not searching the column. Do i have to set any other parameters (editing registry, changing enviornment setting etc). I used the following script to set the lexer.
    begin
         ctx_ddl.create_preference('english_lexer','basic_lexer');
         ctx_ddl.set_attribute('english_lexer','index_themes','yes');
         ctx_ddl.set_attribute('english_lexer','theme_language','english');
         ctx_ddl.create_preference('german_lexer','basic_lexer');
         ctx_ddl.set_attribute('german_lexer','composite','german');
         ctx_ddl.set_attribute('german_lexer','mixed_case','yes');
         ctx_ddl.set_attribute('german_lexer','alternate_spelling','german');
         ctx_ddl.create_preference('japanese_lexer','japanese_vgram_lexer');
         ctx_ddl.create_preference('korean_lexer','KOREAN_MORPH_LEXER');
         ctx_ddl.set_attribute('korean_lexer','COMPOSITE','NGRAM');
         ctx_ddl.create_preference('global_lexer', 'multi_lexer');
         ctx_ddl.add_sub_lexer('global_lexer','default','english_lexer');
         ctx_ddl.add_sub_lexer('global_lexer','german','german_lexer','ger');
         ctx_ddl.add_sub_lexer('global_lexer','japanese','japanese_lexer','jpn');
         ctx_ddl.add_sub_lexer('global_lexer','korean','Korean_lexer');
    end;Hope i presented enough details.

  • Need help with text() processing in XSL

    Hello,
    I have an xml that contains such text in my xml:
    before<a>inside</a>after
    and an xsl that transforms it to HTML (a cut for xsl):
    <xsl:template match="a">
    <xsl:apply-templates/>
    </xsl:template>
    <xsl:template match="text()">
    <xsl:value-of disable-output-escaping="yes" select="."/>
    </xsl:template>
    The result is: inside before after
    but I need: before inside after
    It seems it happens 'cause of this: http://www.w3.org/TR/xslt#conflict
    but I cannot find a way to solve this problem :(
    I had tried to use priority in xsl:template, but it didn't help :(
    Thanks a lot.

    DrClap
    here are xml and xsl.
    That's not a real xml and xsl, but they might describe the idea and problem. I hope I miss nothing.
    P.S. I cannot control xml, that's why I cannot use: <xsl:text> in xml.
    Thank you!
    xml:
    <?xml version="1.0" encoding="UTF-8"?>
    <root>
    <title>Page title</title>
    <page>
    Location: <red>http://host</red>
    </page>
    </root>
    xsl:
    <?xml version='1.0' encoding='ISO-8859-1'?>
    <xsl:stylesheet version="1.0"
    xmlns:xsl="http://www.w3.org/1999/XSL/Transform"
    xmlns:fo="http://www.w3.org/1999/XSL/Format"
    xmlns:fox="http://xml.apache.org/fop/extensions"
    exclude-result-prefixes="fo">
    <xsl:template match="root">
    <html>
    <head>
    <title>
    <xsl:apply-templates select="title"/>
    </title>
    </head>
    <body>
    <xsl:apply-templates select="page"/>
    </body>
    </html>
    </xsl:template>
    <xsl:template match="page">
    <xsl:apply-templates/>
    </xsl:template>
    <xsl:template match="title">
    [Test]: <xsl:apply-templates/>
    </xsl:template>
    <xsl:template match="red">
    <xsl:element name="span"><xsl:attribute name="style">color:red</xsl:attribute><xsl:apply-templates/></xsl:element>
    </xsl:template>
    <xsl:template match="text()">
    <xsl:value-of disable-output-escaping="yes" select="."/>
    </xsl:template>
    </xsl:stylesheet>

  • Need help removing text that is covering the document - Adobe Acrobat Standard X

    I am trying to remove "preview only" that is splashed across our document diagonally.  I can remove this on my computer which has Adobe Acrobat XI standard with content editing --> edit text and images  (a cursor pops up and i can just use the delete button), but not on another computer with Adobe Acrobat X standard.  The only thing we could do was add a red line over the "preview only" and delete that red line. 
    I did try to do my research, but all of the search ideas I was using didn't prove fruitful.
    Any help or ideas you could provider, I would certainly appreciate.
    Thank you in advance.

    Hey courtney evans,
    Please let me know how have you sent the PDF file on other computer.
    Also, are you viewing the file in a browser or downloading it and then opening in Acrobat.
    Is the document scanned?
    Please specify and let me know.
    Regards,
    Anubha

  • Need Help Adding Text To A Template

    I am using this template: http://www.templatemonster.com/flash-templates/21091.html
    I would like to add text as it opens, going across the flag, something like this....
    Sam Young. Not Uncle Sam, but here to serve you!
    Now, obviously, that's not what I want it to say, but it's an example, of how I want it to move across the page, on the flag, before it opens to the first page. I've tried everything I know to do. I'm sure I'm just missing something silly. Can anyone help me?
    Thanks!

    Wow... am I asking the impossible here? lol
    I'm still looking for help with this, if anyone can help me.
    I am searching for template help tutorials, and I don't know if I'm just calling it the wrong thing or what.
    I want the text to scroll ACROSS the screen, on entry to the site, across one of the stripes on the flag. Every tutorial I have found tells you how to add scrolling text in the box, like paragraphs. I just want one line, a Welcome Message, if you will, that goes across the screen. I was able to do this in DHTML previously, but Flash is a whole new ball game for me.
    Ok, I just did a search, and it's a MARQUEE that I want. So, I've searched, and found this: http://www.kirupa.com/forum/archive/index.php/t-3601.html
    So, I'll try that and see if it does what I want it to do.
    I'm including all of this, in the event that someone else is interested.

  • I need help adding text!

    I need some help with PE 12 and adding text...

    this forum is for photoshop elements and photoshop elements doesn't support editing GIF.
    You might want to post to photoshop community Photoshop General Discussion

  • Need Help Printing Text Messages From E71

    I need to print some saved text messages that are on my e71 but cannot ge **bleep** to hook up to my bluetooth printer it just never finds it. Is there another way to print these messages fromt he phone i really need them

    connect to pc and use ovi suite
    If  i have helped at all a click on the white star below would be nice thanks.
    Now using the Lumia 1520

  • Need Help removing text from an image.

    I am using illustrator Cs3 version.  I have an image of a sun with text in front of it, now I only need the image of the sun to then use in photoshop. How do I remove the text and still have a full-color image of the only the sun? Please Help! (Image below)

    Hi,
    You have posted your question in the Adobe Illustrator Draw iOS app forum. To get help for your question please post in the Adobe Illustrator desktop forum: https://forums.adobe.com/community/illustrator.
    Regards,
    Jose

  • Need help moving text

    Using CS4 on Win7 Pro. My client bought a template that has a flash piece and I am a total novice in Flash but I do know that you edit the FLA file and export it as a movie. So I opened the FLA file and I'm able to edit the text (all that I want to do right now) but some of the text I'm replacing is longer than the original so it gets hidden under a replay button and some is shorter and is spaced too far away from the replay button (I also want to change the color of that and I think I can figure that one out). There is plenty of room to the left for the text to move to as the original file does have varying lengths of text and it does adjust for that but I can't figure out how to change it for what I've done (I haven't received the final text yet so this is a practice run)... I changed the text by going to the timeline and selecting text from the icon (Edit Symbols) in the upper right of the screen. But where do I tell it where the text should start or how wide it needs to be? I tried moving them individually in the timeline but that didn't change it.
    Here's the file: http://do-rightweb.com/fertility/flash/header_vJT.fla (I only want to leave this up temporarily because it is huge. The test location of the file in action is here: http://do-rightweb.com/fertility/
    In the future I may have to swap out some of the images leaving the transitions which I think I can do by adding them as a layer deleting the ones I'm replacing...
    So if anyone can lend a hand in helping me figure out how to adjust the text width I would be totally jacked! Any help and advice about swapping images or changing colors would be greatly appreciated too. Or even a link to a video to help me understand how to reverse engineer this would be cool.
    Thanks in advance for your help and assistance!

    Hi,
    Since the text animation is done using the timeline in this file you have to manually edit the positioning of the each symbol element in every keyframe i.e
    You have to go in to the editing mode for txt_c instance of txt_2 > Layer 8 where the required elements are placed
    1. You have to position the Layer 1(under Layer 8) items at every key frame for the display text
    you may have problem while positioning second text onwards as you will not be seeing them on stage. For this you may want to duplicate the symbol(txt_5) at key frame 37 and edit it to have only the second text and remove everything else. And you can swap the existing symbols at key frame 44 and key frame 54. You can repeat this for the rest of the text.
    2. Next you have to position the numbers which spread across three layers (txt_3, txt_4 and Layer 6) under Layer 8. They are for prefix number, suffix number and the dot respectively
    Thanks!

  • Need help entering text into my website

    Hello,
    You'll have to forgive me as I am new to Dreamweaver, HTML
    and such things.
    I have created my website interface, as shown here:
    http://img88.imageshack.us/img88/1452/mywebsitepm9.jpg
    I created this in photoshop, then sliced the image into
    slices and exported it into Dreamweaver.
    I wish to add text to the large white box in the middle of
    the page, what is the best way to do this?
    Thanks for any help

    > I created this in photoshop, then sliced the image into
    slices and
    > exported it
    > into Dreamweaver
    A method that is highly unlikely to produce a worthwhile
    website. Photoshop
    is an image editing program, not a website building program.
    If you are
    trying to become a bona fide web designer I suggest you start
    by learning
    HTML & CSS. This is a good starting point:
    http://www.amazon.com/XHTML-Sixth-Visual-Quickstart-Guide/dp/0321430840/sr=1-1/qid=1165172 849/ref=pd_bbs_sr_1/102-5389401-2687307?ie=UTF8&s=books
    If you are just trying to create a one-time site for your
    personal use you
    are probably better off buying a template and then just
    filling in the
    blanks. Search this NG for sources of quality templates.
    Beware, many
    template sites sell pure junk. Try here:
    http://groups.google.com/group/macromedia.dreamweaver
    (You may want to bookmark that page.)
    Walt
    "shallowdeep" <[email protected]> wrote in
    message
    news:ekuht8$s0n$[email protected]..
    > Hello,
    >
    > You'll have to forgive me as I am new to Dreamweaver,
    HTML and such
    > things.
    >
    > I have created my website interface, as shown here:
    >
    >
    http://img88.imageshack.us/img88/1452/mywebsitepm9.jpg
    >
    > I created this in photoshop, then sliced the image into
    slices and
    > exported it
    > into Dreamweaver.
    >
    > I wish to add text to the large white box in the middle
    of the page, what
    > is
    > the best way to do this?
    >
    > Thanks for any help
    >
    >
    >

Maybe you are looking for