Searching For Keywords embedded in metadata tags of PDF files

We have a large number of Adobe Acrobat PDF files that were converted from paper documents a number of years ago that we would like to store in OCS Content Services. They were bitmap imaged, so they are not OCR-readable.
We had already embedded keywords in the metadata tag fields associated with the PDF file, but the Ultrasearch Engine in OCS does not appear to "see" them. Is there anyway to tune Ultrasearch so that it can search these fields?
Thanks!
Mark

it calls for me to print out the line that the keyword is on. Is there an easy way to change what i already have to implement that or should i start from scratch again?Well, I haven't taken the time to read all your code to see exactly what you are doing, but it does look a little to complicated to me. I think I would simply read one line at a time from the file and search that line for keywords. This [url http://javaalmanac.com/egs/java.io/ReadLinesFromFile.html]link shows the general structure.

Similar Messages

  • PDF Portfolio Search for keywords

    Hello all,
    I am trying to find out how you can search a pdf portfolio for keywords that I have added into pdf documents.
    I have added the keywords through the document properties window.
    It seems odd to be able to add keywords if you can't search for them - any help greatly appreciated
    Thanks
    Matt

    Thanks, what a bummer . I was hoping for a fix or a freeware solution :P.
    Is there a way to notify Apple about this? Keywords seem poorly designed, or at least not thought through, compared to other programs.

  • I have ca. 30 pdf documents I need to search for keywords; how can I do on my MAC?

    I have ca. 30 pdf documents I need to search for keywords; when I open these documents in Adobe Reader on my MAC, it shows a Search tool; however, when I search for keywords I know are in the document, none are found.  How can I do a keyword search?

    Do you know if the text has been OCR recognised? Are the original documents "scans"?
    An easy way to find out, if you can select an individual word or letter? If you are selecting a whole block of text then the document will need to be put through Optical Character Recognition (OCR) software first to enable you to keyword search.

  • Improving Search for Keywords in NI Example Finder

    Search for Keywords is presently limited to logical "OR" search. It would highly profit from logical "AND" Search.

    Thinking out of the box here...
    It's possible to search just the bookmarks with a script. The problem is how to show the result, since it's not possible to highlight the found bookmarks.
    One option I can think of is to change the color of the matches to a different color, like red. Unfortunately, this doesn't work in Reader...
    The only other option that might work in Reader is to tell the user where to look, ie which position in the tree (and what items are before and after it).

  • Tell me Logic for search for duplicate words(or strings) in a large file.

    Search for duplicate words (or strings) in a text file containing one word per line. For each word that occurs more than once in the flat file output should be as follows
    <word> <number of occurrences> <line numbers in the file where the word occurs>
    For example, if the word Hello occurs thrice in a file at lines 100, 178 and 3456 the output should read
    Hello, 3, [100, 178, 3456]

    Incidentally i wrote similar code some days back. You need to do some modifications to get the exact output you want, but i hope it will be of some help.
    One more thing its written using JAVA5
    public class Test
         private static final String COLLECTIONS_TEXT = "C:\\Documents and Settings\\amrainder\\Desktop\\Collections.txt";
         public static void main(String[] args) throws IOException
              findDuplicateWords();
         private static void findDuplicateWords() throws IOException
              Collection<String> words = new LinkedHashSet<String>();
              File file = new File(COLLECTIONS_TEXT);
              StreamTokenizer streamTokenizer = new StreamTokenizer(new FileReader(file));
              int token = streamTokenizer.nextToken();
              while(token != StreamTokenizer.TT_EOF)
                   if(token == StreamTokenizer.TT_WORD)
                        words.add(streamTokenizer.sval);
                   token = streamTokenizer.nextToken();
              System.out.println(words);
    }Cheers,
    Amrainder

  • How do I search for and select a repeating phrase in a file.

    How do I search for and select a repeating phrase in a file read via Applescript. The file contains details of phone calls submitted to a charity and I want to be able to analyse the types of call taken. The phrase is 'Type of enquiry' and I want to extract the description that follows. I am a beginner at Applescript and have been working my way through Applescript 1-2-3 but cannot find an obvious solution although I have manged to open and read the file! All suggestions gratefully received.

    In applescript you can use something like the following:
    set {oldTID, my text item delimiters} to {my text item delimiters, "Type of enquiry"}
    set dataBits to text items of textToParse
    set my text item delimiters to oldTID
    the variable dataBits, then, will be a list where every list item is a string that starts with the text following that phrase. 
    You could do regular expressions searches as well, but you'd need to download and install the Satimage osax.

  • How can I embed metadata in a pdf file using Acrobat XI Pro?

    Hi, everyone!
    I need some info how I can put things like key words, description of the file and the like permanently in the file to let it 'travel' in the file everywhere, no matter what the OS is. I guess I should use Properties 'path' but it resembles Windows system properties, so I'm not sure. Moreover, there's so little about it in manuals....
    Once I've done with it, can it be read just by Adobe Reader?
    And one important thing more: is there any inner search engine inside Acrobat XI to search the files by key words and other similar metadata criteria or I'm solely dependant on the OS search function?

    Thanks for the link - it sounds highly specialised, but simple experiments of opening a Properties Windows and putting some description notes and author or key word is readable by Acrobat reader or the system search engine so maybe it's not really complicated - mind you, I don't mean right-clicking on the file where options are poor and probably thode of the OS but Preferences edition after opening the document in Acrobat XI Pro - I hope this one is of Adobe and maybe I won't need Bridge to read it I know in the suggested thread it's a problem of graphics metadata within the pdf file....

  • How to read metadata from a pdf file

    hello
    i have got xmp sdk for windows.
    i want to read the metadata from a pdf file but i cannot
    find a way to do so.
    i cannot understand that which method to use to open the file whose metadata i want to read.
    if someone can tell me by an little code example then it would be great help.
    thanks

    The sample XAPDumper read metadata in a file (PDF or not) if it is valide. If you want to keep the XAPMeta object, don't delete this object in ProcessSubstring().

  • Is there an ADOBE plugin for MS Project 2010 to create Dynamic PDF files?

    I wonder if there exist any ADOBE plugin for MS Project 2010 to create Dynamic PDF files?

    We are not going to use form fields. An employee is going to create large projects in MS Project 2010 and want to share his projects with other employees. So the need is just a viewer. So I guess we need to check out all the third part MS Project Viewers which is out there. Thanks for your answers guys!

  • Searching for keywords in adobe bridgeS

    Comment on question 3
    When I click on ctrl F in bridge and type in a keyword, (for example
    clouds), nothing comes up except a message that says ‘no items to display.’

    In the last few hours my reading has told me that the problem I've described above is a pesky standing issue with Spotlight and QuickTime and that I must therefore use a work-around instead of waiting like Wilkins Micawber for an improvement in the operating system.
    My work-around is to insert, in the "keywords" section of QT7's metadata window, a baseline en dash (commonly known in the innernets as an "underscore", even if it doesn't underscore anything) between the words that I want the computer to regard as phrases during searches, for instance "good_idea" instead of "good idea". Then, when I'm setting up a phrase-search of the QT keywords, I include the baseline en dashes in the search-term so the computer will bring up only those word-strings whose words are separated from each other by the dashes. It works well, and the results are precise.
    It's a clunky work-around, because I can't globally tweak my existing metadata in the "keywords" section of every one of my QT files, but at least I can casually do such retrofitting from now onwards whenever I have cause to enter a particular QT file's metadata window. And my work-around still allows a keywords-search to interpret any particular word in a tweaked phrase as a solitary searchable word, such as "idea" in "good_idea".

  • Is it possible to search for keywords in an XML file?

    Hey all,
    I'm currently searching for ways to count the number of pre-defined keywords in an XML file. An example would be trying to find how many times "war" is mentioned in a news networks RSS feed <title></title> and adding it to a variable.
    I've found a few custom AS3 XML class files out there but nothing I've been able to adapt. I'm at the point of considering just taking the title of these news stories converting them all to strings and searching them that way...
    I'd be grateful for any help
    Thanks!

    An XML element can have multiple attributes such as:
    <title keyword="war" category="international">War Breaks Out in the Middle East!</title>
    In the above example, the XML element <title> has 2 attributes named keyword and category.
    So in my example I'm searching for the attribute named "keywords" within the element <newsItem>.
    The XML document could be formatted something as the following:
    ============
    <someNewsFeed>
         <items>
              <newsItem keyword="war" category="international" url="www.cnn.com/headline2010020111">War Breaks Out in the Middle East!</newsItem>
              <newsItem keyword="healthcare" category="politics" url="www.cnn.com/headline2010020321">Healthcare Bill Still Has Some Hurdles to Jump</newsItem>
              <newsItem keyword="lady gaga" category="entertainment" url="www.cnn.com/headline2010020981">Lady Gaga Has 360 Reach</newsItem>
         </items>
    </someNewsFeed>
    ===========
    So in my example, the if/else loop is looking through each XML node named <newsItem> and when it finds it, it then see if it has an attribute called "keyword" and if it does, it then looks to see if it contains the word "war". If it DOES, then it will do the proceeding actions.
    ===========
    if (newsFeed.items.newsItem[q].attribute("keyword") == "war") {
              warKeywordCounter == warKeywordCounter+1; //increase your keyword counter
              trace("Current value of warKeywordCounter = "+ warKeywordCounter);
    ==========
    I'm assuming you already know how to load external XML documents into flash.
    I hope this helps.
    Message was edited by: dchaparro
    Corrected if/else code to include right parentheses.

  • Search for Blank or find metadata that is empty

    Hello super-intellegent people,
    How do I search for all assets that have no keywords?
    Thanks,
    Jacob

    Unfortunately you can't from the interface. (Send feedback to Apple if you want to have your voice heard.)
    You'd have to script a custom solution to extract all the assets and test for empty keywords fields.

  • Cant' search for keywords - there are too many of them

    It's probably a known bug/issue, but when you have to many keywords, you can't search for all of them.
    This grey box pops up with all your keywords, which are ordered alphabetically. Unfortunately I'm not able see my keywords from A to L, because there are too many.
    Here is an image to further illustrate what I mean.
    http://farm4.static.flickr.com/3222/23789607331e284cabe4o.jpg

    Thanks, what a bummer . I was hoping for a fix or a freeware solution :P.
    Is there a way to notify Apple about this? Keywords seem poorly designed, or at least not thought through, compared to other programs.

  • Search for keyword within musical genre

    Trying to help my wife find a children's song on a particular topic for her kindergarten class. How does one conduct a search for a keyword WITHIN a particular genre? For example, how do I search for "Friday" or "Weekend" within the Children's Music genre?
    Thanks...

    I didn't see an obvious way to do it in the iTunes Store, since Power Search does not have a genre choice.
    At the Amazon.com MP3 Download Store, you can go directly to Children's Music and do the search.

  • Searching for Keyword

    Hi,
    Is it possible to search for a keyword in a folder which has several PDF's. For instance if i have several PDF documents in a folder. Can i search for the key word in the content of al the PDFs and display the result as the list of PDF documents which has the keyword in them.

    I think you would have to write a java application to do something like
    that.

Maybe you are looking for

  • CS6 Installation with serial no. that is not working

    need help badly. bought CS6 yesterday through Adobe website and the order was confirmed with serial no. when downloading and filling in the serial no, kept getting a msg saying "we are unable to validate this serial no for adobe illustrator CS6. what

  • Problem with making call on my PAP2T

    I've configured my PAP2T, and got a strange issue: each reboot of the PAP2T device will only allow me to make one call, each call after that will get a busy signal. Please help me!!! Thanks!

  • OBJECTS_NOT_CHARLIKE when loading to infocube from DTP

    Hello Guru's, I am trying to load a simple excell spreadsheet into our BW system. I created the  data source, turned excell into matching format then saved as csv, created a conversion to be able to load from the data source, then created the info pa

  • Turning off work phone calls and text

    Hi I wonder if anyone can help please.  I use my phone for business and personal calls but find it infuriating when work colleagues telephone during periods when I'm not working.  Is there any way that I can turn off a group of work contacts so that

  • IDOC in status 2

    Hi, I am trying to post an IDOC through R/3 to XI system which finally converts into a file and goes to 3rd party. When I am posting IDOC in R/3 I am getting 02 status and error as: Communication error when sending with HTTP and Status as Error Passi