Search Query that crawls words in PDFs

Hello, I'm building a website where people can look at PDFs for a library. It will be exactly like this http://173.12.11.248/
What I'm stuck on is building the search page.  http://173.12.11.248/search.htm
I used the tutorial on Adobe on building a search and results page and I have a database recordset, but how do i enter in MYSQL the PDFs so that they will be searched through.
I tried having the PDF row be a TEXT field with the link to the PDF on the site, but that didn't work.
Any ideas?

It's not clear to me what you are asking. It sounds as if you have built a page that searches a database, but what you need is to search file contents. The link you provided is using ASP so it's going to be difficult to do a pdf file search with that. If you were using PHP you could use the pdfsearch.php class. But that's going to be extemely slow to search many documents. You're better off using an indexing search engine like zoom :www.wrensoft.com/zoom

Similar Messages

  • Search Query that shows articles based on the current logged in users Office location

    Hi,
    I've set up a "news" channel that shows articles targeted to the current user. Basicly this on done by queryting the property of the custom "Office Location" termset.
    My setup is something like the following in my term store:
    The setup works perfect now. countries are not taggable so at the moment the user needs to choose City 1.. City 2 etc. for their articles which works fine since sometimes articles are also only targeted to cities and not countries.
    Anyways now we have got 2 users in City 4. Here it makes no sense that articles should be targeted City 3 and City 4 in order to show articles to entire Country 2. Basicle what I want here is that if an article is targeted with City 3 it should also be shown
    to City 4 users.
    How is this archived best? I have tried to make the Countries available for tagging as well, however it not show anything since the user property of course still is City 3 or City 4.
    Thanks for your time

    Hi,
    You can not get hold of the parent in your scenario. You want the parent of the tagged term in the user profile, and there is no built-in mechanism in search to get this. So as I mentioned you have to use JSOM to get the parent term, and then use that in
    your owstaxIdOfficex0020Location:term query.
    owstaxIdOfficex0020Location:{User.SPS-Location.IDWithChildren}
    would be a valid query, except it won't help you much :)
    Thanks,
    Mikael Svenson - Search Enthusiast
    SharePoint MVP/MCT/MCPD - If you find an answer useful, please up-vote it.
    http://techmikael.blogspot.com/
    Author of Working with FAST Search Server 2010 for SharePoint

  • Need help in building search query

    Guys ..
    Problem Description:
    I have a huge table that is indexed using CONTEXT.
    I want to write a search query that considers the following:
    1. number of keywords match
    2. takes care of spelling mistakes, synonyms and acronyms
    3. proximity - the keywords should not be too far of each other.
    e.g. I have this phrase: "Horizontal Stabilizer Trim Brake"
    I was thinking of writing a query like:
    SELECT SCORE(1) SCORE,
    TEXT text
    FROM MY_TABLE
    WHERE CONTAINS(TEXT, '(Horz | Horizontal) ACCUM (Stab | Stabilier) ACCUM Trim ACCUM (Brk | Break)', 1) >= 0
    ORDER BY SCORE DESC
    The results doesnt look satisfactory. I have not used "near" operator as i dont know how to use it.
    Please help me as I am very much new to Oracle Text.
    -G

    Well, I'm not going to write the function for you, but we can at least talk through a general strategy.
    A lot depends on how you help your users on the front end -- for example, if they're searching a technical document, you may want to return results that aren't perfect matches but you do want to make sure the user picks 'mandatory' and 'useful' keywords in a way that lets you figure out which ones are really important. On the other hand, if you're google and have to handle queries like 'horizontal stabilizer trim brake' and 'were Pete and Jenny in the break room' then you run the risk of spending too much time looking for interesting words, almost doing a full-text search on the query trying to derive meaning.
    So I'm going to presume that you have some control over what/how the users generate their searches so that finding keywords isn't the issue.
    The plan will be to parse the query a bit to find the interesting words, clean them up, and weigh their importance, then use transformed data to build the query template to score various combinations.
    So here's some pseudocode for the function:
    function parse_query(pQueryWords in clob) returns clob as
    begin
        generate_token_list (); -- split the query into a set of individual tokens/words
        for each token in token_list
            if it's a mandatory word then accumtokenlist := accumtokenlist || ' ' || token ||'*10' -- weigh the presence of the token strongly
            if it's a useful word then accumtokenlist := accumtokenlist || ' ' || token ||'*5' -- domain-specific words are also important
            if it's a stopword or reserved word, then do not add it to the list
            if it's not on my lists, then accumtokenlist := accumtokenlist || ' ' || token
                                         and normaltokenlist := normaltokenlist ||' ' || token
        end;
        --so now, we have two lists, one for NEAR and one for ACCUM
        now build the guts of the template
            querytemplate := querytemplate || '<seq> || normaltokenlist || '</seq>';
            querytemplate := querytemplate || '<seq> || replace (accumtokenlist, ' ',' ACCUM ') || '</seq>';
            querytemplate := querytemplate || '<seq>$' || replace(normaltokenlist,' ','$') || '</seq>';
            querytemplate := querytemplate || '<seq>? || replace(replace(accumtokenlist,' ',' ?'),' ', ' accum ') || </seq>';  -- first fuzzy the words, then accum
            querytemplate := querytemplate || '<seq>? || replace(replace(normaltokenlist,' ',' ?'),' ', ' near ') || </seq>';  -- first fuzzy the words, then near
        return querytemplate
    end;So, with a 'cooked' query text that is template-friendly, all we need to do is apply a template that is aware of your inputs:
    query_Template_string := '
    <query>
       <textquery lang="ENGLISH" grammar="CONTEXT"> horizontal stabilizer*5 trim brake*10
         <progression> '
    || parse_query('horizontal stabilizer trim brake')  ||
    '     </progression>
       </textquery>
      <score datatype="INTEGER" algorithm="COUNT"/>'
    </query>So that's an example of one approach.

  • How to modify search query based on new FE

    Hi,
    As per a requirement, in All Accounts search (advanced search section), I had to replace 5 checkboxes (e.g. Role1, Role2, Role3, Role4, Role5) by a single dropdown (codelist) name= "Account_Roles" containing same values (Role1, Role2, Role3, Role4, Role5). I created Customer's XBO and declared field of the codelist type, name= "Account_Roles". So, the XBO now has below 6 fields :
    Role1, Role2, Role3, Role4, Role5 (type = Indicator)
    Account_Roles                            (type = codelist)
    Existing search on All Accounts worked on the 5 checboxes and fetched those accounts for which the values matched.
    E.g.
    If in search parameters, I check Role1 & Role2, and for Account1, these values are stored as "true", then Account1 will be shown in the result list.
    Now the search UI doesn't have the checkboxes, but codelist ('Account_Roles') which can have value 'Role1'/'Role2'/'Role3'/'Role4'/'Role5' or combinations are also possible like 'Role1' OR 'Role2', etc..
    My question here is -
    Will the old search query that worked on indicator fields earlier, handle the new FE "Account_Roles" automatically ?
    If not, then will modification be required to existing search query ?
    If modification is required, what steps I need to follow to handle this new field instead of checkboxes.
    I need exact steps to modify a search query, as I tried several things in the PDI but was not able to confirm if this requirement can be implemented by modifying search query or it is not feasible from PDI perspective.
    Appreciate all your inputs.
    Thanks,
    Sachin.

    Hi,
    you can create a link between the vendor objects itself, means link 0vendor with 0ven_compc and 0ven_purorg. This should give you a list of all vendors multiplied with the comp codes multiplied with the purch. org. May be here it is possible to create another link between a attribute (eg. comp_code of 0ven_purorg with comp_code of 0ven_compc). In case it is not possible you need to add this link information somehow. Another option might be to create 2 queries. One on a infoset of 0vendor and 0ven_purorg and another one on 0vendor and 0ven_compc.
    regards
    Siggi

  • How to add named destination while exporting ms word to pdf that will support in adobe reader 10 and 11

    Hi,
    I need to add named destination in pdf when ms word is exporting to pdf  that will support named destination in adobe reader 10 and 11.
    I tried with options create bookmarks as word bookmarks but that also not supporting in adobe reader 10/11.
    Please let me know how we can create bookmarks while exporting MS word to PDF.
    And when we are opening pdf in IE with search parameter,its always opening PDF with 2 document on left serach panel.
    Suneetika

    Might be better to post this in the Acrobat or Indesign forums.

  • How to Search word in PDF?

    Hi,
    I am developing windows application, which will accept PDF file path as input.
    Users are allowed to  search for the specific word in the PDF and replace with new word.
    I don't have any idea on this. So can any one help me to start with.
    how can i search word in PDF and edit the PDF file.
    Thanks,

    PDF is a not a format that enables replacement of textual content, since it doesn't reflow.  You need to discuss with the person that gave you the job how difficult/impossible that it is.

  • Programmatically search for a word in PDF file

    In the program I am developing, I am opening up a PDF file from the application. Is there any way to search for a particular word in the PDF file and move to the page containing the first occurance of that word in the PDF file programmatically? I am using VC++ to develop the application.
    Any guidance is appreciated and thanks in advance.

    Thanks for your reply Leonard.<br />I am not using any library now. Currently I am just opening up the PDF file in Adobe Acrobat Reader using ShellExecute() API passing the PDF filename. <br />Now I want to open the PDF file in Adobe Acrobat Reader, and move to the page containig a particular text automatically.<br />Is it possible through any command line arguments to AcroRd32.exe like, <br />AcroRd32.exe <filename.pdf> search:<wordtosearch><br />OR I have to use any third party library to do this.

  • Help- trouble searching any words in PDFs!

    hi all,
    i have been having  trouble searching any words in PDFs:
    when i downloaded a PDF in my windows XP, it opened and searched fine at first with Adobe Reader. Then i zipped to to my iMac OS X, and when opening it in Preview, it would not find any text that i searched for. I re-downloaded it straight from the internet to my mac, several times, and each time still Preview could not search it.
    Then i tried downloading and installing the entire Adobe Reader to my mac, so i could open it with that. But, then it still would not search/find anything in the PDF.
    Then, I downloaded yet another PDF reader called Skim, installed and tried that on my MAC. Still, no luck. I went back to the original site where i got this PDF, and found the most updated version, deleted the former and redownloaded the latest to both my windows and my mac.  Finally this time it worked! So happy! Unfortunately, just today, a day or two later, i opened a different PDF on my mac, and it opened automatically using Adobe reader, (rather than the usual default Preview app, as i guess i might have set my prefs to change adobe reader to be the default while i was tinkering around trying to troubleshoot it all), and again i could not get the search to work at all! Then, i changed my mac prefs to have the default PDF opener be Preview. But still,Preview would not find any text in this pdf EITHER!
    Can anyone please help? I have been going crazy trying to chase after the solution to this perplexing problem! thanks!
    BTW,  i use Firefox (always the latest version) as my browser. And the PDFs i am discussing are downloaded already and opened from my OS (not the internet.)

    public void checkWord() {
             String word1="1stWord";
             String word2="2ndWord";
             String word3="3rdWord";
             boolean w1,w2,w3;
             w1=false;
             w2=false;
             w3=false;
             ArrayList list=new ArrayList();
             list.add("1stWord");
             list.add("2ndWord");
             list.add("3rdWord");
             list.add("4thWord");
             Iterator i=list.iterator();
             while(i.hasNext()) {
                       String listValue=(String)i.next();
                       if(listValue.equals(word1)) w1=true;
                       else if(listValue.equals(word2)) w2=true;
                       else if(listValue.equals(word3)) w3=true;
             if(w1&&w2&&w3) {
                       System.out.println("All Matched");
    }Will it work?

  • Search Query Help – more than one word

    I have been struggling with this for weeks.  I used Dreamweaver to set up a lot of this and I think it is hindering me as I make changes to the code.
    I have a search interface that takes the input from a user (in the form of text) – searches though the db to match words – then return the results based on how many times the input is located in the db.
    I have this working very good for a single word input.  If you search for one word only – it works great.  But, if you try to input two words together – it returns no results.
    I have tried several different things to get this to work – like exploding the input then searching the db for each word in order.  The problems I get, I think, is that this conflicts with the code Dreamweaver wrote – I think where the code requests the value string.  But, each time I think I find a fix for one error – it creates several others – not just in the php code above the header – but in the php code in the body when I try to retrieve the results.
    It seems that I am just going around in circles.  But, there has to be simple way to search for more than one word in the db – count the matches – add the matches from each search together – then return the results ordered by the largest number of matches.
    I have even tried using a foreach statement – but received errors that I have never seen before.  Then after researching and trying to fix those errors – just received more and more errors.
    Please help.  If it cannot be done given the code I have – then please point me in the right direction to rewrite the code.
    Code is attached.

    Based on what you explained as your requirements, it is very easy. I'll just speak in psuedo-code.  You will need to parse the input into seperate words. If there can be an unknown number of words you can store those in a string array.
    Then just use the array to build your where clause using the IN predicate or using the = and OR operators.
    Select count(*), keyword
    where keyword in (term[0], term[1], etc)
    group by keyword
    order by count(*) desc
    If this is not what you meant, then you will need to explain your requirements in more detail. As you have discovered, once your requirements exceep the capabilities of DW's behaviors, you will need to become very familar with sql and your server side scripting language. If you don't understand what the DW code is doing (as evidenced by your comments in the attachment), then you really are not ready to start modifying it.

  • Why can't I type in my new word doc that I changed from PDF?

    I need help! I can't type in my word doc that I converted from PDF.

    You may have visited a website that has disabled the location bar.<br />
    Try to open a new window to see if that gives you a working location bar.<br />
    If that works then close the not working window and close and restart Firefox to see if that stays.<br />
    * http://kb.mozillazine.org/Prevent_websites_from_disabling_new_window_features
    * http://kb.mozillazine.org/JavaScript#Advanced_JavaScript_settings

  • Internal query that is used during the message search from EMC

    Hi,
    We have a requirement where in we need to identify the internal query that is executed ( since there were few mails that are being deleted at production instance, so want to know the details) when we search for the data from the following navigation:
    Oracle Email Center -> Messages -> Search.
    Could you please help us out in knowing the exact query as early as possible as it is of high priority.

    Thanks for your input.
    Let me explain how we solved our problem in brief:
    - We have created a new infobject for item number with 5 digits
    - We added this to the key in DEV system
    - We created a copy of the active table how it would look like after transport on DB level
    - We copied data 380 million records into the new table on DB level
    - We renmade the active table to xxx_old to have a back-up and then re-created an empty active table via SE14
    - We imported the transport when all 3 DSO tables were empty, which was very fast
    - We dropped the empty active table
    - We renamed the copy of the active table to carry the name of the active table
    During this process we ensured beforehand that SIDs are available for the new IOBJ. We also filled the new IOBJ by copying the values from the 3-digit IOBJ.
    All went well.

  • I am having the same problem I think.  With mobileme you simply copy documents to the idisk folder and then synch.  I cannot seem to sink that folder anymore.  Any idea as to how I can simply copy folders to icloud and then access the MS Word and PDF file

    I am having the same problem I think.  With mobileme you simply copy documents to the idisk folder and then synch.  I cannot seem to sink that folder anymore.  Any idea as to how I can simply copy folders to icloud and then access the MS Word and PDF files on my iphone?

    Apple never bopthered to explain that this would happen
    Your iDisk is still accessible after moving to iCloud in exactly the same way as before. Nothing is deleted until June 30th 2012.
    , so I could easily have lost ALL of the files I kept on iDisk.
    No, you couldn't. Firstly, nothing was deleted from your iDisk. Secondly, any files stored on your iDisk should never be your only copy. Even if your iDisk spontaneously combusted, you should keep local backups elsewhere.
    Does Apple WANT people to move their storage elsewhere and stop paying Apple for it?
    Yes. Apple doesn't provide such a service anymore, nor are you paying them for it.
    Apple has made no effort to suggest remedies for the problem it has given iDisk users
    They've provided instructions on how to download your files from your iDisk. What you do with them after that is your choice.

  • I am trying to subscribe so that I can convert PDF to word. Get so far but only options for address are in USA, I live in England

    I am trying to subscribe so that I can convert PDF to word. Get so far but only options for address are in USA, I live in England

    Hi golfgerald,
    I'm sorry that you're having trouble subscribing. Please take a look in the lower-left corner of the Adobe.com website. Do you see "United Kingdom" listed as the region? If not, please click Change Region, and then choose United Kingdom.
    Did that do the trick?
    Best,
    Sara

  • Just converted word to PDF so that I can save on iPhone and subsequently use. Where is the file??

    Just converted word to PDF so that I can save on iPhone and subsequently use. Where is the file?? I have no idea where it is!! Extremely frustrating and not at all user friendly!! Took my 6.99 British pounds and not a sausage to be had!!

    It sounds like you may have purchased a CreatePDF subscription through safari, using our web interface. If this is the case, the good news is that subscription will also work with the (free) Adobe Reader. Simply install the Reader from the App Store:
    https://itunes.apple.com/us/app/adobe-reader/id469337564?mt=8
    Once installed, you can then use "Open In" from another app like Mail to open the Word document in Adobe Reader. Reader will show the word doc, but it will also show you a message pointing to the "convert" button (A document with an arrow through it in the toolbar). Pressing this button should prompt you to sign into Acrobat.com. Use the same account you used to purchase your subscription and Reader will upload the word document for conversion, placing it in the Acrobat.com and Outbox section of the Reader. Please let me know if you have trouble!

  • Web search request that is a pdf

    Why does a web search request that is a pdf go directly to a file to save when I click it to open?

    That is a setting of your browser, not of the Adobe plugin.

Maybe you are looking for

  • LaserJet 600 M603 - Driver Issue Win7

    Trying to install Universal Print Driver PCL6 on 32 bit Windows 7 and we're given an error. "Windows can't find the driver for HP..." We click Ok then we get another window below: It points to the UPD folder installed on the computer. We click ok and

  • Start routine in DSO Self transformations

    Hi SCN, I need to write start routine to fill one of my target feild. Here my info object is ZCust and attribute zpur_grp My DSO have 0customer and zpur_grp. Am creating self transformations for my dso and need to fill zpur_grp from zcust if dso-zpur

  • APEX Views - Strange query results

    One of my customers is running 3.1.2 version of APEX on a 10g database. I had two problems with it today: 1. I implemented IBM Webseal SSO there and suddenly none of the authorisation schemes were running. Why, I don't know. I had to delete and recre

  • ITunes update tv show problem

    I just updated to the latest version of itunes 7 and now all my tv shows I have in my libary have sorted themselve into alphabetical order, it used to be sorted by the tv show episode number but I can't seem to get them to rearrange into that order?

  • CLAF_CLASSIFICATION_OF_OBJECTS - RMWB tcode

    I'm using CLAF_CLASSIFICATION_OF_OBJECTS function module to get characteristic value in the tcode RMWB. i have to avoid the deleted values and i'm not able to see any indicator. how to find the deletion indicator? i noted that there is an field LKENZ