Programatically search text in PDF

HI,
I am using IAFormApp formApp = new AFormApp(); to search for fields but I am getting the following error:
An unhandled exception of type 'System.Runtime.InteropServices.COMException' occurred in mscorlib.dll
Additional information: Retrieving the COM class factory for component with CLSID {7CD069A1-50AA-11D1-B8F0-00A0C9259304} failed due to the following error: 80040154 Class not registered (Exception from HRESULT: 0x80040154 (REGDB_E_CLASSNOTREG)).
I am using a 64 bit computer and the dll is 32 bit and therefore it is not getting registered. Kindly help me since it is an important project and I am stuck.

Try the Acrobat SDK forum.

Similar Messages

  • Search text in PDF and MS Word document

    Can any body tell me how search text in PDF and MS Word document through Java code, any body has code or any suggestion to give
    Thank You
    Adnan

    Can any body tell me how search text in PDF
    and MS Word document through Java code, any
    body has code or any suggestion to giveYes.
    First, you need to work out how to read each document type from Java.
    E.g, for MS Word you could use Apache Jakarta POI - HWPF: http://jakarta.apache.org/poi/hwpf/index.html
    Then, you use Apache Lucene to index and search.
    See http://lucene.apache.org/java/docs/index.html
    ~D

  • Searching text in PDF

    I believe that I have heard that in GW 8 it will be possible to search text in PDF documents.
    I have tried it, but it doesn't work.
    Is there a way to make it work in GW 8?
    Thanks,
    Tomislav

    Dave Parkes wrote:
    > I don't know enough about the Linux setup to know precisely what is called
    > on that OS.
    It's still called the document conversation agent on Linux. I would set off
    an indexing run to see if it kicks it all off properly. My PDFs have been
    indexing here for a good long time :)
    Danita
    Time to upgrade to GW8!
    http://www.caledonia.net/gw8upg.html

  • How to search text in pdf

    Dear All,
                 i just want to search particular text in pdf and get the page no where text situated.
    Please do the needfull.
    Regards,
    Parthasarathy.S

    Hi,
        im using it in desktop only windows xp. im using visual studio 2005 and VB.net for development..can you help me with some sample codes...
    Thanks in advance.
    Parthasarathy.S

  • Search text in PDF file

    I would like to text search in pdf file, through java (VJ++), is it possible through java.io, i'm getting junk text.
    also tried to add COM wrapper through VJ++, but file is not getting loaded ?? any examples ??
    Thank you

    any ideas
    searching for PDF
    help required

  • How to search text in pdf file?

    Hi all
    I have to store the cover of a newspaper that include images and text and then should be able to search keywords in the cover.
    I've read about to store in pdf format and use intermedia text.
    I am just wondering the way to store and to do the search .
    Thanks all

    Hi,
    You need store the PDF document in a BLOB column and create a CTXSYS index type.
    e.g.: (.doc files)
    CREATE INDEX I_DOC ON DOC_TABLE (DOC_COLUMN) INDEXTYPE IS CTXSYS.CONTEXT PARAMETERS ('SYNC (ON COMMIT)');Then you can test typing this SQL below:
    select score(1) from DOC_TABLE where contains(DOC_COLUMN, 'My text', 1) > 0;In my case, i use this index for purpose to search on Word Documents (.doc)
    Maybe this link help you to create an index type using FILTERS, in order to search on PDF files:
    http://www.oracle.com/technology/products/text/htdocs/altfilters.htm
    Cheers

  • Search text in PDF?

    i was wandering if it is possible to have a collection of
    PDF's to apply text search... I don't know, but ussually the verity
    collector store documents like word and txt files, but what about
    PDF with text???
    somebody can answer me?

    Erick_Rod wrote:
    >
    >
    Yes

  • Preview Doesn't Find All Searched Text in PDFs

    I downloaded a PDF copy of my home insurance policy and I had it opened in Preview. The pages are displayed in the side bar. When I type in a search query such as "building" or "personal" or "dwelling" it doesn't find all occurrences if any yet I know there should be plenty of hits.
    I guess I can try Adobe Acrobat.
    Thanks for any advice.
    Kelvin

    I get 4 results when I search for ''<identifier>'' on that web page.
    Start Firefox in <u>[[Safe Mode|Safe Mode]]</u> to check if one of the extensions (Firefox/Tools > Add-ons > Extensions) or if hardware acceleration is causing the problem.
    *Switch to the DEFAULT theme: Firefox/Tools > Add-ons > Appearance
    *Do NOT click the Reset button on the Safe Mode start window
    *https://support.mozilla.org/kb/Safe+Mode
    *https://support.mozilla.org/kb/Troubleshooting+extensions+and+themes

  • Extract text from PDF without opening PDF in window C#

    Hello,
    I'm creating a application for searching text in PDF's. I found some code wich uses the SDK from Acrobat (Installed on my system). But all the snippets I find seem to open a PDF window and then extract the text. Is it possible to extract the text without openening this window. I think this would increase the search time since I need te search a lot of files. And I just need a list with the file name and page number where the search string is found.
    AcroAVDoc avDoc = (AcroAVDoc)gAppClass.GetInterface("Acrobat.AcroAVDoc");
    Then I use the javascript obects to acces the "getPageNumWords" and "getPageNthWord" in a loop and putting the word in a string.
    Thanks in advance fore the help.
    I didn't want to put the entire code here because it's easely found all over the web
    Thanks in advance for your help.
    avDoc.Open(System.IO.Path.GetFullPath(filespec), filespec); 

    Hello,
    I own a copy of Acrobat pro 9. and its is for my own use. I am not a proffesional developper and this application wil not be distributed.

  • Search text programatically with axAcroPDF control

    Hello,
    i want to use the axAcroPDF control to display PDF files inside my windows forms application. That's easy just put the control there and call LoadFile method. The control has methods to change page, hide toolbar etc... but not to search text. I want to search text, if possible matching exact phrase or not. I need to do that programatically, do you know how can i do that?

    Post only once.

  • Search for text in PDF binary

    Hello experts,
    by using a SAP BI tool we generate reports in File format. SAP standard do some generating and give back an internal table with filename and file content in binary form.
    Now we would like to search in this PDF binary for an special text or string to use them for changing filename.
    Is there any way to do that? Every idea and hint is welcome.
    Best regards,
    Peter

    Now we would like to search in this PDF binary for an special text or string to use them for changing filename. Is there any way to do that?
    Based on your posting it sounds a bit like you're doing ABAP processing. However, I'll ignore that for now and just say that in the Java environment I have had good experience with the Java Library [iText PDF|http://itextpdf.com/]. I'm not sure what SAP offers in that area, but they must have something, because [TREX|http://help.sap.com/saphelp_nw70/helpdata/EN/a4/929d4206b70931e10000000a1550b0/frameset.htm] "understands" PDF (though that doesn't mean that you have a nice API for parsing PDFs).
    You probably investigated this already, but I'd take a look at possibilities to hook in before (or at the time) the PDF gets generated (might be easier to craft and export a filename there). Thanks to the [enhancement framework|http://help.sap.com/saphelp_nw70ehp2/helpdata/en/94/9cdc40132a8531e10000000a1550b0/frameset.htm] you usually have quite a few ways to get things done...
    Note that even if you're able to read a PDF, it doesn't necessarily mean that you can parse it the way you want. A silly example would be scanned pages, where the page is stored as an image and at best the scanner software runs some OCR (with possibly buggy results) to provide capabilities for searching the PDF. In your case that's probably not an issue, but still the question might be if the information you're looking at is structured enough to get it back...
    Cheers, harald

  • Search for text in PDF by VBA with only Adobe Reader installed

    My problem is widely known and frequenty posted, for instance:
    "Can anyone help me to open and search for a specific text string in a PDF document, return a true or false indicator (and nothing else)?"
    The answers mostly refer to and include
      Set gApp = CreateObject("AcroExch.App")
    which, as I understand, works only with a certain level of Adobe Acrobat being installed.
    My question now:
    I want to give this type of functionality (via an MSAccess Form, i.e. populate a ComboBox with PDF filenames which answer YES to certain text occurences)  to - say 20 - users in my company who have Adobe Reader 9.1 installed and not more.
    Bying this number of Adobe Acrobat licenses for just this purpose would be a heavy overkill which I just can't afford.
    Any suggestions? many thanks in advance.

    Now we would like to search in this PDF binary for an special text or string to use them for changing filename. Is there any way to do that?
    Based on your posting it sounds a bit like you're doing ABAP processing. However, I'll ignore that for now and just say that in the Java environment I have had good experience with the Java Library [iText PDF|http://itextpdf.com/]. I'm not sure what SAP offers in that area, but they must have something, because [TREX|http://help.sap.com/saphelp_nw70/helpdata/EN/a4/929d4206b70931e10000000a1550b0/frameset.htm] "understands" PDF (though that doesn't mean that you have a nice API for parsing PDFs).
    You probably investigated this already, but I'd take a look at possibilities to hook in before (or at the time) the PDF gets generated (might be easier to craft and export a filename there). Thanks to the [enhancement framework|http://help.sap.com/saphelp_nw70ehp2/helpdata/en/94/9cdc40132a8531e10000000a1550b0/frameset.htm] you usually have quite a few ways to get things done...
    Note that even if you're able to read a PDF, it doesn't necessarily mean that you can parse it the way you want. A silly example would be scanned pages, where the page is stored as an image and at best the scanner software runs some OCR (with possibly buggy results) to provide capabilities for searching the PDF. In your case that's probably not an issue, but still the question might be if the information you're looking at is structured enough to get it back...
    Cheers, harald

  • Trex is not searching texts in any document types other then PDF.

    Dear All
    We are implementing DMS in ECC 6.0.We have configured Trex 7.0 text search in ABAP stack. Trex not searching text in .dwg (Autocad) *.doc (Word files) files in SAP System through CV04N T-code it is not searching.
    It is searching only pdf files.
    System Details:
    Server ECC 6.0
    SAP_BASIS - SAPKB70010
    SAP_ABA - SAPKA70010
    SAP_APPL - SAPKH60007
    EA-APPL - SAPKGPAD07
    Error Message:
    We have added the mime types for full text search in SAP System, SPRO &#61664; Cross-Application Components &#61664; Document Management &#61664; General Data &#61664; Settings for Storage Systems &#61664; Maintain Storage System as application/acad & application/doc. And also in Trex server usr\sap\<SID>\TRX00\Trex\TREXValidMimeTypes.ini file.
    After adding we have restarted the Trex server & done the Reindexing in SAP System & tried. But it is not searching the text in autocad files.
    Kindly support for us, to solve this issue.
    Regards
    Harshavardhan.G
    Mob: - 91 99130 88039

    Hi Harshavardhan,
    could you please create a OSS ticket for BC-TRX and attach an example of DWG document to this. Please also check if the includehidden parameter (TREXFilter.ini) is set to true.
    Best regards,
    Mikhail

  • Randomly Missing Text in PDF Created from FrameMaker

    This problem relates to a structured FM document, but I suspect it might be a general issue and have posted it here in the general forum for that reason.
    I am generating PDFs that are missing text somewhat randomly throughout. I tried searching the forum for solutions, but none of the suggested fixes worked and none of the posts specifically addressed the issue I am experiencing.
    I am working in structured FM. The templates we use were originally created for FM8. We use both FM8 and FM10 in our work group. We are able to duplicate the same problem in both versions and on multiple computers.
    I thought I had narrowed the problem down to certain paragraph formatting, since it only ocurs in three or four paragraph formats (a bullet list, table text, etc.) Garden variety formatting. But in most places in the document, these formats appear perfectly. The strangest occurence is a single intance where the page number is missing from the footer.
    I thought it might be a font issue, as I've had similar issues in the past. I had a missing font warning in the console, but I am pretty sure that this has nothing to do with it, since they are fonts we are not using and all the other text from the same formats appears.
    I tried turning off "Remember Missing Font Names" in preferences. No help.
    I checked that the fonts are in the local directory and appear as embedded subsets in the PDF.
    I also tried checking and unchecking the "Rely on system fonts only; do not use document fonts" option in the PDF output settings. Also no help.
    The randomness of the missing fonts bewilders me and I've exhausted my own troubleshooting abilities. I would be happy to share a source file if anyone thinks they could help me that way.
    Thanks in advance,
    Douglas

    There is a known bug in Windows XP that causes random dropped text in
    PDF. The hotfix is here, though the link does not seem to be working at
    the moment:
    http://support.microsoft.com/?id=952909
    However, the above link directs you to a download link that is here:
    http://support.microsoft.com/Hotfix/KBHotfix.aspx?kbnum=952909&kbln=en-us <http://support.microsoft.com/Hotfix/KBHotfix.aspx?kbnum=952909&kbln=en-us

  • How to select and search text in this document?

    http://www.oracle.com/technology/products/manageability/database/pdf/ow05/PS_S003_274003_1 06-1_FIN_v2.pdf
    is a document I can read but cannot copy text from. I can't search for any text in it either. Is there a way to convert it to a PDF file I can select and search text in? What did the author do to make it "encrypted"? Thanks.
    Yong Huang

    I notice Google can convert it to plain text:
    http://74.125.95.132/search?q=cache:e4rkLs8pPekJ:www.oracle.com/technology/products/manage ability/database/pdf/ow05/PS_S003_274003_106-1_FIN_v2.pdf+understanding+shared+pool&cd=1&h l=en&ct=clnk&gl=us
    (If that long URL doesn't work, just search for "understanding shared pool" and click "View as HTML".)
    For now I'll use that. Thanks everyone.
    Also, my local desktop search program Copernic can also index keywords in the article.

Maybe you are looking for

  • Block Categorys

    Still cannot Block Pornography/Adult/Illegal drugs etc.  Any Ideas will be helpful. Ironport S160>Web Security Manager>Access Policy>block categorys but still nothing. Non-profit sites getting blocked as malware??...I thought S160 would block these s

  • CS4 Web Prem -- CS4 Master Collect, what do you think?

    Hi there I have CS4 Web Premium edition for educators. I inherited some courses in which I have to teach After Effects. I was wondering what would you do to acquire After Effects CS4, would you: acquire AE as a single product OR upgrade to Master Col

  • Shortcuts bar has disappeared in Photoshop Elements 10

    I recently installed the free trial version of Photoshop Elements 10. Today I noticed that there is no shortcuts bar in the Edit window.  I believe a shortcuts bar is supposed to be located just below the menu bar and above the options bar.  I  think

  • Image File Size is insane

    Assuming a photo is 400 KiloBytes (some are bigger), which is 3200 Kilobits (quite a lot for a 500 * 500 pixel image for the web) This creates a some problems…. 1. Effectively excludes dialup users from viewing any of these sites and reduces the expe

  • Obsolete flash player 16.0.0.235

    Hello, I'm coming here to find help for a problem. My firefox browser has Shockwave Flash 16.0.0.235 module installed, but I can't uninstall it. I downloaded the new version of Flash Player 17 in order to update, but it just install shockwave 17 with