Creating and Searching index files

Hi,
This is my first posting so apologiese in advance if it is difficult to understand.
Firstly, I am from a development background of some 6 years (c#, asp.net, vb.net, SQL) so i have a solid grounding and am happy to give any new development a try.
I have a client who has 10,000's of scanned pdf documents but no real way of retrieving and searching these documents.  In fact another person in my team is involved in making sure these pdfs are searchable by doing some OCR and rescanning work.  I have been asked to come up with a way to allow client to retrieve(open) and search the pdf files.
Here is what i am proposing based on what I have leared about Acrobat Pro 9 and Adobe Reader 8/9.
I actually want to do a lot more than above but think that is enough to get me started.  In a nutshell the client would like a web based solution to search and retrieve (open) there scanned pdf documents.  I have read on other posts in the forum an understand it can be quite difficult to search pdf's (pdx file) from Internet Explorer.
As a start i had the following tasks in mind as needing completed:
Task 1 - Create directory structure on file server for all the scanned pdfs.  Create catalog(s) for these documents and then create an index which creates a .pdx file.  Given the number of documents it looks like I will have X number of catalogs and X number of pdx files.
Task 2 - create a web application that allows a tree view to be displayed that replicates the directory structure above so a user can open the pdf from the browser.
Task 3 - I know the users can use Adobe Reader advanced find functionality and select the indexes i have created.  However, I would like to create a plugin for Adobe Reader that will load all of the indexes into the selectable index list and select all by default.  Given the volume of documents it is likely the user will not know exactly what they are looking for so will need to search across all indexes.
Thanks in advance for any help anyone can provide me in getting started with these tasks.

Some links that may be helpful.
Acrobat Developer Center:
http://www.adobe.com/devnet/acrobat.html
Forums:
Acrobat SDK
http://forums.adobe.com/community/acrobat/acrobat_sdk
Acrobat Scripting
http://forums.adobe.com/community/acrobat/acrobat_scripting
Various topic specific forums at the AUC
http://acrobatusers.com/forum
Be well...

Similar Messages

  • Indexing and Searching PDF Files

    Hi All,
    I am trying to store and search PDF files in the oracle database.
    I can insert and index the PDF files just fine but cannot get any result. I always get No Rows.
    Here's what I am doing and the issues I am facing.
    I created a Table with fields
    ID (VARCHAR)
    NAME (VARCHAR)
    DOC (BLOB)
    I inserted the PDF file in the BLOB field through a Java program and insert worked fine as I verified by retreiving the PDF and writing to file.
    I created index using following SQL:
    create index my_index on PDF_TABLE(PDF_FLD) indextype is ctxsys.context
    parameters ('datastore ctxsys.default_datastore
    filter ctxsys.inso_filter');
    The index was created successfully without any problems.
    I ran query as follows and got no rows although the searched text is in PDF
    SELECT SCORE(1), PDF_FLD from PDF_TABLE WHERE CONTAINS (PDF_FLD, 'Table of Cotents',
    1) > 0;
    I tried alternate queries as well with no luck.
    Any ideas ??
    Thanks

    After creation the index you need execute next operations.
    first, check that your index tables conatins indexed terms. Execute
    select token_text from dr$YOUR_INDEX$i;
    Second, you will need to check the index errors table CTX_INDEX_ERRORS. This is owned by the user CTXSYS, and most users do NOT have # SELECT privilege to it by default.
    If it's OK, then check that your PDF documents is supported by INSO filter.
    Citation:
    "PDF - Portable Document Format
    Acrobat Versions 2.1, 3.0, 4.0, and 5.0 including Japanese PDF"
    (Appendix B. Supported Document Formats in Oracle Text Reference 9.2)
    For Oracle 9i you could install 9.2.0.4 patchset (it included INSO FILTER 7.5)
    P.S.
    for the beginning, you could find answers for your question about Oracle Text here
    http://otn.oracle.com/products/text
    Sorry for my English.
    Best regards, Victor Zogin.

  • How to create search index files for Java Pet Store

    Hi All,
    As you may know, the java pet store application uses for the search function a search index object, which itself uses the following files '_36.cfs', 'deletable', 'segments'.
    Now as I want to change the data of the database (delete some pets, and names, etc.), it does not have an effect on the search results. This is because the application uses the search index files and does not use the database for the search query.
    So can anybody help me and tell me how I can create this three files from my *.sql file, so I can search in my own data?
    Thank you very much for your help.
    Regards,
    Wolfgang

    gonso777 wrote:
    Solved:
    I had the same problem. It seems that the installer does not unzip three files where it should.
    With NetBeans (Using File Perspective) or just editing build.xml
    Netbeans: select build.xml/Run Target/Other Targets/unzipindexes: Tough it should work it fails in resolving $javaee.domaindir$ at least in my system, but it does suscessfully create a new directory named ${jee.domaindir} that includes lib/petstore/searchindex and three files: _36.cfs , deletable, segments.
    Copy those three files to your_glasshfish_path/domains/domain1/petstore/searchindex.
    Now you are done. I hope that you had a nice time while waiting two years for it to be answered. How is it that it is not answered anywhere else?
    Regards,
    Ramon Talavera
    www.sciencetechworks.comThanks. I didn't wait 2 years for this, but I just replied on a 2 year old post. I only recently tried the petstore app. I thought I needed to study lucene first to figure things out, it turns out there was an 'internal target' on the build file for this. Thanks a bunch!

  • Creating and parsing LDIF file

    Hi,
    Does JNDI provide any libraries to create and parse LDIF files by performing an ldap search?.
    thanks in advance

    My guess would be that when you created the war file the internal paths were incorrect. Open the war file using WinZIp and check the file paths. They should be relative to the web application directory (ie prj.java should have the file path "WEB-INF/classses")
    As an added thought you may want to put your servlet file into packages.

  • Install database , application and search index on Single machine ?

    Under SAP PROD PAK SERVER 4.3 -> Windows Server on IA32 32bit (SAP Education Products" SAP PRODUCTIVITY PAK BY ANCILE" SAP PRODUCTIVITY PAK 4.3) I am only getting the below files-
    RWDuPerfServer43P_1-10009622.ZIP
    RWDuPerfServer43P_2-10009622.ZIP
    Is this the right server software ?
    - Can we install database , application and search index on same machine ?
    - OS is windows (2003, 2008, ???), Is RWD only supported on 32 bit windows with IA processor or any processor and also on 64 bit ?
    - Where can I find the installation document or all the prerequistes ?
    - Which database is preferable SQL, oracle or DB2 , currently we have SAP system on DB2
    - Do we need to integrate the RWD with all SAP system landscapes ?
    - Also suggest the hardware requirement (RAMprocessorhard disk space) if 1 server installation is possible for all the req (Search Server + Database Server + Web Application Server + Administrator Workbench)
    Thanks
    Saurabh

    Hi Saurabh,
    These are the latest patches - 4.31 and 4.32.
    To install 4.30 you will need to go to the "Installation and Upgrade" section of Service market place and download and install this first.
    Installations and Upgrades -
       P     
           "SAP PRODUCTIVITY PAK BY ANCILE"
                SAP PRODUCTIVITY PAK 4.3
    The patches can be downloaded and installed then.
    The documentation will be within the installation files.
    Hope this helps,
    Kind Regards,
    Matthew

  • How does APEX create and save new files. What extension does it save in?

    Hi can someone help me with this question?
    How does APEX create and save new files. What extension does it save in?
    Cheers!
    VJ

    It's really too bad we can't see VJ's face when the concept sinks in. This is one of my favorite moments when teaching APEX classes. Most people love it, some people don't. If nothing else it really proves the power and performance of the Oracle database. Each page view can generate 40+ queries, yet on the average system this takes less than .04 seconds.
    Keep in mind there are no undocumented features or "Oracle Internals" that the APEX team uses to achieve this performance, just sound database design. With every feature they add they evaluate how it will be used and design the tables and indexes to most efficiently answer the query. Sometimes this means going against "purist" normalized principals.

  • How to create and save a file in flex web application ?

    Hi,
          I want to create and save a file form flex web application. Is it possible ?
    I have done some googling and found that its only possible through file referece, which needs some serverside implementation be called.
    Is it not possible with file reference to create and save file directly ?
    Can any one suggest how to over come this.
    Thanks in advance.

    Alex,
    code written with the Flex Builder 3 libraries will certainly run in Flash Player 10. However, what you need is not so much the capabilities of FP10 but the library functionalities of the Flex 4 package. And I think that Flex 4 code will only run in FP10 and not 9 (but haven't tested this as at work we have not yet been permitted to upgrade, so there is not a lot of point going for Flex 4!).
    Richard

  • How to create and sign CAB files info on Supplement Option

    To all Supplement Option subscribers:
    At the supplement option for iDevelopment Accelerators, I uploaded a zip with detailed info on how to create and sign CAB files (necessary when you are running your application on the Native IE Java Virtual Machine).
    Not all functionality provided to you through JInitiator will work (e.g. WebUtil doesn't) but the way how to make and sign a CAB file will become more easy.
    Cheers,
    Marc Vahsen
    Headstart Team

    This defeats the purpose of trying to centralize SAP-related web resources on the SAP server. Typically IIS/Apache or other non-SAP servers are under the control of IT and not the SAP BASIS group.
    We simply would like to have a centralized location to store static web files so that they are not overwritten during Portal support pack applications.

  • How to create and read text file using LabVIEW 7.1 PDA module?

    How to create and read text file using LabVIEW 7.1 PDA module? I can not create a text file and read it.
    I attach my code here.
    Attachments:
    File_IO.vi ‏82 KB

    Well my acquisition code runs perfect. The problem is reading it. I can't seem to read my data no matter what I do. My data gets saved as a string using the array to string vi but I've read that the string to array vi (which I need to convert back to array to read my data) does not work on the pda. I'm using version 8.0. So I was trying to modify the program posted in this discussion so that it would save data from my DAQ. I did that but I still can't read the data after its saved. I really don't know what else to do. All I need to do is read the data on the pda itself. I can't understand why I'm having such a hard time doing that. I found a possible solution on another discussion that talks about parsing the strings because of the bug in the "string to array" vi. However, that lead me to another problem because for some reason, the array indicators or graphs don't function on the pda. When i build the program to the pda or emulator, the array indicators are faded out on the front panel as if the function is not valid. Does this kind of help give a better picture of what I'm trying to do. Simply read data back. Thanks.

  • I downloaded the Acrobat 11 trial and  I can open the product. During the install it deleted the existing print driver and did not install a new print driver so I CAN Not create and print pdf files.

    I downloaded the Acrobat 11 trial and  I can open the product. During the install it deleted the existing print driver and did not install a new print driver so I CAN Not create and print pdf files.

    What OS? Have you tried a repair and updates from the HELP menu? The updates may be the key.

  • Create and deploy sda file in NWDS 7.0

    Hi,
    Could u pls tel me how to import , create and deploy sda file.
    I hav got an .sda file, i need to import ,made changes to that nd deploy it again in NWDS 7.0
    help me by providing some links or turorials...
    thanks
    raji

    Hi,
    As i told you earlier, there is no option to import a .sda file in NWDS 7.0.
    So you have to follow the approach which i mentioned in your earlier thread.
    https://www.sdn.sap.com/irj/scn/profile?userid=3978830
    That is the only way which you can do.
    Regards,
    Saleem

  • Indexing and searching excel file

    hai friends,
    i need to index and search the records from the excel file using lucene java
    if u ve any code for that plz give me
    thank you in advance

    gimbal2 wrote:
    I'm not even going to try and tell you just how wrong your post is.But I will! ;-)
    Ok, checking the items from [_How To Ask Questions The Smart Way_|http://www.catb.org/~esr/faqs/smart-questions.html]:
    - [_Write in clear, grammatical, correctly-spelled language_|http://www.catb.org/~esr/faqs/smart-questions.html#writewell]
    - [_Be precise and informative about your problem_|http://www.catb.org/~esr/faqs/smart-questions.html#beprecise] (especially the third item)
    - [_Be explicit about your question_|http://www.catb.org/~esr/faqs/smart-questions.html#explicit]

  • Indexing and Searching pdf files which are used as attachment in an Announcemnet list item

    Hi all,
    I am using a SharePoint 2013 online environment and trying to search and find pdf files which are attached to a announcement list item. However it does not find anything when I search for the name of the pdf file or the content of the pdf file.
    When I attach a word to the list item it gets indexed and it find the file.
    thanks and appreciate every kind of advice.

    Are you able to search for pdfs in other locations? SharePoint 2013 comes with an iFilter out of the box unlike 2010 which needed configuration.

  • How can i  create and access resource file(with .res ext) in j2me

    hi,
    i am developing an mobile international application using jsr-238 an optional api to implement the idea of internationalizationand localization.
    please tell me how can i create .res files in my application to store strings of different languages.thanks inadvance.

    Hello GG RA,
    which fileformat do you want to use (binary or ASCII)?
    There are some LabVIEW shipping examples for File I/O
    You could use the LabVIEW NI Example Finder (Help --> Find Examples..(search item "file")) to get an overview about the different file-formats in LabVIEW.
    Hope this helped out and feel free to reply if you have more questions about File I/O in LabVIEW
    Best regards
    Benjamin

  • Problem while creating text search index

    Hi,
    When I am tring to create a text search index I am getting the following:
    ORA-29855: error occurred in the execution of ODCIINDEXCREATE routine
    ORA-20000: Oracle Text error:
    DRG-10700: preference does not exist: GLOBAL_LEXER
    ORA-06512: at "CTXSYS.DRUE", line 160
    ORA-06512: at "CTXSYS.TEXTINDEXMETHODS", line 364
    ORA-06512: at "myschema.RECREATE_SEARCHINDEX", line 31
    ORA-06512: at line 2
    any idea what is causing the problem?
    saby

    Hi,
    I think Probably what has failed is some inner kind of operation that didn't handle the exception correctly, that is dropping the offending object automatically and it issue is raised (error). Therefore you'll have to manually drop the index and re-create the desired index (that might solve your problem)
    - Pavan Kumar N

Maybe you are looking for

  • Syncing shared calendar in iCal

    Hello, My wife and I use google calendar's to put all of our events and commitments in, and we "share" our respective calendars with the other so we can see what the other is up to and try and plan around each other. I use the mail app and iCal for e

  • How do I get rid of the plus signs in Pages layout?

    When I'm working on a document in Pages, the text boxes show little plus signs here and there. How do I get rid of them? Thanks.

  • Getting Java 1.4.2_10 to work along side of 1.5.0_06 in IE

    I'm trying to use IBM's Tivoli Enterprise Portal through IE but it only will take java 1.4.2_10 which shouldn't be a problem except I have clients that also need version 1.5.0_06. I've tried a lot of different settings in IE and in the two different

  • Can email be password protected on the Ipad 2?

    Can email be password protected on an Ipad 2?

  • Muse will not preview with edge animate

    Once I have published my Edge content I place it in Muse and then try to preview it using the preview button.  It will not work.  I preview my work using the Edge Animate previewer and it plays just fine.  I play the HTML file in Firefox and it works