Indexing pdf documents with indextype ctxsys.context

I have an application that stores the contents of uploaded documents in BLOB data fields. We provide web pages which search through the uploaded documents based on text entered by the user. We currently upload both MS Word .doc and HTML documents. For the HTML documents, which are made available to the public, we index the table with the following procedure:
CREATE OR REPLACE procedure WEBADMIN.index_redacted_docs is
begin
declare
cur           PLS_INTEGER;
exec_int           PLS_INTEGER;
counter          number;
begin
select count(*) into counter
from user_indexes
where index_name = 'DOCS_CTX_REDACTED_IDX';
if (counter = 1) then
ctx_ddl.sync_index (idx_name => 'docs_ctx_redacted_idx');
else
cur := DBMS_SQL.OPEN_CURSOR;
DBMS_SQL.PARSE (cur, 'create index docs_ctx_redacted_idx on documents_ctx_redacted (blob_content) ' ||
     'indextype is ctxsys.context parameters (''filter ctxsys.null_filter'')', DBMS_SQL.NATIVE);
exec_int := DBMS_SQL.EXECUTE (cur);
DBMS_SQL.CLOSE_CURSOR (cur);
end if;
exception
when others then
     DBMS_SQL.CLOSE_CURSOR (cur);
     raise;
end;
end;
We run this process after every uploaded HTML file and are able to locate documents which contain any text entered by the user. The portion of the command we use to query the documents_ctx_redacted table (blob_content is the BLOB field in this table) is (using "corn" as a sample query text):
WHERE (contains (BLOB_CONTENT, 'corn', 10) > 0)
Our customer is now asking that PDF files be uploaded as well and searched in the same manner. After the PDF files are uploaded (into the same table as the HTML files) and the index updated, with the above command ctx_ddl.sync_index (idx_name => 'docs_ctx_redacted_idx'), since the index already exists, we cannot get any rows returned with the above WHERE (contains .... ) clause. We know the text we're looking for (such as "corn") is contained in the PDF files, but the search does not find them, although it finds the HTML documents just fine. I've also tried dropping the index entirely and recreating it, but that also only finds the HTML documents but not the PDF's.
What are we doing incorrectly with the PDF files? Thanks.

We are using Oracle version 10.2 . I looked at the relevant Oracle Text documentation for that version, and the best I could glean was that PDF files are supported by the filter ctxsys.auto_filter (rather than null_filter) when creating the index. I dropped the existing null_filter index and created a new index with the auto_filter parameter, but the end result was the same. I still get no PDF records found when issuing the command (using "corn" as the text query)
WHERE (contains (BLOB_CONTENT, 'corn', 10) > 0)
although the HTML records show up fine again.

Similar Messages

  • A PDF document with information about activation for CS6 software

    Here's a PDF document with information about activation for CS6 software.
    I found it very useful for understanding how the system worked for activation of perpetual licenses, both in cases where the computer is and isn't connected to the Internet.

    Ann, with some browsers, the PDF document will download in the background rather than appear in a browser window.

  • Problem in printing pdf document with java code

    Hi All
    I want to print a pdf document with java code i have used PDFRenderer.jar to compile my code.
    Code:
    File f = new File("C:/Documents and Settings/123/Desktop/1241422767.pdf");
    FileInputStream fis = new FileInputStream(f);
    FileChannel fc = fis.getChannel();
    ByteBuffer bb = fc.map(FileChannel.MapMode.READ_ONLY, 0, fc.size());
    PDFFile pdfFile = new PDFFile(bb); // Create PDF Print Page
    PDFPrintPage pages = new PDFPrintPage(pdfFile);
    // Create Print Job
    PrinterJob pjob = PrinterJob.getPrinterJob();
    PageFormat pf = PrinterJob.getPrinterJob().defaultPage();
    pjob.setJobName(f.getName());
    Book book = new Book();
    book.append(pages, pf, pdfFile.getNumPages());
    pjob.setPageable(book);
    // System.out.println(pjob.getPrintService());
    // Send print job to default printer
    pjob.print();
    but when i am running my program i am getting error
    Exception in thread "main" java.awt.print.PrinterException: Invalid name of PrintService.
    Please anybody, knows the solution for this error?
    Thanks In Advance
    Indira

    It seems that either there is no default printer setup or you have too many printers or no printer setup at all. Try running the following code. It should print the list of available print services.
    import java.awt.print.*;
    import javax.print.*;
    public class PrintServiceNames{
         public static void main(String args[]) throws Exception {
              PrintService[] printServices = PrinterJob.lookupPrintServices();
              int i;
              for (i = 0; i < printServices.length; i++) {
                   System.out.println("P: " + printServices);
    }From the list pick one of the print service names and set it explicitly like "printerJob.setPrintService(printServices);" and then try running the program.

  • How to create a  PDF document with page curls using Adobe  CS 4?

    My  goal is to create a  PDF document with page curls. I am using Adobe  CS 4.
    1.      The document was created in Adobe InDesign  CS 4  where the page  turn (curl) transition  was applied.
    2.      Then the document was exported to .swf.
    3.     The .swf file was imported into   Adobe Acrobat Pro  to create a PDF file with  flip page or page curl transitions.
    These are the problems.
    1.      The background is not  transparent.
    2.      Page dimensions have to be increased at least an inch in width and length so that the full page can show
    3.      The command and+   will not only increases the document's  screen size. It increases the page margins.

    PDF was never designed to support the Flash page curl effect (it didn't exist back then). Anything you try (and you've tried the standard hack) will look like a hack. Personally, I don't think the effort is worth it for an effect that's much overused.

  • Support for multiple pdf document with signature

    I have a question about signatures.
    Does the PDF specification support the combination of multiple pdf documents into one pdf document with the following constraint:
    - each individual pdf document has been signed (by an X.509 certificate),
    - the individual documents can come from different sources, so they have to be 'included' or merged into the final pdf,
    - the final pdf document should not change the signatures of the individual documents.
    So each part of the document should maintain the integrity of the individual components and the signature of the individual comnponents are still valid.
    Does the pdf specification support this feature?
    With kind regards,
    Ernst Jan

    What you want is called a PDF Package (or Portable Collection in the PDFRef) and is new in PDF 1.7. You can try it out with Acrobat 8.
    Leonard

  • Tried to open a pdf document with Acrobat XI Pro Trial and sign in popped up , I keyed in my username and password but it doesnt respond its like grey with the four dots rotating forever. Any solution Please ?????

    Tried to open a pdf document with Acrobat XI Pro Trial and sign in popped up , I keyed in my username and password but it doesnt respond its like grey with the four dots rotating forever. Any solution Please ?????

    Hi higi97,
    How are you connected to the internet? Are you behind a particularly secure corporate firewall? Are you on Mac or Windows? Do you have any anti-malware software running on your machine that may be preventing applications other than your browser from connecting to the internet?
    You may try to follow the steps below:
    1.Close the Creative Cloud application.
    2.Navigate to the OOBE folder.
    Windows: [System drive]:\Users\[user name]\AppData\Local\Adobe\OOBE
    Mac OS: /Users/[user name]/Library/Application Support/Adobe/OOBE folder
    3.Delete the opm.db file.
    4.Launch Creative Cloud.
    Let us know if that helps,
    Regards,
    Rave

  • I can't open any pdf documents with Adobe Reader XI...get an error that "Adobe has stopped working" then it closes.  Any suggestions?

    I can't open any pdf documents with Adobe Reader XI...get an error that "Adobe has stopped working" then it closes.  Is this an Adobe issue, Microsoft issue, or security issue? 

    Thank you!  That seemed to do the trick.  Don't think I'd ever have figured that one out!!
           From: pwillener <[email protected]>
    To: Sharon Atkins <[email protected]>
    Sent: Monday, February 16, 2015 9:47 PM
    Subject:  I can't open any pdf documents with Adobe Reader XI...get an error that "Adobe has stopped working" then it closes.  Any suggestions?
    I can't open any pdf documents with Adobe Reader XI...get an error that "Adobe has stopped working" then it closes.  Any suggestions?
    created by pwillener in Adobe Reader - View the full discussionSince you mention Microsoft, I assume that you are on Windows. Can you open Reader by itself?  If so, try disabling Protected Mode Edit . If the reply above answers your question, please take a moment to mark this answer as correct by visiting: https://forums.adobe.com/message/7201334#7201334 and clicking ‘Correct’ below the answer Replies to this message go to everyone subscribed to this thread, not directly to the person who posted the message. To post a reply, either reply to this email or visit the message page: Please note that the Adobe Forums do not accept email attachments. If you want to embed an image in your message please visit the thread in the forum and click the camera icon: https://forums.adobe.com/message/7201334#7201334 To unsubscribe from this thread, please visit the message page at , click "Following" at the top right, & "Stop Following"  Start a new discussion in Adobe Reader by email or at Adobe Community For more information about maintaining your forum email notifications please go to https://forums.adobe.com/thread/1516624.

  • Whenever I open a pdf document with Adobe Readeron my PC, nightly automatically opens a blank tab.

    Whenever I open a local pdf document with Adobe Reader, nightly automatically opens a blank tab.

    Sorry, Mozilla isn't actively working on Win 64-bit version development at this time.
    You might want to check the Builds fora over at mozillaZine to see what other Nightly users have to say about issues with Nightly versions Win 64-bit.
    http://forums.mozillazine.org/viewforum.php?f=23

  • Creating a PDF document with visible bookmarks

    Hi.
    I'm using VB to create a PDF document with bookmarks. I have no problem with this part.
    What I want to do, though, is to program the document so that the bookmarks pane is visible when I open the document.
    Any ideas?
    All Best,
    Ethan

    I found the answer in the Interapplication Communication API Reference:
    Dim SetPageMode as Boolean = PDDoc.SetPageMode(nPageMode),
    where nPageMode has the possible values:
    0: leave the view mode as is
    1: display without bookmarks or thumbnails
    2: display using thumbnails
    3: display using bookmarks

  • PDF document with text with a β character, when uploaded changes to a !

    PDF document with a beta character, when uploaded it changes to a !  How do I correct?

    Uploaded via https://app.applyyourself.com/AYUploadDocumentation/document_97063027.pdf?AYID=995F9F8-B28 8-41DC-9DAA-70DD6A0258D&a=97063027.
    I do not think I am not viewing it thru Adobe Reader.
    This is the printed copy:
    A NOVEL GSK-3! OVER-EXPRESSING MOUSE MODELOF
    PARKINSON’S DISEASE
    1Megan Winter, 2Jonathan Wills, & 2Anita Sidhu
    1Department of Biology, Georgetown University; 2Department of Biochemistry and Molecular and Cell Biology,
    Georgetown University School of Medicine

  • Print 3D PDF document with white background

    Hi!
    Is it possible to print a view from 3D PDF document with white background?
    I could not find any possibilty to change the grey background color, neither in the standard preferences nor in the print options...
    Thanks for your help!
    Kind regards,
    Mark

    Hi,
    As you experienced all DIALOG function modules will not work in Batch because the is no connection with a frontend (PC).
    You have to get your PDF on a server so you can process them in Batch.
    Maybe you can than use the FM: ALINK_DOCUMENTS_CREATE_FILE
    Success,
    Rob

  • Which API can use to save PDF document with Adobe Reader 9?

    Hello,
        which API can use to save PDF document with Adobe Reader 9? It is said that "CosDocSaveWithParams" can't be used.
    thanks!
    jimmy

    Unless the PDF file is "READER ENABLED for SAVE" (see the adobe product pages on LiveCycle Reader extensions server)
    You are not able to call a Save at all, if the document is reader enabled then you need to use the JavaScript call to save the document,
    As you appear to be wanting to do this in a plug-in you would need to call the script from a plug-in, this can be done using the AFExecuteThisScript () function call.
    Please note that this does will fail if the document is not reader enabled for save and you need to include the Forms HFT in your plug-in,
    This plug-in would also have to be approved by Adobe to be used with the reader and as you are implementing a function that is available in the full Acrobat product this may not be a straight forward approval.
    HTH
    Malky

  • Can't open PDF documents with Adobe

    Every time I try to open up a PDF with Adobe the error message
    Adobe Acrobat Professional 8 cannot be launched at this time. You must launch at least one other suite component before launching Acrobat 8 Professional.
    We've tried reinstalling and can't figure out what the problem is... Any ideas would be great!

    Thank you!  That seemed to do the trick.  Don't think I'd ever have figured that one out!!
           From: pwillener <[email protected]>
    To: Sharon Atkins <[email protected]>
    Sent: Monday, February 16, 2015 9:47 PM
    Subject:  I can't open any pdf documents with Adobe Reader XI...get an error that "Adobe has stopped working" then it closes.  Any suggestions?
    I can't open any pdf documents with Adobe Reader XI...get an error that "Adobe has stopped working" then it closes.  Any suggestions?
    created by pwillener in Adobe Reader - View the full discussionSince you mention Microsoft, I assume that you are on Windows. Can you open Reader by itself?  If so, try disabling Protected Mode Edit . If the reply above answers your question, please take a moment to mark this answer as correct by visiting: https://forums.adobe.com/message/7201334#7201334 and clicking ‘Correct’ below the answer Replies to this message go to everyone subscribed to this thread, not directly to the person who posted the message. To post a reply, either reply to this email or visit the message page: Please note that the Adobe Forums do not accept email attachments. If you want to embed an image in your message please visit the thread in the forum and click the camera icon: https://forums.adobe.com/message/7201334#7201334 To unsubscribe from this thread, please visit the message page at , click "Following" at the top right, & "Stop Following"  Start a new discussion in Adobe Reader by email or at Adobe Community For more information about maintaining your forum email notifications please go to https://forums.adobe.com/thread/1516624.

  • PDF Documents with Functional Buttons - Works for IE and FireFox but not for Chrome and Safari

    To all,
    Good morning.  Our institution developed a PDF document with functional buttons for printing and e-mailing.  When the document is viewed via Internet Explorer and Firefox, the functional buttons work as expected.  However if the document is viewed via Chrome and Safari, the functional buttons do not work.  The workaround for those who are using Chrome and Safari, the PDF document needs to be saved to their local drive before the functional buttons work.
    Has anyone experienced this issue before?  If yes, what was your solution other than the workaround.
    Thanks.
    Babylon5

    Here's how to configure your browser to run Adobe Reader: http://helpx.adobe.com/acrobat/using/display-pdf-browser-acrobat-xi.html

  • How do I print .pdf documents with reader for windows 8?

    I have Reader for windows 8.  How can I print .pdf documents with it? 

    See FAQ: Printing from Adobe Reader for Windows 8 Tablets.

Maybe you are looking for

  • Monitoring 2k footage via SDI, Premiere CC, Decklink Studio

    Hello all, System specs: MacPro 4,1 2.26Ghz 8 core, 32 GB RAM, OSX 10.8.4 Adobe CC Resolve 9.1.5 FSI LM-2461W BM Decklink Studio 2 BM Desktop Video 9.7.7 After searching the forums, I'm here seeking the collective wisdom of the Adobe CC user base. I'

  • Multi block validation in a form using nested tables

    Hi, I have a tables that contains 3 nested tables plus some other varchar2 columns. I have created a form with 4 blocks, three of them based on views on the 3 nested tables, the forth containing the other columns from the table. I have created relati

  • Can I play multiple clips (sequentially) in QuickTime?

    For a sports event, I often shoot about 30 video clips, totaling an hour or so, pausing during timeouts etc.  My camera is a HD Panasonic, shooting AVCHD onto a SD card.  Is it possible to later put the SD card into my laptop and play the whole game?

  • LabVIEW tag control question

    I have create a tag control and add several page in the tag control. I want to disable one of the page in tag control by property node. However, it disable the tag control and also all pages. Is there any method to disable one of the page in tag cont

  • InCopy vs. Indesign

    I am currently considering whether to purchase the Indesign program or merely "settle" with the InCopy version. Might anyone give some input as regards to when it would be beneficial to choose IC i/o ID? Could I change pictures, text and to some exte