Indexing Problem with FILE_DATASTORE and .pdf files

Hello all,
Do any of you have an example showing how to index .pdf files through FILE_DATASTORE? I am able to successfully index text and .doc files but not a .pdf file. Below is the script that I use to index my files:
create index myindex on mytable(docs)
indextype is ctxsys.context
parameters ('datastore COMMON_DIR filter ctxsys.null_filter');
I am using Oracle 8.1.6
Thanks you!!!
-garrett

I don't think that you are able to index anything else then plain ascii texts, because you are not using the INSO filter.
Use preferences like this:
exec ctx_ddl.drop_preference('NO_PATH');
exec ctx_ddl.create_preference('NO_PATH','FILE_DATASTORE');
exec ctx_ddl.drop_preference('MY_LEXER');
exec ctx_ddl.create_preference('MY_LEXER','BASIC_LEXER');
exec ctx_ddl.set_attribute('MY_LEXER','MIXED_CASE', 'NO');
exec ctx_ddl.set_attribute('MY_LEXER','INDEX_THEMES','NO');
exec ctx_ddl.set_attribute('MY_LEXER','INDEX_TEXT', 'YES');
exec ctx_ddl.drop_Preference ('MY_FILTER');
exec ctx_ddl.create_Preference ('MY_FILTER','INSO_FILTER');
exec ctx_ddl.drop_section_group ('MY_SECTION');
exec ctx_ddl.create_section_group ('MY_SECTION','NULL_SECTION_GROUP');
drop index i_filenames;
create index i_filenames on filenames (filename)
indextype is ctxsys.context
parameters ('datastore NO_PATH
section group MY_SECTION
lexer MY_LEXER
filter MY_FILTER
memory 10M
IMPORTANT is the INSO_FILTER preference.
Thomas

Similar Messages

  • Problem with saving a pdf file to computer. Continually get an error message " This document could not be saved. There was a problem reading this document (21).

    Need advice on a saving file issue. I'm having problem with saving a .pdf file to computer. Continually get an error message " This document could not be saved. There was a problem reading this document (21). This is new as this error message just recently started to pop-up.

    More information about this issue can be found here:
    https://forums.adobe.com/thread/1672655
    A "quick" fix that worked for me was to uninstall Adobe... then download the base install for Adobe Reader 11.0.
    Then download each of the individual updates and run them sequentially. 
    I've installed back up to the last security update which is version 08 and have been able to do normal Save As operations.
    You will have to disable automatic updates in order to stay at version 08 until Adobe resolves this issue in a later release.
    http://www.adobe.com/support/downloads/product.jsp?product=10&platform=Windows
    Adobe Reader 11.0 - Multilingual (MUI) installer    AdbeRdr11000_mui_Std
    Adobe Reader 11.0.01 update - Multilingual (MUI) installer    AdbeRdrUpd11001_MUI.msp
    Adobe Reader 11.0.02 update - All languages    AdbeRdrSecUpd11002.msp
    Adobe Reader 11.0.03 update - Multilingual (MUI) installer    AdbeRdrUpd11003_MUI.msp
    Adobe Reader 11.0.04 update - Multilingual (MUI) installer    AdbeRdrUpd11004_MUI.msp
    Adobe Reader 11.0.05 security update - All languages    AdbeRdrSecUpd11005.msp
    Adobe Reader 11.0.06 update - Multilingual (MUI) installer    AdbeRdrUpd11006_MUI.msp
    Adobe Reader 11.0.07 update - Multilingual (MUI) installer    AdbeRdrUpd11007_MUI.msp
    Adobe Reader 11.0.08 security update - All languages    AdbeRdrSecUpd11008.msp

  • Problem with Preview and PSD files - random gray square

    Hi guys, hope you can help...
    I've got a problem with Preview and PSD files.
    If I open in Preview both an original jpg straight from my reflex and the photoshop version of the same picture, the psd file presents a gray square (of what it seems unrendered image) in a random area of the photo (sometimes in the center.). The square is quite big...
    If I zoom in or zoom out it disappears...if I scroll to another photo and then back to the psd, the square it's there again...sometimes in a different position.
    I've tried the same psd on my older iMac with leopard...and got no problem at all.
    I suspect it got something to do with my Ati...
    (this is the second iMac 27...the first went back for gray banding on the lcd screen and flickering and yellow tinge........)
    Thanks for your help.
    DAve.

    maybe I'm onto something...
    I've just found out that opening Preview in 32bit mode (instead of default 64bit) works flawlessly with any psd files. If I switch back to 64bit mode, Preview is much faster but the gray square comes back...
    It seems like the i7 is much faster than the Ati....
    Any more realistic ideas?

  • Index failed for Docs and PDF files with return code 8002

    Hi All,
    I try to index Word documents in a webdav folder in KM. When I add a text document the Indexing completes correctly and I can find the document via the search. When I add a Word Document or PDF to the folder and reindex I get the message Index failed with return code 8002.
    How can I solve this Problem.
    Kind Regards,
    Richard Middelburg

    Hi,
    I have changed the host already from localhost into the hostname of the linux server. I tried this also with the FQDN of the server.  I have entered the following entry
    Hostname linux = sap12
    In the URL generator service = http://sap12
    Must I restart the portal?
    Richard

  • Problem with color in pdf files

    My documents contain color logo.
    Whenever converted into pdf file, I always see color logo in the pdf files.
    Today, the logo came out to be in black and white.
    I repeated many times even with Adobe PDF (printer driver).
    But logo came out in B & W.
    I am wondering how to fix this problem.
    I currently use WinXP Pro SP2-Acrobat Pro 7.10.
    NOte: Last week, v. 7.09 was updated to 7.10.
    I wonder this update contains a bug which produces B&W image.

    Please ignore this post.
    I would like to ask the moderator to delete this post.
    (note; I found that theproblem has nothing to do with pdf but with the original xls invoice douments).

  • Problems with Contribute3/Dreamweaver8 & PDF files

    Does anyone know of problems downloading PDF files that have
    been uploaded with either Contribute 3 or Dreamweaver 8 to a
    server. I seem to be able to read these files in FireFox but not
    IE6 - don't know what I am doing wrong - HELP !!

    May be that the acrobat reader association with IE has got
    corrupted. Why
    not try to download an updated reader, and install it.
    Paul Whitham
    Certified Dreamweaver MX2004 Professional
    Adobe Community Expert - Dreamweaver
    Valleybiz Internet Design
    www.valleybiz.net
    "mitchellbutt" <[email protected]> wrote in
    message
    news:eiaop2$4o0$[email protected]..
    > Does anyone know of problems downloading PDF files that
    have been uploaded
    > with
    > either Contribute 3 or Dreamweaver 8 to a server. I seem
    to be able to
    > read
    > these files in FireFox but not IE6 - don't know what I
    am doing wrong -
    > HELP
    > !!
    >

  • Problems with Lion and PDF's

    I recently upgraded to Mountain Lion 10.8.2. Among other problems, when I download a PDF file from a site I use daily, I try to open it in Preview and get an error message "The file---could not be opened. It may be damaged or use a file format that Preview doesn't recognize". However, if I download the same file a second time, replacing the first, it opens just fine. The first file also won't open with Acrobat 9.0. How do I fix this? Thanks.

    Bootup holding CMD+r, or the Option/alt key to boot from the Restore partition & use Disk Utility from there to Repair the Disk, then Repair Permissions.

  • Problem with Java and Zip Files

    Hello there everyone. I have a problem with trying to zip up directories with empty folders in them. They zip fine with my code, but according to winzip, the number of files in the archive is incorrect. It's supposed to have a total of 288 files in it, but when it's zipped up, it only says 284. I mention specifically the "empty directories" note because the zipping works fine without empty folders.
    Below is the code for the zip method:
    public static void zip(File[] list, File zipfile, File zipRoot)
         throws IOException {
          if (!zipfile.exists()) {
                   zipfile.createNewFile();
              else if (!zipfile.isFile()) {
                   throw new IOException("The zip file, " + zipfile.getAbsolutePath()
                             + " is not a file.");
              if (list.length == 0) {
                  LOG.error("The list of files to zip up is empty.");
              for (int i = 0; i < list.length; i++) {
                   if (!list.exists()) {
                        throw new IOException("The file or directory, " + list[i].getAbsolutePath()
                                  + " does not exist.");
              FileOutputStream fos = new FileOutputStream(zipfile);
              ZipOutputStream zos = new ZipOutputStream(fos);
              for (int i = 0; i < list.length; i++) {
                   if (LOG.isDebugEnabled())
                        LOG.debug(i + ": " + list[i].getName());
                   String entryName = getRelativeName(list[i], zipRoot);
                   if (list[i].isDirectory()){
                        if (list[i].listFiles().length == 0){
                             ZipEntry entry = new ZipEntry(entryName + "/");
                             zos.putNextEntry(entry);
                   else {
                        ZipEntry ze = new ZipEntry(entryName);
                        zos.putNextEntry(ze);
                        byte[] buffer = new byte[8096];
                        FileInputStream fis = new FileInputStream(list[i]);
                        int read = 0;
                        read = fis.read(buffer);
                        if (LOG.isDebugEnabled())
                        LOG.debug("\tFound " + read + " bytes.");
                        if (read == -1){
                             //empty file, but add it for preservation.
                             //zos.write(buffer,0,0);
                        while (read != -1) {
                             zos.write(buffer, 0, read);
                             read = fis.read(buffer);
                        fis.close();
                        zos.closeEntry();
              zos.close();
    The files look like they're there, but I need the system to be able to determine the number correctly. 
    Here's the interesting thing:  It zips the files, and then when I use the size() method for zip files in java, it says 284 files.  But when I unzip, it says 288 again.  It's like there's files missing when compressed, but when decompressed, they return.  Note that the files are actually there.  If I open the archive in a third party app such as Winzip AND Winrar AND IZarc, they all show 288 files.  Any idea what would make the number of files appear incorrectly in a zip file when zipped by java with the code above?  Thanks in advance.
    - Chris                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           

    I figured out the problem. When zipping files in windows using winzip, it doesn't explicitly count folders as a "file/folder" as a file in itself. It will create the folders for files to go in, but the folder itself will not be 'counted' when you query the info of the file itself. You have more control of the zip file in java, and can count the folder as a file or not.

  • Problem with downloading a PDF file from a webserver run on Linux

    Hi,
    I've written a simple functionality that manages file attachments.
    Everything works fine (attaching, downloading, deleting) when the webserver runs on Windows.
    However when I deployed the code to the Resin webserver run on Linux and use the Win browser to connect to the app, the downloading of PDF file doesn't work (uploading and downloading of txt, doc, xls, jpg files is OK).
    The downloaded PDF file is almost twice as big as original (~28KB when original is ~12KB) and it can't be open.
    I guess it is the problem of writing to the output stream of HttpServletResponse but I can't localize the problem.
    Here is the code I use for downloading file:
    private boolean downloadFile(HttpServletResponse response, String filePath,
                   String originalFilename) {
         File file = new File(filePath);
         String contentType = URLConnection
                   .guessContentTypeFromName(originalFilename);
         // If the content type is unknown set the default value.
         if (contentType == null) {
              contentType = "application/octet-stream";
         BufferedInputStream input = null;
         BufferedOutputStream output = null;
         try {
              input = new BufferedInputStream(new FileInputStream(file));
              int contentLength = input.available();
              response.reset();
              response.setContentLength(contentLength);
              response.setContentType(contentType);
              response.setHeader("Content-disposition", "attachment; 
                           filename=\""+ originalFilename + "\"");
                output = new BufferedOutputStream(response.getOutputStream());
              int bufSize = 10000;
              byte[] buf = new byte[bufSize];
              int bytesNo = 0;               
              while ((bytesNo = input.read(buf, 0, bufSize)) != -1) {
                   output.write(buf, 0, bytesNo);
              output.flush();
              input.close();
              output.close();
         } catch (IOException e) {
              log.debug(e.getMessage());
              e.printStackTrace();
    }Can you point any problem?
    Thanks in advance,
    Ala

    matali wrote:
              int bufSize = 10000;
              byte[] buf = new byte[bufSize];
              int bytesNo = 0;               
              while ((bytesNo = input.read(buf, 0, bufSize)) != -1) {
                   output.write(buf, 0, bytesNo);
    This piece is completely wrong and doesn't work for files bigger than 10000 bytes. Replace it by
    byte[] buffer = new byte[10240]; // 10KB exactly.
    int length = 0;
    while ((length = input.read(buffer)) > 0) { // Read next 10KB of input to buffer.
        output.write(buffer, 0, length); // Write specified length of buffer to output.
    }Or just use output.write(input.read()) in a loop of the contentLength as you're alredy using BufferedInputStream/BufferedOutputStream.
    Another thing, the method is declared to return a boolean, but it actually doesn't? I would just let it throw an exception in case of failure instead of returning a boolean and let the calling method handle the exception.

  • OBIEE 11g having problems with password protected PDF files.

    I have been able to get an analysis in OBIEE 11g to display PDF files.
    However, some of these documents contain sensitive information and must be secured. Since anyone with access to the file name
    could simply type in the proper path in the browser window, this is unacceptable. In order to try and prevent this, I created a pdf file
    that is protected with a password.
    Opening the file by itself, produces the desired results. The password is requested before the file will open.
    When I open the file through my analysis in OBIEE, Adobe reader activates, but the password is not requested and the file does not open.
    It is as if OBIEE is somehow not sensing that Adobee is asking for a password.
    Does anyone have any experience with this?

    FYI, in case anyone is interested, I found out what is going on.
    I created the original password protected PDF using Microsoft Word. I did this because I do not have a full version of Adobe Acrobat that allows me to create files.
    On a hunch, I found someone that has a full version of Acrobat, and had them create a password protected PDF file. This file worked perfectly.
    Apparently, Word is not strictly adhering to PDF guidelines, and OBIEE senses the differences, resulting in the file not opening properly.
    Something to keep in mind for anyone linking to password protected PDF files in OBIEE.

  • Problems with reading a pdf-file using Safari/Adobe Acrobat Pro

    Using Adobe Acrobat Pro 9.5.5. and Safari in Snow Leopard I open the pdf-file of  http://dare.uva.nl/document/505726 and small boxes appear in the text f.i. on page 70, 74, on the last page etc.
    I don't know why those small boxes appear only on my MacBook Pro with the installed software and others don't read a corrupt text. If I click in den pdf-file on the right side of the mouse and choose "open with Adobe Acrobat Pro" the pdf-file opens correctly (without small boxes)!

    As you discovered, Adobe Acrobat/Pro/Reader v.9 will crash for all managed user accounts that store their home folders on the server because it has "difficulty" reading and writing to a folder across a network, which is why it crashes shortly after it launches.
    HOWEVER, there is hope. All you need to do is tell Adobe to write all of its files to the local drive rather than the home folder on the sharepoint, as follows:
    While logged into your network-user account:
    1. Go to the "Shared" folder on the local drive. (Local HD > Users > Shared)
    2. Create a new folder named "9.0_x86" if you have an Intel or "9.0_ppc" if you have a PPC (without the quotes for either case). You will have to authenticate as an Administrator.
    3. Go to the Acrobat folder within the user's Application Support and delete the folder in it.
    (Home > Library > Application Support > Adobe > Acrobat > 9.0_x86 or 9.0_ppc)
    4. Open/launch the Terminal application found within the Utilities folder on you Mac.
    (Applications > Utilities > Terminal.app)
    5. In Terminal, enter the 1st command for Intel or the 2nd for PPC:
    ln -s /Users/Shared/9.0_x86 ~/Library/Application \Support/Adobe/Acrobat/9.0_x86
    OR
    ln -s /Users/Shared/9.0_ppc ~/Library/Application \Support/Adobe/Acrobat/9.0_ppc
    Adobe Acrobat 9 should now work properly. You may get the "quit unexpectedly" error message the first time you quit the application, but should only happen the one time.
    On a side note, the print spool is slow when printing very large multi-paged PDFs.
    This is a fix I found in 2009 by Dennis, can't remember where though.
    Message was edited by: Gabriel Prime

  • Problems with jdbc and .jar files.

    I am having a problem with connecting to my database when running my .jar file as an executable.
    It connects correctly when I use the java command in command prompt and it also runs correctly when using my IDE.
    Is there something that needs to be added to the manifest or in the code it self inorder for the program to find the JDBC driver correctly?

    This is the error message that I get:
    java.lang.ClassNotFoundException: net.sourceforge.jtds.jdbc.Driver
    at java.net.URLClassLoader$1.run(Unknown Source)
    at java.security.AccessController.doPrivileged(Native Method)
    at java.net.URLClassLoader.findClass(Unknown Source)
    at java.lang.ClassLoader.loadClass(Unknown Source)
    at sun.misc.Launcher$AppClassLoader.loadClass(Unknown Source)
    at java.lang.ClassLoader.loadClass(Unknown Source)
    at java.lang.ClassLoader.loadClassInternal(Unknown Source)
    at java.lang.Class.forName0(Native Method)
    at java.lang.Class.forName(Unknown Source)
    at AboutFrame2$Connect.getConnection(AboutFrame2.java:109)
    at AboutFrame2$Connect.displayDbProperties(AboutFrame2.java:127)
    at AboutFrame2.<init>(AboutFrame2.java:72)
    at MainStartFile2.main(MainStartFile2.java:7)
    Error Trace in getConnection() : net.sourceforge.jtds.jdbc.Driver
    Error: No active Connection
    I think that it is because it cannot find the driver. I am going to add the path to the manifest and see if that works thank you for all the help.

  • Problems with syncing and transferring files

    Hi there. Im having problems with syncing my Ipod Touch and uploading files. Ive managed to transfer multiplr MP3 fils but i noticed that when i disconnectktwith my PC. All the sings are gone. What could have haplened?
    Please help.
    Thanks.

    Apologies for the misspellings.
    Basically i have tried uploading my MP3 files (one by one) in my device. then i tried uploading films. then when i disconnected my device all my songs on the device is gone. i do not know what happened exactly.
    it would seem the current iTunes version is "unstable". having uploaded 50 songs, doing them one by one only to find out out it got wiped out in the device for some reason.
    any explanation for this?
    Thank you.

  • [solved] Problem with Amarok and flac-files

    When I installed Arch for the first time a couple of months ago, I was happy to find that some flac-files that didn't play in Amarok on my Debian installation was now played back perfectly. However, upon trying to play these files today, Amarok refuses to play them. I get an error stating that there is no suitable demux plugin for the files. Other flacs play back fine though, and the problem files play back perfectly in mplayer.
    Another thing I would be very happy to resolve is that I have a few files that use japanese in their tags. IIRC these displayed correctly in Amarok on my Debian install, but in Arch all I see is squares where there should be japanese, despite me installing both arphic-ukai and arphic-uming packages. What am I missing?
    All help appreciated!
    Last edited by Emphrygian (2007-04-20 22:36:45)

    Your tip worked like a charm and the files now play perfectly in Amarok once again.
    I solved my second problem by installing the ttf-sazanami package from AUR and adding both sazanami gothic and sazanami mincho as substitute fonts for Sans Serif (my default font) in qtconfig (not sure that I really needed both though, but hey, it works now ).
    Last edited by Emphrygian (2007-04-20 22:35:37)

  • Windows 8 Elements 11 Problem with DNG and CR2 files

    Have a Windows 8 machine with Elements 11 which has Camera Raw 7.4 installed and showing correctly in the editor. The Browser shows thumbnails of the DNG and CR2 files, but when an attempt is made to edit them, the editor opens but does not open the file. Same thing happens if I try to open the file using the File Open from the menu.
    Also I have been through the help and forums and found an article about opening jpeg files as Camera Raw files (http://blogs.adobe.com/pselements/open-jpeg-files-in-adobe-camera-raw-in-photoshop-element s/) and this does not work either.
    Help please !

    Disclaimer:  The following applies to WINDOWS EXPLORER.  I don't use Bridge so I don't know if it's applicable at all to Bridge.
    Adobe does not provide a codec for viewing Canon raw files as thumbnails.  Canon has one for free but it only works on a 32 bit Windows system.
    Early this year I found a free download for displaying raw file thumbnails that works on an x64 Windows system.  Since that time the author realized there's a real demand for this software and has started charging for it.  In any case, I find it works very well.
    http://www.fastpictureviewer.com/codecs/
    -Noel

Maybe you are looking for

  • Problem with Ant Script while deploying a BPEL Process

    Hi all, Im making use of Ant script to Deploy a BPEL process ...the bpel process simply invokes the credit rating service and returns the credit rating..credit rating service and the whole BPEL process are deployed in the default domain...now i made

  • How do i get adobe reader to work on my 27" iMac with maverick

    Hey! Is there anybody there now how to get Adobe reader work proberly so i come up with a pdf instead of a grey page. my imac is a i7 core, with maverick and 32 Gb ram.

  • IP Profile - is it stuck/restricting my download s...

    After my recent extremely slow DL speed issues and intervention by the moderators, my BT Total Broadband now appears much better. However, my IP line profile appears now to be stuck at 2000, even though I am syncing at over 3000. Note: My previous IS

  • ACR 8.7

    Will we be getting camera and lens profiles for the Panasonic LX100, which was included in ACR 8.7 Mike

  • MDM access problem

    Hi experts, Iam facing few problems while working with MDM console and data manager. 1. When i tried to access the MDM server of my neighbour which is in the same network connected through PPTP connection, i could not access it. It shows that the ser