How to search text in pdf file?

Hi all
I have to store the cover of a newspaper that include images and text and then should be able to search keywords in the cover.
I've read about to store in pdf format and use intermedia text.
I am just wondering the way to store and to do the search .
Thanks all

Hi,
You need store the PDF document in a BLOB column and create a CTXSYS index type.
e.g.: (.doc files)
CREATE INDEX I_DOC ON DOC_TABLE (DOC_COLUMN) INDEXTYPE IS CTXSYS.CONTEXT PARAMETERS ('SYNC (ON COMMIT)');Then you can test typing this SQL below:
select score(1) from DOC_TABLE where contains(DOC_COLUMN, 'My text', 1) > 0;In my case, i use this index for purpose to search on Word Documents (.doc)
Maybe this link help you to create an index type using FILTERS, in order to search on PDF files:
http://www.oracle.com/technology/products/text/htdocs/altfilters.htm
Cheers

Similar Messages

  • Search text in PDF file

    I would like to text search in pdf file, through java (VJ++), is it possible through java.io, i'm getting junk text.
    also tried to add COM wrapper through VJ++, but file is not getting loaded ?? any examples ??
    Thank you

    any ideas
    searching for PDF
    help required

  • How to control text for PDF Files in the BI7.0 Broadcaster

    Hi ,
    I am running a web template in the BI7.0 Broadcaster. The output prints in the PDF format. In the precalculation tab I have set up 4 variables(VAR1,VAR2 .. etc) . The main folder where these PDF files get stored I can control the text but dont know how to change the PDF file names which fall into this folder. All the PDF files have the technical name and number(like XXX_00000,XXX_00001 etc.) How can I control the text for these files.
    Thanks,
    Kal

    HI Kal,
    it is not possible to get any influence on the filenames. If you use Filternavigation the filename is generated to be unique.
    best regards,
    kai

  • How to search in several pdf files at once

    how is it possible to make one pdf file starting from 20 pdf's in order to search for an item in one operation.
    I do not need to change the contenance (pro edition off adobat?)

    Adobe Reader has the ability to search multiple pdf's. Just open it, select Search from the Edit menu and select All PDF Documents in from the Where would you like to search? option group.
    You could also set up a search folder in Finder. Add a search category of "Kind" is "PDF." Limit the scope to whatever folder holds them all.

  • How to search text in pdf

    Dear All,
                 i just want to search particular text in pdf and get the page no where text situated.
    Please do the needfull.
    Regards,
    Parthasarathy.S

    Hi,
        im using it in desktop only windows xp. im using visual studio 2005 and VB.net for development..can you help me with some sample codes...
    Thanks in advance.
    Parthasarathy.S

  • How to search for all PDF files on my hard drive

    How can I search for all of a file type on my hard drive without getting documents that happen to have the text in them?  i.e. PDF or DOC or XLS
    Thank you
    Peg

    Put the search term in quotes, and do not forget the .  (dot)
    Results (partial list) of Spotling search for ".pdf". There is a section listing PDF documents.

  • How to highlight text in pdf file programmatically

    I am using below mentioned code to highlight text programmaticaly using VB.net But required text is getting selected but doen't get permanently highlighted. Can anybody help me out.
    Dim gAvDoc As
    Acrobat.CAcroAVDoc 
    Dim myhitlist As
    Acrobat.CAcroHiliteList 
    Dim mypdpage As
    Acrobat.CAcroPDPage 
    Dim acroPageView As
    Acrobat.CAcroAVPageView 
    Dim textSelect As
    Acrobat.CAcroPDTextSelect 
    Dim gPDFPath As
    String
    Dim gApp AsAcrobat.AcroApp 
    gPDFPath =
    "C:\Test.pdf"
    gApp = CreateObject("AcroExch.App") 
    gApp.Show()
    gAvDoc = CreateObject("AcroExch.AVDoc") 
    gAvDoc.Open(gPDFPath,
    "")acroPageView = gAvDoc.GetAVPageView()
    acroPageView.GoTo(1)
    myhitlist = CreateObject(
    "AcroExch.HiliteList")myhitlist.Add(50, 10)
    mypdpage = acroPageView.GetPage()
    textSelect = mypdpage.CreateWordHilite(myhitlist)
    gAvDoc.SetTextSelection(textSelect) 
    gAvDoc.ShowTextSelect()
    gAvDoc.Close(True) 

    Patrick on writting saving code also its not hightlighting the text. Text gets selected but it doen't get highlighted with yellow color...
    Dim gAvDoc As
    Acrobat.CAcroAVDoc 
    Dim myhitlist As
    Acrobat.CAcroHiliteList 
    Dim mypdpage As
    Acrobat.CAcroPDPage 
    Dim acroPageView As
    Acrobat.CAcroAVPageView 
    Dim textSelect As
    Acrobat.CAcroPDTextSelect 
    Dim gPDFPath As String 
    Dim gApp As Acrobat.AcroApp 
    Dim mypddoc As Object
    gPDFPath ="C:\Test.pdf"
    'Initialize Acrobat by creating App object
    gApp = CreateObject("AcroExch.App")mypddoc = CreateObject(
    "AcroExch.PDDoc") 
    gApp.Show()
    mypddoc.Open(gPDFPath)
    gAvDoc = CreateObject("AcroExch.AVDoc") 
    gAvDoc = mypddoc.OpenAVDoc("testing")acroPageView = gAvDoc.GetAVPageView()
    acroPageView.GoTo(1)
    myhitlist = CreateObject(
    "AcroExch.HiliteList")myhitlist.Add(50, 10)
    mypdpage = acroPageView.GetPage() 
    textSelect = mypdpage.CreateWordHilite(myhitlist)
    gAvDoc.SetTextSelection(textSelect)
    gAvDoc.ShowTextSelect()
    mypddoc.Save(PDSaveFull, gPDFPath)
    gAvDoc.Close(True)

  • Editing text from pdf file

    how to edit text from pdf file?

    Adobe Reader does not allow editing the text of a PDF document. You will need to get Acrobat on your Windows or Mac to do that.

  • How can I prevent a PDF file from being copied, printed or downloaded? Students should only be able to view the text and and not distribute it in any way.

    How can I prevent a PDF file from being copied, printed or downloaded? Students should only be able to view the text and and not distribute it in any way.

    You can prevent it from being printed by applying a security policy to it
    in Acrobat. The rest can't be prevented, unless you spend  a LOT of money
    on DRM protection.

  • How to Extract the Highlight Text in PDF File

    Hi Scripters,
    i want know, how to extract the hightlight text in pdf files for text only format for (*.txt) file extension save.
    regards
    baby

    Hi,
    Okay i'll try do best.
    thanks for your reply.
    Regards
    Baby

  • How to use OCR Font A type by the time of writing some text into Pdf fil

    Hi,
    I am generating one pdf file in java. How can I use OCR Font A for text of pdf file ..Please can any one help where can I get OCR Font A and how to use that one in java ... I want to write some text into pdf file and that text should use OCR Font A family ...
    Thanks.

    This document shows how to disable OCR during conversion; just do the opposite: https://forums.adobe.com/docs/DOC-3062

  • How to write a unicode text in pdf file

    Dear Friends,
    I am a beginner in acrobat pdf plug-in development. I was trying to write a unicode text (Tamil text) into pdf file.
    Using same api I am able to write english text in time-roman, areal etc fonts. But I am not able to write tamil texts.
    The code is as below:
            memset(&pdeFontAttrs, 0, sizeof(pdeFontAttrs));
            pdeFontAttrs.name = ASAtomFromString("Latha");
            pdeFontAttrs.type = ASAtomFromString("TrueType");
            pdeFont    = PDEFontCreateFromSysFont(                                        \
                            PDFindSysFont(&pdeFontAttrs, sizeof(pdeFontAttrs), 0),    \
                            kPDEFontCreateEmbedded);
            pdeText = PDETextCreate();
            PDETextAdd(pdeText, kPDETextRun, 0, (ASUInt8 *)buffer, _tcslen(buffer),
                                    pdeFont, &gState, sizeof(gState), NULL, 0, &textMatrix, NULL);
            PDEContentAddElem(pdeContent, kPDEAfterLast, (PDEElement)pdeText);
            PDPageSetPDEContent(pdPage, gExtensionID);  
            PDPageReleasePDEContent (pdPage, gExtensionID);
    KIndly assume that PDEGraphicsState and PDETextMatrix are set properly set, I am not pasting entire code to avoid complexity.
    Thank you,
    Safiq

    Dear lrosenth,
    I went through some codes/suggestions in internet and I found that I need to have cmap file and cid font file for the respective font since pdf doesn't support unicode fonts directly.
    Can you help me to know where can I get cmap file and cid font file for tamil language font Latha(TrueType) microsoft font.
    Regards,
    Safiq

  • How do I open a PDF file with Adobe Reader if it is in iBooks

    I have a number of PDF files in my iBooks library.  I just downloaded Adobe Reader for the iPad.  I can't see how I can open those PDF files.  The only thing I seem to be able to do with them is to open them from iBooks directly.  Doing that does not provide me with the features that Adobe Reader would provide (such as searching for particular text, etc.).

    Use iTunes to save the PDF FROM iBooks, and to add them to Adobe Reader.
    Connect your iPad to your computer. Bring up iTunes on your computer.
    To save a PDF FROM iBooks, click on "Books" in the Library section on the left. Select the PDF you want to save and right-click and choose Show in Finder (Show in Explorer in Windows). Go up one level in your computer hierarchy and you'll see all the "Books" (PDFs and eBooks) in iBooks.  You can leave them here if you want to view the PDFs in iBooks in the future, or drag them to the Desktop.
    To move the PDF TO Adobe Reader:
    Return to iTunes. Click on the iPad on the left in Devices. Click on the Apps tab. Scroll down the the File Sharing section as shown below. Click on Adobe Reader. Click the Add button and select the PDFs you identified in the previous step. They'll be added to Adobe Reader

  • How do I combine already pdf files in pack?

    How do I combine 2 pdf files in adobe pack?

    So, if you have a PDF pack account, make sure to log in.
    Online, you would go to https://cloud.acrobat.com/combinepdf, by clicking on the "Combine Files" tab.
    Next, click the large blue button that says, "Select Files to Combine."
    Wait for the shadow-box window to appear, and select the PDF files you want to combine, either by selecting them in your computer's filesystem, from the pop-up window, or by dragging the files into the shadow-box.
    Alternatively, you may switch to Acrobat.com files, and upload any files that you have stored there.
    Click on the blue button stating continue.
    Enter a name in the box with light gray text stating "Enter combined PDF file name"
    When done, press combine.
    You should see a light grey status box in the top right stating the status of the combining, and when finished, you will see the file appear in the section toward the bottom half of your browser stating "Combined Files" - it should be the document to the furthest left-side. (If your lucky, it is already there.)
    Hopefully, this will help you!
    -Alex
    Like what you see? Is it a correct answer? If so, you should vote a correct answer on this answer.
    You, too, can be an awesome user - just remember - see something, say something!@

  • Why can't I "Save as Text" a pdf file received as an email attachment?

    I can "Save as text" a pdf file which I have created in my own computer (that is, it goes into MS notebook that I then can Copy and Save as an MS Word file) but not when I receive a pdf as an email attachment. (The file is saved, but it is empty.) Why would I want to convert my own pdf back to text? Well, in case I no longer have the original Word document I suppose, but the thing is "Save as text" works with my pdf, but not with those I recieve from others. How come? Thanks!

    Is this a scanned PDF? If so, it must first be OCR'd.

Maybe you are looking for

  • Convert this number to a date

    Hi, from my database there is a field for date and it is a number like this: 1295944970.   This corresponds to a date displayed as 1/25/2011 displayed on the website.   Does anybody know how to convert the data 1295944970 from the database date field

  • Custom Cluster and role of INDX in that???

    Hi All Gurus, I need to create a custom cluster and then import and export the data to DB. I saw some threads where i saw INDX being used. Can anybody tellme what is this INDX table and how does it play a in creation of custom cluster also how do i c

  • Business Support Services

    As a business customer with fios internet and telephone, what support services are available to my business?  I am speaking about 'on site' support;  inside wiring, local network issues, printer connections, etc ??  Does my 'business' account offer m

  • Restriction needed for incorrect 543- O goods movement

    Dear All, I have a Subcontracting BOM like 100 PC of Material A is produced by 100 KG of material B. Suppose, Vendor end stock is 500 Kg in a single Batch. Then at the time of GR in MIGO if I am giving GR quantity of Material A is 100 PC then system

  • MY BLUETOOTH WONT TURN ON HELP

    bluetooth stopped working. it wont turn on anymore. ive tried turning it on but no response.