Unable to perform content search in PDF files
Hi All,
I am able to search the file based on content in .doc and .txt however the search result has not included the .pdf files despite it includes the content I am searching for.
Do I need set any domain properties.
Thankx.
Krrish7.
Perform a FileManager.updateDocument and supply the id of the document to update along with a definition containing the new file name (Attributes.NAME).
For example ... to rename document.out to document.pdf
ManagersFactory session = ....
FileManager fileM = session.getFileManager();
Item doc = fileM.resolvePath("/path/to/document.out", null);
// rename to document.pdf
fileM.updateDocument(doc.getId(),
new NamedValue[]
ClientUtils.newNamedValue(Attributes.NAME, "document.pdf")
}, null);
Note also that you could add a new format from Enterprise Manager for ".out" files, if all ".out" files were of type pdf.
cheers
Matt.
Similar Messages
-
CF perform word search on PDF files?
Can CF MX (6.1 or 7) perform a word search of PDF documents?
What I would like to do, at the minimum, is have CF search
PDF files located in a directory for a specific word, and return a
list of files that have that word (or phrase) in them.
am I asking too much?
Thanks for any and all help.
RussYes. Use the Verity search engine that comes with
ColdFusion. -
Merge/append the contents of a pdf file to an MS Word document in JAVA
Hi experts,
I have a requirement to merge/append the contents of a pdf file to an ms word document(Not to be done by including pdf file objects in word). Please suggest some java libraries or ways to do it using java. Any pointers would be appreciated.
Regards,
SubinOdonnells52 you will still need to install Adobe Acrobat Professional prior to being able to make edits to PDF or utilize Acrobat. By default the installation files have been saved to your download folder.
If you continue to face difficulties with the Adobe Download Assistant then please see Troubleshoot Adobe Download Assistant.
You can also download the installation files directly from Download Acrobat products | Standard, Pro | XI, X if you are unable to locate the downloaded installation files. -
Windows 7 32 bit search of PDF files does not work
I have installed Windows 7 on a 32 bit Dell Laptop and I can no longer search the content of my PDF files using Microsoft's search in Windows Explorer which I need to do. I’ve tried following the advice listed on several sites on line and nothing seems to work. My 64bit processor required a download of Adobe PDF iFilter 9 for 64-bit platforms and those systems work fine. All the write-ups suggest that Reader XI for 32 bit Windows 7 has the filter built in and Microsoft indexing indicates it is there but I still can't search for words in a PDF like I could using Windows XP.
Any advice would be greatly appreciated! Thanks in advance for any help.I will never understand why but in the end I rebuilt my 32 bit dell laptop from scratch and the pdf files can now be searched.
I cannot search them on a mapped drive as I was able to with Windows XP because now they must be indexed and windows 7 will seems not to allow a mapped location to be indexed which must be done to make the pdf files searchable so I have had to move the files to the local drive.
My Windows 7 64 bit systems can search the mapped drives just fine without needing to be indexed. Again I will never understand why this works and the 32 bit machine does not. -
Problem searching some PDF files in Acrobat Reader – Non-ASCII characters
Acrobat Reader cannot search some .pdf files. I have put an example document up on Scribd here.
Any attempt to search for any word that can be clearly seen to be in the document fails with “No matches were found.”
This example document is NOT a scanned document – words and characters can be selected.
A hex display tool shows that the characters in a PDF document that can be successfully searched are in the ASCII/1252 range (A=0x41, etc).
Copying and pasting characters in the example document to a hex display tool shows that the characters in the document are not in the ASCII range.
For example the letters A to Z in the example document are in the range ‘A’ = 0xDF (decimal 223), ‘B’ = 0xDE (decimal 222), through to ‘Z’ = 0xC6 (decimal 198).
However, characters in these non-ASCII ranges are displayed perfectly by Acrobat Reader, as can be see if the example document is opened.
Therefore, as Acrobat Reader knows what these characters are, it doesn’t seem unreasonable to say that it should be able to search for and find them.
Tests were performed using Acrobat Reader X v10.1.4.
Can anyone say what this problem is?Hi Pat, thanks for your reply.
Your reference to the title of that page being 'HARNESSES' indicates that, when you view that document in Adobe Reader, you are seeing 'HARNESSES', not
"ØßÎÒÛÍÍÛÍ". And that the remainder of the document is similarly being displayed in readable English language.
Yes as you say, you can search for 'ß' and get hits on 'A' (to use that as an example) in the example document.
But the need to form a word to be searched for into whatever code mapping this is using (for example having to enter "ØßÎÒÛÍÍ" for HARNESSES - I'm not even sure how that would be entered from a keyboard) doesn't seem to be very convenient.
Its clear the example document is using some code mapping other than ASCII / Windows-1252 (which has 'A' as 0x41). But it is also clear that Adobe Reader knows what that mapping is, and knows to use it, as its displaying (for example) 'A' for the code 0xDF.
So I guess the question is - why isn't Adobe Reader's knowledge of this mapping being extended to its search input? -
Hi,
I've got allmost every thing working now
except that searches on PDF files ddon't
produce the deisred results.
The filter seems on only search the pdf file
for infomation that one would seem in the
document info thru the acrobat reader!!
It doesn't seem to index the contents of the
pdf document as it does w/ other formats like
exel and word :(
Do I need to do any additional setup to crete
a more comprehendive index on these pdf files?
cheers,
VijayHi,
We have working intermedia successfully after
some fixes with tnsnames.ora and listner.ora..
This is for your reference.
1. You may need to change listner.ora and tnsnames.ora for creation of external procedure processes
2. Change listner.ora to include parameter
LD_LIBRARY_PATH
3. Restart listner process
Below is sample files
Regards,
Yogesh
Database support
Citibank,
NewYork, NY 10048
# LISTENER.ORA Configuration File:/export/opt/UNPACKAGED/oracle/8.1.6.0/sparc-solaris2/product/network/admin/listener.ora
# Generated by Oracle configuration tools.
# Modified Yogi 05/18/00
LISTENER =
(DESCRIPTION_LIST =
(DESCRIPTION =
(ADDRESS_LIST =
(ADDRESS = (PROTOCOL = IPC)(KEY = EXTPROC))
(ADDRESS_LIST =
(ADDRESS = (PROTOCOL = TCP)(HOST = ertdev9-1)(PORT = 1521))
SID_LIST_LISTENER =
(SID_LIST =
(SID_DESC =
(SID_NAME = PLSExtProc)
(ORACLE_HOME = /export/opt/UNPACKAGED/oracle/8.1.6.0/sparc-solaris2/product)
(PROGRAM = extproc)
(envs=LD_LIBRARY_PATH=/export/opt/UNPACKAGED/oracle/8.1.6.0/sparc-solaris2/product/lib:/export/opt/UNPACKAGED/oracle/8
.1.6.0/sparc-solaris2/product/ctx/lib )
(SID_DESC =
(GLOBAL_DBNAME = emdev1)
(ORACLE_HOME = /export/opt/UNPACKAGED/oracle/8.1.6.0/sparc-solaris2/product)
(SID_NAME = emdev1)
(envs=LD_LIBRARY_PATH=/export/opt/UNPACKAGED/oracle/8.1.6.0/sparc-solaris2/product/lib:/export/opt/UNPACKAGED/oracle/8
.1.6.0/sparc-solaris2/product/ctx/lib:/export/opt/UNPACKAGED/oracle/8.1.6.0/sparc-solaris2/product/ctx/bin)
# TNSNAMES.ORA Configuration File:/export/opt/UNPACKAGED/oracle/8.1.6.0/sparc-solaris2/product/network/admin/tnsnames.ora
# Generated by Oracle configuration tools.
# Modified Yogi 05/18/00
EMDEV1 =
(DESCRIPTION =
(ADDRESS_LIST =
(ADDRESS = (PROTOCOL = TCP)(HOST = ertnj.ssmc.com)(PORT = 1521))
(CONNECT_DATA =
(SERVICE_NAME = emdev1)
EXTPROC_CONNECTION_DATA =
(DESCRIPTION =
(ADDRESS_LIST =
(ADDRESS = (PROTOCOL = IPC)(KEY = EXTPROC))
(CONNECT_DATA =
(SID = PLSExtProc)
(PRESENTATION = RO)
<BLOCKQUOTE><font size="1" face="Verdana, Arial">quote:</font><HR>Originally posted by Vijay ([email protected]):
Hi,
I've got allmost every thing working now
except that searches on PDF files ddon't
produce the deisred results.
The filter seems on only search the pdf file
for infomation that one would seem in the
document info thru the acrobat reader!!
It doesn't seem to index the contents of the
pdf document as it does w/ other formats like
exel and word :(
Do I need to do any additional setup to crete
a more comprehendive index on these pdf files?
cheers,
Vijay<HR></BLOCKQUOTE>
null -
how do i search a pdf file on acrobat.com?
Currently, you cannot search the PDF content when it is placed on Cloud.
You need to use the Adobe Reader to search the PDF.
The Reader can be integrated with Acrobat.com and then you can open your files in Reader application and with ctrl-F you can search any word.
You can do some more search with multiple files using Advanced Search.
In the Reader application, choose Edit > Advanced Search.
Link to install Reader:
Adobe - Adobe Reader download - All versions
Regards,
Anoop -
To get the content of a pdf file in a particular position in text format
I am troubling with geting the content of a pdf file in a particular position.I got the code to get the content of a pdf document as whole in text format.But i only need the content at a particular area.
i am using PDFTextStripper class in pdfBox jar to get the content as whole.
pls send some sample code
pls help me
Edited by: thomas00 on Sep 21, 2007 2:55 AM
Edited by: thomas00 on Sep 21, 2007 3:08 AMpls any one reply
-
Looking for a free iOS 4 app that can search through .pdf files or spreadsheets
Looking for a free iOS 4 app that can search through .pdf files or spreadsheet
ThanksHey there
"pdf creator" for iPad works flawlessly for me working with pdf files
It takes care of all my needs
I'm not sure about sending via Wifi or Bluetooth but I send them via e- mail all the time
Possibly it could handle your needs as well
Just type it into the App Store search field and the first one that comes up is the one I use
Jump on over there and read up on it before buying and see if it will help you
Hope this helps
Regards -
To view the flash technology content in this pdf file please install this version of flash player
I am getting an error while opening a pdf file I am using Windows 7 with adobe 10.1.7 installed error message " to view the flash technology content in this pdf file please install this version of flash player"
Both ActiveX and plugin? What versions?
-
How to show content in a pdf file on swf
I have a project Flash, that i want show content in a pdf file on swf file, but i don't know to show it
So, who body help me ?
Thanks!you can load your pdf into your browser using
navigateToURL(new URLRequest("yourpdf.pdf")); -
Unable to highlight text in my PDF files
Iam unable to highlight text in my PDF files ,The tool shows in the toolbar and i am able to select it .But when i go to the related text which i wish to highlight ,nothing happens . kindly advise, what am i doing wrong. The comment tool is working fine , and i am able to put text comments in the box.But i need to highlight certain text also.
.I am using Adobe reader version 10.1. I reinstalled the program two times but still the highlight tool does not works .May be that the PDF files doesn't contain any text. What did you get when you use File > Save As > Text...
-
Can we Read/Display the content of Word/PDF file in Flex 3/4 ?
Hello All,
Can we read/display the content of Word/PDF file in Flex 3 or Flex 4?. I have one word file containing Arabic and English content with some settings like Bold, Color, Align etc. I want to display the content of this word file as it is in the flex web application.
Awaiting for prompt reply.
Thanks and Regardsthank you for your immediate reply, but,
sorry, this does not work.
With this code:
<cfpdf action = "read" source = "dok_1.pdf" name =
"mypdf">
<cfdump var="#mypdf#"/>
I get this result:
Everything, but no text of the document.
PDFDocument
Application name of application
Author bimbam Verlag GmbH
CenterWindowOnScreen [empty string]
ChangingDocument Allowed
Commenting Allowed
ContentExtraction Allowed
CopyContent Allowed
Created D:20080710
DocumentAssembly Allowed
Encryption No Security
FilePath [empty string]
FillingForm Allowed
FitToWindow [empty string]
HideMenubar [empty string]
HideToolbar [empty string]
HideWindowUI [empty string]
Keywords [empty string]
Language [empty string]
Modified [empty string]
PageLayout SinglePage
Printing Allowed
Producer [empty string]
Properties [empty string]
Secure Allowed
ShowDocumentsOption [empty string]
ShowWindowsOption [empty string]
Signing Allowed
Subject [empty string]
Title Rheinische Angler-Zeitschrift
TotalPages 1
Trapped [empty string]
Version 1.3
Maybe i do not understand the cfpdf tag the right way.
What i want is a kind of pdf-to-text conversion.
Do I have to use the processddx action? I do not think so.
But there is a property DocumentText .. ? -
Geting the content of a pdf file in a particular position
I am troubling with geting the content of a pdf file in a particular position.I got the code to get the content of a pdf document as whole in text format.But i only need the content at a particular area.
i am using PDFTextStripper class in pdfBox jar to get the content as whole.
pls send some sample code
pls help mepls any one reply
-
Is it possible to extract the contents of any PDF file using Adobe PDF SDK?
Is it possible to extract the contents of any PDF file using Adobe PDF SDK?
For Example: There is one pdf file. Let us say xxx.pdf with 32 pages. I am interested in only in a topic present at 10th page. Can I extract this information and save it into another pdf file (means new pdf file)?Thanks Irosenth,
I am actually interested in extract the page and create a new PDF with that page. But still there is a catch that on which basis the page needs to extract either on PAGE number OR on Bookmark basis.
But here in this scenario assume I am looking for the PDF file, now I want to save only page 5. How can I extract page 5 automatically/programmatuically? Or in simple word how can I get the reference link of page 5?
Here I am not getting clear picture that Do I need both SDK Adobe & Acrobat to achieve this requirement. And more over you have mentioned that SDK itself is free. But on Adobe side it is mentioned that it is available by license only with this I have another doubt: To work my desktop/system application with Adobe PDF library, this library needs to distribute with the application. So in this case will it be chargeable for each and every deployment.
Could you please provide me the link from where I can download the SDK? So that I can do some excerise with SDK to figure out the exact flow of functionality to work with my application.
Maybe you are looking for
-
The correct way to delete old transport request record ?
Dear all, We want to delete old transport request record before 2008 (two years before) of our system. We learn that we should delete the records in the below two path: /usr/sap/trans/data /usr/sap/trans/cofiles Our target is that we will not see the
-
Strange error when using the ParserAdapter class
Hi Gurus: in my app, i created a ParserAdapter class ParserAdapter pa = new ParserAdapter(); Then I set the ContentHandler and ErrorHandler pa.setContentHandler(ContentHandeler ch); pa.setErrorHandler(ErroHandler er); and in my main() I have String x
-
Satellite Pro M30 Is there an ExpressCard adapter for PCMCI?
I currently have a PCMCI card and would like to get use with a new Sattelite Pro laptop, unfortunatly it only comes witha ExpressCard slot. Is there an ExpressCard adapter for PCMCI cards?
-
I am really confused about where to place SAML assertions - should it go into the wsse Security Header, contained in the SOAP header? or Should it go into the SOAP body? If it will be in the SOAP Header, what would the SOAP body contain? would it be
-
Hi, Like many other users have posted I have been having problems with very slow downloads in the last few days and connection problems to the itunes store where the rest of the internet was fine. After trying disabling firewalls, upgrading to IE7 et