How Best to Search pdf Files Over the Web

I have a large number of pdf files. (about 73,000). These are scans of old newspapers that have been OCR'd and saved as pdf. I work in a library and need to find the best way to make these text-searchable through my library's website.
Do I need to create an index?
What I would like to do is have a search box on my web page where a user can enter a keyword, and pull up the pages that contain that word.
Any suggestions on how best to do this are greatly appreciated!!
Thanks!

Google can do it for you if you use their search engine as the basis of your library search capabilities.

Similar Messages

  • How can i open pdf files from the web without saving them

    Windows 7
    Firefox 5
    Adobe Reader 9
    when i want to see a web based pdf file i have to save it before opening it instead of it just opening in a pdf window.

    You can't.
    Those PDFs are only accessible on the Mac.
    They can not even be accessed by iOS.

  • Lately pdf files from the Web do not display properly; what is wrong and how can I fix it so I don't have to use IE?

    I am a professor and frequently need to open pdfs of academic journal articles from electronic databases. Within the last two or three weeks, when I try to do this using Firefox, only the first page of the pdf displays and the rest are blank. In addition, a bar appears across the top with the message "This pdf file may not display properly." So I have had to switch to Internet Explorer to open the files and do my research. What is going on and how can I fix it so I can open pdf files from the Web in Firefox and have them display properly?

    My question is simple - why would you put an application in production that has a lot of bugs? To replace a good and steady Adobe product? My customers will believe that I put a pathetic fallible form online. They are not sophisticated on computers so I just tell them to use IE. Ugh
    I must agree with pgwebgirl!

  • How to attach a PDF file TO the form so that recipients can access the file as a resource?

    How to attach a PDF file TO the form so that recipients can access the file as a resource?

    Sorry we do not support this. What you can do though is add a Formatted Text element and add a link to a PDF. To insert a link add some text, select the text and right click on the text (or look under the Insert menu in the top right of the scene)
    Gen

  • Can any one tell me how to attach a pdf file to the mail through workflow

    I have a smart which i am able  to convert it to a pdf file...now.. can any one tell me how to attach a pdf file to the mail through workflow

    Hi,
    To create the task for attachment
    Use the BOR SELFITEM and method NOTES_APPEND.
    The out come of this task contain a link called attachment with a clip attached. Clisk on that icon and choose the type of attachment u want . RAW , EXCEL , TXT , PDF... Then using the import icon u can attach the document u like.
    But the TYPE : OBJ.
    Similarly using NOTE_DISPLAY method u can display the documents u like.
    In any work item u have the facility to attach any atttchment for further circulation .
    Attchment @ WORKITEM
    1.Click the workitem for which you want to create the attchment
    2.Press create attachment
    3. Add the attachment u like (PDF)
    Reward points for useful answer.
    Richard A

  • How do I read PDF files from the SARS website

    How do I read PDF files from the SARS website
    Got the latest version xi.0.08 of Abobe Reader
    Operating system is win XP Sp3

    You need Adobe Reader or Adobe Acrobat or other PDF reading application installed on your system. Then just double click on the form or publication wanted and follow the prompts as needed.

  • Visual Studio 2013. Viewing a PDF file via the Web Browser control in Windows 8.1

    I am writing a Windows Forms application in which I need to view PDF files via the Web Browser control. When the control is given the Navigate() command, Windows takes over and asks which program I want to open the file with. All was fine with VS 2010 and
    XP. I have un-installed "PDF Complete", which was the default PDF viewer, but I now get asked for an alternative.
    Thanks in advance.
    Mike Saunders

    Hi Mike,
    It seems that we could use Internet Explorer to open a local .PDF file if it has the Adobe Reader. I doubt that the web control would be related to it. Like the MVP's suggset in the following thread, maybe it would be related
    to certain plug-in tools in your side like the adober reader.
    Reference:
    https://social.msdn.microsoft.com/Forums/en-US/a717117e-41ae-4775-9c28-d5a1042b396e/pdf-is-not-opening-in-web-browser-control-in-vbnet?forum=vbgeneral
    Best Regards,
    Jack
    We are trying to better understand customer views on social support experience, so your participation in this interview project would be greatly appreciated if you have time. Thanks for helping make community forums a great place.
    Click
    HERE to participate the survey.

  • I converted a pdf file to the web but it moved me the picture to the left of the screen

    Hello Friends
    I converted a pdf file to the web but it moved me the picture to the left of the screen
    Why does this happen?
    have a nice day!

    thanksssssssssssssssssssssssssssssssss@@!!!!
    You were helpful to me
    And another small question
    In my web(http://josephones.com/)
    At bottom left corner
    You can see four text fields
    How can I make this contact form?

  • Adobe 8 - Unexpected Corrupt or incorrect file type warning when viewing a PDF file from the Web

    Hi,
    On downloading a PDF file from my web site using Adobe 8 on the client machine I expect the file to open automatically however I receive encounter the "File Download" dialog. It has the following warning message: "The file you are downloading cannot be opened by the default program. It is either corrupted or it has an incorrect file type. As a security precaution, it is recommended that you cancel the download."
    This issue did not occur within previously tested versions of Adobe Reader (e.g. Version 5.1).
    On investigating the issue I discovered the factor that results in Adobe 8 failing to open the file automatically is the Content-Type = "application/pdf; Charset=utf-8". However, if I intercept the response and alter the Content-Type to contain only "application/pdf;" the issue on Adobe 8 does not occur.
    I have even tested when the Content-Type contains an empty Charset (i.e. "application/pdf; Charset=") but this issue still occurs. In addition setting the Charset to other encodings (e.g. "ISO-LATIN-7", "iso-8859-1") the issue occurs.
    QUESTION(s):
    1. Why is it necessary for Adobe 8 when opening PDF files from the web to expect no Charset within the Content-Type?
    2. If this was not an issue within previous versions of Adobe surly this is a bug?
    Since I cannot ensure that the Response's Content-Type does not contain a Charset at all please can someone suggest either a work-around or more importantly a solution to the issue?
    Please Note:-
    1. Within Adobe, Edit -> Preferences -> Internet option it's compulsory for our users to have "Display PDF in browser" ticked!
    2. We cannot prevent a Charset from being present within the Content-Type of the response sent to the client, hence this issue.
    Many Thanks in advance,
    Martin.

    Bill,
    This is exactly the same behavior I observed when I rendered a PDF stored in a database through a .NET page. If I chose to save it, it would open fine after the save, but if I chose to open it in the browser without saving it, I would get the error.
    The problem was not with the PDF or Acrobat. It was with the way I was delivering the PDF. Putting the following code before my binarywrite fixed the issue completely:
    Response.ClearHeaders()
    Response.ClearContent()
    Response.ContentType = "application/octet-stream"
    Response.AddHeader("content-disposition", "attachment; filename=" + saveFilename)
    where saveFileName is the name you want to give the PDF.
    You can use 'inline' instead of 'attachment' above if you just want the PDF to open (no option to save first). You can also use 'pdf' instead of 'octet-stream' (not sure the difference).
    The trouble seems to be that .NET puts some info in the header and/or content by default prior to anything your code says to write. If you don't clear this, IE has an issue with it. Other browsers seem to ignore it.
    Bottomline is that this is an issue the website programmer needs to fix and is not something the user can fix by changing some setting(s).
    I hope this helps.

  • Bug:Downloading pdf file from the web,file name in dialog box populates with incorrect file name

    Hi Gurus
    When downloading any pdf file from the web using adobe Reader X the file name dialog box populates with incorrect file name
    compared to the saved file name of the document downloaded.
    Clarity
    1. Download any pdf file from the web
    2. Click File, Save as note the populated File Name
    3. Compare the populated file name with the saved file name of the documant yuo are downloading.
    4. The file Name is incorrect...
    Is there any work around on this issue?Can someone please help?..
    OS:Windows7
    Reader: Adobe Reader X
    I've tried a couple of other versions and still get the same results...

    Are you downloading the file (right-click, save link as), or just opening it by clicking it?
    If the latter, then what happens is the file is loaded to the temporary internet files folder by IE and assigned a temporary name, which is then used when you're trying to save it.
    You can test this out by going to File - Document Properties before trying to save and looking at the file's location.

  • When attempting to open a hyperlink to a PDF file on the web from a Microsoft WORD for Mac 2011 (14.3.9) document, Safari 7.0 instead displays the file as text?

    When attempting to open a hyperlink to a PDF file on the web from a Microsoft WORD for Mac 2011 (14.3.9) document, Safari 7.0 instead displays the file as text?

    As seen in http://answers.microsoft.com/en-us/mac/forum/macoffice2011-macword/has-the-word- 2011-for-mac-invisible-toolbars/018a3ab6-0570-4ad5-abf8-5b6427fdde3e?msgId=e111b f0a-0e32-4fa3-9536-f349dad8439d
    and it worked for me:
    1. Quit Word
    2. In the Finder's menu bar, select Go > Go to folder and type or paste: ~/Library/Preferences/
    3. Click on Go
    4. Locate the preference file com.microsoft.Word.plist, then Option-drag it to the desktop to create a backup copy
    5. Go to Applications/Utilities and open Terminal
    6. Paste the following bold command at the $ prompt (it's a single line):
         defaults write com.microsoft.Word 14\\Toolbars\\Show_HIToolbar -boolean TRUE
    7. Press Return and then quit with Command Q
    8. Start Word and test. If the fix works, trash the backup file in the Desktop file. Otherwise, restore it.
    In the original source the author also mentions the change in Word 2008

  • I am having a problem where pdf files on the web (i.e., links in a Word doc) open after an extended time and only as gobbldygook ( a file containing a series of characters and letters that make no sense).  This also happens for another Mac user coworker

    Hi There:  I am having a problem where pdf files on the web (i.e., links in a Word doc) open after an extended time and only as gobbldygook ( a file containing a series of characters and letters that make no sense).  This also happens for another Mac user coworker in my office, while the PCs don't have this problem...  Any help/suggestions for a fix would be most appreciated! 

    Just adding more info - MacBookPro running 10.5.8 and using Safari as the browser.  The problem comes and goes - sometimes the linked Word files will open OK, n others its just a strring of crazy characters... 

  • HT5678 I have problems saving pdf files in the web when I use Safari. I don't have this problem with firefox

    I have problems saving pdf files in the web when I use Safari.  I don't have this problem with firefox.

    do they save ok if Safari - Preferences - Security - Allow all other plug-ins is unchecked ?
    If so, likely an adobe or other pdf plugin
    Re-check the setting above & quit any browsers,
    Look in this folder by triple-clicking the line below, then ctrl-clicking it & choosing Services - Open
    /Library/Internet Plug-Ins/
    remove anything with pdf in the filename & test again
    Sometimes, plug-ins can be in the User Library folder :
    ~/Library/Internet Plug-Ins/

  • Opening PDF files on the web with IPad

    Hi everyone
    I am having difficulty opening some (but not all) pdf files on the web with my IPad. When I click on the icon it comes up with approx half of the first page of the document, but nothing else. It also does not provide an "open in" bar. Any thoughts on what is going on here?
    With thanks, Mike.

    Thank you for following up. We often have people reporting similar problems and I suggest they follow up with the website owner but I rarely hear back from the original poster so I'm never sure that this is indeed the problem. Ironically, serving a document for mobile is almost easier than serving it to a desktop browser because there are no plugins to get in the way. That said, a lot of website owners have gone through pains to do plugin detection and user authentication and authorization and mobile represents another case that they would need to handle on top of what they already do.

  • Problem of opening PDF files on the web

    I have installed the latest version of Acrobat Reader, but I still couldn't read some PDF files on the web, what should i do?

    Hi wings57638543,
    1. Which browser are you using?
    2. Are there some specific files that you are unable to load in the browser? Can you share the link?
    3. In Reader DC, go to Edit->Preferences->General and click on "Select As Default PDF Handler" button. Then try loading those PDFs.
    Thanks.

Maybe you are looking for