Copying text from PDF and no spaces

I  get a pdf of Investors Business Daily from thier site.  It is copy protected but I use PDFKey to unlock it, so I can copy text.  When I copy text and paste it elsewhere often there are no spaces between words, I at first thought it had to do with the justification they used.   However if I use Preview the spaces are there!   So why the difference?  I use acrobat pro vs. preview for other reasons.  Is there some setting in preferences?
Second question.  On this site, upper right is a box and magnifing glass.  If I use that to search the forums I get results for ALL forums.  Anyway to limit the search to just the forum your in?

Steve,
I think you missunderstand.  Since IBD posts most of the articles on their webpage, and that is not copy protected I could just do that.  Also since I am not republishing the articles but merrily re-organizing them for my own use, this does NOT break copyright law (check laws about copying any copyrighted material for you own personel use).
You must own a copy of the material being reproduced.          check, I am a subscriber
Purpose of copying - for your own private use.                       check
Copies cannot be lent or shared with anyone.                         check
The work being copied must be a legal (i.e. non-pirate) copy.   check
Beyond this there are also fair use and research and study laws, that, if they applied for my use, would still permmit copying the material.
That aside, I am NOT asking for help in using PDF cracking software.  Infact, Preview copies the material from the original file just fine.
All that PDFKey does is flip one bit in the file, it does not change spaces or spacing between words, throughout the file.
I am just asking how can it be that copying the same material from one source file, using two different PDF reading apps, give different results.

Similar Messages

  • Copy text from PDF and paste to imessage

    Before iOS7 I would be able to copy text from a PDF file and paste it normally into a text message or imessage.  Now when doing the same thing it doesn't paste the actual text I copied, but it pastes as an html attachment.  Have I changed a setting or is this just a bug that needs to be fixed?  Please help.

    I get the PDF in an email and open to view it.  Not sure what the pdf opens in automatically, but that is the way I've been doing it since I've had the iphone 4 and haven't had issues until ios7. 
    I just now tried to opened a file in the default view and then switched it to open in Adobe reader and I can copy and paste normally.  So now i have to do an extra step all the time?

  • How can I copy text from PDF and include the source filename in the pasted selection?

    I'm a biologist and frequently cut-and-paste notes from PDFs of scientific articles.  I name all of the PDF articles with their PubMed ID, a short unique identifier (e.g. 19397482.pdf).  When I take notes, I will select a few sentences from the PDF and then paste them into a text editor for later reference. 
    Can anyone suggest a method or script that would allow me to paste the copied text with the Pubmed filename included in a single action?  I would want the pasted output it to look something like this, with the filename appended to the end:
    Of the transcripts that were significantly different, there was a greater number of transcripts that were down-regulated in the IVC embryos (380) than the number of transcripts that were up-regulated (208).  [20668257.pdf]
    This would really help me to properly cite information sources during the writing process.  I know there are bibliography managers that might be able to do something like this, but I prefer to read the PDF articles directly in Preview and select the text as I am reading. 
    Thanks very much for any suggestions / ideas.
    jjw

    To copy and paste in a single action:
    tell application "Preview" to activate
    tell application "System Events" to tell process "Preview"
        -- Get the PubMed ID:
        get the title of the front window
        set thePubMedID to word 1 of result
        -- Copy the selected text to the clipboard:
        keystroke "c" using {command down} -- ⌘C
        delay 0.25 -- adjust if necessary
        -- Add the PubMed ID to the contents of the clipboard:
        set theNotes to the clipboard
        set the clipboard to (theNotes & space & "[" & thePubMedID & ".pdf]")
    end tell
    tell application "Notational Velocity" to activate
    tell application "System Events"
        -- Paste the contents of the clipboard to the end of the Notational Velocity document
        key code 125 using command down -- ⌘↓
        keystroke return & return
        keystroke "v" using {command down} -- ⌘V
    end tell

  • Problems when i copy text from Pdf and paste on Word

    In Pdf documente the text is in perfect conditions, but, when i copy the text and paste in WORD document the character change into random crazy character like: "()*"*&!(!*"(!"(!)"( )*"()!*("!&("@*")(!*@"!*@(
    how i fix this??

    I have the same problem when copying the PDF into a Word file. I tried Save as RTF doc. and it is still just symbols.
    It could be a font problem, because it has some weird Gill Sans and Futura fonts. I am sending a picture as an example.
    The option of exporting as Tiff and then applying OCR is interesting but still is kind of slow when i have a 100 pages document. If the fonts is the problem is there any way i could select the whole text and apply it a known font like Arial?
    Thanks for any info!
    Cheers,
    Sebastian

  • Copying text from PDF to Pages

    I am trying to copy text from a PDF file into Pages, after pasting the copied text into my new Pages document the spacing between most of the text becomes corrupeted,
    for ex.
    "Copying text from PDF to Pages" is imported as "CopyingtextfromPDFtoPages"
    does anyone know how to correct this?
    Imac   Mac OS X (10.4.7)  

    Rishi,
    Welcome to Apple Discussions.
    After reading your post, I tried to duplicate this problem. I opened a PDF, selected a sentence, then copied it to the clipboard. I then opened Pages, selected the blank template, then pasted in the text. It pasted perfectly.
    Does this problem happen with all text in a PDF? With different PDFs?
    -Dennis

  • Can't Copy Text From PDF Within Gmail Preview

    Hi,
    If I click on a pdf attachment while inside gmail, I get the expected preview window of that PDF. However, I can't copy text from it. If I press Command-C, it just makes a beeping noise letting me know nothing is being copied. If I select text and right-click, there's no 'copy' option. Also, in the top toolbar, Copy is greyed out in the Edit menu. However, if I just hold down Command-C for a couple seconds the text will copy anyways (while hearing a dozen audible beeps). I can copy text without issue using Firefox from the same PDF attachment preview inside Gmail.
    Thanks for any help.
    Yosemite 10.10.1
    Safari Version 8.0 (10600.1.25.1)

    Having same issue as well. I've had this problem pre-Yosemite and it seems to be Safari related. Copy to clipboard in Gmail preview works fine in Chrome & Firefox, just not Safari. As you indicated if you hold ctrl-c long enough it will eventually copy, just forces you to listen to the series of annoying bongs.
    Also there's no copy to clipboard in the context menu as illustrated below:
    http://i.imgur.com/adyyJO4.png
    Pretty annoying really, I'm not about to download a dozen attached PDFs to copy to clipboard, so now I'm sitting in the Delta lounge with my laptop bonging away as I wait for Safari to copy to clipboard and everyone looks at me like I'm a moron.
    NOTE: No I do not have acrobat or any other software loaded besides vanilla Safari. This has gone on on my MBP, Mac Pro and now this brand new MBP I got Monday as a company upgrade, it literally has nothing installed on it at the moment.

  • Copying text from pdf with embedded font

    I have tried everything to copy and paste text from pdf into word. I think because it has embedded text it comes over as garbled. I have downloaded the font, tried to open it in several other aps, viewed it as html -- to copy and paste ...
    anyone have a trick that they can share with me before I poke my eyes out
    thank you

    Thanks for your prompt reply.
    As i said i have the font installed on my system. for your reference,
    following is the link to the pdf file. also the second link is the link to
    the fonts used. Kindly help me to sort this issue.
    https://www.yousendit.com/download/T2dkcHBEVEh0QTIwYjhUQw
    https://www.yousendit.com/download/T2dkcHBFQXBrYUJYd3NUQw

  • Help: Copy text from PDF to AI or MS Word

    Hello,
    I'm new to this boards so, if topics like this were brought up I'm sorry and be glad, if someone can redirect me to it.
    I've also searched for solution throu google and did not find anything that would fix this so I came here as my last option
    I have recived a PDF document from a company about pad printing. I'd like to copy some text from this PDF to Illustrator/Word so I could edit some things. I could retype all but document has over 55 pages and I'm leaning thowards easyer solutions than retyping everything.
    The issue I have is when I try to copy text from this document: http://rapidshare.com/files/384536794/one_page.pdf.html
    all I get is some encrypted symbols...
    If any1 could direct me to some guide or post solution how I could get text from this document to AI/MS Word I'd appreachiate it.
    TY in Advance.
    Stefan
    Sorry for my bad english.

    I don't know why you wanna use rapidshare for reference, would you know there were many problem about them? Actually I hate rapidshare.
    You can use Acobat to your work easily, as this way has been suggested, I would introduce you another free approach.
    1. Download AnyBizSoft PDF to Word Converter, which is a totally free desktop applicaiton.
    2. Install and launch the app.
    3. Import PDF files. You can add many files at a time as well as choose the certain page of a file.
    4.Set output folder and click Convert
    Word done and hope that I provide some assistance.

  • Copying text from pdf, "square" characters

    Using SQL 2005 standard edition.
    I have an issue with pdf generated reports. When copying text from the pdf (from a cell in a table row) the characters are just pasted as "squares" into notepad\word etc. Each character is just a small square. I am using Arial font and not doing anything outside of the norm. This is also an inconsistent problem, it doesnt happen on all cells\tables but only on particular ones. Could it be data specific?
    Anyone come across this before?

    It is not data specific but the format specific. There are many similar issues commonly encountered. The reason is that the format in the clipboard cannot be well recognized by notepad or word. You can refer to this article:
    http://support.microsoft.com/kb/212318
    From which you can read:"Microsoft Word does not directly support conversion from Adobe Portable Document Format (PDF). PDF is the format in which documents are saved for viewing and exchange in the Adobe Acrobat Reader and Adobe Acrobat Exchange for Windows."
    For those words if you could not copy/paste successfully, you may use some tools from Adobe company to extract the data from PDF file to some recognized format such as HTML. I know that there is a tool which can do this conversion:
    http://www.adobe.com/products/acrobat/access_onlinetools.html
    Hope this answers your question.Please remember to mark the replies as answers if they help and unmark them if they provide no help

  • Applescript or workflow to extract text from PDF and rename PDF with the results

    Hi Everyone,
    I get supplied hundreds of PDFs which each contain a stock code, but the PDFs themselves are not named consistantly, or they are supplied as multi-page PDFs.
    What I need to do is name each PDF with the code which is in the text on the PDF.
    It would work like this in an ideal world:
    1. Split PDF into single pages
    2. Extract text from PDF
    3. Rename PDF using the extracted text
    I'm struggling with part 3!
    I can get a textfile with just the code (using a call to BBEDIT I'm extracting the code)
    I did think about using a variable for the name, but the rename functions doesn't let me use variables.

    Hello
    You may also try the following applescript script, which is a wrapper of rubycocoa script. It will ask you choose source pdf files and destination directory. Then it will scan text of each page of pdf files for the predefined pattern and save the page as new pdf file with the name as extracted by the pattern in the destination directory. Those pages which do not contain string matching the pattern are ignored. (Ignored pages, if any, are reported in the result of script.)
    Currently the regex pattern is set to:
    /HB-.._[0-9]{6}/
    which means HB- followed by two characters and _ and 6 digits.
    Minimally tested under 10.6.8.
    Hope this may help,
    H
    _main()
    on _main()
        script o
            property aa : choose file with prompt ("Choose pdf files.") of type {"com.adobe.pdf"} ¬
                default location (path to desktop) with multiple selections allowed
            set my aa's beginning to choose folder with prompt ("Choose destination folder.") ¬
                default location (path to desktop)
            set args to ""
            repeat with a in my aa
                set args to args & a's POSIX path's quoted form & space
            end repeat
            considering numeric strings
                if (system info)'s system version < "10.9" then
                    set ruby to "/usr/bin/ruby"
                else
                    set ruby to "/System/Library/Frameworks/Ruby.framework/Versions/1.8/usr/bin/ruby"
                end if
            end considering
            do shell script ruby & " <<'EOF' - " & args & "
    require 'osx/cocoa'
    include OSX
    require_framework 'PDFKit'
    outdir = ARGV.shift.chomp('/')
    ARGV.select {|f| f =~ /\\.pdf$/i }.each do |f|
        url = NSURL.fileURLWithPath(f)
        doc = PDFDocument.alloc.initWithURL(url)
        path = doc.documentURL.path
        pcnt = doc.pageCount
        (0 .. (pcnt - 1)).each do |i|
            page = doc.pageAtIndex(i)
            page.string.to_s =~ /HB-.._[0-9]{6}/
            name = $&
            unless name
                puts \"no matching string in page #{i + 1} of #{path}\"
                next # ignore this page
            end
            doc1 = PDFDocument.alloc.initWithData(page.dataRepresentation) # doc for this page
            unless doc1.writeToFile(\"#{outdir}/#{name}.pdf\")
                puts \"failed to save page #{i + 1} of #{path}\"
            end
        end
    end
    EOF"
        end script
        tell o to run
    end _main

  • Copying text from emails and Word into a GoLive 9 webpage

    Most of my web work involves copying and pasting text from emails and/or Word onto a page created in GoLive and uploading it to the web. In the past this has never been a problem with previous versions of GoLive. Now with GoLive 9, I constantly get extra characters appearing such as "" where almost every space is located between sentences. Foreign characters often appear as strange symbols whereas they copied over perfectly (French and German-type accents etc) before.
    When I copy something I now have to go to a menu item [type/HTML/paragraph format] to turn "paragraph" off which helps get rid of the extra formatting not wanted (often everything appears in a large bold font, even though it wasn't like this in the original document). Also until I turn off "paragraph" I am unable to align any of the text as it has a mind of its own, regardless of what the webpage is specifying, (ie left alignment, centred etc).
    Is there any way to copy and paste text without having to editor and reformat it - just as one used to do in all previous versions of GoLive?

    That is an interesting idea but is still really a lot of unnecessary mucking around when GoLive 9 should do all this automatically with no fuss at all. I have been using GL since its early PageMill days and never had this problem before.
    I understand what you mean by using a blank page - I might give it a try.
    I found CS2 was getting slower and slower on my PowerPC. I now have an Intel iMac. Sometimes I would go to open a GL file and I could make a cup of tea waiting for it to open (admittedly it was a large file with lots of table cells and photos). Now I can open the same file in a few seconds using GL9. I looked up the Adobe support for 10.5 and it was listed as having issues which wouldn't be resolved. Photoshop Elements 4 was listed the same yet I had had no problems with it. A friend had huge problems with InDesign when he changed to 10.5. I guess it varies from computer to computer.
    Only yesterday my GL9 refused to open and thanks to these forums, we found out that yesterday's Opera browser update was the culprit. Having had such quick success using these forums I thought I would give this annoying copying problem a try and see if there was some setting I could change or something I could do to correct the fault.

  • Copying text from PDF files in another language

    When I try to copy text from a PDF file which is written in Greek - the minute I paste it into another document, the text cannot be recognized so comes out as a jumbled mess. How an I solve this problem??

    Into what other document type?  Word?  Do you have a Greek font selected when you paste?  Or a Unicode font?

  • Copying text from PDF created using print to PDF function in OS X

    I use a MacBook Pro with Mac OS X Lion, and Microsoft Word 2008 for Mac and Adobe Acrobat Pro.
    For some reason when I use the Print to PDF function to export a PDF of a Word document, then open it with Acrobat Reader or Acrobat Pro 9 and try to select text and copy it then paste it into a word processor (include Word 2008) the resulting text is gibberish. It looks like some sort of encoding issue, but I can't understand that, since it's all happening on the same Mac! I have also tried to do this with Preview as the PDF reader but I still get gibberish.
    The issue first started occuring with Snow Leopard, and all software is patched, but no dice.
    I've attempted to work around this by using all of the different PDF options under the print dialog, and by saving the doc as a PDF, but I still get the same thing.
    I've also tried copying and pasting the text int Pages, then saving it as PDF and trying to ready it .. again, no luck.  I was able to output the file directly from Pages to Preview and save it from there, but it really doesn't seem like this should be necessary, given that the functionality is build into the OS.
    Anybody else have experience with this? I have just one user that needs to copy and paste text from the doc, so it's a real pain to have to maintain separate PDF and Word versions.
    Thanks!
    D

    Rishi,
    Welcome to Apple Discussions.
    After reading your post, I tried to duplicate this problem. I opened a PDF, selected a sentence, then copied it to the clipboard. I then opened Pages, selected the blank template, then pasted in the text. It pasted perfectly.
    Does this problem happen with all text in a PDF? With different PDFs?
    -Dennis

  • Can not copy text from pdf with unicode font embedded.

    Hello,
    I have a pdf with unicode font embedded in it. The pdf has no restrictions. When i try to copy some text text fro m pdf and then paste itin word, the text is not the same which i copied from the pdf. it changes.
    I have tried multiple softwares which converts pdf to word or pdf to excel. but none can do my job. what should be the problem? Kindly guide me.
    Thanks & Regards-
    rashmi

    Thanks for your prompt reply.
    As i said i have the font installed on my system. for your reference,
    following is the link to the pdf file. also the second link is the link to
    the fonts used. Kindly help me to sort this issue.
    https://www.yousendit.com/download/T2dkcHBEVEh0QTIwYjhUQw
    https://www.yousendit.com/download/T2dkcHBFQXBrYUJYd3NUQw

  • How to read text from PDF and HTML

    I have got solution to read text form .txt file but did'nt get code for PDF and HTML.
    I dont want to convert PDF to txt.
    Please help me ...

    reading from a file is always the same. using the same strategy used for a .txt will allow you to read a .pdf file.
    Offcourse in itself it will be useless becuase pdf files have a special internal structure.
    html files are identical to txt files.
    What are you trying to accomplisch with the files you are reading ?

Maybe you are looking for

  • Dual Boot Windows 7 and solaris 10

    Can some body guide me how to dual boot windows 7 and solaris 10? Thanks

  • Keying video that has similar colour background, advice please-image included.

    Last-year I visited America, and on the last day managed to secure an interview with a WW2 Ace, which had to be done within a very short time span. Unfortunately this left me with a situation that I couldn't really control, and so I am in a situation

  • Task Deadline without Route

    Hi , I want to display the date in the deadline column of workspace list view So I set the value deadline in workbench and its looking fine in the workspace But I do not want to move next step when deadline occurs because i want  user to open and pro

  • Dual band support in UK

    i own a nokia 3300. I would be flying to UK soon. I understand that 3300 has dual band support. I wanted to confirm whether I can use it in UK too.

  • Mail suddenly stopped working?

    this all started a couple of days ago. i start up mail and i opens up and every thing but i cant use the program, if i switch to it it just has the loading rainbow circle thing constantly there. iv tried restarting my machine twice and force quitting