Smart quotes and em-dashes in epub file

When I create an epub file from my InDesign book file, I find that sometimes (not always, thought!) my smart quotes and em-dashes revert to gibberish.
For example, when I open the epub file, the em-dash is fine, but in chapter 2, it changes to —.
Does anyone know how I might correct this?
Thanks so much!

It sounds like somethings wrong with the chapter's character encoding.  The character encoding is declared in the header of the ePub HTML. From ID it's:
<?xml version="1.0" encoding="UTF-8" standalone="no"?>
This test has UTF-8 declared in the first line of its source code:
http://www.zenodesign.com/forum/encodetest/UTF8.html
And this has no encoding
http://www.zenodesign.com/forum/encodetest/None.html
Most browsers let you override or choose the character set, so in Safari if you set the character set to View>Text Encoding>default, you'll see a difference between the above links, but if you set it to Unicode (UTF-8) they both display correctly. In FireFox it's View>Character Encoding.
You can check and edit the contents of the .epub file by changing the extension from .epub to .zip, and then extracting the contents.

Similar Messages

  • Convert smart quotes and other high ascii characters to HTML

    I'd like to set up Dreamweaver CS4 Mac to automatically convert smart quotes and other high ASCII characters (m-dashes, accent marks, etc.) pasted from MS Word into HTML code. Dreamweaver 8 used to do this by default, but I can't find a way to set up a similar auto-conversion in CS 4.  Is this possible?  If not, it really should be a preference option. I code a lot of HTML emails and it is very time consuming to convert every curly quote and dash.
    Thanks,
    Robert
    Digital Arts

    I too am having a related problem with Dreamweaver CS5 (running under Windows XP), having just upgraded from CS4 (which works fine for me) this week.
    In my case, I like to convert to typographic quotes etc. in my text editor, where I can use macros I've written to speed the conversion process. So my preferred method is to key in typographic letters & symbols by hand (using ALT + ASCII key codes typed in on the numeric keypad) in my text editor, and then I copy and paste my *plain* ASCII text (no formatting other than line feeds & carriage returns) into DW's DESIGN view. DW displays my high-ASCII characters just fine in DESIGN view, and writes the proper HTML code for the character into the source code (which is where I mostly work in DW).
    I've been doing it this way for years (first with GoLive, and then with DW CS4) and never encountered any problems until this week, when I upgraded to DW CS5.
    But the problem I'm having may be somewhat different than what others have complained of here.
    In my case, some high-ASCII (above 128) characters convert to HTML just fine, while others do not.
    E.g., en and em dashes in my cut-and-paste text show as such in DESIGN mode, and the right entries
        &ndash;
        &mdash;
    turn up in the source code. Same is true for the ampersand
        &amp;
    and the copyright symbol
        &copy;
    and for such foreign letters as the e with acute accent (ALT+0233)
        &eacute;
    What does NOT display or code correctly are the typographic quotes. E.g., when I paste in (or special paste; it doesn't seem to make any difference which I use for this) text with typographic double quotes (ALT+0147 for open quote mark and ALT+0148 for close quote mark), which should appear in source code as
        &ldquo;[...]&rdquo;
    DW strips out the ASCII encoding, displaying the inch marks in DESIGN mode, and putting this
        &quot;[...]&quot;
    in my source code.
    The typographic apostrophe (ALT+0146) is treated differently still. The text I copy & paste into DW should appear as
        [...]&rsquo;[...]
    in the source code, but instead I get the foot mark (both in DESIGN and CODE views):
    I've tried adjusting the various DW settings for "encoding"
        MODIFY > PAGE PROPERTIES > TITLE/ENCODING > Encoding:
    and for fonts
        EDIT > PREFERENCES > FONTS
    but switching from "Unicode (UTF-8)" to "Western European" hasn't solved the problem (probably because in my case many of the higher ASCII characters convert just fine). So I don't think it's the encoding scheme I use that's the problem.
    Whatever the problem is, it's caused me enough headaches and time lost troubleshooting that I'm planning to revert to CS4 as soon as I post this.
    Deborah

  • Combining smart quotes and inch marks?

    Dilemma! I have smart quotes turned on because I need them. I also need to be able to use inch marks in the same document. I have searched the Help files with no luck. Can anyone tell me if there's a way to have both smart quotes and inch marks in the same Pages document?
    Thanks in advance!
    Powermac G5 & Powerbook 15" Mac OS X (10.4.8)
    Powermac G5 & Powerbook 15"   Mac OS X (10.4.8)  

    You can indeed combine smart quotes and the character that indicates inches, by using the appropriate inch mark character from the system-wide Character Palette. To do this, you will need access the Input Menu at the top right of the menu bar. If you don't have the Input Menu visible, go into the International System Preferences, in the Input Menu tab, check the "Show Input Menu in Menu Bar" option at the bottom of the panel, and check "On" for the Character Palette. This will put a little flag on the right side of your menu bar -- this is the Input Menu.
    From the Input Menu, select the Character Palette. With the palette open, choose the Punctuation category. You will be presented with a wide array of punctuation characters, including proper inch and foot marks (which are actually "prime" and "double prime" marks, and are angled, unlike straight quotes). Select in Pages where you want the character to go, and then double-click it, and it will be inserted. The prime and double-prime marks are not considered quotes by the system, and so won't be curlified.

  • Pasting smart quotes and apostrophes in code view.

    Sine upgrading to Dreamweaver CS5, I haven't been able to copy/paste smart quotes and apostrophes into code view without them automatically being converted to straight quotes.
    For example, the following sentence (notice the curly quotes):
    John’s new song is called “DW Blues”
    would get pasted into Code View as:
    John's new song is called "DW Blues"
    Notice the smart quotes and apostrophe are replaced with single and double ticks, or "straight quotes."  While this seems like a minor detail, it's extremely important to our writers and editors to have them appear on the website exactly as typed.
    If I do the same copy/paste in Design View (doc type is XHTML Transitional), it appears as:
    John's new song is called &quot;DW Blues&quot;
    The characters are still replaced, and the straight quotes are then entity encoded (as expected).
    This doesn't happen with other valid UTF-8 characters like ™, ®, —, etc., or with any other code editors I've used, including DW CS3.
    Is there a hidden preference somewhere to disable this "feature," or is it just a bug?
    Please help!

    It's now 4 years since jsparacio posted this, and I just wanted to let everyone know that I had -- and am still having -- the exact same problem with Dreamweaver CS5 (running first under Windows XP, then Windows 7, and now again with Windows 8.1). So it's not just Macs that are affected.
    FWIW, I have set my DW CS5 Paste preferences to the 3rd of 4 options available
        1 - Text Only
        2 - Text With Structure
        3 - Text With Structure Plus Basic Formatting
        4 - Text With Structure Plus Full Formatting
    in the EDIT > PREFERENCES > Copy/Paste Preferences dialog box.
    But the Paste Special command ignores this setting, giving me only the first 2 options from which to choose, with option 2 the default selection for Paste Special operations (options 3 and 4 are grayed out, and can't be selected).
    According to David Sawyer McFarland's _Dreamweaver CS5: The Missing Manual_ (O'Reilly Media, 2010), the reason these are grayed out is because I am pasting unformatted ASCII text which I generated in a program editor called "UltraEdit":
        "... Choose EDIT > PASTE SPECIAL to open the Paste Special window. Here, you can choose which of the four techniques you wish to use ... sort of. You're limited to what Dreamweaver can paste. For non-Microsoft Office products, you can use only the first two options--the others are grayed out--whereas you can choose from any of the four with text copied from Word or Excel." (McFarland, p. 81)
    Regardless of such restrictions, standard copy-and-paste (CTRL+C followed by CTRL+V) works just fine for me using Dreamweaver CS4 (i.e., I have never needed to use the Paste Special command), but with DW CS5, neither Paste command (CONTROL+V or CTRL+SHIFT+V) works properly with typographic/curly/smart quotes.
    All typographic quotes -- ASCII-0146 and ASCII-0147 (double quote marks); plus ASCII-0145 and ASCII-0146 (single quote marks, for quotes within a quote) -- are converted to inch (&quot; is entered in the code) and foot (' is entered in the code) marks when I copy-and-paste text with these characters into Dreamweaver's Design View.
    When I copy this same plain ASCII text directly into the code (rather than using Design View), typographic double open & close quotes are converted to the inch (") mark, and typographic single open & close quotes are converted to the foot (') mark.
    The beginning of this week, I installed Dreamweaver CS5, ver. 11.0, Build 4909 under Windows 8.1 OS on my new Ultrabook. I was hoping that under Windows 8.1, DW CS5's handling of typographic quotes might improve so that I can actually use this program that I purchased 4 years ago. Alas, no such luck: I continue to have the same problem I had when I first upgraded to Dreamweaver CS5 back in August 2010 (then running under Windows XP on my desktop computer).
    Back in August 2020, when I first asked about fixes, I was told to change the Title/Encoding setting of Page Properties to "Western European" -- which I tried, but it didn't work then, and it doesn't work now ... and even if it did, it wouldn't be a proper fix for the problem as I have plenty of good reasons for wanting my HTML page Title/Encoding set to Unicode (UTF-8), not Western European ("charset=iso-8859-1").
    The ability to copy-and-paste typographic quotes is such a big deal for me that I chose back in August 2010 to revert to Dreamweaver CS4, which I've been using ever since.
    It is *very* frustrating that, 4 years later, I still can't use this program, and shall be reverting to DW CS4, yet again.
    I continue to be completely flummoxed by this. Every other program with which I am familiar converts non-typographic quotes to typographic/curly/smart quotes ... never have I seen the process automated in reverse!

  • "Smart Quotes" and Extensions

    I'm working on a text processing extension and want to implement a "Convert special characters to safe HTML entities" (that's just a working title!) menu option.  When I copy and paste some blocks of text from Word into the editor, then right-click my selection and choose my extension to process that text, the text that gets sent via the xml packet to my handler converts (in this case) Word's smart quotes (&#8220 and &#8221) to question marks.  This is before my handler processes anything.  I'm just using cfdump to look at the data.  Am I missing some intermediary processing that needs to be done somehow?  I can't figure out where it would be done since I don't have any control over the selected editor content until my handler is fired.
    Thanks!
    Andy

    The encoding is utf-8 (which is the default in CF 9 (and maybe CF 8 too), but I explicitly set it anyway, and still no luck.  It seems like Word Smart Quotes are actually seen by ColdFusion as 3 different characters.  I don't know that Smart Quotes are actually representable as specific entities, though.  When I copy a closing smart quote out of Word into a cfm file and so something like <cset q = "{smart quote here}">, and then loop over that variable one character at a time and output the ascii values, I get 3 values: 226, 8364, 65533.
    Here's a test that contains Smart Quotes that I just copied from MS Word into this editor, it will be interesting to see how they are represented after I post this message:
    “Test”
    Andy

  • Macros in Acrobat (Changing "smart quotes" to straight quotes in PDF files)

    Is there a way to do macros in Acrobat? We need a macro that will change all of the smart quotes in a pdf file to straight quotes so that they will convert properly when we create the final version of a form that goes to our Xerox printers. We have a macro in Word that does this, but aren't sure how to do it in Acrobat.
    Thanks for the help!
    Ken K. - 2191

    This would be extremely advanced programming for Acrobat; there is
    automation and something a bit like macros but it can't touch the page
    contents. It would have to be a plug-in in C/C++, and it is
    complicated by font encoding and subset issues.
    The subset issue would probably be a killer. In many, perhaps most,
    cases the PDF file that uses smart quotes will have a font that
    includes ONLY the smart quotes, and left out the straight quotes
    because they weren't needed.
    Aandi Inston

  • I need the find bar to stop matching smart quotes for straight ones.

    Some recent update of Firefox caused the find bar to match smart quotes (“ and ”) for the straight quote ("). Additionally, it now matches smart single quotes and apostrophes (‘ and ’) for the straight single quote (').
    I need to disable this matching so that a search for a straight quote will only yield a straight quote result. Also, if any other character matching was added, I'd like to disable that too. I need exact results.

    Hi Chuck, this could be coincident with the launch of the "per-tab" find bar with its new UI design. However, I didn't see a specific but for the change in the tracking system (Bugzilla).
    If this is a major pain point for you, to help track it down, could you do a comparison between Firefox 24 and Firefox 25? You can download the Portable builds of old versions of Firefox and run them without disturbing your main installation of Firefox. (However, you might not be able to run more than one at a time.) Those two would be here:
    * [http://sourceforge.net/projects/portableapps/files/Mozilla%20Firefox%2C%20Portable%20Ed./Mozilla%20Firefox%2C%20Portable%20Edition%2025.0.1/ Portable Firefox 25.0.1]
    * [http://sourceforge.net/projects/portableapps/files/Mozilla%20Firefox%2C%20Portable%20Ed./Mozilla%20Firefox%2C%20Portable%20Edition%2024.0/ Portable Firefox 24.0]
    Does it work the way you want in either of those?

  • Smart quotes won't work

    Hello there dear community,
    for some reason, Pages will not put quotation marks as I want them to be. As I am from Germany and therefore gernerally write in German, quotation marks are not “ ” but „ ” . I have not figured out how to get Pages to do that. I have already changed spelling language to German, enabled and disabled Smart Quotes, and set the System Preferences to the format I'd like the marks to be, but nothing happens.
    What am I doing wrong?
    Thank you.

    Pages '08 and '09 are unaware of the Operating System on this point.
    You get the described characters because the app is running in German.
    At this time, you
    may just disable the quotes substitution feature.
    Uncheck the box which is checked here.
    I will search in the app resources if I may change the set of substitution quotes.
    Yvan KOENIG (VALLAURIS, France) dimanche 30 janvier 2011 17:52:29
    I got it.
    The resource is stored in :
    Macintosh HD:Library:Application Support:iWork '09:Frameworks:SFWordProcessing.framework:Versions:A:Resources:German.lproj:Aut oCorrect.plist
    To change the behavior, you must edit the pointed set of characters.

  • CharConversionException when displaying smart quotes

    My servlets are throwing a CharConversionException when I attempt to display "smart quotes." The quotes and other symbols (i.e. long hyphen) are entered into an online form when a user composes something in Word 2000 then copies/pastes it into a text field. The servlet is able to get the parameter and update the MS SQL 7 database but not display the value back in the confirmation page. I am running this application on iPlanet web server v.6 SP 1. It seems to be related to my servlet engine as it doesn't have this problem when using the servlet runner (srun)--that will display the characters as a question mark. This is a set of �smart quotes� and a long�hyphen

    problem solved:
    1.- I changed the response content type of my Servlet to:
    response.setContentType("text/html;charset=\"windows-1252\"");
    2.- In my DB I changed the type of the field to the "nText" type which support more characters like Unicode, etc

  • Epub interpreting apostrophes and em dashes as Äô and Äî

    I am using InDesign CS4 to create an ePub. I have one chapter that is interpreting apostrophes and em dashes as Äô and Äî, upon exporting to ePub format. If I export chapter document separately from the book these characters are displayed correctly, but if I export entire book these characters display correctly in all but this chapter. This ePub does include a TOC Style. When I delete text containing styles that are associated with the TOC style and export, characters are displaying correctly.

    The only reason I even know about this is
    because I’m able to see the catastrophe in my
    original text included in their responses.
    That doesn't necessarily mean that your outbound mail is appearing strange to the reader. I would advise you to check that and verify that you really have a problem.
    If your readers do see strange characters, it is often because you are sending rich text (which is really html mail) and they are using MS Outlook. Some possible fixes are in this note:
    http://homepage.mac.com/thgewecke/woutlook.html
    PS Getting rid of all smart punctuation (curly quotes and apostrophe's, special dashes) may also be helpful.

  • The starting quote marks at the end of a line in my epub file are separated from the words quoted.

    The starting quote marks at the end of a line in my epub file are routinely separated from the word quoted which appears on the next line. The formatting contractor accurance.com who prepared this epub file tells me that this is a feature of Adobe Digital Edition 2.0 and that there is nothing they can do to correct this. Are they correct, or else how can this problem be overcome?

    I'm not sure about buying css style sheets. 
    You don't need one for each book; if you want to have a series of books all with the same general format,
    then they should all use the same style sheet on a given device, but maybe different style sheets for different devices.
    All the books will then have one consistent look on a given device, and a different consistent look on another device.
    ADE only runs on PCs and Macs.  There are lots of different ereader software apps out there for running on different devices.
    For a dedicated device, you are stuck with the reader the device provides.
    For tablets you can download different ereader apps, for example Bluefire (my favourite, ios, Android), Aldiko (android) and DReader and DL Reader (windows RT).
    I don't know if the readers/apps are derived from the same root as ADE in any way and likely to show similar bugs.
    If it is not DRM, you should certainly try on Calibre, which is the most popular (free) ereader app for computers.
    If you care to create a very short non DRM extract (complete with css) and post it somewhere on the web I'll have a quick look.
    If you have inclination to get more involved yourself,
    there is free software called sigil (https://code.google.com/p/sigil/) that will help you get off the ground and experiment.

  • Find open and close smart quotes

    Dear scripter,
    Here I am trying find whether all the open double smart quotes(") are closed with close double smart quotes("). Here is my small work to find how many open and close quotes.
    myopenqu=app.activeDocument.search("^{", false, false, "^{");
    myclosequ=app.activeDocument.search("^}", false, false, "^}");
    myopenqulength = myopenqu.length;
    myclosequlength = myclosequ.length;
    if(myopenqulength!=myclosequlength)
    alert("Quotes not matched\n"+ myopenqulength +" Open quotes found\n" + myclosequlength +" Close quotes found")
    It works well, the script just alert if I have 25 open quotes and 23 close quotes
    Quotes not matched
    25 Open quotes found
    23 close quotes found
    Now I want to find the quote set where the 24,25th set close quotes are missing?
    Is there any way to find the misisng quotes
    Thanks in advance
    regards
    a r u l
    vpublish.net

    You'd have to look for unmatched quotes, in this case two cases of two open quotes without an intervening closed quote. This is difficult to script in CS2 because it doesn't have GREP natively and because of footnote problems. In CS3/4 it should be possible. But even then it will work only if you don't have quotes within quotes.
    Peter

  • Find and replace smart quotes with straight quotes?

    I understand I can turn off smart quotes so that I can type straight quotes, but I need to replace hundreds of curly smart quotes with straight quotes, is there a feature that will let me do this? I am using FM8.
    Thanx,
    Willian

    I am using FM9....so I don't know if the same shortcuts apply, but this is what I found out last week.
    Use the Find and Replace tool:
    With smart quotes turned off and the Num Lock key turned on:
    Alt0147 will give you beginning quotation marks
    Alt0148 will give you ending quotation marks
    In the Find box use ALT0147 or ALT0148 for the beginning or ending quotes. When you click in the box and type
    one of the shortcuts the correct quote will be shown in the box.
    In the replace box type the regular straight quotes on your keyboard.
    I was thrilled that it would work!...course you do have to do them separately and be careful not to replace the curly quotes
    that you want to leave in your document.
    Hope this helps using FM8....
    ls

  • CF 7.02 Access Encoding and Smart Quotes

    The latest version of ColdfFusion with the updater applied
    (7,0,2,142559 ) does not properly handle smart quotes pasted from
    MS Word using UTF-8 as the charset. This worked on CF7 pre updater
    and CF 6. Is anyone aware of a fix for this ?

    France19 wrote:
    > The latest version of ColdfFusion with the updater
    applied (7,0,2,142559 ) does
    > not properly handle smart quotes pasted from MS Word
    using UTF-8 as the
    > charset. This worked on CF7 pre updater and CF 6. Is
    anyone aware of a fix for
    > this ?
    sure? i thought you had to use access for unicode to get
    unicode support in access.

  • Smart quotes, M-dash, N-dash, etc. not showing up in pdf

    I am using InDesign CS4 on a Mac 10.6.3 and I noticed that when using HelveticaNeuv TT font there are certain type elements that don't show up when I export to pdf. The things that aren't showing up on the pdf are things like quotes, apostrophes, M-dash - and there's probably more that I haven't noticed yet. I have used this font a lot before with my older mac and version CS3 with no problems. Don't have a clue what's going on...I really like this font and it's used extensively in a lot of my older documents so I really need to fix this problem. Thanks for your help!

    You might be using the Helvetica Neue on the Mac from your system. That version is 'tweaked' by Apple, and has been known to cause issues before. Try googling for "Helvetica Neue InDesign", there have been questions (and answers) about this font before.

Maybe you are looking for