Pre-InDesign – preparing text for ePub

This is not exactly an InDesign issue but I can’t imagine where else I might start this discussion!
I have to convert a number of printed books to ePub. The titles are mostly fiction, (they are the usual simple designs with one-column layouts and without illustrations). I will be using a ScanSnap S1500M to scan and digitize the pages on an iMac with the software ABBYY FineReader Express.
After scanning I need to manually check the text for scanning mistakes, remove folios and running heads and ensure that, apart from chapter headings and paragraphs the text runs on. I then subsequently need to place this text into InDesign for paragraph and character styling, adding the prelims, the cover, and adding metadata, before exporting to ePub from InDesign CS5.5.
I know how to do the InDesign part but am looking for solutions for after scanning, that is, having digitised the text, which programs and workflows are recommended ? Would you use MSWord and keep some styling or maybe a program like BBEdit, that strips out all codes and brings it back to basic text?
Thanks, Derek

After scanning I need to manually check the text for scanning mistakes, remove folios and running heads and ensure that, apart from chapter headings and paragraphs the text runs on.
Assuming you have a rectangular text block and that folios, etc. are outside it, I would recommend you scan to PDF, batch crop the entire PDF (as with Acrobat Pro), and then do your OCR. That way you'll save yourself a lot.
Though it's been years since I've used ABBYY FineReader; maybe it already solves this problem for you?

Similar Messages

  • InDesign Tagged Text for Cross-reference Entries

    I'm transforming XML to InDesign Tagged Text. The XML has index codes. For regular page entry type entries I'm having no problem outputting to the appropriate InDesign Tagged Text markup. However, I cannot figure out how to code cross-reference type entries. The document "Using Adobe InDesign Tagged Text CS5 Tagged Text" is extremely limited in its usefulness as it does not list all possible values for tag type tags, etc. I've tried dozens of tag combos and guesses at values. None have worked. Also, for some reason, even though I can create a "See x" type reference in the InDesign document, when I export to InDesign Tagged Text to look at the code, those tags are not included in the export.
    Does anyone have a more definitive list of possible IDTT index tag values?

    I have been exporting various things to IDTT to see what the result would be, with nothing really helpful as a result. I'll try hyperlinks, but reading of the Adobe guide to InDesign Tagged Text and also just looking at the InDesign scripting object model leads me to believe that there must be specific tags to create index-specific cross-reference tags.

  • Help with numbering lists in Indesign CS5.5 for epub file

    I am working with a book in Indesign CS5.5 to export it as an epub file. I have been working on formatting the text and I have run into a small problem. Throughout the book there are multiple lists within the texts that need to be numbered. Each list, however, must start over at number 1 and not be continued on from the previous list. The lists are all unrelated and I need them to be completely separate. I used "Type", "Bullets and Numbering", then "Apply Numbering" to begin with. This had each list in the correct order. However, I also created a paragraph style with the correct font, size, indentation, and spacing that must be applied to all the lists before exporting. When I do this, everything keeps its correct formatting except that some of the lists continue their numbering from the previous list. For example, on page 1 I might have a list of four things all formatted correctly and number 1 through 4. On page 3, I have another list, of completely unrelated items. I therefore would like this list to be started over at number 1, but when I apply the paragraph style to it, it starts at 5, picking up where the last list left off. I need to keep these as the assigned paragraph style but it messes up the numbering. If anyone could help me figure out how to keep this from happening I would greatly appreciate it!

    If you don't mind cracking open the ePub and editing template.css, then adding rules like:
    h2 {
              page-break-before: always;
    should force each paragraph whose style is mapped to an h2 tag on to a new page. Whether or not it works will depend on how good the ePub reader is at implementing CSS. It seems to work for Digital Editions and also Kindle, if you convert ePub to Kindle format.
    This doesn't help at all if you are trying to force sections into separate XHTML files, though.

  • Using Indian language (Mangal) font for ePub in InDesign CS6?

    Hi,
    I created one Indian language epub in Indesign CS6 wiht "mangal font". See correct file for your ref.
    But when I set text in Indesign & export that document for ePub & check in Adobe Digital Edition. I saw one character has missing. see below image:
    Why this happened? Could any budy help on this topic? I need solution for this. or tell me which devnagari font has use for Adobe digital Edition?

    Hi Ellis,
    Please find download link below:
    https://www.dropbox.com/s/nsi1yr76ial5cog/test_files.zip
    Inside of zip are test document, font, InDesign file & picture of original text for your ref.

  • Need help with Italics for ePub and InDesign CS5

    Could someone help a me out. I just can not find the answer to this and I know it should be simple, I am just missing it.
    I am creating an ePub from CS5 InDesign. All I want to do is have the italics that are shown in the text box in InDesign, show up in the epub file.
    I would prefer not to say what I have done (I think I have tried everything, but obviously I did them all wrong, so no point listing them) and I am hoping someone can do a simple, step by step guide to how to have italics show up in the ePub file.
    Sorry if this question is beneath everyone, this is really annoying that I can not figure this out on my own.

    Yup, done this.
    In InDesign, the text is Italic. It has the Character Style Italic associated with it.
    But when I export to ePub, it is not italic.
    Here is the HTML from the xhtml file for a single line of what is supposed to be italic text,
    <p class="normal normal-override-2" xml:lang='en-us'><span class="italic italic-override">Words that should be Italic.</span></p>
    The above line does not show up as italic in any of the 3 readers I am using to test with.
    Now, in the css sheet, the span.italic-override does not have anything associated to font-style. If I manually add the font-style italic, then re-pack everything, it works.
    It just seems to me a bit absurd to have to export the epub, unpack it, open it in dreamweaver, add the font-style, and repack it just to have something as simple as italic work.
    So, I am missing something, I get that... I just can't figure out what!

  • InDesign CS4 crashes when I export for epubs

    Every time I choose "Export book for digital editions...." in the book menu, the program sometimes (but not always) exports to the epub format -- but then InDesign crashes and closes.  Sometimes there is an epub saved; sometimes there is not.
    This is the error message I get:  "Runtime error:  R6025 - pure virtual function call".  Makes no sense to me, and it happens every time, no matter what other programs I do or do not have open.
    I'm running Win XP SP3 and have plenty of memory.
    Thanks.

    Good morning (for me, at least). OK, let's try to catch up.
    I am still not 100% clear what your actual problem is. Perhaps you can start by restating it clearly and unambiguously.
    Has it changed since June?
    You might post a screenshot of the way you have the TOC set up.
    But the first step is to figure out if it happens with a brand new document that is untainted by your existing document. Perhaps something in your current document is making InDesign misbehave. So make sure that TOC export works right from a fresh document.
    After that, then the task is to figure out what about your document is causing the problem. Exporting to IDML is a good step to clearing corruption, but is not 100% definitive. But if it works, great. Other steps including using the Pages (panel) > Move Pages function to move pages between documents, and also exporting individual stories to InDesign Tagged Text and then re-importing them into new documents. You can also export some items as snippets, though this is likely to have the same effect as IDML, but somewhat more self-contained (think "IDML Export of this page only.")
    In short, this is Divide and Conquer. The problem could be in a story, in a style, on a page, in an image, etc., etc.
    Additionally, you might want to consider ponying up $40 and engaging Adobe Support: http://adobe.com/go/supportportal.
    Unfortunately support for CS4 is somewhat waning (it is 3 versions old!), and Adobe Support is not always stellar at solving problems quickly. Still, they are an option to consider.
    Also, I'm definitely not an EPUB person. Hopefully when you post the details of what exactly you're doing some of the EPUB folks will still be paying attention to this thread and can point out any obvious issues they are aware if ("Hey! It's a known bug if you use the letter é in your EPUB TOC! Just remove it and you're good!" or whatever).
    You ask:
    Do you know what I can do to fix the stack problem?  Is that an Adobe problem or a Windows problem?  I'm waaay beyond the limits of my expertise here.
    It's not a "problem" per se -- the stack is a data structure that stores state about functions calls made by programs. Under normal circumstances, when a program crashes -- that is, when the operating system detects that a program has done something it should never do, such as tried to access a region of memory that does not belong to it, the operating system forcibly terminates the program -- the stack reflects information about the function that was currently executing, and the function that called it, and the function that called it, etc., etc., typically 20-50 levels deep.
    Depending on exactly why the program faulted, sometimes this information is not available. For instance, if the program failure that caused the crash involved corruption on the stack.
    Often the stack can give us useful clues about the nature of the crash, and those clues can help us avoid the crash. Simple example: "Oh, this crash is related to the third party extension Frobozz Magic Font! Turn it off!"
    I just posted a question about the corrupted stack in the Microsoft forum, and they replied only Adobe can fix it.  It's apparently not a Microsoft problem.
    They are, to a first order, correct. When InDesign crashes, it's an Adobe problem. The stack issue is a low-level diagnostic and is unlikely to be a real problem. But certainly no one could fix it without altering the InDesign program, and only Adobe can do that.
    I'm getting pretty nervous about this; I have clients screaming for their books, and I'll have to refund some money if this isn't solved.  Whatever you can do would really be appreciated!
    I think you need to present us with a lot more information about your problem.
    Precise steps to reproduce it.
    Screenshots showing how you have it set up.
    Copies of files that exhibit it.
    That we can try to reproduce it, see if it is specific to your machine or your document, give recommendations on how you can work around or avoid it, etc.
    And seperately, aside from all of those things, is getting the problem fixed. Though if it happens in CS4 and not in CS5.5, well, then it's unlikely that a fix will be forthcoming. Adobe just isn't updating CS4 any more. They're hardly updating CS5 and CS5.5, and CS6 is on the horizon.
    Oh, and if you're willing to throw money at the problem, you can look at upgrading. EPUB support has changed dramatically in CS5.5 (for the better, I am told, but I don't speak from personal experience), so it's likely that any problems you saw with it under CS4 will be very very different [and hopefully gone!] in CS5.5.

  • InDesign CS 5.5: Custom Footnote and style for ePub

    Hello
    i am wondering if there is away to:
    create custom footnote, so instead of normal number i want to InDesign to create it with parentheses.
    when i paste the text for that note at the bottom of the page, InDesign add extra space between the number and the text, is there any way to decrease the space
    i have a book that has so many chapters and InDesign keeps counting the footnote like (1,2,3,4,.......66) and so on, is there any way to control the footnote so every chapter start with number (1)?
    regards

    Omar Saleh wrote:
    Hello
    i am wondering if there is away to:
    create custom footnote, so instead of normal number i want to InDesign to create it with parentheses.
    when i paste the text for that note at the bottom of the page, InDesign add extra space between the number and the text, is there any way to decrease the space
    i have a book that has so many chapters and InDesign keeps counting the footnote like (1,2,3,4,.......66) and so on, is there any way to control the footnote so every chapter start with number (1)?
    regards
    Hi, Omar:
    In Type > Document Footnote Options, you can set a parenthesis or other prefix and suffix character. Adjust the space after the footnote number by choosing a separator; if you choose a tab, set the tab stop position for the footnote paragraph style. Create a footnote paragraph style if you don't have one, and specify it in the Footnote Formatting area of the Footnote Options dialog box.
    Search Google for terms like "InDesign footnote formatting" without quotes for details.
    I'm not sure how much of the formatting will be retained in an ePub.
    HTH
    Regards,
    Peter
    Peter Gold
    KnowHow ProServices
    Message was edited by: peter at knowhowpro

  • How do I import an index into Indesign for epub and keep the links live?

    What is the best way to import a subject index (the indexer uses skyindex, and has an option to output index links to indesign para numbers), but he currently provides a word file for import into Indesign 5.5 with dead page numbers. We need to retain that for print publishing, but change the index numbers for epub, to links in both directions. Whats the best way to tackle this? I hope someone out there has done this!

    Page numbers do not exist in epub.
    Have a read through this:
    http://www.pigsgourdsandwikis.com/2010/07/creating-index-for-epub-with-indesign.html
    Bob

  • Why do InDesign CS 5.5 quit when i try to export for ePub?

    Everytime i try to export my document for ePub, InDesign cs 5.5 quit. This is a book i made for print, but i would like to go forth and back from indesign to Adobe digital editions to make the changes that are needed end then use Dreamweaver for working with the CSS. But i can't seem to get an ePub out of the document! And I haven't got a clue what's stopping it for exporting...

    I suspect that you ask this question in the wrong forum, you are in the Adobe Captivate forums, no publishing to epub. Perhaps you'll need the InDesign forum?
    Lilybiri

  • Export for epub missing in my Indesign CS5

    I have just installed In Design CS as part of the Adobe design premium suite. My problem is, Export for epub menu option is not visible. Hence I'm unable to export any epub. How can I bring back this option. Please help.
    Refer the attached screengrab.

    Did you check your "Applications/Adobe InDesign CS5/Scripts" directory for additional directories "XHTML For Digital Editions" and "Export as XHTML"?
    Be aware that both subdirectories "Resources" and "startup scripts" should be present with a bunch of additional scripts.
    Uwe

  • InDesign Help | Export content for EPUB | CS6

    This question was posted in response to the following article: http://helpx.adobe.com/indesign/using/export-content-epub-cs6.html

    The Kindle whitepaper is out of date. In particular, it seems that Amazon does not accept Calibre-created.mobi files -  .mobi files created by converting from .epub to .mobi in Calibre - and insists on using Kindlegen to create .mobi files. In my experience, however, Calibre does a better job of converting from .epub to .mobi than Kindlegen, and I do not know why .mobi files created by Calibre are unacceptable.

  • InDesign CC: Text in editable text fields disappearing after exporting

    Hi,
    I'm having an issue of text disappearing from editable text fields after exporting the document as an interactive pdf. I've created the text that needs to be editable in acrobat with InDesign CC. So for example, I want to have an editable text field that already says "I'm Mary." I want users to have an ability to change "Mary" to "Tom" or "Ben" in Acrobat. So the pdf should have an editable text form saying "I'm Mary" sentence when users open the pdf in Acrobat. However, after exporting, the text fields remains but the text itself "I'm Mary." is gone. I don't understand why this happens because the text is there when previewed in SWF preview window in InDesign.
    The below is the list of steps I took.
    STEP 1: Create some texts (For example, m with the text frame tool in InDesign CC
    STEP 2: Right click the frame, and click on "Interactive > Convert to Text Field"
    OR
    STEP 2: Select the text frame, open "Buttons and Forms" window and change the type as "Text Field"
    STEP 3:  Export the document as an interactive pdf. The setting is default.
    Result: When I open the pdf in Acrobat, the text field is there, but it is blank.
    Please help me to resolve this issue. I know this is possible because a person before me at my job did it.
    Thank you!

    As far as I know, to have a pre-populated text form [I'm Mary] appear in the text field, you will need to set the Default Value of the text field in Acrobat. You can do that by selecting the property of the field > Options > Default Value

  • Indesign CS5 Crashes on Epub Export

    I have used the epub export feature in Indesign CS5 successfully for several books. However I have one title that I have been trying to export for epub. It gets about 2/3 of the way through (as measured by the progress bar) and then Indesign crashes. I am wondering if there is a known issue and (hopefully) a work around. The book exports to PDF just fine.
    Thanks for any advice,
    Chuck

    I have also had a crash on export problem recently. I went through the suggestions in this thread and searched others.
    Then I stepped through a few things in my INDD.
    Deleted the suspect page. Export worked.
    Brought the page back. Export crashed.
    Looked at all the text on the page. I had 4 boxes with bulleted text.
    I converted the bullets to text and the export went ahead with no problem.
    I turned the bulleted list back into a real bulleted list and the export crashed again.
    Go figure.

  • Scrollable Frame solution for EPUB Fixed Layout

    Hi,
    I have found a solution (or, rather a hack) which allows us to create scrolling frames for EPUB Fixed Layouts.
    You can find the exported EPUB, as well as my InDesign file on my Dropbox: EPUB scroll.
    This solution has some weak points and has only been tested in iBooks on an iPad with iOS 8, but here's how it works:
    Create two text frames (one as big as your text requires, and one which will be the container [tip: make the container a bit wider than the text frame])
    Cut the big text frame, select the smaller container frame, and Edit->Paste into
    Right click on the container, select Object Export Options, choose EPUB and HTML and enter "subchapter" as epub:type:
    Open a text editor and create a new CSS file. For CSS code, see below
    Export as Fixed layout EPUB. In the export dialog box, select the CSS tab and add the previously created CSS file
    Done!
    CSS code:
    div[*|type = "subchapter"] {
    position: relative;
    div[*|type = "subchapter"] > div {
    overflow: auto;
    As you see, steps 1 and 2 are essentialy the same as for creating a scrollable frame folio overlay.
    It’s not perfect, and this solution is just a proof-of-concept, but it should get you going.
    An obvious problem is that we use basic HTML scrollbar, which is not being displayed in e-book readers such as iBooks on the iPad. Hence, the user has now idea how long the scrollable content stretches. This could probably be solved with a more complex, JS-based solution.
    Also, the gradient feather at the bottom (which is intended to create a "fade-out effect) seems not to export well. This one should be easy to solve, placing another rectangle with a gradient in it. But I’m not sure if transparency exports well, and haven’t tested it.
    Feel free to use, and improve, this hack!
    Thanks,
    /Jacob

    Thank you Uwe,
    I look and hold you aware of your solution in an environment EPUB
    Patrick
    Dr Patrick Dhont
    Le 29 janv. 2015 à 11:52, Laubender <[email protected]> a écrit :
    Scrollable Frame solution for EPUB Fixed Layout
    created by Laubender <https://forums.adobe.com/people/Laubender> in InDesign EPUB - View the full discussion <https://forums.adobe.com/message/7143179#7143179>
    @Papo – then I would suggest not a scrollable frame, but an MSO you can click through showing the whole text.
    1. Do some text frames the same size
    2. Thread all the text frames to one story
    3. Align the text frames
    4. Make a MSO out of it:
    The text frames are still threaded and text can easily flow between them.
    5. Do some navigation buttons to control the MSO states:
    "Go To First State", "Go To Next State", "Go To Previous State" etc.
    Later you can change the text (add or remove, do different formatting) flowing in that story through the states.
    A very flexible solution. Ok, it's not a scrollable frame, but something like that…
    Uwe
    If the reply above answers your question, please take a moment to mark this answer as correct by visiting: https://forums.adobe.com/message/7143179#7143179 and clicking ‘Correct’ below the answer
    Replies to this message go to everyone subscribed to this thread, not directly to the person who posted the message. To post a reply, either reply to this email or visit the message page:
    Please note that the Adobe Forums do not accept email attachments. If you want to embed an image in your message please visit the thread in the forum and click the camera icon: https://forums.adobe.com/message/7143179#7143179
    To unsubscribe from this thread, please visit the message page at , click "Following" at the top right, & "Stop Following"
    Start a new discussion in InDesign EPUB by email <mailto:[email protected]software.com> or at Adobe Community <https://forums.adobe.com/choose-container.jspa?contentType=1&containerType=14&container=50 01>
    For more information about maintaining your forum email notifications please go to https://forums.adobe.com/thread/1516624 <https://forums.adobe.com/thread/1516624>.

  • How do I import an InDesign tagged text file into multiple pages and export as .ps or .pdf using Jav

    I have an InDesign tagged text file I've translated from .xml. I need to automate the following steps:
    1 - access specific InDesign template (eg. ABC_template.ind)
    2 - import tagged text file into InDesign
    3 - autoflow text to END of document (normally around 3-5 pages)
    4 - save document as either .ps or .pdf file
    5 - where the input file stub name matches the output stub name (eg., OrigName.txt outputs as OrigName.pdf).
    I would like to completely automate this whole process using JavaScript (because I don't know anyone that knows AppleScript). I've automated the first part using a perl script. I've been trying to find sample snipits of JavaScript that would do one or more of the items listed above, but am having a hard time finding what I need.
    Please, I'm desperate!! Can any of you InDesign scripting guru's out there help me??
    Thanks in advance!!
    LindaD

    Hi Linda,
    I might be able to help you out. You can contact me by email (click on my user name for the address), or if you post your email here.

Maybe you are looking for