Hidden formatting held over from Word doc conversion

Our company is shifting our product manuals from Word docs to InDesign.  I'm a newbie to InDesign, and I have just converted my first attempt using File>Place.  This was a moderate size file (15MB), with a lot of embedded images, diagrams, and formatting.  Although some of the text formatting didn't convert properly, the clean up has been easy.  The biggest issues I've found are residual page and section breaks that needed to be manually removed from my text frames.
My boss, however, is hesitant and concerned that there might be some "hidden" formatting that carried over from Word that will ultimately undo all my progress and wreck havoc with the new ID document when we least expect it.  Is there anything I should be looking for in particular?

ML,
This topic was just discussed at InDesignSecrets.com in the last two podcasts.
Go to:http://indesignsecrets.com/category/podcasts
You will find out almost everything you want to know about importing Word files. Especailly check out the part on "Maggy-ing" the Word file, this might be what your boss was concerned with.

Similar Messages

  • Remove Header and footer from Word Docs

    Is it possible to remove the header/footer from Word docs?
    The html output from ctx_doc.markup gets the header/footer and (specialy)page numbers all messed up.
    []'s
    thanks

    Please define 'messed up'. I'll reproduce here and then try to find a cause/solution. I checked a couple of other forums and Metalink with no good discussion on the topic.
    -Ron

  • How do I make hearts and symbols disappear from Word Doc converted from pdf?

    This is the first time I've tried to convert a document.  I am in a new job and I'm stuck.  How do I convert pdf to word and not get symbols all over the Word doc?

    Appears to be a font related issue.
    Regardless, all you can do with the circumstances you describe is to use MS Word to perform cleanup of the export content.
    Alternatively print the PDF to paper and use that as a source for transcription into a fresh word processor file.
    Be well...

  • How can I keep the formatting when converting from Word to pages?

    How can I keep the formatting when converting from Word to pages?

    Use only formatting that is supported (in the same manner) in both applications. You'll need to find which that is through trial and error. Beyond those formatting options, you'll need to do some corrections, as Fruhulda suggests.
    Regards,
    Barry

  • Strange bug: Acrobat 9 deletes words upon conversion from Word doc

    Situation: converting from MS Word 2007 (SP2) to Acrobat 9 Standard (v 9.2.0) in WinXP.
    I've tried this repeatedly. The first word in multiple headings disappear when converted to PDF (they remain in the Word doc, but don't exist in the PDF that was produced from the doc).
    Example: "Correlation Analysis" becomes " Analysis".  This is happening to the text of every "Heading 2" style heading. All the other headings are unaffected.  Does anyone know why this is happening?
    This is insidious because we assume that the finished doc matches the original one. So now I have to also proof the Adobe conversion for unexpected deleted words?
    In the meatime, I'll return to Adobe 8 to do these conversions.

    I see a similar problem being reported in this posting:
    http://forums.adobe.com/message/2370280
    No resolution there either.
    Regarding your suspicion that it may be related to Word, what contradicts that is the fact that the full text of the header appears in the resulting PDF bookmark!!  However, the first word of the text is missing from the heading in the PDF (though it remains in the source Word doc).
    Also, I am NOT referring to the page header or footer.  I'm referring the text heading in the body of the document. In my case, only the "Header 2" styled headings are affected.

  • How do I save formatting from Word docs to pages?

    Hi
    Often when I open a word document in Pages I get a document warning pop up box.  Often I find that basic things from the Word doc have been changed.  For example, things that were in Bold text in the Word doc are no longer in Bold in the Pages file.  How do I prevent this from happening as the document moves back and forth between myself ( a Pages user) and a clien (who has Word)?

    Hello
    As far as I know, Word create faux-style when a bold or an italic version of a font isn't available.
    In such case, bold or italic will be dropped in Pages.
    From my point of view it's a good choice but it may hurt some users.
    Yvan KOENIG (VALLAURIS, France) lundi 13 juin 2011 16:18:30
    iMac 21”5, i7, 2.8 GHz, 4 Gbytes, 1 Tbytes, mac OS X 10.6.7
    Please : Search for questions similar to your own before submitting them to the community
    To be the AW6 successor, iWork MUST integrate a TRUE DB, not a list organizer !

  • Add sig to word doc conversion from pdf

    Why can't I add a signature to the word doc I just converted from .pdf?

    Thank you so much for getting back to me, Dennis!
    Glad you are all set.
    Please don't hesitate to reach out if you have further questions.
    Kindest regards, Stacy

  • Question re Create PDF from Word doc

    Hi
    I am trying to create a pdf from a Word document (Word 2007, Acrobat 8 Professional), and am having lots of trouble with images,fonts and document overhead. With the fonts, i generate the PDF by saying Create Pdf from the plugin in Word. I specify in the Preferences that fonts are not to be embedded. I then open the PDF and access PDF Optimiser -> Audit Space Usage, and it says that fonts take up like 20% of the document, however there are only like 10 lines of text in Arial (9pt), and 6 titles also in Arial (Bold). I dont understand why Font is taking up so much space considering that i have elected not to embed fonts (if i go to the Fonts section in the Optimiser, it shows both embedded and unembedded panes blank).
    With the images, i have 2 jpegs in the footer, that are compressed (JPEG -> Low). The other thing i am struggling with is the headers and footers, if i generate the PDF from the Word doc with Headers/Footers, then images take up over 15% of the document. Not sure if there is a way to add headers and footers within minimum impact of filesize? I have tried checking everything in the 3 categories of Dicard Objects, Discard User data and Clean Up. but the Document Overhead remains at 35%.
    What else can i do to get these filesizes reduced?I have been researching this for days and have not come across anything that has helped.I have tried to PDF print, i have changed the font to like Courier, ensured no thumbnails or bookmarks, ensured JPEGs are not embedded in the doc, tried Save As, Save As under a different filename - basically anthing you can find on the net i have found and tried, but still cant fix this!!!! please, if anyone knows acrobate 8 better than me / or knows what the problem is, please advise????
    Thanks very much.

    The colors and size in a graphic needs to be done in a graphics editor. What type of editor would depend on the use of vector graphics versus bitmap. Sometimes vector graphics are larger than bitmaps if you are using a lot of lines that would display better as just a splotch of color. Such are the variations between vector and bitmap graphics, but important if you are looking for size reduction. For a bitmap, I would do the sizing and color depth with IrfanView, but you should be able to do that with PhotoShop if you have it. Vector graphics can be adjusted in Illustrator. The size of vector graphics is not an issue since they are scalable, but the size of a bitmap is important since your are looking at individual pixels and that depends on size. The point is that if you can adjust the color depth and size for the desired pixel resolution, the bitmap is optimized for the conversion to PDF from WORD.
    As I mentioned, the smallest file size job options should minimize font storage in the PDF. Checking with the PDF Optimizer does not always give you all of the fonts. I am not sure why. It is better to check the font tab in the document properties to see what has been embedded. There is a preflight macro to embed fonts, you might check to see if there is one to delete fonts (I have not checked on that). Sometimes you can play with the reprint of a PDF, but that is not an option that is generally recommended, particularly if you have any tagging or such. Of course, tagging can really bloat a PDF, but is needed for a variety of reasons such as format for saving back to WORD (not a great workflow), accessability, and related issues. In another topic, there is some discussion of the purposes of tagging and bookmarks. However, the tags and bookmarks take space if that is really an issue for you. The latter are avoided if you use the print to the Adobe PDF printer and do not use PDF Maker in the PDF creation process. Again, there is a trade-off here in terms of size and functionality, particularly accessibility compliance.
    Not sure I am helping as I run on, but sort what might be useful for you.

  • Unique issue with PDF to WORD .doc conversion with Acrobat Pro - any ideas?

    I have been unable to solve the following issue when converting (save as...) PDF documents to Microsoft Word .doc using numerous methods. This could either be an issue that would be fixed in Acrobat Pro itself, or in MS Word - posting to the Adobe forums first.
    PREFACE: I am attempting to use the converted .doc file with translation applications/software. Google Translator Toolkit is what I use the most, but ALL other translators are having this very same issue with the .doc file. --The source PDFs are product information from drug manufacturers in various countries that I need to have translated to English. I do not have access to their source documents, as they do not provide their own source docs for obvious reasons.
    ALSO: I cannot use Google Translator toolkit to translate from PDFs directly - if you do that, it will attempt to translate a PDF and then export in an .html file, but it does not get the exact spacing of the sentences correctly, which leads to errors in translating - key things such as "can take with alcohol" and "do not take with alcohol". So that's out!
    I am not having any problems with the resultant .doc file in MS Word itself. It looks right, the spacing matches the original PDF source perfectly, prints correctly, etc... Reference here on a product info sheet from Austria in German:
    The problem: This is a screenshot from Google Translator Toolkit - the right side of the image - the spacing in the lettering from the .doc file I am uploading is not being read correctly, resulting in untranslated gibberish. (Note: this isn't a problem with the translation applications or software -- all are having this issue with .doc files converted from .pdf - this issue isn't present with any old .doc file that wasn't converted from a .pdf) -- It's definitely got something to do with some kind of embedded data in the .doc file that I cannot isolate!!)
    My settings in Adobe Pro (convert from PDF to .doc):
    Page layout: Flowing Text (this prevents the resultant .doc from having all of those text boxes, which also don't then work in translators)
    Include comments: True
    Include images: True
    Run OCR if needed: True
    Notes:
    -I have run OCR text recognition on the source PDF files in it's specific language.
    -I have edited the accessibilty of the PDF and have run the tag recognition and quick checks (to see if they solved the issue, which it did not - tagged or untagged, same problems!)
    -I have exported the .doc BACK to PDF using MS Word's function, which results in a great looking tagged PDF. THEN I re-saved this new PDF back as a .doc - same issue.
    -I have tried saving the PDF in all of the other formats that the translators accept. All have different issues. The only one that works consistently is saving to a .txt (plain)... The best is a .doc to .doc conversion, with all the original spacing. (I am not spending hours reformatting a .txt translation in word)...
    I can't seem to find where this spacing data is in the .doc file!!!! (Changing the fonts, sizes, margins -- doesnt fix this either). I have tried so many methods...
    Any thoughts on other things to try in Adobe Pro (or Word)?
    EDIT: Here's an additional tidbit of info that may be the key to this... There's some kind of coding that is in the .doc that Adobe Pro converted from the source PDF that doesnt display in Word, but that is being seen by the translation programs....... I have no idea what these are, but I want to remove them!
    Message was edited by: KaotikADC

    I would suggest you look at the fonts that are being used. It may be a font issue that is not properly being read by the translation program.

  • Acrobat 9 Pro PDF is not retaining original page settings from Word doc

    The document I am trying to put into PDF format (from Word 2000, using Acrobat 9 Pro (trial version)) drops my original Word page settings (5.5" x 8.5") and defaults to 8.5" x 11" in the PDF.  Using "Crop Pages" does not remedy the situation - the doc just stays at the larger size in Acrobat/PDF. Can anyone tell me what I might be doing wrong?

    Change the paper size in the printer properties before creating the PDF. This is the same as putting the proper paper in a physical printer.

  • From Word DOC to Acrobat PDF... borders and lines get screwed up.

    I've been looking this up for days now and have yet to find a solution.
    I have a MS Word document that has a bunch of tables with borders and sometimes just text with borders for stylistic reasons. In Word, it all looks perfect.
    But, when I convert it to PDF using Acrobat, many of those borders get screwed up in various random ways.
    Sometimes certain borders get thicker, sometimes thinner, sometimes invisible altogether. Sometimes it's only certain lines of the same border (just the left side, or just the top). I see no real pattern to what's causing it to only happen in certain cases only.
    What makes it extra annoying is that if I zoom in or out a certain amount, the borders will look the way they are supposed to look. Some will look perfect at 100% zoom only, and some will look perfect at 75% zoom only.
    This has been driving me insane. Does anyone have any idea how I can fix this?
    Any help would be greatly appreciated.

    Converting Word (table) to pdf - lines screwed up - googled as far back as 2004.
    BUG STILL exists. HELP/FIX PLEASE?
    http://www.pcreview.co.uk/forums/missing-table-lines-conversion-pdf-t878406.html
    http://forums.adobe.com/thread/305508 
    Trying to convert any word doc with tables (& shading) to PDF
    - basic table, black borders throughout
    - shaded headings, black outline border
    - shaded subheadings, black outline border 
    However when convert to PDF:
    - 'displays' NO top cell border for some/all shaded rows
    - shows diff thickness lines
    - each conversion, diff lines missing/incorrectly sized
    - however converted pdf prints perfectly fine 
    Adobe know about the bug, per PRMW's (Paul's) post on 2009-07-15 15:44:34, however only offered a painful time consuming workaround using non-freeware Adobe Pro:
    http://acrobatusers.com/forum/pdf-creation/word-pdf-table-lines-missing-or-faded#comment-7 8139
    - "It is not feasable to edit 200+ tables in the PDF every time the PDF is generated, as we maintain the original in word.
    - "This complete issue seems to have been passed off by Adobe as no problem and that there is a work around. I consider this an unsatisfactory response from a major product supplier. 
    Microsoft TechNet & NitroPdf said it's an Adobe issue & to contact Adobe to fix the bug. 
    Tried, but proble exists:
    * Word 2010 > File  > Save & Send > Create PDF/XPS Document
    * Word 2010 > Save As > Pdf
    * Word 2010 > Print > PrimoPdf  (even tried properties > advanced > dpi 300/600/2400) > Custom
    * Word 2010 > Print > doPDF v7  (even tried 'high quality images)
    * Word 2010 > Print > PDFCreator
    * Word 2010 > Print > CutePdf Writer      (even worse)
    * Nitro Pdf Reader  > Convert From File > (even worse)
    * www.pdfonline.com > Word to Pdf         (even worse)
    * www.wordtopdf.com > email: Sorry, an unexpected conversion failure occurred when converting your file. 
    Software:
    * Word 2010 - tried with .docx & .doc (97 to 2003)
    * Adobe Reader 8.2.6 (freeware), then upgraded to Adobe Reader X 10.0.1 (freeware)
    * GhostScript 9.01 w32 (freeware)
    * CutePdf Writer (freeware)
    * PrimoPdf (freeware)
    * Nitro Pdf Reader 1.4.0.11 (freeware)
    * doPDF 7.2.361 (freeware)
    * PDFCreator 1.2.0 (opensource - www.pdfforge.org) 
    Seems to display better at 300%, but lines still not right (even at 2400%), but who views pdf's at this zoom?
    Message was edited by: shell_l_d

  • Copy/paste text from Word doc or PDF to Robohelp

    Bolded words, indentions, paragraphs, line spacing, font, and
    text size amongst other things are lost when I do a simple
    copy/paste... Any way to relieve this?
    Thanks

    Hi
    I'd like to answer this in a slightly different way..
    It's a good idea to format anything you paste into RH in RH
    using paragraph styles etc. Hardcoded formatting won't be under
    stylesheet control which is very bad news.
    In fact I generally paste stuff from a word doc into notepad
    first to remove any formatting soI have "clean" text to begin with.
    That way you ensure consistency when styles are updated etc
    as well maintaining consistency with what's already there.

  • Acrobat X Converts Non-Bold Font to Bold From Word Doc

    I had used Acrobat 8 at work for over three years with no problems converting any MS Office documents to PDF files.  I've purchased Acrobat X Pro for my use at home and now every time I try to convert a Word file (either a doc or a docx), it changes the font on the first page to bold and leaves the second page normal, as the original font in the Word file.  I'm not using an especially unusual font (tried it with both Californian and Antiqua, with the same results) and the font shows as embedded in the Acrobat properties, so I don't think that's the problem.
    Has anyone else experienced this problem and, if so, is this just one of those "improvements" that renders the software useless, or is it possible this is something they hope to fix soon?

    I'm not sure what you mean by "checking the font for changes".  In the Word file, only items that are supposed to be bold show as bold; in Adobe, the first page shows entirely as bold and the properties show that I have both regular, italic, and bold fonts - which is correct, because the original Word file has both regular, italic, and bold fonts in it - just not where they are supposed to be (at least on the first page). Second page is fine.  And, as I mentioned, "converting it from Word 2007 using Save As PDF" - that *is* using the plug in.  My point is that whether using the Word PDF plug in or converting directly using Acrobat X created the problem.  I say "created" because this morning, it seems to be working fine.  No rhyme. No reason. Not doing anything differently.  I do know that I've seen a similar problem in a PDF letter I received from a colleague who was using Acrobat X (while I was still using 8).  She reviewed the original Word file and, like mine, it was fine, it was just in converting that the problem arose.  So I've decided that the solution involves rebooting one's 'droid.  It seems that 21st century solutions are the same as they were a quarter of a century ago.  Oh joy.
    Thanks to all for the input, though.

  • Import Glossary from Word Doc?

    Hello!
    Is it possible to import the contents of a Word doc into a
    project's glossary, rather than having to create from scratch
    and/or cutting & pasting into the Glossary?
    I saw nothing about this in LiveDocs or the Forum...
    If I have a Word doc, what styles should be applied to the
    term and to the definition?
    Thanks in advance!
    Kathy

    Hi Kathy
    Yes, it's possible. However, keep in mind that the Glossary
    file exists as a simple ASCII text file. Thus, no formatting is
    allowed. I might suggest the following approach:
    Add a single term to the Glossary using RoboHelp. Then sneak
    behind the scenes and open the ProjectName.GLO file with Windows
    Notepad. Note the structure. Now format the text in your Word
    document to follow the example you just looked at. You could then
    select it all, copy and paste into the .GLO file.
    Cheers... Rick

  • Acrobat Hangs Creating PDF from Word Doc - Possible Solution

    For the last several months, I had been working on converting a 20MB Word document into a PDF. The Word doc has external hyperlinks as well as many internal hyperlinks. It also has many sections and uses the TOC generator. I found that when I converted it to a PDF, whether I was using PDF Maker in Word or directly in Acrobat, the process would hang. In Acrobat it would hang either at 5% or 10%. In Word, the hang would occur before it processed the bookmarks.
    I believe I found the solution. When using PDF Maker, you are prompted about SmartTagging the document and how it can be time consuming. I always selected "No". Apparently, even with selecting "No", just having this prompt appear is part of the problem. When I went into the settings for PDF Maker and turned off SmartTagging there, so that when creating the PDF you are not prompted, the PDF creation is successful everytime.
    It took me many months of aggravation before I discovered this fix. I can replicate the issue everytime I let it prompt me about SmartTagging, so there is definitely some quirk going on there.
    HTH,
    Merg

    Hi,
    I'm facing the same problem, pdf maker hangs while "converting codes", this happens once in a while, i'm doing a .net batch conversion of about 2000 documents. (MS Word 2003 acro prof 7)
    Can you tell me where in the settings i find  samrt tagging
    (i'm am using only a fixed set of headings to create tags)
    Huubo

Maybe you are looking for

  • How to install office 2010 on bootcamp windows 7

    Hi, I purchased office 2010 for home and business. I have to download the software from the net, but when I do, I can't run the installation. I double click the icon, then it askes me specify the program I want to use... First of all, I don't know wh

  • Text name in ps

    I need the text name(RSTXT-TDNAME) of a component in ps, i can see for example: 30000000013760001, the three first characters are the mandant and the 4 in the end are the position, and in the middle i think that it is the network, but it doesn´t suit

  • Podcast archive only appearing under music

    Radio 4's In Our Time have recently made their whole archive available to download. I've been subscribing to the podcast for some years. However, when I import the new, archived In Our Time programmes they're separated as podcasts under genre in Musi

  • Slow startup on G4 Cube

    Startup on my G4 Cube 450mhz running 10.4.11 has become abnormally slow. Once running, I don't find that applications are having any running issues. What hard drive maintenance/procedures are recommended to possibly remedy this issue?

  • Need help for transfer itune library to new computer.

    I bought I new computer and I want to transfer my itunes library from my old computer to my new one. i have been able to get my music and apps but I can't get my movies and tv shows. I can't find how to back them up on to cd and I tried redownloading