Highlight File Format and PDF Documents with Chinese and English characters

I'm a developer working on an application that makes use of the Highlight File Format / external highlight server capabilities of Adobe Reader.
The highlighting worked correctly until we started to introduce pdf documents that were scanned to recognize Chinese in addition to English.
The xml file seems to have the correct values in it. For example, if the 10 characters to highlight are at position 41 on the first page. The "xml" file has the <loc pg=0 pos=41 len=10>.
If the document is scanned for English only, it works fine. That is, the highlight starts at character 41. If the same document is scanned for Chinese and English, the highlight starts at character 22.
Has anyone had a similar experience? Do you know a solution?

Hi,
I don't know about BIP and the specifics of your context, but here are some general answers for the XLIFF format:
<?xml version = '1.0' encoding = 'utf-8'?>
Can we change encoding to 'ISO-8859-1' as soon as we convert the file format ?If you also save the file to ISO-8859-1 as well yes. But you can do this only for languages supported by Latin-1. It makes sense to keep the files in UTF-8.
A-2) he section <header><skl><internal-file> contains a huge string
which seeem to be binary ... What is this ? can we delete it ?That's likely to be the skeleton file. The data used to rebuild the original format after translation.
Most likely it should stay there.
A-3) Can we have one XLF file with muliple <file> sections (one per language to translate to ) ?
This can be very useful for us to manage one only translation file per report template.Yes but no: yves you can have several <file> elements in an XLIFF document, but no: they must be for the same language pair. XLIFF is designed to work with bi-lingual files, not multi-lingual files.
A-4) the most important section for translation is included in the <trans-unit> tag .
Each one has a distinct id like ""49e41f8f" ... Can we replace this by a more meaningfull value ?Those id attributes are used by the filter to merge back the data after translation. You should preserve them.
A-5) the language format is like "en-US" (language code + territory code).
Is it case sensitive ? No it is not case-sensitive (en-us == en-US). The values of xml:lang are not case sensitive (unlike other XML attribute values).
en-US is just the recommended notation.
The XLIFF specification are here:
http://docs.oasis-open.org/xliff/xliff-core/xliff-core.html
Hope this helps,
-ys

Similar Messages

  • We created a pdf document with comments and hyperlinks. On iPad, comments/hyperlinks disappear.

    Using Acrobat Pro 9, we created a pdf with comments and hyperlinks to be shared with a large community on the iPad, through a server. Once the document is opened on the iPad, all the comments and hyperlinks disappear. Are we missing a tool on the iPad or is there a different way to create this file? Thank you.

    Is everyone on the iPad using the latest version of Adobe Reader mobile for iOS (10.6 at the moment)? You can see all the comments (including drawing markups) as well as view the notes attached to text markup with this version.

  • Generating word and PDF documents

    How can I generate Word and pdf documents with JDeveloper ?
    Iwan

    You already got answers on your previous thread:
    Create *.doc and *.pdf files with jDeveloper

  • Need information about FI document  with header and no items

    Hello Experts!
    I need your help for understanding the following subject :
    How is it possible when using F.13 to obtain a fi document with header without items sometimes,
    and fi document with header and items anothers times?
    We didn't saw/found customizing topics about that ( to enable or not header without items) Some exists?
    The only difference in the two cases are the document's type involded which are not the same.
    Regards,
    Josiane

    Hi,
    We verify  and compare everything.  it's the same everywhere except the document type of the invoice
    first case, F.13 create conciliation fi document with header and no items
    invoice ( doc. type YB )
    payment ( doc type ZT)
    conciliation ( doc type XZ )
    2th case, F.13 create conciliation fi document with header   and none items
    invoice ( doc. type YT )
    payment ( doc type ZT )
    conciliation ( doc type XZ )
    We match / compare  every fields and value  between invoice of 1st and 2tch case,  every fields and value between payment  of 1st and 2th case.
    ( we create  tests case in own integration system for reproduce  )
    We alse match / compare customizing of invoice type document YB and YT and  they seems identical...
    The procedure is the same with F.13 to do conciliation in both cases.
    we got same amount, same currency etc etc.
    We still didn't understand why with invoice type YT the conciliation document created is not the same as invoice type YB ...
    Nobody have idea ?

  • Is there the way to use "Highlight File Format" with Acrobat Reader 9.0 or later?

    hi
    Do anyone know the way to use "Highlight File Format" with Acrobat Reader 9.0 later or substitutional way to highlight a PDF file as well as highlight file?
    I want to highlight some words by indicating character's location(from n to m) using Acrobat Reader 6, 7, 8, and 9.
    I use highlight xml file to highlight words in Acrobat 6, 7, and 8.
    http://www.adobe.com/devnet/pdf/pdfs/HighlightFileFormat.pdf
    According to this documents, "External Window Handler Plugin" is required to use highlight file.
    But I cannot find this plugin in Acrobat Reader 9.
    So Acrobat Reader doesn't recognize highlight file.
    Well, I know that there is another way to highlight pdf file. It is called "PDF Open Parameters".
    However this parameters cannot indicate location of characters but words which is to be searched.
    This parameter does not work perfectly to substitute a highlight xml file.
    To satisfy what i want to do, the best solution is that Acrobat Reader 9.0 recognizes highlight xml file.
    But I think it is impossible because of lack of plugin.
    Do anyone know the way to recognize a highlight xml file to 9.0 or another way to highlight some words?
    thanks,
    Asatamax

    The feature is disabled by default in reader 9
    (1) Open Adobe Reader
    (2) Click Edit > Preferences > Search, and
    (3) check the box labelled Enable search highlights from external highlight server.
    (response found here: http://support.dtsearch.com/dts0117.htm)
    I can't imagine why Adobe choose to disable it. It's a bad joke for us at UNESCO (http://unesdoc.unesco.org)

  • Tried to open a pdf document with Acrobat XI Pro Trial and sign in popped up , I keyed in my username and password but it doesnt respond its like grey with the four dots rotating forever. Any solution Please ?????

    Tried to open a pdf document with Acrobat XI Pro Trial and sign in popped up , I keyed in my username and password but it doesnt respond its like grey with the four dots rotating forever. Any solution Please ?????

    Hi higi97,
    How are you connected to the internet? Are you behind a particularly secure corporate firewall? Are you on Mac or Windows? Do you have any anti-malware software running on your machine that may be preventing applications other than your browser from connecting to the internet?
    You may try to follow the steps below:
    1.Close the Creative Cloud application.
    2.Navigate to the OOBE folder.
    Windows: [System drive]:\Users\[user name]\AppData\Local\Adobe\OOBE
    Mac OS: /Users/[user name]/Library/Application Support/Adobe/OOBE folder
    3.Delete the opm.db file.
    4.Launch Creative Cloud.
    Let us know if that helps,
    Regards,
    Rave

  • I am getting this response when I am trying to open a file: Before viewing PDF documents in this browser you must launch Adobe Reader and accept the End User License Agreement, then Quit and relaunch the browser."  What do I do?

    I am getting this response when I am trying to open a file: Before viewing PDF documents in this browser you must launch Adobe Reader and accept the End User License Agreement, then Quit and relaunch the browser.”  What do I do?  I have opened this up in the past without a problem. 

    Back up all data.
    If Adobe Reader or Acrobat is installed, there should be a setting in its preferences such as Display PDF in Browser. I don't use those applications myself, so I can't be more precise. Deselect that setting, if it's selected. Otherwise do as follows.
    Triple-click anywhere in the line of text below on this page to select it, the copy the selected text to the Clipboard (command-C):
    /Library/Internet Plug-ins
    In the Finder, select
    Go ▹ Go to Folder
    from the menu bar, or press the key combination shift-command-G. Paste into the text box that opens (command-V), then press return.
    From the folder that opens, remove any items that have "Adobe" or “PDF” in the name. You may be prompted for your login password. Then quit and relaunch Safari, and test.
    The "Silverlight" web plugin distributed by Microsoft can also interfere with PDF display in Safari, so you may need to remove it as well, if it's present. The same goes for a plugin called "iGetter," and perhaps others.
    If you still have the issue, repeat with this line:
    ~/Library/Internet Plug-ins
    If you don’t like the results of this procedure, restore the items from the backup you made before you started. Relaunch Safari again.

  • I want to share PDF documents with others. These documents have been edited (mostly highlights) by me. I want to share these documents without the changes I've made. How can I do this?

    I want to share PDF documents with others. These documents have been edited (mostly highlights) by me using Preview. I want to share these documents without the changes I've made. How can I do this? Do I have to manually erase all of the changes I've made. The people I want to share it with also have macs.
    Isaac

    You can move the iTunes folder to a separate drive on your pc an then configure each account to use this drive / folder as iTunes library ... The problem is, that iTunes by default stores everything in your personal music folder which is separate for each user account in windows (and by default is on "C" drive).
    I create a separate partition on every pc/laptop (and map a drive name) where I store music, videos and other mass data. This will also keep the "C" drive small, which is backed up from time to time and so my backup is also small.

  • PDF Documents with Functional Buttons - Works for IE and FireFox but not for Chrome and Safari

    To all,
    Good morning.  Our institution developed a PDF document with functional buttons for printing and e-mailing.  When the document is viewed via Internet Explorer and Firefox, the functional buttons work as expected.  However if the document is viewed via Chrome and Safari, the functional buttons do not work.  The workaround for those who are using Chrome and Safari, the PDF document needs to be saved to their local drive before the functional buttons work.
    Has anyone experienced this issue before?  If yes, what was your solution other than the workaround.
    Thanks.
    Babylon5

    Here's how to configure your browser to run Adobe Reader: http://helpx.adobe.com/acrobat/using/display-pdf-browser-acrobat-xi.html

  • How can my laptop access the downloaded files like mp3 and pdf documents which are in my iphone 4G?

    how can my laptop access the downloaded files like mp3 and pdf documents which are in my iphone?

    Yes you will need to NAT at some point to go from private to public address space. Here is a basic configuration if you are interested:
    interface F8
    ip nat inside
    interface G0
    ip nat outside
    ip access-list standard NAT
     permit 192.168.11.0 0.0.0.255
    ip nat inside source list NAT interface G0 overload

  • How can I open and listen to the PDF documents with audio in my iPad?

    How can I open and listen to the PDF documents with audio in my iPad?

    You need to use a PDF reader that supports multimedia. Adobe Reader 10.3 does not. Adobe has not stated whether it will support multimedia in the future.
    In the meantime, you could buy an application that does like PDF Expert. Read about it here:
    Finding the Best Tablet PDF Reader

  • I converted a PDF document to word and it came out all jumbled. How do I correct spelling and typo?

    I converted my pdf document to word and the words come out all jumbled. I thought the document would would come out like the original. what can I do with this document which will take forever to fix?

    Hi buklova,
    Is it happening with this file only or with every files?
    Would you send your PDF document to [email protected] as an email attachment? 
    I will check it from my end.Please add the link to this forum post for reference.
    Regards,
    Florence

  • Problem in printing pdf document with java code

    Hi All
    I want to print a pdf document with java code i have used PDFRenderer.jar to compile my code.
    Code:
    File f = new File("C:/Documents and Settings/123/Desktop/1241422767.pdf");
    FileInputStream fis = new FileInputStream(f);
    FileChannel fc = fis.getChannel();
    ByteBuffer bb = fc.map(FileChannel.MapMode.READ_ONLY, 0, fc.size());
    PDFFile pdfFile = new PDFFile(bb); // Create PDF Print Page
    PDFPrintPage pages = new PDFPrintPage(pdfFile);
    // Create Print Job
    PrinterJob pjob = PrinterJob.getPrinterJob();
    PageFormat pf = PrinterJob.getPrinterJob().defaultPage();
    pjob.setJobName(f.getName());
    Book book = new Book();
    book.append(pages, pf, pdfFile.getNumPages());
    pjob.setPageable(book);
    // System.out.println(pjob.getPrintService());
    // Send print job to default printer
    pjob.print();
    but when i am running my program i am getting error
    Exception in thread "main" java.awt.print.PrinterException: Invalid name of PrintService.
    Please anybody, knows the solution for this error?
    Thanks In Advance
    Indira

    It seems that either there is no default printer setup or you have too many printers or no printer setup at all. Try running the following code. It should print the list of available print services.
    import java.awt.print.*;
    import javax.print.*;
    public class PrintServiceNames{
         public static void main(String args[]) throws Exception {
              PrintService[] printServices = PrinterJob.lookupPrintServices();
              int i;
              for (i = 0; i < printServices.length; i++) {
                   System.out.println("P: " + printServices);
    }From the list pick one of the print service names and set it explicitly like "printerJob.setPrintService(printServices);" and then try running the program.

  • How to create a  PDF document with page curls using Adobe  CS 4?

    My  goal is to create a  PDF document with page curls. I am using Adobe  CS 4.
    1.      The document was created in Adobe InDesign  CS 4  where the page  turn (curl) transition  was applied.
    2.      Then the document was exported to .swf.
    3.     The .swf file was imported into   Adobe Acrobat Pro  to create a PDF file with  flip page or page curl transitions.
    These are the problems.
    1.      The background is not  transparent.
    2.      Page dimensions have to be increased at least an inch in width and length so that the full page can show
    3.      The command and+   will not only increases the document's  screen size. It increases the page margins.

    PDF was never designed to support the Flash page curl effect (it didn't exist back then). Anything you try (and you've tried the standard hack) will look like a hack. Personally, I don't think the effort is worth it for an effect that's much overused.

  • Storing MS-Word and PDF documents in database uploaded through APEX

    I want to allow users to upload MS-Word and PDF documents they have created / edited outside of the APEX app for storage in the database along with the other app data.
    I know that I should use a file browse item in a region for this purpose.
    I have a some questions:
    1. What database datatype is best for storing MS-Word and PDF docs? CLOB?
    2. How can the user see that the file has been uploaded? In other words, how can they see that the filename has been uploaded? I want to show this on the page for later viewing. I have uploaded a document, but later, I have no way to see that the document has in fact been uploaded. From the APEX page, the file browse item does not indicate whether or not a file has previously been uploaded.
    3. How can the user download the previously uploaded document?
    4. Is there a way to give users the capability to edit a document in an APEX app where they can do MS-Word-like editing (font size, font type, bold, italics, etc.)?
    I know I've asked a lot here and I appreciate your input.
    -Reid

    In my case, I am uploading screenshots of my application and then displaying them. You can feel free to use it if you want, the code was mostly taken from examples of file uploads posted by others. Here is some code to get you started:
    File Table description
    CREATE TABLE FILE_UPLOAD
       (name           VARCHAR2(4000) PRIMARY KEY,
        subject        VARCHAR2(4000),
        id             NUMBER,
        blob_content   BLOB,
        mime_type      VARCHAR2(4000),
        PAGE_ID        NUMBER);File Upload Procedure
    IF ( :P20_FILE_NAME is not null ) THEN
         INSERT INTO FILE_UPLOAD (id, NAME, SUBJECT, BLOB_CONTENT, MIME_TYPE, PAGE_ID)
          SELECT ID,:P20_FILE_NAME,:P20_SUBJECT, blob_content, mime_type, :P20_PAGE_ID
                FROM APEX_APPLICATION_FILES
                WHERE name = :P20_FILE_NAME;
       DELETE from APEX_APPLICATION_FILES WHERE name = :P20_FILE_NAME;
      END IF;Download Image Procedure Definition
    CREATE OR REPLACE PROCEDURE download_my_file(p_file in number) AS
            v_mime  VARCHAR2(48);
            v_length  NUMBER;
            v_file_name VARCHAR2(2000);
            Lob_loc  BLOB;
    BEGIN
      SELECT MIME_TYPE, BLOB_CONTENT, name,DBMS_LOB.GETLENGTH(blob_content)
             INTO v_mime,lob_loc,v_file_name,v_length
        FROM FILE_UPLOAD
       WHERE id = p_file;
        -- set up HTTP header
        -- use an NVL around the mime type and
        -- if it is a null set it to application/octect
        -- application/octect may launch a download window from windows
      owa_util.mime_header( nvl(v_mime,'application/octet'), FALSE );
        -- set the size so the browser knows how much to download
      htp.p('Content-length: ' || v_length);
        -- the filename will be used by the browser if the users does a save as
      htp.p('Content-Disposition:  attachment; filename="'||replace(replace(substr(v_file_name,instr(v_file_name,'/')+1),chr(10),null),chr(13),null)|| '"');
        -- close the headers           
      owa_util.http_header_close;
        -- download the BLOB
      wpg_docload.download_file( Lob_loc );
    end download_my_file;
    /make sure to grant the procedure rights for use by anyone
    GRANT EXECUTE ON download_my_file TO PUBLICDisplay Image Procedure Definition
    create or replace PROCEDURE "DISPLAY_IMAGE"
      inID NUMBER
    AS
      vMIME VARCHAR2(48);
      vLENGTH NUMBER;
      vFILENAME VARCHAR2(2000);
      vBLOB BLOB;
    BEGIN
      SELECT MIME_TYPE, BLOB_CONTENT, NAME, DBMS_LOB.GETLENGTH(BLOB_CONTENT)
        INTO vMIME, vBLOB, vFILENAME, vLENGTH
      FROM FILE_UPLOAD
      WHERE ID = inID;
      owa_util.mime_header(nvl(vMIME, 'application/octet'), FALSE);
      htp.p('Content-length: ' || vLENGTH);
      owa_util.http_header_close;
      wpg_docload.download_file(vBLOB);
    END;
    /Again, make sure everyone has rights to execute the procedure.
    GRANT EXECUTE ON DISPLAY_IMAGE TO PUBLICThis doesn't create an editor inside the browser, it just creates a simple upload/display process. For a WYSIWYG editor that handles PDF or Docs, you'll need something far more sophisticated, and I can't suggest anything there.

Maybe you are looking for

  • Does anyone know of an API that can gain access to system resource info?

    I wish to include cpu and memory usage monitors in an application but i don't know any way of obtaining that info. Does any one know a way of doing it?

  • HR Birthdate Issue

    We are trying to use the HR portion but have run into a birthdate issue. We have an employee born in 1948 but the interface will not allow the date to be entered. If we key in 1950 and do the calendar popup, 1948 appears but it sticks as 2048. It the

  • 91 Days and counting and Photo Stream still grayed out in iCloud CP

    Well, in the past 11 days, my daughter produced twin boys. However, Apple hasn't fixed the crummy attempt at iCloud for 91 days now. 91 Days and counting and Photo Stream still grayed out in iCloud CP since problem first discussed with Apple. No help

  • I have a problem in connecting to internet via WiFi.

    I have a problem in connecting to internet via WiFi. This week I received my new "iPhone 6 and 6 plus". Problem is the new phones doesn't connect to internet via my home WiFi  while I don't have any problem on my other devices. The phones connect to

  • Strange message when I try to update my ipod with songs...

    Everytime I try to update my iPod with a song, it appears in the playlist but when I unplug the iPod to listen to the song it disapears. Also I get this strange message. It says: The file or directory \ipod_Control\iTunes\Temp File 4 is corrupt and u