Non-Latin is converted into entity reference in "Copy Text" of Extract

The Japanese (other than Latin) text which I copied in a clipboard is converted into entity reference when I perform "copy text" in a letter layer using Extract for Brackets.
Is this problem a known thing? This does not rise in Extract of Creative Clouds, Dream Weaver.
An example:
Oh, it is → あ
아 → 아
أ نا ← أنا

Hi Bernard,
This is really a 'browser issue' (which is probably not what you wanted to hear). Since it's up to the browser how it handles copy/paste.

Similar Messages

Can not edit Word docs after converting from a ExportPDF file. The PDF file is converted into a non-editable Word Doc image.

Converting a PDF file (scanned copy Adobe Reader X) to a Word Doc using ExportPDF conversion. The Word Doc created is a image file which is not editable. I need an editable Word Doc file.

Hi robertw,
Can you please tell me more about how the PDF was created? (If you don't know, choose File > Properties, and look for the PDF Producer on the Description tab.)
If the PDF was created by a third-party PDF creator, it may be that it's not well-written, and that can affect the quality of the conversion. See Will Adobe ExportPDF convert both text and form... | Adobe Community.
That said, you may be able to select the text by triple-clicking in the Word document.
Please let us know how it goes.
Best,
Sara

Ignoring entity references

Does anyone know how to get the xmlparser wto ignore(pass through without decoding) character entity references?
I'm using XML as a mechanism to move inserts, updates, and deletes from non-oracle databases into Oracle. The
XML is put in an A.Q. and then parsed on the Oracle side into sql statements.
We have entity references like ° that we would like the parser to leave alone. We could then insert the reference
into the database and Web browsers would decode it at retrieval time. Note - we are not using the 9i XML datatype to
store this data - it is parsed into varchar columns. Currently the parse knows only about the 4 or 5 standard references.
When it sees one that is not in the DTD it pukes.

Entity reference means that you use some encoding that does not have
e.g. character � then you must declare entity references in dtd like:
<!ENTITY uacute "�"> 
and in xml:
<myTextElement>Perú</myTextElement>.
but if you have the character in xml like this
<myTextElement>Per�</myTextElement>
then you must add encoding to <?xml version="1.0" encoding="some-enc-that-has-eacute">accordingly.
Anyway, to get rid of these kind of problems you should stick to utf-8 :-)
BTW. Did I answer the right question?
Kullervo

How to send non-latin unicode characters from Flex application to a web service?

Hi,
I am creating an XML containing data entered by user into a TextInput. The XML is sent then to HTTPService.
I've tried this
var xml : XML = <title>{_title}</title>;
and this
var xml : XML = new XML("<title>" + _title + "</title>");
_title variable is filled with string taken from TextInput.
When user enters non-latin characters (e.g. russian) the web service responds that XML contains characters that are not UTF-8.
I run a sniffer and found that non-printable characters are sent to the web service like
<title>��</title>
How can I encode non-latin characters to UTF-8?
I have an idea to use ByteArray and pair of functions readMultiByte / writeMultiByte (to write in current character set and read UTF-8) but I need to determine the current character set Flex (or TextInput) is using.
Can anyone help convert the characters?
Thanks in advance,
best regards,
Sergey.

Found tha answer myself: set System.useCodePage to false

Entity Reference difficulties

I'm trying to generate an XML so that it shows the Entity References. I need help figuring what I'm doing wrong. Complete source is included below.
When I run the included source, I get:
<?xml version="1.0"encoding="UTF-8"?>
<xmlTest>
<polish text1="�"/>
</xmlTest>But what I really want is:
<?xml version="1.0" encoding="UTF-8"?>
<xmlTest>
<polish text1="& # 0 2 2 5 ;"/>
<note value="Note that the forum is too smart for my special formatting. Please disregard the spaces in between each character in the previous element. This current element is just a note is is also to be ignored."/>
</xmlTest>Source:
import java.io.*;
import javax.xml.parsers.DocumentBuilder;
import javax.xml.parsers.DocumentBuilderFactory;
import javax.xml.transform.OutputKeys;
import javax.xml.transform.Transformer;
import javax.xml.transform.TransformerFactory;
import javax.xml.transform.dom.DOMSource;
import javax.xml.transform.stream.StreamResult;
import org.w3c.dom.*;
* @author pclement
public class XmlTransformerEntityIssue {
    /** Creates a new instance of XmlTransformerEntityIssue */
    public XmlTransformerEntityIssue() {
    public static void main(String a[]) throws Exception {
        XmlTransformerEntityIssue test = new
XmlTransformerEntityIssue();
        test.execute();
    public void execute() throws Exception {
        Document doc = generateTempDoc();
        StringBuffer text = transform(doc);
        doc = null;
        System.out.println(text);
        File file = File.createTempFile("xkgjfhfhf", ".xml",
new File("c:/temp/"));
        FileWriter w = new FileWriter(file);
        w.write(text.toString());
        w.flush();
        w.close();
        file = null;
    private Document generateTempDoc() throws Exception {
        DocumentBuilderFactory factory =
DocumentBuilderFactory.newInstance();
        factory.setValidating(false);
        DocumentBuilder docBuilder =
factory.newDocumentBuilder();
        Document doc = docBuilder.newDocument();
        Element xmlTest =
(Element)doc.createElement("xmlTest");
        // try polish
        Element node = (Element)doc.createElement("polish");
        node.setAttribute("text1", "\u00E1");
        // append element
        xmlTest.appendChild(node);
        node = null;
        doc.appendChild(xmlTest);
        return doc;
    private StringBuffer transform(Document doc) throws
Exception {
        TransformerFactory tFactory =
TransformerFactory.newInstance();
        Transformer transformer = tFactory.newTransformer();
        DOMSource source = new DOMSource(doc);
        StringWriter writer = new StringWriter();
        StreamResult result = new StreamResult(writer);
        transformer.transform(source, result);
        doc = null;
        return writer.getBuffer();
}

Turns out, that the character '�' CANNOT be
e represented raw in UTF8. As UTF8 uses the first
128 chars for English, it uses the rest for special
formatting. It is quite well documented.I don't know what you mean by "raw" here. Your characterization of UTF-8 is wrong, although Microsoft's description is correct. It is possible to represent á in UTF-8, it just takes two bytes. I suppose you might call this "special formattting" but I certainly don't. That's just how UTF-8 represents characters.
http://msdn.microsoft.com/library/default.asp?url=/lib
rary/en-us/dnxml/html/xmlencodings.aspYou might want to read this article too:
http://skew.org/xml/tutorial/
I feel it describes things better for Java users.
After having read MS's site, I then went to
O'Reilly's Java Internationalization book. On pages
166 and 167 is where the following block of code came
from:
byte bytes[] = text.getBytes("UTF8");
return new String(bytes);
I don't have the book. Perhaps there's a context where that code is meaningful. There are certainly plenty of contexts where it is not.
I realize that it's a lot easier to read and comment
than to do a POC (proof of concept). But I'm not
sure why you say that the original XML is fine!Looks like I was wrong about that. Apparently you were not generating the original XML correctly. Still, you shouldn't have to resort to hacks like that to get non-ASCII characters into XML. It looks like you read it from a UTF-8 encoded file, incorrectly using your system's default encoding. In that case the O'Reilly hack would reverse that error. At least it would for that particular character. If it had been a Chinese character it probably wouldn't have worked.
The new code runs quite well and comes from O'Reilly
- a source I trust.It's hard to argue with working software.

Coding in non-latin languages

I manage a website that is translated into 100+ languages and they keep adding more. Right now I have to add Sinhala, I have the font, but the problem is that when I copy from the PDF or Word to dreamweaver the text is converted into gibberish as I am in code view. Does anyone have experience working in these type of languages? The other problem is that I do not speak these languages, so I cannot simply type it in. I have to rely on the translated text to copy it over.
I should also mention that the text is within variables that are on a php includes page, (i.e. $header_description = "sample text where the translation goes.";)
Dreamweaver CC on Windows 7

I have built a cms (content management system) that can be used with multiple languages. Although the maximum number of languages that have been used is four, the system can(in theory) handle any number of languages using any alphabet. I have worked with Vietnamese, Japanese, Chinese Simplified, Spanish and German, but not Sinhala.
I do not use font files for any language. I use the UTF8 character set, which, as far as I know, has it all. The translators (who are in their native countries) paste the text into form textareas. I assume they work from MS Word often. We have not tried pasting from a PDF, which doesn't sound like a good approach.
Some people convert the text with non-latin characters to html entities, but I think that is a really bad idea. If your entire web workflow is set to UTF-8, there should not be a problem.

Escaping standard XML entity references

I've been trying to find a way to escape the greater than sign as well as other chars in my xsl stylesheet. I've tried several tips from the postings, but none seem to work. What I need is for my xml to the characters escaped in the same way as the & & ampersand and < < less than chars are. I run into problems when I try to get the ampersand in next to other characters without a separating ;
Thanks in advance.

Vassago (guest) wrote:
: 1) The XML-parser for Java does not expand entity references,
: such as &[whatever], instead all values are null. How to fix
: this? Code example would be greatly appreciated.
: 2) It seems you cannot have international character (such as
: swedish characters, w) as values for internal entities, i.e.
: <!ENTITY Ouml "Y">
: How to solve this problem?
1) You probably have a simple error defining/using your entities
since we've a number of regression tests that handle entity
references fine. A simple example is:
<?xml version="1.0"?>
<!DOCTYPE po [
<!ENTITY status "beta">
]>
<po foo="bar">
<comment> Alpha, then &status </comment>
</po>
2) What do you set your character set encoding to be?
Oracle XML Team
http://technet.oracle.com
Oracle Technology Network
null

ALV QUANTITY field Geting converted into decimals ?

Hi All ,
I have developed an alv report .In that the quantity field is made editable after editing the quantity when i save it .It gets converted into decimals.Suppose if i give 77 its getting reflected as .077 all the field name ,table name ,reference field name, reference table name, quantity field data type has been passed but the problem persists.If there is any one whos has come across this kind of scenario please share your thoughts on this.
Best Regards,
Sreeram

Hi Sreeram,
I am not gettin this issue. See below code :
If you are gettiing the same issue with the below code then it has something to do with your user settings.
Go to Menu Item System -> User Profile -> Own Data -> Defaults ->Decimal Notation.
Set it to 1,234,567.89 and save. Now try the program again.
REPORT z_test.
TYPE-POOLS: slis.
TYPES : BEGIN OF ty_ekpo,
          ebeln TYPE ebeln,
          ebelp TYPE ebelp,
          ktmng TYPE ktmng,
          menge TYPE bstmg,
        END OF ty_ekpo.
DATA: it_ekpo TYPE TABLE OF ty_ekpo,
      it_fieldcat TYPE slis_t_fieldcat_alv.
SELECT ebeln ebelp ktmng menge
FROM ekpo
INTO TABLE it_ekpo
UP TO 20 ROWS.
PERFORM add_fieldcat USING 'EBELN'.
PERFORM add_fieldcat USING 'EBELP'.
PERFORM add_fieldcat USING 'KTMNG'.
PERFORM add_fieldcat USING 'MENGE'.
CALL FUNCTION 'REUSE_ALV_GRID_DISPLAY'
EXPORTING
   i_callback_program                = sy-repid
   it_fieldcat                       = it_fieldcat
TABLES
    t_outtab                          = it_ekpo
EXCEPTIONS
   program_error                     = 1
   OTHERS                            = 2
IF sy-subrc <> 0.
MESSAGE ID sy-msgid TYPE sy-msgty NUMBER sy-msgno
          WITH sy-msgv1 sy-msgv2 sy-msgv3 sy-msgv4.
ENDIF.
FORM add_fieldcat USING p_fieldname TYPE slis_fieldcat_alv-fieldname.
DATA: wa_fieldcat TYPE slis_fieldcat_alv.
CLEAR wa_fieldcat.
wa_fieldcat-fieldname = p_fieldname.
wa_fieldcat-tabname = 'IT_EKPO'.
wa_fieldcat-ref_fieldname = p_fieldname.
wa_fieldcat-ref_tabname = 'EKPO'.
IF p_fieldname EQ 'KTMNG'.
    wa_fieldcat-edit = 'X'.
ENDIF.
APPEND wa_fieldcat TO it_fieldcat.
ENDFORM.                    " ADD_FIELDCAT
Regards,
Jovito

How to Push the into SAP ( XML data converted into IDOC )

Hi
i am getting XML file from Non SAP system.I need to push XML data into SAP on daily basis with out using XI as the middleware.
I know if i get text file will use BDC's or LSMW. But i am getting data in XML format and then i need to converted into IDOC format and stored in to sap data base tables.
Thanks for advance.
srini

Is the XML an IDOC-XML or custom XML that you need to post as an IDOC??
If it is IDOC-XML you need to defined XMLFile port to process the IDOC-XML without any mapping.
If it is a custom XML, parse the XML data into an internal table (as required) & continue with BDC or IDOC posting as you wish.
Check for XML parsing programs..
-Siva Maranani

Songs can't be convert into Ring Tone

There seems to be a problem with itunes or with the songs buying online. When i bought the songs and try to convert into ring tones it say "this song can no longer be covert into a Ring tones" i bought about 5 to 6 songs and all of them says the same thing, actully the first one i bought works then none of the rest works. Now can anyone tell me how solve this problems. thanks

go here
http://www.dyohanan.com/apple/iphone-ringtone-maker/
the trial software allows you to make 3 ringtones, and they actually work. i have 2 new FREE ringtones on my iPhone, the actual version only cost $15, so thats a pretty good deal. Check it out.

We cannot type Polish (non-latin) characters in WebDynpro applications

We cannot type Polish (non-latin) characters in WebDynpro application (in runtime) because 'Browser Help Shortcuts' are fired.
To type a polish character in polish keyboard you need to press AltGr + letter (ie. AltGr + a/c/e/s/o/l/z/x/n). To type an uppercase polish character you need to press AltGr + Shift + letter. This comination is in fact the same as pressing Alt + Ctrl + Shift + letter (because AltGr produces Alt + Ctrl) and it fires some of 'Browser Help Shortcuts'. For example AltGr + Shift + O should produce a letter O with a dash on it's top but instead it fires 'Show nesting of HTML containers'.
We tried to turn off sap-wd-lightspeed, but then other key combinations are reserved for u2018Browser Help Shortcutsu2019.
We need to be able to use AltGr + Shift + a/c/e/s/o/l/z/x/n in runtime.
Product: SAP NW 7.11 SP04
WebDynpro for Java
I hope there is a somewhere a hidden parameter that solves our problem Maybe we're in some kind of debug mode?
Thanks for your help!!

The funny thing is that bold font [when message unread in message list] shows OK, ie in greek, but when i click on unread message, it is assumed to have been read, so it changes over to medium [non bold] and the encoding changes as well into the one that is not greek and thus unreadable. In ~/.sylpheed/sylpheedrc the fonts are:
widget_font=
message_font=-microsoft-sylfaenarm-medium-r-normal-*-*-160-*-*-p-*-iso8859-7
normal_font=-monotype-arial-medium-r-normal-*-12-*-*-*-*-*-iso8859-7
bold_font=-monotype-arial-bold-r-normal-*-12-*-*-*-*-*-iso8859-7
small_font=-monotype-arial-medium-r-normal-*-12-*-*-*-*-*-iso8859-7
In /etc/gtk, for gtk1.2 apps the file refering to greek encoding [el] seems to be fine [exactly the same as in slackware 9.1].

How do I get rid of non-latin fonts?

Hi -
Using FontBook I've isolated all the non-latin fonts into one collection. I had Photoshop shut down during this. I deactivated the fonts individually and then did the same to that non-latin collection. But when I launch Photoshop - all those fonts still show up on the font list. There have always been a few in MS Word - and they are still there - not deactivated.
Eventually I want to delete these useless fonts from my over loaded hard drive - but I can't even seem to isolate them and deactivate them using Font Book.
What is the correct procedure for deleting fonts - or at least deactivating them so they don't show up on any app's font list.
TIA your experience,
j2
PS. late 2011 MBPro, Mavericks, CS6 Adobe apps.

Entity Reference embedded within colspec attribute name-table col. heading

My team is working on a project that involves converting MS Word documents to XML. The XML is applied to a stylesheet (*.xsl) which generates a *.pdf document. Is it valid to include an xml entity reference within a colspec attribute name? For example, the xml document would contain the following code for a table:
<t id="t1109681052">TABLE 1.0 My Favorite Restaurants</t>
<tbl id="tbl921133606" val="1.0">
<TblHdg num="1">Table 1.0 My Favorite Restaurants </TblHdg>
<colspec id="colspec1148420628" name=" Restaurant Name" width="0.61875"/>
<colspec id="colspec201089067" name="Type of Authentic Cuisine" width="1.61875"/>
<colspec id="colspec201089069" name="Location" width="1.61875"/>
</tbl>
Is the following code valid in XML to produce the results in the sample table below when the XML is applied to a stylesheet (xsl)?
<colspec id="colspec201089067" name="Type of "Authentic" Cuisine" width="1.61875"/>
The desired table and column titles would be as follows:
Restaurant Name ----     Type of Authentic Cuisine ----     Location
Le Chantecler --------------->French     The Negresco Hotel
Promenade des Anglais Nice France
La Coupola -------------------->French     The Mirabeau Hotel
1 Princess Grace Ave
Monte Carlo Monaco
What is needed in the XML code to produce the table column heading with quotes embedded around the word "Authentic"?
Please advise. Thanks so much and have a prosperous and memorable holiday.

SOLUTION RESOLVED FOR THIS MESSAGE.

Embedded glyphs for alpha not working for non-Latin letters

Even after reading through all the posts here, I'm still
confused about why embedding is necessary for alphas on text and
how embedding works, so maybe someone can sort it out and help me
with my problem:
I have a FLA that is trying to present the same script in
(the user's choice of) several languages -- including non-Latin
alphabets like Korean, Japanese, Russian, Chinese and Arabic. I'm
using the Strings library to load translations into my text movie
clips on the stage.
The language stuff works great except that alpha tweens
weren't working. So I selected the movie clip symbols (three of
them) in the library and told Flash to embed all glpyhs. Each of
these symbols is using Trebuchet MS, so my thinking is that I'm not
exceeding the 65K limit because I'm embedding the same glyphs for
each. (But I'm not sure if that limit is for the SWF or per
symbol.) I only have one other font in the FLA and for that one,
I'm just embedding English alpha characters.
(Perhaps as an aside, I also included these fonts Stone Sans
and Trebuchet MS as fonts in my library, but I don't understand
whether this is absolutely necessary or if embedding glyphs
eliminates the need for these.)
So with those glyphs embedded, my text alpha tweens work
beautifully. However -- and this is where I begin to shed tears --
all the Korean and Cyrillic text is gone. It's just punctuation and
no characters. I don't have Chinese, Japanese or Arabic text loaded
yet, but I imagine that these would suffer from the same problem.
What am I doing wrong? Do I need to create more than one
movie to achieve my multilanguage goal? (Or worse, render each
language separately? -- yuck!) In my old version of Flash (just
up'd to CS3) I could tween alpha text with no problem, so why is
all this embedding even necessary?
Thanks,
Tim

Is this just impossible? Or really hard?

Non-Latin Character Display

I have several Java web applications on Tomcat where non-Latin characters function properly with only one exception. Non-Latin characters, Chinese in this case, can be displayed properly thorough the JSTL message tag. The related configuration are followings:
HTML: <meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
JSP: <%@ page contentType="text/html;charset=UTF-8" language="java" %>
The applications can take Chinese character inputs correctly with a filter of converting request character encoding to "UTF-8".
The only problem is that Chinese characters don't displayed properly when they are directly typied on a JSP file. I have set the Eclipse file text encoding to utf-8 and the characters are shown correctly in the IDE.
I believe that is a TC configuration related issue. After having "set JAVA_OPTS= -Dfile.encoding=UTF-8" in the catalina.bat file, nothing has changed.
How to solve this problem?
~~~~~~~~~~~~~~~~~~~~~~~~~~~
Solved!
Just need to have the JSP page tag on the related JSP file.
Message was edited by:
vwuvancouver

I have several Java web applications on Tomcat where non-Latin characters function properly with only one exception. Non-Latin characters, Chinese in this case, can be displayed properly thorough the JSTL message tag. The related configuration are followings:
HTML: <meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
JSP: <%@ page contentType="text/html;charset=UTF-8" language="java" %>
The applications can take Chinese character inputs correctly with a filter of converting request character encoding to "UTF-8".
The only problem is that Chinese characters don't displayed properly when they are directly typied on a JSP file. I have set the Eclipse file text encoding to utf-8 and the characters are shown correctly in the IDE.
I believe that is a TC configuration related issue. After having "set JAVA_OPTS= -Dfile.encoding=UTF-8" in the catalina.bat file, nothing has changed.
How to solve this problem?
~~~~~~~~~~~~~~~~~~~~~~~~~~~
Solved!
Just need to have the JSP page tag on the related JSP file.
Message was edited by:
vwuvancouver

Non-Latin is converted into entity reference in "Copy Text" of Extract

Similar Messages

Maybe you are looking for