Accented (Hungarian) characters - puzzle.

Hi all,
I have a .csv file with English words and their Hungarian equivalent - the .csv file is
fine and appears with the correct accents in both LibreOffice Calc and with gedit
and the command line (i.e. $> more my_file.csv) in bash.
==========================
consumption,fogyasztás
consumption,tüdővész
precious,értékes
precious,drága
===========================
I then create an external table
=================================
CREATE TABLE Hungarian_dict
Angol VARCHAR2(50),
Magyar VARCHAR2(50)
ORGANIZATION EXTERNAL
type oracle_loader
default directory MY_FILES
access parameters
records delimited by newline
fields terminated by ','
missing field values are null
location ('SZOTAR.csv')
reject limit unlimited;
============================
The problem is, for example, the word "consumption"
Select * from hungarian_dict where angol = 'consumption';
appears thus (exported from SQL Developer (also gibberish in SQL*Plus).
==========================
pol@pol-laptop:~/Downloads/personal/hungary$ more export.tsv
"ANGOL"     "MAGYAR"
"consumption"     "fogyasztás"
"consumption"     "tüdÅ‘vész"
pol@pol-laptop:~/Downloads/personal/hungary$
===========================
My question is how do I get Oracle to display the correct settings? Do I have to
reload my table with different NLS_LANG settings? NLS_LANGUAGE in the
database?
===========
SQL> show parameter NLS_LANGUAGE;
NAME                    TYPE     VALUE
nls_language               string     AMERICAN
SQL>
===============
at the bash prompt, if I type "locale" the result is
=====================================
en_IE.utf8
=====================================
for everything.
Should I create my table with a different character set and collation for the
Hungarian language field?
TIA,
Paul...

>
Firstly, thanks to both Srini and Orafad - just a followup (puzzling).
NLS_CHARACTERSET     WE8MSWIN1252
Without CHARACTERSET clause, datafile encoding need to match database character set.
But I guess data you have is in UTF-8.OK, I put in CHARACTERSET UTF8 in my access parameters for my table.
I also specified NVARCHAR for my input fields.
This appears to give the correct characters - i.e. I ran these queries
SELECT * from angol_magyar where angol = 'handkerchief';
SELECT * from angol_magyar where angol = 'glove';I get these results
"ANGOL"                  "MAGYAR"
"handkerchief"           "zsebkendő"      <- note o-double-acute - no problems with either SQL Developr or exporting to text - more in bash OK.
"glove"                       "kesztyű"         <- note u-double-acute -                           ditto.But, when I run these queries
SELECT * from angol_magyar where magyar LIKE '%ő%';
SELECT * from angol_magyar where magyar LIKE '%ű%';i.e. with the o-double-acute or the u-double-acute surrounded by % signs - which I thought would
just bring back all the words with the double-acute accented characters, these queries in fact bring back
any magyar word with o - i.e. "o" on its own and o-acute and o-umlaut and o-double-acute.
This is puzzling to say the least. Any ideas?
To verify data stored, you could use:
select column, dump(column, 1016) from table where suitable_condition_to_pick_row ... ;For the query
SELECT magyar, dump(magyar, 1016) from angol_magyar where (angol = 'handkerchief' or angol = 'glove');
The result is (gibberish to me):
"kesztyű"     "Typ=1 Len=14 CharacterSet=AL16UTF16: 0,6b,0,65,0,73,0,7a,0,74,0,79,1,71"
"zsebkendő"     "Typ=1 Len=18 CharacterSet=AL16UTF16: 0,7a,0,73,0,65,0,62,0,6b,0,65,0,6e,0,64,1,51"Also, how do I have different collations for the English and Hungarian fields in my table?
Paul...

Similar Messages

  • Hungarian characters not displaying properly

    What is the fix for ColdFusion MX 6.1 when it does not
    display foreign characters properly? I have 2 links one just and
    HTML page which displays my hungarian characters properly and when
    I change it to a ColdFusion Page it does not display properly.
    HTML Page
    http://www.etlsoft.hu/demo1/index6.htm
    ColdFusion Page
    http://www.etlsoft.hu/demo1/index6.cfm

    FYI I was able to solve the problem by adding this tag.
    <cfprocessingdirective pageencoding = "UTF-8" />

  • Using accents/special characters in keyboard WAY TOO SLOW!

    I'm often using french, and thus many words have many accents (just a short phrase exemple : l'été passé j'ai dû boire l'amère potion de la déception... A short sentence, 5 accents. Can be worse).
    I'm typing pretty fast, and already getting frustrated because the french keyboard doesn't have the ' included, which would save so much time in contractions. (Would probably be added automatically if I could use the corrector, but I can't, cause switching from french to english all the time, I just get stupid jumble all the time).
    But the worst of the worst is each time I need to type an accented letter, I lose 2.5 seconds! Because that's how long it takes for the accents options to pop-up after I hold my finger on the letter...
    It's completely ridiculous!
    Typing a short 3 letter word like "été" takes over 6 seconds!
    It makes me so mad!
    I almost completely stopped writing my french emails or Facebook updates and such from my iPod Touch and prefer to wait until I'm on my MBP! Is that normal when you have the chance to own the Touch?!
    I've contacted Apple about this many months ago, but like any complaint I've had in the past, never heard of them.
    Has anyone else have such a delay with the accents/special characters keyboard (popups)?
    Is there something I can do?

    Simply copy and paste. Just did it and it worked. Perhaps I'm not understanding something. What do you mean by "insert a font."

  • DOMparser throws IOException when encounters Hungarian Characters

    Hoi!
    I wrote a piece of code that extracts some
    information from an XML document into a vector of Java classes, using the oracle.xml.parser.v2.DOMParser.
    And it worked. Or seemed to work...
    But when I put some articles in the XML file
    in Hungarian, the parser threw IOException.
    If I remplace the Hungarian characters to
    English "equivalents" a -> a etc., it works.
    I don't know. If XML is made up of Unicode characters, what's the problem with it?
    (The hex code of a was E1 in my text editor,
    as I'm using Win NT :(. )
    can I modify the xml prolog somehow?
    I'd rather not write a conversion program
    from a text file to another.
    Any ideas?
    and here's the code:
    DOMParser theParser = new DOMParser();
    XMLStreamToParse = XMLes.class.getResourceAsStream(xmlDocPath);
    theParser.setValidationMode(false);
    try{
    theParser.parse( XMLStreamToParse );
    //this throws IOException
    null

    What are you using as your test client?The test client is WebStone 1.0. WebStone always downloads the whole response, and reports the size of the response in bytes. From this I can see that when the IO exception occurs, webstone is unable to read the whole response, as it reports a smaller size.
    So, I do not think the problem is that the client has prematurely aborted its download. WebStone doesn't work that way. I think something has gone awry on the server side, and this worries me.

  • Sap script - Problem while printing Hungarian characters during Unicode

    Hello Friends,
    We had done  unicode changes ie Version upgrade from 4.6 C to ECC and because of that sap script Hungarian characters are printing as Junk characters ex - #####.
    Can any one please help me on this issue. In SO10 Print preview also we are facing same issue.
    Thank you,
    Regards,
    Phaneendra

    Hi Phaneedhra,
    the below thread explains the font issue for Hungary..
    SAPScript with Hungarian characters on it.  Weh we review the output online it looks good, but when we print it we get a u2018#u2019 in place of Hungarian characters.  I found the following link in SDN and am wondering if you can help.
    Hungarian Character Printing problem
    Problem in displaying special chars
    Prabhudas

  • Fail to pass special Hungarian characters using WSDL

    Dear All,
    I'm using WLS 7.0SP1, the webservice is generated with the ant task in rpc-style.
    The client is written in VB6 with MS SoapToolkit3.
    The simple method that receives and returns a
    string fails when the input or output
    contains special Hungarian characters.
    Can anyone help how to solve the problem?
    Thank you,
    Peter

    Hello,
    Thank you for the help, I set the property and it's working fine,
    the characters appear as they should!
    Meanwhile I realized that the failiure was because of some
    unneeded '\0' characters at the back of the strings.
    Thank you again,
    Peter
    Bruce Stephens <[email protected]> wrote:
    Hello,
    On the server, is the VM locale set to "en" ?
    Try setting the the following system property on the server startup:
    weblogic.webservice.i18n.charset="utf-8"
    Could you post a SOAP trace?
    Thanks,
    Bruce
    Peter Dobszai wrote:
    Dear All,
    I'm using WLS 7.0SP1, the webservice is generated with the ant taskin rpc-style.
    The client is written in VB6 with MS SoapToolkit3.
    The simple method that receives and returns a
    string fails when the input or output
    contains special Hungarian characters.
    Can anyone help how to solve the problem?
    Thank you,
    Peter

  • Problem inserting text with special Hungarian characters into MySQL database

    When I insert text into my MySQL db the special Hungarian
    characters (ő,ű) they change into "?".
    When I check the
    <cfoutput>#FORM.special_character#</cfoutput> it gives
    me the correct text, things go wrong just when writing it into the
    db. My hosting provider said the following: "please try to
    evidently specify "latin2" charset with "latin2_hungarian_ci"
    collation when performing any operations with tables. It is
    supported by the server but not used by default." At my former
    hosting provider I had no such problem. Anyway how could I do what
    my hosting provider has suggested. I read a PHP related article
    that said use "SET NAMES latin2". How could I do such thing in
    ColdFusion? Any suggestion? Besides I've tried to use UTF8 and
    Latin2 character encoding both on my pages and in the db but with
    not much success.
    I've also read a French language message here in this forum
    that suggested to use:
    <cfscript>
    setEncoding("form", "utf-8");
    setEncoding("url", "utf-8");
    </cfscript>
    <cfcontent type="text/html; charset=utf-8">
    I' ve changed the utf-8 to latin2 and even to iso-8859-2 but
    didn't help.
    Thanks, Aron

    I read that it would be the most straightforward way to do
    everything in UTF-8 because it handles well special characters so
    I've tried to set up a simple testing environment. Besides I use CF
    MX7 and my hosting provider creates the dsn for me so I think the
    db driver is JDBC but not sure.
    1.) In Dreamweaver I created a page with UTF-8 encoding set
    the Unicode Normalization Form to "C" and checked the include
    unicode signature (BOM) checkbox. This created a page with the meta
    tag: <meta http-equiv="Content-Type" content="text/html;
    charset=utf-8" />. I've checked the HTTP header with an online
    utility at delorie.com and it gave me the following info:
    HTTP/1.1, Content-Type: text/html; charset=utf-8, Server:
    Microsoft-IIS/6.0
    2.) Then I put the following codes into the top of my page
    before everything:
    <cfprocessingdirective pageEncoding = "utf-8">
    <cfset setEncoding("URL", "utf-8")>
    <cfset setEncoding("FORM", "utf-8")>
    <cfcontent type="text/html; charset=utf-8">
    3.) I wrote some special Hungarian chars
    (<p>őű</p>) into the page and they displayed
    well all the time.
    4.) I've created a simple MySQL db (MySQL Community Edition
    5.0.27-community-nt) on my shared hosting server with phpMyAdmin
    with default charset of UTF-8 and choosing utf8_hungarian_ci as
    default collation. Then I creted a MyISAM table and the collation
    was automatically applied to my varchar field into wich I stored
    data with special chars. I've checked the properties of the MySQL
    server in MySQL-Front prog and found the following settings under
    the Variables tab: character_set_client: utf8,
    character_set_connection: utf8, character_set_database: latin1,
    character_set_results: utf8, character_set_server: latin1,
    character_set_system: utf8, collation_connection: utf8_general_ci,
    collation_database: latin1_swedish_ci, collation_server:
    latin1_swedish_ci.
    5.) I wrote a simple insert form into my page and tried it
    using both the content of the form field and a hardcoded string
    value and even tried to read back the value of the
    #FORM.special_char# variable. In each cases the special Hungarian
    chars changed to "q" or "p" letters.
    Can anybody see something wrong in the above mentioned or
    have an idea to test something else?
    I am thinking about to try this same page against a db on my
    other hosting providers MySQL server.
    Here is the to the form:
    http://209.85.117.174/pages/proba/chartest/utf8_1/form.cfm
    Thanks, Aron

  • Printing Hungarian characters fails on Zebra Printer

    Hi guys,
    we are trying to print hungarian characters like the unicode U+0150 (LATIN CAPITAL LETTER O WITH DOUBLE ACUTE). I have loaded the latest Zebra drivers into our ECC 6.0, loaded the Swiss 721 Font into the printer, but still it doesn't work. Right now I'm a little stuck on the topic, the last thing I have tried was to change the codepage of the printer driver, but even when I use the one that works with our laser printers, the characters aren't printed or they are replaced by other characters in the codepages. Can anyone help us quite fast ;-)?
    Regards,
    Michael

    Hi Micheal,
    we are also facing same problem..
    when we raised note to SAP.. this is the response we got..
    After consulting with SAP, I found out that our SAP system doesnu2019t support printing Hungarian characters because our SAP system configuration uses CP8000 which is single codepages.  CP8000 doesnu2019t support Latin2 characters. Hungarian is in Latin2 group.
    To be able to print Hungarian characters, we need to switch to Unicode system configuration.
    see the below Links it may help for some ...
    Hungarian Character Printing problem
    Problem in displaying special chars
    Prabhudas

  • Aterm 1.0.1: no accented greek characters

    After a pacman -Syu a new version of aterm, namely 1.0.1, got installed. But then I cannot type greek accented characters in this new version and every program opened via aterm, eg vim. Greek characters type OK, only accents don't show. I don't know where the problem resides, I downgraded to aterm 1.0.0 and added aterm to IgnorePkg in /etc/pacman.conf. Now everything works as it should. Anyone knows the solution to this?

    Hi Dimos,
    as with most containers, GlassFish has a default charset encoding configured in a central configuration file (sun-web.xml). See the corresponding document for details:
    http://docs.sun.com/app/docs/doc/820-4502/beavn?l=en&a=view
    However, it seems I misunderstood your first posting: I thought you were experiencing that problem with the current production release, not with the EA-version.
    I'm not sure if the developers changed anything about the charset-handling in the current EA-version, but as there have been some issues with that until at least 1.0.2, it wouldn't be completely unlikely.
    You could try to find out if this is a general issue by switching your application (or a copy)/workspace to English and see if the problem persists. If not, it is very likely to be a charset problem with the current EA-version.
    -Udo

  • Files with accented / special characters dissapear / not visable

    I did everything which was suggested in threath http://discussions.apple.com/thread.jspa?threadID=875688&threadrate=2&messageid= 4170111#4170111 but have another problem which maybe new:
    All the files appeared when I first used the USB HD on my Airport Extreme but a few weeks ago I tried to play a song via my Itunes and suddenly all my files containing special characters or accented characters where not recognized anymore. For example "josé gonzales - heartbeat.mp3" was gone...! Also in Finder.
    I connected my harddrive to directlly to my mac and all the files we visable again! I did the account trick with password and moved all the files in the sharedfolder but still alle the files with accented characters where not visable??? I can't seem to find a solution? Does anyone got the same problem?
    Qryn

    Actually, The inso filters do not even work for me for PDF file versions 1.2+. Is anyone else experiencing the same difficulties ? How often does Oracle update the INSO filters to keep up with vendors (Adobe/Microsoft) coming out with newer versions of their document products ?
    null

  • Jar (or Zip) file with problems in accents, special characters

    Hi! I've a servlet that creates a jar (or zip) file and then send it. My problem is that when I create an entry in which the filename has special characters such as accents, when I unjar or unzip de file, it brings a lot of garbage characters. For example, the nex entry:
    Ex�menes.doc
    when unzipped is:
    Ex?�menes.doc
    I've tried a lot of things:
    -Setting the locale to ES, MX
    -Replacing all the letters with special characters with its unicode (like s.replace('�','\u00E1')
    -Trying to convert it to UTF8 (new String(path.getBytes(),"UTF8") )
    -Replacing the file separator char (according to a workaround that I found in the bug database)
    But nothing of this worked, alone or together. I've read that this is (or was) a bug in the API, but don't know if a solution has been found.
    Any help will be greatly appreciated!

    It's not clear what you are asking. Maybe this will help
    http://www.cfdev.com/code_samples/code.cfm/CodeID/83/Java/Simple_Ant_build_xml_Build_Task

  • Problem inserting special Hungarian characters into db

    Hi,
    I've posted this question in the database connection forum
    but put it here too because I don't know where to fit better.
    I read that it would be the most straightforward way to do
    everything in UTF-8 because it handles well special characters so
    I've tried to set up a simple testing environment. Besides I use CF
    MX7 and my hosting provider creates the dsn for me so I think the
    db driver is JDBC but not sure.
    1.) In Dreamweaver I created a page with UTF-8 encoding set
    the Unicode Normalization Form to "C" and checked the include
    unicode signature (BOM) checkbox. This created a page with the meta
    tag: <meta http-equiv="Content-Type" content="text/html;
    charset=utf-8" />. I've checked the HTTP header with an online
    utility at delorie.com and it gave me the following info:
    HTTP/1.1, Content-Type: text/html; charset=utf-8, Server:
    Microsoft-IIS/6.0
    2.) Then I put the following codes into the top of my page
    before everything:
    <cfprocessingdirective pageEncoding = "utf-8">
    <cfset setEncoding("URL", "utf-8")>
    <cfset setEncoding("FORM", "utf-8")>
    <cfcontent type="text/html; charset=utf-8">
    3.) I wrote some special Hungarian chars
    (<p>őű</p>) into the page and they displayed
    well all the time.
    4.) I've created a simple MySQL db (MySQL Community Edition
    5.0.27-community-nt) on my shared hosting server with phpMyAdmin
    with default charset of UTF-8 and choosing utf8_hungarian_ci as
    default collation. Then I creted a MyISAM table and the collation
    was automatically applied to my varchar field into wich I stored
    data with special chars. I've checked the properties of the MySQL
    server in MySQL-Front prog and found the following settings under
    the Variables tab: character_set_client: utf8,
    character_set_connection: utf8, character_set_database: latin1,
    character_set_results: utf8, character_set_server: latin1,
    character_set_system: utf8, collation_connection: utf8_general_ci,
    collation_database: latin1_swedish_ci, collation_server:
    latin1_swedish_ci.
    5.) I wrote a simple insert form into my page and tried it
    using both the content of the form field and a hardcoded string
    value and even tried to read back the value of the
    #FORM.special_char# variable. In each cases the special Hungarian
    chars changed to "q" or "p" letters.
    Can anybody see something wrong in the above mentioned or
    have an idea to test something else?
    I am thinking about to try this same page against a db on my
    other hosting providers MySQL server.
    Here is the to the form:
    http://209.85.117.174/pages/proba/chartest/utf8_1/form.cfm
    Thanks, Aron

    Some new info about the advancements in my project:
    I've tried to make the insertion at a third hosting
    provider's MySQL server with my 'everything is UTF-8' test case and
    IT'S DONE! There are my lovely spec chars :-)
    Then I've checked the char encoding according -Per's tip in
    all of my so far used test MySQL dbs and it reported that
    'CHARSET=utf8 COLLATE=utf8_hungarian_ci' so this part seems to me
    OK.
    I asked my hosting provider where my production app should
    run about the db driver and they told it's JDBC (what version of
    Jconnect still donno') and they are ready to append
    &characterSetResults=UTF-8 to the JDBC url connection string
    (somebody told this tip also as a possible soultion) but they asked
    me to provide the complete connection string to be used for my
    datasource. I've tried to compose it in my localhost development
    environment in ColdFusion Admin but it gave me a Connection
    verification failed error. So I think I did something wrong and
    need help to write the correct connection string that can be passed
    to the hosting provider. So the connection string structure I tried
    to use in the JDBC URL field of the datasource area of CFAdmin is
    something like this:
    jdbc:mysql://someipaddresshere/mydbname&characterSetResults=UTF-8
    How can it be corrected?
    Thanks, Aron

  • A few hungarian characters appear as #  in SAP main menu

    Hi,
    In the  user menu favourites, s few hungarian charcters appear as #.Its all fine when i login with English language.
    But when i login with Portuguese,a few characters are dispalyed as #.

    Thanks for the information.
    Regards,
    pavan

  • Hungarian characters not printing correctly

    Hi All,
    I am working with scripts and trying to print the value of a variable through a variable which is coming from print program.
    For example : &gs_text001&
    variable &gs_text001& has value Kezdési id&#337;pont
    Kezdési is coming as it is
    but id&#337;pont is coming id#pont
    Could any of you have any idea about this problem.
    Thanks,
    Mark

    BASIS need to set up printers for printing in Hungarian. It is inline with the character sets that a printer can support with particular fonts. Better search for OSS notes for more details

  • Accents/Special Characters

    I'm looking for a grep or script that will allow me to put accents on letters.  For instance, taking e and making it ē with the macron above it.  Our authors indicate they want this by indicating it this way:
    e[set macron over e]
    At this point, I have simple find/replaces loacking all of those and replacing them with the ē.  It's the same for every letter asking for a specific accent above.
    I'm hoping to figure out a grep or script that will keep my find/replace list from getting too big.  At this point, I've got 35 find/replace just to locate these, and my list is growing every time I encounter another accent.  Considering how much each letter has...
    Anybody know of a simple way to do this?
    Thank you!

    Hi Matthew,
    Which option you use to replace your text in find option.
    Otherwise use the "Clipboard Contents, Formatted" in the "Change To" option.
    Clipboard contents used  to change the recent copy in your active document.
    Please find the screenshot. May it will helpful....
    Thanks in advance
    BEGINNER

Maybe you are looking for

  • Can't import photos into iPhoto-- it sees them but won't import

    I've been using iPhoto with my Mac Mini for almost a year now without a single issue (that's the main reason I bought a mac). I believe I have iPhoto 08 (I bought the computer 1 year ago). I have a newer Canon point-and-shoot that I've been using. Ab

  • Jdbc sender channel memory size issue

    Hi Experts, We are facing the below error in connect to jdbc sender channel from XI . Could anyone of you suggest me the right action to be taken in this regard. Database-level error reported by JDBC driver while executing statement 'select * from***

  • How do i change transition length in fcpX ?

    Hi Im trying to increase the lenght of transitions using FCP X version and having no sucess. thanks.

  • Downloading loading option and date format

    Hello, While downloading a pdf file by mistake i checked on both "Save File" and also "Default", following to which whenever i download any file its directly saving file wherein i need to just open the file withou saving the save. Am not able to find

  • Netflix watch instantly issue

    I recently read that netflix' watch instantly option can work on macs. I used a link for the silverlight "opt in" option. The website said "congragulations the new media player is enabled." When I try to watch a movie though, it says my OS is not com