Convertion to UTF-8

Hello everybody,
Recently I'm making some i18n for supported by me API. I was surprised that convertion to UTF-8 makes sometimes result buffer which contains more characters than I need (after termination 0 ther is meaningless chars). May be I'm wrong (pls tell me if i am so to not ask stupid questions ;)
Anyway I've tried to make simple cut of byte[] buffer to first occurence of 0 in it and back transformation have worked fine.
Is there simple fix for this issue?
10x in advance

10x for answers!
First I woukd like to note that I'm happen to know what is UTF-8 encoding (especially that 1 character is possible to take more than 1 byte).
Next - I've solved my problem. It appeared that sometimes in String for convertion appeared '\0' character and it went in output. I thought that converted string finishes to '\0' charcter and bytes after are not needed.
Anyway now I have another problem.
If I try to convert to upper case this string (these are hex char codes)
0xC3 0X8D 0X77 0X2E 0X96 0X99 0X20 0XE6 0X86 0X70 0X6B 0X46 0X79 0X4B 0X54 0XF8 0X4B 0X9E 0X3C 0X2A 0X53 0X55 0XA8 0X47 0X85 0X8F 0XC3 0X67 0XFC 0XDF
result is String with 1 character more! Is this correct?

Similar Messages

  • Convertion from UTF-16 to UTF-8 in XI

    Hi,
      From Source system (MDM), sometimes data are coming in UTF-16 format in to XI. My target system is R/3 which is UTF-8. Here's the scenario:-
    MDM->MQ Queue-> Local JMS Queue-> XI->R/3
    Here I am using sender JMS Queue adapter to receive the data from Local JMS Queue and using receiver IDOC adapter to send the IDOC into R/3. I am using ABAP mapping for this scenario.
      Since the target system in UTF-8 and the data are coming sometimes in UTF-16, how can I change the format UTF-16 to UTF-8 in sender JMS adapter.
    Please advice.
    Reply with details would be appreciated.
    BR
    Soumya

    Hi Soumya ,
    You can do this in Adapter module in JMS sender adapter .
    obj = inputModuleData.getPrincipalData();
    msg = (Message) obj;
    XMLPayload xmlpayload = msg.getDocument();               
    xmlpayload.getContent()
    convert from UTF 16 to UTF 8 then
    xmlpayload.setContent();
    Hope this works.
    Cheers,
    Reddy

  • Convert a UTF-16 format  file to UTF-8 format

    Hi all
          I have a xml file in UTF-16 format I have to convert it into UTF-8 format so that
    it can be opened with Internet Explorer.now when i am opening the file its showing error"<b>Whitespace is not allowed at this location. Error processing resource 'file</b>"
                    the file content is given below
                 < ? x m l   v e r s i o n = " 1 . 0 "   e n c o d i n g = "
    u t f - 8 " ? >
    < _ G L B - R G T X _ O R D E R S C P G >
         < I D O C   B E G I N = " 1 " >
             < E D I _ D C
    4 0   S E G M E N T = " 1 " >
                 < T A B N A M > E
    D I _ D C 4 0 < / T A B N A M >
                 < M A N D T > 1
    5 2 < / M A N D T >
                 < D O C N U M > 0 0 0 0 0 0
    0 0 0 2 7 5 9 7 2 2 < / D O C N U M >
                 < D O C R
    E L > 6 2 0 < / D O C R E L >
                 < S T A T U S > 3
    0 < / S T A T U S >
                 < D I R E C T > 1 < / D I R
    E C T >
                 < O U T M O D > 2 < / O U T M O D >
             < E X P R S S / >
                 < T E S T / >
          < I D O C T Y P > O R D E R S 0 5 < / I D O C T Y P >
                 < C I M T Y P > / G L B / R G T X _ O R D E R S
    C P G < / C I M T Y P >
                 < M E S T Y P > O R D C
    H G < / M E S T Y P >
                 < M E S C O D / >
         < M E S F C T / >
                 < S T D / >
    < S T D V R S / >
                 < S T D M E S > O R D C H G <
    / S T D M E S >
                 < S N D P O R > S A P G E 7 < /
    S N D P O R >
                 < S N D P R T > L S < / S N D P R
    T >
                 < S N D P F C / >
                 < S N D P R
    N > G E 7 Q A R 3 1 5 2 < / S N D P R N >
                 < S N
    D S A D / >
                 < S N D L A D / >
                 < R
    C V P O R > A 0 0 0 0 0 0 0 4 0 < / R C V P O R >
      < R C V P R T > L S < / R C V P R T >
                 < R C V
    P F C > L S < / R C V P F C >
                 < R C V P R N > R
    L E C P G W D I 1 < / R C V P R N >
                 < R C V S A
    D / >
                 < R C V L A D / >
                 < C R E D
    A T > 2 0 0 7 0 6 0 5 < / C R E D A T >
                 < C R E
    T I M > 1 5 4 1 5 3 < / C R E T I M >
                 < R E F I
    N T / >
                 < R E F G R P / >
                 < R E F
    M E S / >
                 < A R C K E Y / >
                 < S E
    R I A L > B U S 2 0 1 2       1 1 0 0 0 0 0 3 0 1 < / S E R
    I A L >
             < / E D I _ D C 4 0 >
             < E 1 E D K
    0 1   S E G M E N T = " 1 " >
                 < C U R C Y > E U
    R < / C U R C Y >
                 < W K U R S > 1 . 0 0 0 0 0 <
    / W K U R S >
                 < Z T E R M > Z 0 0 1 < / Z T E R
    M >
                 < B S A R T > N B < / B S A R T >
       < B E L N R > 4 5 2 0 0 2 2 0 7 2 1 1 0 0 0 0 0 3 0 0 < /
    B E L N R >
                 < R E C I P N T _ N O > 0 1 0 0 0 0
    0 0 0 5 < / R E C I P N T _ N O >
                 < A B R V W _
    B E Z > B U S 2 0 1 2       1 1 0 0 0 0 0 3 0 0 < / A B R V
    W _ B E Z >
                 < _ G L B - R G T _ C H A N G E F L
    A G   S E G M E N T = " 1 " >
                     < Q U A L F >
    0 0 2 < / Q U A L F >
                     < F L A G > X < / F L
    A G >
                 < / _ G L B - R G T _ C H A N G E F L A G
    >
                 < _ G L B - R G T _ C P G D A T A   S E G M E
    N T = " 1 " >
                     < L A N G U > e n < / L A N G
    U >
                     < C O N S I G N _ F L A G > S < / C O N
    S I G N _ F L A G >
                     < I N D I C A T > S < /
    I N D I C A T >
                 < / _ G L B - R G T _ C P G D A
    T A >
             < / E 1 E D K 0 1 >
             < E 1 E D K 1 4 
    S E G M E N T = " 1 " >
                 < Q U A L F > 0 1 4 <
    / Q U A L F >
                 < O R G I D > Z 2 0 0 < / O R G I
    D >
             < / E 1 E D K 1 4 >
             < E 1 E D K 1 4  
    S E G M E N T = " 1 " >
                 < Q U A L F > 0 0 9 < /
    Q U A L F >
                 < O R G I D > 0 0 3 < / O R G I D >
             < / E 1 E D K 1 4 >
             < E 1 E D K 1 4   S E
    G M E N T = " 1 " >
                 < Q U A L F > 0 1 3 < / Q U
    A L F >
                 < O R G I D > N B < / O R G I D >
       < / E 1 E D K 1 4 >
             < E 1 E D K 1 4   S E G M E
    N T = " 1 " >
                 < Q U A L F > 0 1 1 < / Q U A L F
    >
                 < O R G I D > Z 2 0 1 < / O R G I D >
    < / E 1 E D K 1 4 >
             < E 1 E D K 0 3   S E G M E N
    T = " 1 " >
                 < I D D A T > 0 1 2 < / I D D A T >
                 < D A T U M > 2 0 0 7 0 5 1 8 < / D A T U M >
           < / E 1 E D K 0 3 >
             < E 1 E D K 0 3   S E G
    M E N T = " 1 " >
                 < I D D A T > 0 1 1 < / I D D
    A T >
                 < D A T U M > 2 0 0 7 0 5 1 8 < / D A T U
    M >
             < / E 1 E D K 0 3 >
             < E 1 E D K A 1  
    S E G M E N T = " 1 " >
                 < P A R V W > A G < / P
    A R V W >
                 < P A R T N > 2 0 0 0 0 1 2 < / P A R
    T N >
                 < T E L F 1 > + 4 1 2 1 9 2 4 1 8 6 8 < /
    T E L F 1 >
                 < T E L F X > + 4 1 2 1 9 2 4 1 8 6
    7 < / T E L F X >
                 < B N A M E > P l a n t   O p
    e r .   B u y e r < / B N A M E >
                 < P A O R G >
    Z 2 0 0 < / P A O R G >
                 < O R G T X > E u r o p
    e < / O R G T X >
                 < P A G R U > 0 0 3 < / P A G
    R U >
             < / E 1 E D K A 1 >
             < E 1 E D K A 1 
    S E G M E N T = " 1 " >
                 < P A R V W > L F < /
    P A R V W >
                 < P A R T N > 0 1 0 0 0 0 0 0 0 5 <
    / P A R T N >
                 < N A M E 1 > I n t e r n a t i o
    n a l   A g r o . < / N A M E 1 >
                 < S T R A S >
    1 4   R u e   d e   S a h o n e < / S T R A S >
    < S T R S 2 > C h r i s t < / S T R S 2 >
                 < P F
    A C H > 1 5 5 < / P F A C H >
                 < O R T 0 1 > L y
    o n < / O R T 0 1 >
                 < P S T L Z > 1 0 2 0 4 < /
    P S T L Z >
                 < L A N D 1 > F R < / L A N D 1 >
               < T E L F 1 > 0 8 1 < / T E L F 1 >
    < S P R A S > E < / S P R A S >
                 < B N A M E > A
    l f r e d o < / B N A M E >
                 < I L N N R > 7 6 5
    4 3 2 1 5 4 3 2 1 5 < / I L N N R >
                 < S P R A S
    _ I S O > E N < / S P R A S _ I S O >
                 < _ G L B
    - R G T _ T A X C O D E S   S E G M E N T = " 1 " >
            < S P T N U M > 3 4 5 < / S P T N U M >
        < C Y T A X N U M > 4 5 6 < / C Y T A X N U M >
            < O T R I N F O > F R 9 8 7 1 2 5 6 4 4 4 2 < / O T
    R I N F O >
                 < / _ G L B - R G T _ T A X C O D E
    S >
             < / E 1 E D K A 1 >
             < E 1 E D K A 1  
    S E G M E N T = " 1 " >
                 < P A R V W > E K < / P
    A R V W >
                 < N A M E 1 > F R   P L   N e u t r a
    l i a < / N A M E 1 >
                 < N A M E 2 > G T   C o p
    y   R e f e r e n c e   F a c t o r y < / N A M E 2 >
          < S T R A S > G l o b e   S t r e e t   Z 2 0 3 < / S
    T R A S >
                 < O R T 0 1 > N e u t r a l i a < / O
    R T 0 1 >
                 < P S T L Z > 9 9 9 9 9 < / P S T L Z
    >
                 < L A N D 1 > F R < / L A N D 1 >
    < B N A M E > A l e s s a n d r o   V e r r e s c h i < / B
    N A M E >
                 < I L N N R > 5 0 0 0 2 4 3 0 0 0 4 9
    7 < / I L N N R >
                 < _ G L B - R G T _ T A X C O
    D E S   S E G M E N T = " 1 " >
                     < C R N U M
    > 2 9 8 < / C R N U M >
                     < S P T N U M > 3 9
    8 < / S P T N U M >
                     < C Y T A X N U M > 4 9
    8 < / C Y T A X N U M >
                 < / _ G L B - R G T _ T
    A X C O D E S >
                 < _ G L B - R G T _ C P G M I D
      S E G M E N T = " 1 " >
                     < S M T P _ A D D
    R > a l e s s a n d r o . v e r r e s c h i @ n e s t l e .
    c o m < / S M T P _ A D D R >
                 < / _ G L B - R G
    T _ C P G M I D >
             < / E 1 E D K A 1 >
             < E
    1 E D K A 1   S E G M E N T = " 1 " >
                 < P A R V
    W > L S < / P A R V W >
                 < P A R T N > 0 1 0 0 0
    0 0 0 0 5 < / P A R T N >
                 < I L N N R > 7 6 5 4
    3 2 1 5 4 3 2 1 5 < / I L N N R >
             < / E 1 E D K A 1
    >
             < E 1 E D K A 1   S E G M E N T = " 1 " >
         < P A R V W > W E < / P A R V W >
                 < N A M
    E 1 > F R   P L   N e u t r a l i a < / N A M E 1 >
        < N A M E 2 > G T   C o p y   R e f e r e n c e   F a c
    t o r y < / N A M E 2 >
                 < S T R A S > G l o b e
      S t r e e t   Z 2 0 3 < / S T R A S >
                 < O R T
    0 1 > N e u t r a l i a < / O R T 0 1 >
                 < P S T
    L Z > 9 9 9 9 9 < / P S T L Z >
                 < L A N D 1 > F
    R < / L A N D 1 >
                 < _ G L B - R G T _ E 1 E D K
    A 1   S E G M E N T = " 1 " / >
             < / E 1 E D K A 1 >
             < E 1 E D K A 1   S E G M E N T = " 1 " >
       < P A R V W > R G < / P A R V W >
                 < N A M E
    1 > G L O B E   R e f e r e n c e   F r a n c e < / N A M E
    1 >
                 < S T R A S > S t r e e t   1 < / S T R A S
    >
                 < O R T 0 1 > C i t y < / O R T 0 1 >
         < P S T L Z > 1 2 3 4 5 < / P S T L Z >
                 <
    L A N D 1 > F R < / L A N D 1 >
                 < I L N N R > 5
    0 0 0 2 4 3 0 0 0 7 0 1 < / I L N N R >
                 < _ G L
    B - R G T _ T A X C O D E S   S E G M E N T = " 1 " >
              < C O T A X N U M > 1 2 3 4 < / C O T A X N U M >
                     < C R N U M > 1 2 3 4 5 6 7 8 < / C R N U M
    >
                     < S P T N U M > 1 0 0 5 1 9 6 2 < / S P T
    N U M >
                     < C Y T A X N U M > 1 0 0 5 1 9 6 2
    < / C Y T A X N U M >
                     < O T R I N F O > 0 0
    0 0 0 0 0 5 6 7 8 9 < / O T R I N F O >
                     < J
    U R C O D E > 1 0 0 5 1 9 6 2 < / J U R C O D E >
      < / _ G L B - R G T _ T A X C O D E S >
             < / E 1 E
    D K A 1 >
             < E 1 E D K 0 2   S E G M E N T = " 1 " >
                 < Q U A L F > 0 0 1 < / Q U A L F >
    < B E L N R > 4 5 2 0 0 2 2 0 7 2 < / B E L N R >
       < D A T U M > 2 0 0 7 0 5 1 8 < / D A T U M >
    < U Z E I T > 1 4 0 4 4 0 < / U Z E I T >
             < / E 1
    E D K 0 2 >
             < E 1 E D K 1 8   S E G M E N T = " 1 "
    >
                 < Q U A L F > 0 0 1 < / Q U A L F >
       < T A G E > 3 0 < / T A G E >
                 < P R Z N T >
    2 . 0 0 0 < / P R Z N T >
             < / E 1 E D K 1 8 >
       < E 1 E D P 0 1   S E G M E N T = " 1 " >
                 <
    P O S E X > 0 0 0 1 0 < / P O S E X >
                 < A C T I
    O N > 0 0 2 < / A C T I O N >
                 < P S T Y P > 0 <
    / P S T Y P >
                 < M E N G E > 1 0 0 . 0 0 0 < / M
    E N G E >
                 < M E N E E > E A < / M E N E E >
             < B M N G 2 > 1 0 0 . 0 0 0 < / B M N G 2 >
         < P M E N E > E A < / P M E N E >
                 < V P R
    E I > 1 0 < / V P R E I >
                 < P E I N H > 1 < / P
    E I N H >
                 < N E T W R > 1 0 0 0 < / N E T W R >
                 < N T G E W > 8 0 0 < / N T G E W >
    < G E W E I > K G M < / G E W E I >
                 < M A T K
    L > P 2 0 < / M A T K L >
                 < B P U M N > 1 < / B
    P U M N >
                 < B P U M Z > 1 < / B P U M Z >
           < B R G E W > 1 0 0 0 < / B R G E W >
                 <
    W E R K S > Z 2 0 3 < / W E R K S >
                 < E 1 E D P
    0 4   S E G M E N T = " 1 " >
                     < M W S B T >
    0 . 0 0 < / M W S B T >
                 < / E 1 E D P 0 4 >
             < E 1 E D P 2 0   S E G M E N T = " 1 " >
           < W M E N G > 1 0 0 . 0 0 0 < / W M E N G >
           < A M E N G > 0 . 0 0 0 < / A M E N G >
       < E D A T U > 2 0 0 7 0 5 1 8 < / E D A T U >
         < E Z E I T > 1 4 5 0 0 0 < / E Z E I T >
       < _ G L B - R G T _ S L I D 0 1   S E G M E N T = " 1 " >
                         < E T E N R > 0 0 0 1 < / E T E N R >
                   < / _ G L B - R G T _ S L I D 0 1 >
       < / E 1 E D P 2 0 >
                 < E 1 E D P 1 9   S E G
    M E N T = " 1 " >
                     < Q U A L F > 0 0 1 < / Q
    U A L F >
                     < I D T N R > 0 0 0 0 0 0 0 0 0 0
    4 3 0 0 0 0 9 8 < / I D T N R >
                     < K T E X T
    > C a r t o n s   -   C e r e a l s   M i l k   2 4 x 4 5 0 
    g < / K T E X T >
                 < / E 1 E D P 1 9 >
        < E 1 E D P 1 9   S E G M E N T = " 1 " >
      < Q U A L F > 0 0 3 < / Q U A L F >
                     < I D
    T N R > 0 7 6 1 2 2 9 7 0 1 6 1 1 9 < / I D T N R >
        < / E 1 E D P 1 9 >
             < / E 1 E D P 0 1 >
    < E 1 E D S 0 1   S E G M E N T = " 1 " >
                 < S
    U M I D > 0 0 2 < / S U M I D >
                 < S U M M E > 1
    0 0 0 < / S U M M E >
                 < S U N I T > E U R < / S
    U N I T >
             < / E 1 E D S 0 1 >
             < E 1 E D S
    0 1   S E G M E N T = " 1 " >
                 < S U M I D > 0 0
    5 < / S U M I D >
                 < S U M M E > 0 < / S U M M E
    >
             < / E 1 E D S 0 1 >
         < / I D O C >
    < / _ G L
    B - R G T X _ O R D E R S C P G >

    Hi Saurabh,
    your xml is not wellformed coz there are spaces in the prolog between the first character '<' and the '?'. Coz of any reason there are spaces between all characters, that is making your document not readable for parsers including IE. For your task to convert to UTF-8 you can use a XSLT mapping with element "output" and attribut "encoding"
    <?xml version="1.0" encoding="UTF-8"?>
    <xsl:stylesheet version="1.0" xmlns:xsl="http://www.w3.org/1999/XSL/Transform">
         <xsl:output encoding="UTF-8"/>
         <xsl:template match="/">
    Regards,
    Udo

  • Pacman: Pathname can't be converted from UTF-8 to current locale

    Hello Forum,
    I am stuck with a problem to which I cannot find any solution after spending half the day trying to solve it:
    I have a package wich I installed from AUR -> MyCustomRepository working nicely on my main Arch system. Now I would like to add the package to a livecd using the archiso scripts. When I add it to my packages.both list it downloads fine.
    However when installation into chroot should start pacman gives me the following error:
    (378/378) loading package files [######################] 100%
    error: error while reading package /var/cache/pacman/pkg/pd-extended-0.43.4-1-x86_64.pkg.tar.xz: Pathname can't be converted from UTF-8 to current locale.
    error: failed to commit transaction (libarchive error)
    Errors occurred, no packages were upgraded.
    My System only uses en_US.UTF-8 as locale, so I don't really understand why any conversion should be neccesary.
    I read about a similar error somewhere with a reply stating it had  to do with packaging. No solution was given however. Can this be the case? And if yes, why do I not have any problems installling in my main system? And more importantly, how can I fix it? i don't really know where to look for the right way now.
    I already tried to include a different packages from AUR through my repository, which worked as expected.
    the package is pd-extended from AUR with a slight modification to make the .desktop file actually start pdextended.
    Any help is greatly appreciated.
    Thank you,
    Dominik
    # Maintainer: <fero dot kiraly at gmail.com>
    # Contributor: Dominik Schmidt-Philipp <[email protected]>
    pkgname=pd-extended
    pkgver=0.43.4
    pkgrel=1
    pkgdesc="PureData Extended 0.43-4 version"
    url="http://puredata.info/"
    arch=('i686' 'x86_64' )
    license=('BSD')
    depends=('libdc1394' 'desktop-file-utils' 'libiec61883' 'fftw' \
    'jack' 'tk' 'freeglut' 'libquicktime' 'libdv' 'gsl' 'imagemagick' \
    'ftgl' 'libgl' 'hicolor-icon-theme' 'lua51')
    makedepends=('dssi' 'git' 'libv4l' 'tcllib' 'subversion' 'swig' 'automake' 'curl')
    conflicts=('pdp' 'zexy' 'puredata')
    provides=( 'pd-extended' 'pd-gem' 'pdp' 'zexy')
    optdepends=('puredata-utils' 'pd-vanilla')
    replaces=(pd-extended)
    backup=()
    options=('!makeflags' '!libtool')
    install=pd-extended.install
    source=('change_gem_configure_file.patch' 's_inter.c.patch' 'tclpd.Makefile.patch' 'makefile.am.patch')
    md5sums=('63a36c51706eaed98611aa4b4bb6ed0d' '8b748d09e8bdf04540ba579d0a6b5318' '463de23166cef2dcf6c7f6989e636213' '5938b12a232004de15f5a383f95cedca' )
    build() {
    unset CFLAGS
    unset LDFLAGS
    unset INCLUDES
    #downloading pd-extended----------------------------------------------------------
    msg "Begin SVN checkout for pd-extended 0.43.4"
    cd $srcdir
    svn checkout https://pure-data.svn.sourceforge.net/svnroot/pure-data/branches/pd-extended/0.43 $pkgname
    msg "SVN checkout done or server timeout"
    cd ..
    #FIXES -----------------------------------------------------------------------------
    # fix #1 lua
    #patch $srcdir/$pkgname/externals/loaders/pdlua/src/Makefile ./repair_lua_lib_number.patch
    #fix #2 -- GEM configure.ac
    patch $srcdir/$pkgname/externals/Gem/configure.ac ./change_gem_configure_file.patch
    #fix #3 -- repair wish8.6 (for tcl8.6)
    patch $srcdir/$pkgname/pd/src/s_inter.c ./s_inter.c.patch
    #fix #4 -- repair version of tcl (8.6)
    patch $srcdir/$pkgname/externals/loaders/tclpd/Makefile ./tclpd.Makefile.patch
    #fix #5 -- repair binary name to pd-extended (from pd)
    #patch $srcdir/$pkgname/packages/Makefile ./program_name.patch
    patch $srcdir/$pkgname/pd/src/Makefile.am ./makefile.am.patch
    # 64 bit archutecure----------------------------------------------------------------
    if [ "$CARCH" = "x86_64" ]; then
    # fix -fPIC issue in PDP
    sed -e "s|CFLAGS =|CFLAGS = -fPIC|" \
    -i $srcdir/$pkgname/externals/pdp/opengl/Makefile.config || return 1
    # fix -fPIC issue in pddp
    sed -e "s|DEFINES =|DEFINES = -fPIC|" \
    -i $srcdir/$pkgname/externals/miXed/Makefile.common || return 1
    # setting additional variable
    FPIC_FLAG="-fPIC"
    else FPIC_FLAG=""
    fi
    #MAKE --------------------------------------------------------------------------
    cd "$srcdir/$pkgname/packages/linux_make" || return 1
    make BUILDLAYOUT_DIR=$srcdir/$pkgname/packages \
    GEM_EXTRA_CXXFLAGS="$FPIC_FLAG" \
    prefix=/usr \
    install || return 1
    package() {
    cp -urv $srcdir/$pkgname/packages/linux_make/build/usr $pkgdir/
    mv $pkgdir/usr/share/man/man1/pd.1 $pkgdir/usr/share/man/man1/pdextended.1
    rm $pkgdir/usr/share/man/man1/pdreceive.1
    rm $pkgdir/usr/share/man/man1/pdsend.1
    rm $pkgdir/usr/include/m_pd.h
    rm $pkgdir/usr/bin/pd-gui.tcl
    cd $srcdir/$pkgname/
    # PD License
    install -Dm644 pd/LICENSE.txt $pkgdir/usr/share/licenses/pd-extended/LICENSE.txt
    cd packages/
    install -p linux_make/default.pdextended $pkgdir/usr/lib/pd-extended/
    # Gnome menu support
    install -d $pkgdir/usr/share/icons/hicolor/128x128/apps
    install -p -m0644 linux_make/pd-extended.png $pkgdir/usr/share/icons/hicolor/128x128/apps/
    install -d $pkgdir/usr/share/icons/hicolor/48x48/apps
    install -p -m0644 linux_make/48x48/pd-extended.png $pkgdir/usr/share/icons/hicolor/48x48/apps/pd-extended.png
    install -d $pkgdir/usr/share/applications/
    install -p linux_make/pd-extended.desktop $pkgdir/usr/share/applications/
    sed -i "s/^Exec=.*/Exec=pdextended/" $pkgdir/usr/share/applications/pd-extended.desktop
    # mime
    install -d $pkgdir/usr/share/mime/packages/
    install -p linux_make/pd-extended.xml $pkgdir/usr/share/mime/packages/
    install -d $pkgdir/usr/share/icons/hicolor/128x128/mimetypes
    install -p linux_make/text-x-puredata.png $pkgdir/usr/share/icons/hicolor/128x128/mimetypes
    # files for /etc
    cd "$srcdir/$pkgname"
    install -d $pkgdir/etc/bash_completion.d/
    install -p scripts/bash_completion/pd $pkgdir/etc/bash_completion.d
    # emacs mode for .pd files
    install -d $pkgdir/usr/share/emacs/site-lisp/
    install -p scripts/pd-mode.el $pkgdir/usr/share/emacs/site-lisp/
    # Pd-related scripts
    install -p scripts/pd-diff $pkgdir/usr/bin/
    install -p scripts/config-switcher.sh $pkgdir/usr/bin/
    # vim:set ts=2 sw=2 et:

    I chroot into workdir. uncommented en_US.UTF-8 in /etc/locale.gen, run locale-gen and now get this inside the chroot.
    [root@dArch2 releng]# chroot work/x86_64/root-image/
    [root@dArch2 /]# locale
    LANG=en_US.UTF-8
    LC_CTYPE="en_US.UTF-8"
    LC_NUMERIC="en_US.UTF-8"
    LC_TIME="en_US.UTF-8"
    LC_COLLATE="en_US.UTF-8"
    LC_MONETARY="en_US.UTF-8"
    LC_MESSAGES="en_US.UTF-8"
    LC_PAPER="en_US.UTF-8"
    LC_NAME="en_US.UTF-8"
    LC_ADDRESS="en_US.UTF-8"
    LC_TELEPHONE="en_US.UTF-8"
    LC_MEASUREMENT="en_US.UTF-8"
    LC_IDENTIFICATION="en_US.UTF-8"
    LC_ALL=
    this looks alright to me. Any more suggestions?
    It's the first time I use a custom-repo and I set it up only for this purpose. So yes, it's up to date and can be used outside the chroot. Also a different package I put from AUR into my custom-repo installs fine and doesn't give this error.
    How else could I test the custom-repo?

  • Gdk-WARNING **: Error converting from UTF-8 to STRING

    I just installed Arch64 few days ago. In my old setup I was used to work with 2 softwares: Eclipse CDT and WingIDE.
    I tried to start these applications (they require java and python but they are binary releases) locally and I get this errors which I can't solve by myself googling... Reporting snapshots of my desktop too.
    [j_zar@Surprise eclipse]$ ./eclipse
    (eclipse:7864): Gdk-WARNING **: Error converting from UTF-8 to STRING: Impossibile aprire il convertitore da "UTF-8" a "ISO-8859-1"
    (eclipse:7864): Gdk-WARNING **: Error converting from UTF-8 to STRING: La conversione del set di caratteri da "UTF-8" a "ISO-8859-1" non è supportata
    (eclipse:7864): Pango-WARNING **: /usr/lib/pango/1.6.0/modules/pango-basic-fc.so: wrong ELF class: ELFCLASS64
    (eclipse:7864): Pango-WARNING **: Failed to load Pango module '/usr/lib/pango/1.6.0/modules/pango-basic-fc.so' for id 'BasicScriptEngineFc'
    (eclipse:7864): Pango-WARNING **: /usr/lib/pango/1.6.0/modules/pango-basic-fc.so: wrong ELF class: ELFCLASS64
    (eclipse:7864): Pango-WARNING **: /usr/lib/pango/1.6.0/modules/pango-basic-fc.so: wrong ELF class: ELFCLASS64
    (eclipse:7864): Pango-WARNING **: /usr/lib/pango/1.6.0/modules/pango-basic-fc.so: wrong ELF class: ELFCLASS64
    (eclipse:7864): Pango-WARNING **: /usr/lib/pango/1.6.0/modules/pango-basic-fc.so: wrong ELF class: ELFCLASS64
    (eclipse:7864): Pango-WARNING **: /usr/lib/pango/1.6.0/modules/pango-basic-fc.so: wrong ELF class: ELFCLASS64
    (eclipse:7864): Pango-WARNING **: pango_shape called with bad font, expect ugly output
    (eclipse:7864): Pango-WARNING **: pango_font_get_glyph_extents called with null font argument, expect ugly output
    (eclipse:7864): Pango-WARNING **: pango_font_get_metrics called with null font argument, expect ugly output
    (eclipse:7864): Pango-WARNING **: /usr/lib/pango/1.6.0/modules/pango-basic-fc.so: wrong ELF class: ELFCLASS64
    (eclipse:7864): Pango-WARNING **: /usr/lib/pango/1.6.0/modules/pango-basic-fc.so: wrong ELF class: ELFCLASS64
    (eclipse:7864): Pango-WARNING **: /usr/lib/pango/1.6.0/modules/pango-basic-fc.so: wrong ELF class: ELFCLASS64
    (eclipse:7864): Pango-WARNING **: /usr/lib/pango/1.6.0/modules/pango-basic-fc.so: wrong ELF class: ELFCLASS64
    (eclipse:7864): Pango-WARNING **: /usr/lib/pango/1.6.0/modules/pango-basic-fc.so: wrong ELF class: ELFCLASS64
    (eclipse:7864): Pango-WARNING **: /usr/lib/pango/1.6.0/modules/pango-basic-fc.so: wrong ELF class: ELFCLASS64
    (eclipse:7864): Pango-WARNING **: /usr/lib/pango/1.6.0/modules/pango-basic-fc.so: wrong ELF class: ELFCLASS64
    (eclipse:7864): Pango-WARNING **: /usr/lib/pango/1.6.0/modules/pango-basic-fc.so: wrong ELF class: ELFCLASS64
    (eclipse:7864): Pango-WARNING **: /usr/lib/pango/1.6.0/modules/pango-basic-fc.so: wrong ELF class: ELFCLASS64
    (eclipse:7864): Pango-WARNING **: /usr/lib/pango/1.6.0/modules/pango-basic-fc.so: wrong ELF class: ELFCLASS64
    (eclipse:7864): Pango-WARNING **: /usr/lib/pango/1.6.0/modules/pango-basic-fc.so: wrong ELF class: ELFCLASS64
    (eclipse:7864): Pango-WARNING **: /usr/lib/pango/1.6.0/modules/pango-basic-fc.so: wrong ELF class: ELFCLASS64
    (eclipse:7864): Pango-WARNING **: /usr/lib/pango/1.6.0/modules/pango-basic-fc.so: wrong ELF class: ELFCLASS64
    (eclipse:7864): Pango-WARNING **: /usr/lib/pango/1.6.0/modules/pango-basic-fc.so: wrong ELF class: ELFCLASS64
    (eclipse:7864): Pango-WARNING **: /usr/lib/pango/1.6.0/modules/pango-basic-fc.so: wrong ELF class: ELFCLASS64
    (eclipse:7864): Pango-WARNING **: /usr/lib/pango/1.6.0/modules/pango-basic-fc.so: wrong ELF class: ELFCLASS64
    (eclipse:7864): Pango-WARNING **: /usr/lib/pango/1.6.0/modules/pango-basic-fc.so: wrong ELF class: ELFCLASS64
    (eclipse:7864): Pango-WARNING **: /usr/lib/pango/1.6.0/modules/pango-basic-fc.so: wrong ELF class: ELFCLASS64
    (eclipse:7864): Pango-WARNING **: /usr/lib/pango/1.6.0/modules/pango-basic-fc.so: wrong ELF class: ELFCLASS64
    (eclipse:7864): Pango-WARNING **: /usr/lib/pango/1.6.0/modules/pango-basic-fc.so: wrong ELF class: ELFCLASS64
    (eclipse:7864): Pango-WARNING **: /usr/lib/pango/1.6.0/modules/pango-basic-fc.so: wrong ELF class: ELFCLASS64
    (eclipse:7864): Pango-WARNING **: /usr/lib/pango/1.6.0/modules/pango-basic-fc.so: wrong ELF class: ELFCLASS64
    (eclipse:7864): Pango-WARNING **: /usr/lib/pango/1.6.0/modules/pango-basic-fc.so: wrong ELF class: ELFCLASS64
    (eclipse:7864): Pango-WARNING **: /usr/lib/pango/1.6.0/modules/pango-basic-fc.so: wrong ELF class: ELFCLASS64
    (eclipse:7864): Pango-WARNING **: /usr/lib/pango/1.6.0/modules/pango-basic-fc.so: wrong ELF class: ELFCLASS64
    (eclipse:7864): Pango-WARNING **: /usr/lib/pango/1.6.0/modules/pango-basic-fc.so: wrong ELF class: ELFCLASS64
    (eclipse:7864): Pango-WARNING **: /usr/lib/pango/1.6.0/modules/pango-basic-fc.so: wrong ELF class: ELFCLASS64
    (eclipse:7864): Pango-WARNING **: /usr/lib/pango/1.6.0/modules/pango-basic-fc.so: wrong ELF class: ELFCLASS64
    (eclipse:7864): Pango-WARNING **: /usr/lib/pango/1.6.0/modules/pango-basic-fc.so: wrong ELF class: ELFCLASS64
    (eclipse:7864): Pango-WARNING **: /usr/lib/pango/1.6.0/modules/pango-basic-fc.so: wrong ELF class: ELFCLASS64
    (eclipse:7864): Pango-WARNING **: /usr/lib/pango/1.6.0/modules/pango-basic-fc.so: wrong ELF class: ELFCLASS64
    (eclipse:7864): Pango-WARNING **: /usr/lib/pango/1.6.0/modules/pango-basic-fc.so: wrong ELF class: ELFCLASS64
    (eclipse:7864): Pango-WARNING **: /usr/lib/pango/1.6.0/modules/pango-basic-fc.so: wrong ELF class: ELFCLASS64
    (eclipse:7864): Pango-WARNING **: /usr/lib/pango/1.6.0/modules/pango-basic-fc.so: wrong ELF class: ELFCLASS64
    (eclipse:7864): Pango-WARNING **: /usr/lib/pango/1.6.0/modules/pango-basic-fc.so: wrong ELF class: ELFCLASS64
    (eclipse:7864): Pango-WARNING **: /usr/lib/pango/1.6.0/modules/pango-basic-fc.so: wrong ELF class: ELFCLASS64
    (eclipse:7864): Pango-WARNING **: /usr/lib/pango/1.6.0/modules/pango-basic-fc.so: wrong ELF class: ELFCLASS64
    (eclipse:7864): Pango-WARNING **: /usr/lib/pango/1.6.0/modules/pango-basic-fc.so: wrong ELF class: ELFCLASS64
    (eclipse:7864): Pango-WARNING **: /usr/lib/pango/1.6.0/modules/pango-basic-fc.so: wrong ELF class: ELFCLASS64
    (eclipse:7864): Pango-WARNING **: /usr/lib/pango/1.6.0/modules/pango-basic-fc.so: wrong ELF class: ELFCLASS64
    (eclipse:7864): Pango-WARNING **: /usr/lib/pango/1.6.0/modules/pango-basic-fc.so: wrong ELF class: ELFCLASS64
    (eclipse:7864): Pango-WARNING **: /usr/lib/pango/1.6.0/modules/pango-basic-fc.so: wrong ELF class: ELFCLASS64
    (eclipse:7864): Pango-WARNING **: /usr/lib/pango/1.6.0/modules/pango-basic-fc.so: wrong ELF class: ELFCLASS64
    (eclipse:7864): Pango-WARNING **: /usr/lib/pango/1.6.0/modules/pango-basic-fc.so: wrong ELF class: ELFCLASS64
    (eclipse:7864): Pango-WARNING **: /usr/lib/pango/1.6.0/modules/pango-basic-fc.so: wrong ELF class: ELFCLASS64
    (eclipse:7864): Gdk-WARNING **: Error converting from UTF-8 to STRING: La conversione del set di caratteri da "UTF-8" a "ISO-8859-1" non è supportata
    (eclipse:7864): Gdk-WARNING **: Error converting from UTF-8 to STRING: La conversione del set di caratteri da "UTF-8" a "ISO-8859-1" non è supportata
    (eclipse:7864): Gdk-WARNING **: Error converting from UTF-8 to STRING: La conversione del set di caratteri da "UTF-8" a "ISO-8859-1" non è supportata
    (eclipse:7864): Gdk-WARNING **: Error converting from UTF-8 to STRING: La conversione del set di caratteri da "UTF-8" a "ISO-8859-1" non è supportata
    (eclipse:7864): Gdk-WARNING **: Error converting from UTF-8 to STRING: La conversione del set di caratteri da "UTF-8" a "ISO-8859-1" non è supportata
    (eclipse:7864): Pango-WARNING **: pango_cairo_font_get_scaled_font called with bad font, expect ugly output
    [j_zar@Surprise wingide]$ ./wing
    (wing.py:7922): Gdk-WARNING **: Error converting from UTF-8 to STRING: Could not open converter from 'UTF-8' to 'ISO-8859-1'
    (wing.py:7922): Gdk-WARNING **: Error converting from UTF-8 to STRING: Conversion from character set 'UTF-8' to 'ISO-8859-1' is not supported
    Anybody can help in this???
    Last edited by J_Zar (2007-10-30 11:48:28)

    There has been talk about adding these config files, that fixes this kinda problems, to a separate pkg with a helper script you run 32bit programs with.
    for quick fix
    Link
    Grab the 3 pango files, throw them somewhere, edit pangorc to corresponding paths for the 2 other files.
    Start program with
    export PANGO_RC_FILE="/path/to/pangorc"
    program
    for the string error, if its iconv
    export GCONV_PATH=/opt/lib32/usr/lib/gconv
    to previous command
    Note, you cant start 64bit programs that use pango/iconv from the terminal you exported the variable.

  • Unable to convert from "UTF-8" during oracle upgrade

    Hi,
    I'm trying to upgrade from 9.2.0.1 to 9.2.0.6, while running the runInstaller, am getting the following error:
    ERROR: Unable to convert from "UTF-8" to "ISO-8859-1" for NLS!
    SuSE-7, redhat-2.1AS, redhat-2.1, UnitedLinux-1.0, redhat-3 or SuSE-8
    can anyone tell me how to fix this problem?
    thanks,
    GK

    Hi Vinod,
    I have already checked that and found the package is available in my system.
    <i># pkginfo - SUNWuiu8
    system      SUNWuiu8 Iconv modules for UTF-8 Locale</i>
    Regards,
    Chintan Sheth

  • How to convert xml utf 16 to utf 8

    Is it possible to convert xml file with UTF16 to UTF8 using ABAP? I am using ECC 6.0
    Appreciated your inputs.

    Hhmm, interesting. I thought it should be straightforward, but it the two solutions I could think of seem a bit convoluted. First way is probably to use the iXML libray, where the starting point is the class CL_IXML. You can find the SAP documentation [here|http://help.sap.com/saphelp_nw04/helpdata/en/86/8280d212d511d5991b00508b6b8b11/frameset.htm].
    Then there's a "manual approach": Use OPEN DATASET to read the UTF-16 file, then modify the XML encoding attribute and save it as UTF-8 file. Not straightforward, because the tempting command option [OPEN DATASET .. LEGACY TEXT MODE CODE PAGE|http://help.sap.com/abapdocu_70/en/ABAPOPEN_DATASET_MODE.htm] doesn't work. Per ABAP help:
    For the specification of the code page cp, a character-type data object is expected that must contain - at the time of execution of the statement - the label of a non-Unicode page from the column CPCODEPAGE in the database table TCP00. A Unicode page must not be specified.
    Darn, looks like they expect most Unicode files to be UTF-8. But that might be the reason you want to convert it...
    So use the following steps:
    <ol><li>Open the file as a binary file (only option for UTF-16, seev[here|http://help.sap.com/abapdocu_70/en/ABAPOPEN_DATASET_ENCODING.htm]) via [OPEN DATASET file FOR INPUT IN BINARY MODE|http://help.sap.com/abapdocu_70/en/ABAPOPEN_DATASET.htm] and read content into XSTRING using [READ DATASET|http://help.sap.com/abapdocu_70/en/ABAPREAD_DATASET.htm].</li>
    <li>Convert it to a string using utility class [CL_ABAP_CONV_IN_CE|http://help.sap.com/saphelp_nw04/helpdata/en/79/c554afb3dc11d5993800508b6b8b11/frameset.htm], see example [here|http://wiki.sdn.sap.com/wiki/display/Snippets/ABAPCodePage+Conversions].</li>
    <li>Replace the encoding markup for UTF-16 in the XML with a reference to UTF-8.</li>
    <li>Write the XML content back to a file using [OPEN DATASET file|http://help.sap.com/abapdocu_70/en/ABAPOPEN_DATASET.htm] FOR [OUTPUT|http://help.sap.com/abapdocu_70/en/ABAPOPEN_DATASET_ACCESS.htm] IN [TEXT MODE|http://help.sap.com/abapdocu_70/en/ABAPOPEN_DATASET_MODE.htm] [ENCODING UTF-8|http://help.sap.com/abapdocu_70/en/ABAPOPEN_DATASET_ENCODING.htm] and [TRANSFER|http://help.sap.com/abapdocu_70/en/ABAPTRANSFER.htm]</li></ol>
    Maybe somebody has a shorter way...
    Cheers, harald

  • Convert a UTF-8 string to ISO-8859-1 string

    Hello. As you can see from my other post, I am working on internationalization. I could not find an appropriate entry in the forum already.
    I want to convert form data (submitted from an HTML UTF-8 charset page) from the UTF-8 format to ISO-8859-1 format. How do I do that?
    I.e.
    String utfFormat="&#35222;&#32884;&#32773;";
    String isoFormat="";
    // Do magic here
    System.out.println(isoFormat); // out: "&#12375;&#12390;&#12398;" (or whatever it is)
    Can you help?
    Dailysun
    null

    As I said in the other thread (did you read that, BTW?), you shouldn't have to bother with actual character-set conversions. You just tell the InputStream what the Charset is when you read it in, and the OutputStream what Charset to use when you write it out.
    What you're doing is escaping characters by replacing them with numeric entity references--the opposite of what you asked in the other thread. The process is just as simple: cast the char to an int, convert that to a string with String.valueOf(int), and add the "&#" and ";". You can use a regex-based approach like I did over there, but going in this direction, it will be just as easy without them.
    Hiwa, check out that other thread; I think you'll find it amusing (in light of that second link you posted).

  • Urgent converting to UTF-8

    I want to convert from AL16UTF16 character set to UTF8 character set
    I tried to use the convert function but i couldnt ,
    for example if i have the text ali and iwant to convert this text from
    AL16UTF16 character set to UTF8 character set then what i have to write please help it is urgent
    regrads &thanks in advance

    Create new db instance with appropriate character set
    afterwards use exp imp utility for character set migration.
    Regards
    Singh

  • How to get the data converted in UTF-16 format  downloading(local&appl)

    Dear Freinds,
                     I have interface program when it is downloading on the desktop and application server some of the characters
    are not getting displayed properly ( eg : employee name) . Which is not getting downloaded correctly . This as an UTF 16 or not how to trace if. If this is an UTF issue how we can tackle this issue.
    Please could any body let me know , as iam not aware of solving UTF 16 isse.
    The issue given to me as below.
    for some personnel no.s their names are getting displayed as below
    11450113 = Hänggi
    11450117 = Hässig-Wehrli .
    Could any one let me know how can i approach this issue.
    thaks & regards

    check this link:[https://wiki.sdn.sap.com/wiki/display/ABAP/DownloadFileinUTF-8encoding]

  • Convert data from ANSI to UTF-8 format

    Hello,
    We have data in ANSI format (binary) which we need to convert into UTF-8 format. We are able to do it using FM's GUI_UPLOAD and GUI_DOWNLOAD but as the amount of data is huge we are planning to get the conversion done using background job. But, we are not able to get it yet. Can anybody suggest ways to resolve this? Can class CL_ABAP_CONV_X2X_CE be used for the same?
    Thanks...
    With Regards,
    Mukul Kulkarni

    Hello,
    If you have the data in Binary mode, you can use the addition IN BINARY MODE while uploading.
    For downloading in UTF-8, you can use the addition TEXT MODE ENCODING UTF-8.
    BR,
    Suhas

  • Problem in converting value in UTF-8

    Hello experts,
    I have to store my textfields value in UTF-8 format. Whenever I inserted the qury write the following code for converting into UTF-8 fromat. But its store in like. e.g. &#2348;&#2368;&#2337; format for "&#2348;&#2368;&#2337;" value. I want to insert as like I am typing in my textfield.
    code for converting to UTF-8
    String name=request.getParameter("txtname");
    byte arra[]=name.getBytes("ISO8859-1");
    String nameM=new String(arra ,"UTF-8");Please help me out...

    go through all the links in this url, may be it will helps you
    http://java.sun.com/javase/technologies/core/basic/intl/faq.jsp#core-encoding

  • How to convert UTF-16 to UTF-8

    data source is 'ъѓъѓ№ѓфчр Фюыр№ 80Ъ                     ', it is Ukraine.
    I want to remove the blank, but no matter which key word in SAP I use, it doesn't work. i checked hexadecimal of the space from the text above , it is 00A0, but actually system only regard 0020 as space. i checked on internet,  the space of the text should be encoded with UTF-16 and system is UTF-8, 00A0 is extended ASCII, so 00A0 can't be seen in SAP system.
    my question is in this situation, how can a remove the space?

    Hi Eric,
    This Document might help u,
    Link: [how to convert UTF-16 to UTF-8|How to convert xml utf 16 to utf 8;
    -Dileep .C

  • Dilemma converting arbitrary encoding to UTF-8

    Here's my dilemma: I recently modified our webapp to use UTF-8 encoding across the board, since data with special characters that users added to the content management backend was being displayed incorrectly in ISO-8859-1. It works great for Strings we get from the database, since it uses UTF-8. The problem now is that there are also files that consist of html chunks that get added to pages when they're rendered by the jsps. Those files aren't always UTF-8 encoded, so characters are displaying incorrectly in those parts of the page.
    The problem is that we don't know what encoding the html chunks are, some are ISO-8859-1, some are Windows-1252, etc. There are hundreds of them, and the users use all kinds of programs to generate the files, Frontpage, Dreamweaver, etc. so there's no common encoding used. I'm trying to modify the code that reads those files so it converts the text to UTF-8 for display, but without knowing what encoding the file is in, how can you do the conversion properly? Here's the code I have currently:
            ByteArrayInputStream contentInput = file.getContent();
            // wrap byte stream in UTF-8 character stream
            BufferedReader br = new BufferedReader(new InputStreamReader(contentInput, "UTF-8"));
            StringBuffer outputBuffer = new StringBuffer("");
            do {
                readString = br.readLine();
                outputBuffer.append(readString);
            while (readString != null);We get a ByteArrayInputStream from the third party API, which I wrap in a UTF-8 encoded BufferedReader. The problem is that, for instance, this character '�', when encoded in the file as ISO-8859-1, get's garbled when converted to UTF-8.
    My question is: Is there a way to convert text to UTF-8 without knowing the encoding of the file? I suspect the answer is no, but I'm really hoping it's yes, since the alternative is re-encoding hundreds to thousands of files in the db, then retraining hundreds of users to always save files as UTF-8. (You can't see my brain spasming at the thought of that, but trust me, it is ;P).

    As an update, in case anyone else runs into this same problem:
    I used the SmartEncodingInputStream from uncle_alice's link, and it works just well enough to solve my problem. The only encoding that it guessed correctly was UTF-8. But it guessed windows-1252 for US-ASCII, windows-1252, and ISO-8859-1. Since 1252 is a superset of ascii and 8859, using 1252 decodes all the characters correctly from those encodings. All the content I tested with was decoded correctly, presumably because it all uses one of those four encodings. The one snag I hit was that the SmartEncodingInputStream doesn't reset the InputStream after it reads it, so I have to do it manually after getting the guessed encoding. Here's the code I used:
            // Get the file content
            ByteArrayInputStream contentInput = file.getContent();
            StringBuffer outputBuffer = new StringBuffer("");
            // wrapper around the input stream that guesses the encoding of the stream
            SmartEncodingInputStream smartIS = null;   
            // use a 8k buffer, and a default encoding of windows-1252
            smartIS = new SmartEncodingInputStream(contentInput, SmartEncodingInputStream.BUFFER_LENGTH_8KB,
                    Charset.forName("windows-1252"));
            String charsetName = smartIS.getEncoding().name();      // get the name of the encoding guessed
            contentInput.reset();       // reset the position to the beginning of the stream
            byte[] contentBuffer = new byte[8192];
            int bytesRead = 0;
            while( (bytesRead = contentInput.read(contentBuffer, 0, 8192)) > 0 ) {
                // encode the output with the encoding guessed by the SmartEncodingInputStream
                outputBuffer.append(new String(contentBuffer, 0, bytesRead, charsetName));
            contentInput.close();I left out the try/catch blocks for readability. I get the ByteArrayInputStream from a library call, and end up with the file contents encoded in UTF-8 in outputBuffer.

  • Applescript for converting UTF-8 (styled )to ASCII

    Does anyone know of an applescript built to change text encoding from UTF-8 to standard ASCII 7 bit?
    I found something called TEC_OSAX1.3.3 on the macscripter forum but was unable to get it to work (I think because it is a classic application). Any help on this would be appreciated as well (can't open the readme file).
    I have a standard UTF-8 XML file that I want to prepare to upload to a web site but I need all the extended UTF-8 characters converted into the standard ASCII characters.
    I have enough applescript experience to do very basic operations but not enought to build something more complex like this.
    Any help would be appreciated.
    Thanks
    Jesse
    Mac book pro   Mac OS X (10.4.2)  
    I-Mac G5   Mac OS X (10.4.2)  
    I-Mac G5   Mac OS X (10.4.2)  

    I have a standard UTF-8 XML file that I want to
    prepare to upload to a web site but I need all the
    extended UTF-8 characters converted into the standard
    ASCII characters.
    There's no way to convert "extended" UTF-8 into ASCII, since the latter doesn't contain the required characters. Unless you are talking about converting them into NCR's like & #1234;. If that's the idea, there's an app called UnicodeChecker which can do it.
    Sometimes UTF-8 can be converted to ISO-8859-1. Is that what's required?
    Why exactly do you need to do this? Is your server one those few which are (mis)configured to only support Latin-1? If so, this can usually be fixed by other means, like an .htaccess file in your web space.
    If you are just talking about one or a few files, you can simply open with TextEdit set to UTF-8 and then save in the new encoding.

Maybe you are looking for

  • Is there a border component in the latest sdk build??

    hey guys... im trying to use the spark border component and its giving me an error in the latest sdk... the one im using right now is 4.0.0.13210 the code i have is <s:Group xmlns:fx="http://ns.adobe.com/mxml/2009                xmlns:s="library://ns

  • XML show form error

    Hello All, I am trying to develop an XML form wherein Edit form i have an HTML editor and in Show form i want to display the message that in edited in Edit form. But when i run the Show form, i text is displayed like this: <FONT color=#242424><STRONG

  • Error message: "The iTunes Music Libary.itl is locked..."

    Hi Heres the error message: "The iTunes Music Libary.itl is locked, on a locked disk, or you do not have permission for the file" One of my friends had this problem but he cant remeber what he did to change it like this or how to change it back, can

  • Printing fails, says low toner on old Laserjet

    Under Fedora 15, Firefox will not print to HP Laserjet 6MP connected to JetDirect EX Plus3. It says low toner. Other programs print successfully, CUPS control panel also says no toner left. However I believe this 12 year old printer/jetdirect combo h

  • Conference mic suggestions

    I'm looking for some recomendations on a mic that would work well for a conference room environment. I have G4 with an iSight camera setup in our conference room. I want to be able to give presentations to people in the conf room from remote location