Filtering out non-English characters
Anyone know of a way to use the Junk Mail filters to filter out email that has non-English characters in the subject line? I get a lot of spam that has either Asian or Russian characters in the subject and body of the email.
Thanks, Jim
Jimbot,
you might want to take a look at Junk Matcher - it allows you to flag mail based on character sets used - it seamlessly integrates with Mail's spam filter - once you have set it up to match your preferences, it should improve Mail's spam filter a lot:
http://junkmatcher.sourceforge.net/
Andreas
Similar Messages
-
Hello, I have read several times that since Java uses Unicode, it solves the problems of non-English characters automatically or something like that.
But my app is not working as expected. Would someone help please?
I have a client/server combo written in Java. The server can send messages in English or Japanese. The Japanese messages are hard-coded as String literals in the server source code. On the client side, they are displayed on a JEditorPane. But the Japanese characters are all garbled. The OS on the server side and client side are, of course, different.
My supposition, which is obviously wrong as it is not working, is that since both ends of communication are Java app, I need not worry about any encoding conversions for String literals.
Suggest me what is wrong here?How is the required encoding/decoding supposed to be done?
When I didn't worry about non-English characters, I did the following, which WORKED.
// SENDER side
Socket socket ;
PrintWriter out = new PrintWriter(socket.getOutputStream(),true);
String outMessage = "my message";
out.println(outMessage);//RECEIVER
Socket socket ;
BufferedReader in = new BufferedReader(new InputStreamReader(socket.getInputStream()));
String inMessage = in.readLine();When non-English characters are involved, I did the following, which DID NOT WORK. Please someone correct me.
// SENDER side
Socket socket ;
PrintWriter out = new PrintWriter(socket.getOutputStream(),true);
String outMessage = "my message";
String utfString = new String(outMessage.getBytes(),"UTF-8");
out.println(utfString);//RECEIVER
Socket socket ;
InputStreamReader ins = new InputStreamReader(clientSocket.getInputStream(),"UTF-8");
BufferedReader in = new BufferedReader(ins);
String inMessage = in.readLine();The received message is still garbled. -
Odd number of non-english characters get broken in windows-chrome and ff
I developed jnlp applet which prints out the user input.
When I put odd number of non-english characters(eg: chinese), chrome and firefox browser prints out the last character as question mark.
input : 가
output : 가��
I checked on java console that the character is correct.
It must be bug in communication of applet to chrome browser.
IE prints out correctly.
I can resolve the issue by appending white space on applet and remove it on java script.
Anyone has any clue on the issue?
Codes are as follows.
MainApplet.Java
public class MainApplet extends JApplet implements JSInterface{//, Runnable {
public int stringOut(String sData) {
OutData = sData;
return 0;
js File
function TSToolkitRealWrapper ()
var OutData;
var OutDataNum;
var TSToolkit = new TSToolkitRealWrapper();
var attributes = { id:'TSToolkitReal',code:'tradesign.pkitoolkit.applet.MainApplet', width:100, height:100} ;
var parameters = {jnlp_href: getContextPath() + '/download/pkitoolkit.jnlp',
separate_jvm:true, classloader_cache:false} ;
TSToolkitRealWrapper.prototype.stringOut=function(str)
var nRet = TSToolkitReal.stringOut(str) ;
this.OutData= TSToolkitReal.OutData;
return nRet;
HTML
<SCRIPT language=javascript>
<!--
function StringOut(form)
var data = form.data.value;
var nRet = 0;
var base64Data;
nRet = TSToolkit.stringOut(data);
if (nRet > 0)
alert(nRet + " : " + TSToolkit.GetErrorMessage());
else
form.data1.value = TSToolkit.OutData;
-->
</SCRIPT>
Edited by: user13496918 on 2013. 3. 20 오후 7:29
Edited by: user13496918 on 2013. 3. 20 오후 7:39
Edited by: user13496918 on 2013. 3. 20 오후 9:17
Edited by: user13496918 on 2013. 3. 20 오후 9:18I checked on java console that the character is correct.So it isn't a Java problem.
It must be bug in communication of applet to chrome browser.So tell the people who make the Chrome browser.
IE prints out correctly.That's a change. I've just spent nine days tracking down an IE applet problem and I'm not finished yet.
Please omit the boldface next time. We can read. Boldface doesn't help; it makes it worse. -
Support issue for non-English characters (in html forms)
Hi group!
I just want to post an issue here and see if anyone else has the same problem. First off, Im running Windows XP MCE but the French version (not the english version). This may help find out where the problem really is.
Second, I know a bit of html and such, and I'm referring to HTML Character entities for this thread, there's a quite complete list here for reference: http://www.faqs.org/docs/htmltut/characterentitiesfamsupp69.html
I noticed that some, not all, non-English characters written in a textarea (which is, basically, a multi-lined input box) doesnt pass well or at all to the server when sending the form from Safari. Most of the time, the content of the text area is reduced to the beginning and ends where the first accentued character is met.
The most used French accents (é, à) are usually well interpreted (but may, once in a while, produce that bug too) by safari, but ô and î doesnt do that well.
Oddly, this bug doesnt happen all the time and doesnt "crash" in the same manner everytime.
So I started a thread just to see if there's anyone else having issues with any non-english characters mostly in forms. Probably flash/shockwave does work, but I'm not sure- I have not tested yet.
Acer Aspire 5044 Windows XP Turion 1.8GHz, 1Gb SDRam, ATI 200M xpressYes, it is a known issue. I also noticed that it sometimes works, but most of the time it does not. It will hopefully be solved in the future. According to http://www.apple.com/safari/download/ changes that will come include:
# Support for International users
# International text input methods
# Advanced text (contextual forms, international scripts)
Sony Vaio Windows XP -
Non-english characters in sqlplus
Friends,
9.0.1,10.2.0.1
CAn the non-english characters be displayed in sqlplus prompt?
I have to print the data with some hindi charcaters like name from the report generated via proc.
What is teh workaround for this?
ThanksHi,
I have RH Linux 7.3.
Here is my situation.
I have some tables that have some columns.In these columns i need to store data in hindi.
The database character set(nls_characterset) is US7ASCII.
The datatypes of the columns is varchar2 type.
The tables are accessed through Pro*c and the proc code generates the report.
The data in these reports needs to be printed out and the corresponding data has to be in hindi.
So i have to do two things
1) On query the tables, the user should see hindi data along with english data.
2) The printed report should also contain hindi data.
Questions:
1) If i change the charcater set by the command:
alter database character set AL32UTF8;
and OS locale to UTF-8
Would these chnages solve the purpose?
2) Or do i need to recreate the database with AL32UTF8 and exp/imp all the data?
3)Any other advice/option?
Thanks -
Non-English characters in URL for rwservlet
I'm having a problem when I try to use non-english characters in a URL request to generate a report.
This works fine:
http://...rwservlet?report=r1.jsp&m1=Fred
But if I try Fréd (e with accent graph) the report does not return any data even though the SQL by itself would find data.
I tried UTF-8 encoding
http://...rwservlet?report=r1.jsp&m1=Fr%C3%A9d
8859-1 encoding
http://...rwservlet?report=r1.jsp&m1=Fr%E9d
Or just spell it out (not sure what that gets encode as):
http://...rwservlet?report=r1.jsp&m1=Fréd
But noting works. Any ideas?
Thanks, AndreasSuggestions
1) Try with NLS_LANG as
SWEDISH_SWEDEN.WE8DEC
2) Make a paramform and enter via paramform (unencoded)
(This is just for testing purpose)
3) Change machine locale to swedish and try
4) Which reports version is this ?
Please see
BUG 2713695 - NLS CHARACTERS FOR PARAMETERS CHANGE TO QUESTION MARKS WHEN PASSED ON URL BAR
Get in touch with Support to see if this is the issue and if "yes" get a one-off patch.
[ All Docs for all versions ]
http://otn.oracle.com/documentation/reports.html
[ Publishing reports to web - 10G ]
http://download.oracle.com/docs/html/B10314_01/toc.htm (html)
http://download.oracle.com/docs/pdf/B10314_01.pdf (pdf)
[ Building reports - 10G ]
http://download.oracle.com/docs/pdf/B10602_01.pdf (pdf)
http://download.oracle.com/docs/html/B10602_01/toc.htm (html)
[ Forms Reports Integration whitepaper 9i ]
http://otn.oracle.com/products/forms/pdf/frm9isrw9i.pdf
--------------------------------------------------------------------------------- -
Non English characters conversion issue in LSMW BAPI Inbound IDOCs
Hi Experts,
We have some fields in customer master LSMW data load program which can
contain non-English characters. We are facing issues in LSMW BAPI
method with non-English characters Conversion. LMSW steps read and
conversion are showing the non-English characters properly with out any
issue. While creating inbound IDOCs most of the non-English characters
replaced with '#' and its causing issues in creating customer master data in
system. In our scenario customer data with non-English characters in
the first name, last name and address details. Any specific setting
needs to be done from our side? Please suggest me to resolve this issue.
Thanks
Rajesh YadlaIf your language is a unicode tehn you need to change the options like IN SAP you need to change it to unicode in the initial screen Customize local layout(ALT F12) options 118 --> Encoding ....
-
Prevent Non-English Characters on JSP forms
I was hoping to get any programming tips/ideas to prevent users from entering non-english text on web-forms.
Any feedback would be greatly appreciated. Thanks.I have a jsp page something like:
<tr>
<td colspan=2> </td>
<td colspan=2>
<textarea name="title" cols="<%=cols%>" rows="3" wrap><%= form.getTitle()%></textarea>
</td>
</tr>
When the user submits the page, I do the form validation in the java formhandler. I was hoping that I could somehow compare the ascii codes of the character to ensure user is entering only english characters.
The following is the code, I have written in java the form-handler
for (int i =0 ; i < title.length() ; i++) {
char c = title.charAt(i);
System.out.println("c = " + c + ", ascii = " +(int)'c');
if (int(c) > 127) {
setErrorMessage(ID.QUESTION.TITLE, "Non-English characters are not allowed. Please enter the required information only in Enlgish.");
But for some reason which I am not able to debug, is that no matter what character I enter english or non-english its ascii equivalent i.e. the int(c) value getting printed out is always 99. Moreover even if I enter a non-english character, in the system.out it is printing its english equivalent...if that makes any sense...
I hope I was able to explain my problem...Any help/feedback would be greatly appreciated.
Thanks. -
PDF generation for Non English Characters from ADF
Hi
We are using below piece of code to generate pdf from ADF Managed bean. It works fine. However for non English Characters(eg. Japanese,Vietnamese,Arabic) it puts
I got few blogs
https://blogs.oracle.com/BIDeveloper/entry/non-english_characters_appears
However we are not using BI Publisher product . We are using its API's
Can anyone tell where do we need to setup fonts within ADF or Weblogic or Server ?
Input Parameters are
a)xml Data
b)InputStream ie rtf Template
import oracle.apps.xdo.XDOException;
import oracle.apps.xdo.template.FOProcessor;
import oracle.apps.xdo.template.RTFProcessor;
public static byte[] genPdfRep(String pOutFileType,byte[] pXmlOut ,InputStream pTemplate)
byte[] dataBytes = null;
try {
//Process RTF template to convert to XSL-FO format
RTFProcessor rtfp = new RTFProcessor(pTemplate);
ByteArrayOutputStream xslOutStream = new ByteArrayOutputStream();
rtfp.setOutput(xslOutStream);
rtfp.process();
//Use XSL Template and Data from the VO to generate report and return the OutputStream of report
ByteArrayInputStream xslInStream = new ByteArrayInputStream(xslOutStream.toByteArray());
FOProcessor processor = new FOProcessor();
ByteArrayInputStream dataStream = new ByteArrayInputStream((byte[])pXmlOut);
processor.setData(dataStream);
processor.setTemplate(xslInStream);
ByteArrayOutputStream pdfOutStream = new ByteArrayOutputStream();
processor.setOutput(pdfOutStream);
byte outFileTypeByte = FOProcessor.FORMAT_PDF;
processor.setOutputFormat(outFileTypeByte); //FOProcessor.FORMAT_HTML
processor.generate();
dataBytes = pdfOutStream.toByteArray();
} catch (XDOException e) {
e.printStackTrace();
return dataBytes;
Appreciate your help.
Thanks,
AbhijitFonts are defined in the template you use to generate the pdf. Your application add the data and both is processed yb the FOP processor. Now there are two possible causes of the '???' :
1. the data you sent to the template contains the '???' already
2. the template can't digest the data (the special characters) and puts '???' in the pdf.
Before going on you have to find out which one is your problem. The 2nd is the problem you better ask this in a FOP forum as you have to solve it by changing the template.
Timo -
Non english characters in DN cannot be retrieved
We are using Netscape directory server 4, protocal V3. We have a problem related to non-english characters appearing in RDN.
We publish to Ldap entries using the values from database. For example, we have pubulished an entry to Ldap, based on DB values, the entry should have a DN like: ou=Liege BELGIUM ... LGG1a, <other components of DN>. However, when we call netscape search API (search against uid attribute which does not have non-english characters), the search return the entry, but when further call getDN() method on the returned Ldap Entry, it only returns Li, instead of the complete DN value.
It seems the entry is corrupted in Ldap. I wanted to delete the corrupted entry and re create new one to test. I tried many ways, but none of them worked, I think it is because DN is corrupted, there is no key value to identify the Ldap entry for any operation(modify, delete).
You help and insights are much appreciated.
Thanks.
Han ShenLDAP uses the UTF8 encoding. You must store data in the directory using the UTF8 encoding. This includes DN values. This also means that if you want to be able to view the values in your native character set and font, you must use an application that can convert the UTF8 LDAP data back to the native character encoding. The directory console by default should work for LATIN-1 (ISO 8859) languages if the LOCALE is set correctly.
-
Non English characters in BIP email
Hi, my report contains Japanese characters, when I view the output in HTML format. It is displayed properly. But when I click on send button , enter email parameters like to, cc, bcc, subject , etc and send it, in the mail I receive, the japanese characters are not getting displayed properly. The same problem occurs for spanish and portugese texts-in general to all non english characters. I am using Oracle Business Intelligence Publisher Release 10.1.3.4. If someone has faced a similar issue, kindly help. Thanks in advance
Suggestions
1) Try with NLS_LANG as
SWEDISH_SWEDEN.WE8DEC
2) Make a paramform and enter via paramform (unencoded)
(This is just for testing purpose)
3) Change machine locale to swedish and try
4) Which reports version is this ?
Please see
BUG 2713695 - NLS CHARACTERS FOR PARAMETERS CHANGE TO QUESTION MARKS WHEN PASSED ON URL BAR
Get in touch with Support to see if this is the issue and if "yes" get a one-off patch.
[ All Docs for all versions ]
http://otn.oracle.com/documentation/reports.html
[ Publishing reports to web - 10G ]
http://download.oracle.com/docs/html/B10314_01/toc.htm (html)
http://download.oracle.com/docs/pdf/B10314_01.pdf (pdf)
[ Building reports - 10G ]
http://download.oracle.com/docs/pdf/B10602_01.pdf (pdf)
http://download.oracle.com/docs/html/B10602_01/toc.htm (html)
[ Forms Reports Integration whitepaper 9i ]
http://otn.oracle.com/products/forms/pdf/frm9isrw9i.pdf
--------------------------------------------------------------------------------- -
Non-English characters not displaying correctly - Serious Issue
My corporate email is on a Lotus Domino server with Lotus Traveler installed.
I have set my PlayBook (with OS 2) up to syncronize with the corporate email trough Active Sync (see http://alturl.com/qh3nn), which works perfectly.
I have however noticed that in some emails special non-english characters are displayed correctly but in some emails special non-english characters are displayed as a black diamond with a question mark inside.
This is of course a serious issue as most non English speaking countries use some special characters.
When trying to understand this problem how can I analyse the emails and see what character set is being used?
And of course better; has someone solved this?I am having the same problem. Is there any update available?
-
How to retrieve non-english characters from a query
Hello,
My apologies if this post is not in its proper place, but I was a bit confused where to add it.
I'm running a query using SQL Developer on a table which contains several companies names from many different countries, and one of the checks I need to make to ensure data consistency is to search for all rows which the name of company contains special or non-english characters (like ç, ã, ä as example).
I don't know what can I use to do this. I tried to collate using NLS_SORT but it didn't work.
Is there someway to select only the rows that contain these special or non-english characters, excluding from the results the rows that only have english characters? Please have in mind that we have many languages in this table.
The field I would like to make the conditions on is VARCHAR2.
Please let me know if there is any extra information I should provide you so that you can help me.
Thank you in advance for the help.
Regards,
LuísHi Luis,
My apologies if this post is not in its proper place, but I was a bit confused where to add it.This is the Forum for the SQL Developer Data Modeler product.
I suggest you try using the SQL and PL/SQL Forum: PL/SQL
David -
Encoding non english characters with utf 8 on jsp (Critical!!)
I am inserting hebrew characters from JSP into oracle db and everything is fine until this point. But when I try to retrieve the information from the database, the characters are not displayed properly (I get some garbage characters). I am sure that the data stored in the database is correct, but not sure why there is a problem in displaying the data in the JSP.
I came across a thread on TSS
http://www.theserverside.com/discussions/thread.tss?thread_id=28944
and followed the suggestions given there like having
<%@ page contentType="text/html; charset=UTF-8" pageEncoding="UTF-8" %>
<META http-equiv="Content-Type" content="text/html; charset=UTF-8">and also this
<%
//Some JDBC and sql statement query UTF-8 data and then ...
String str = rs.getString("utf8_data");
str = new String(str.getBytes("ISO-8859-1"),"UTF-8");
%>
<%= str %>Now, the data getting displayed is partly correct, I mean to say, some characters are still coming as squares.
Any ideas will be of great help.even i doubt the database charset for this issue. But what I dont understand is how only certain hebrew characters are getting stored properly and why others are corrupted?
Also, can anyone let me know how i can view the Non-English characters present in the database directly, as TOAD is not able to display them -
Only VBA does not recognize non-English characters
Hello guys,
I have a new laptop with Windows 8.1 bought in the USA and I'm having a difficulties with Excel VBA (Office 365 University-64x bought in the Czech Republic - Central Europe). The VBA does not recognize non-English characters (particularly "ř" and
"ů") which causes me problem when running some codes that I wrote earlier on my previous laptop (Windows 7, bought in the Czech Republic with the same Office).
The problem with non-English characters has occurred only in VBA so far, otherwise I can use these characters normally in Excel cells, Word... I tried to install both English and Czech version of the Office with no change, I also installed Czech proofreading
tools and set everything to Czech in the Office. The location and language preferences in the Windows are also set up to Czech. And it is not a problem of a font. I also mentioned that when I tried to look up these characters, using Ctrl+F, it changes
original ř to r after a search and again this is only an issue of the VBA.
Thank you very much for any help.
TomHi Tom,
VBA for Excel can only recognize ASCII code from 0 to 255, if you use other special characters like "ř" or "ů", it will returns 63(?) to you. To use this kind of characters, you have to utilize ChrW function to parse a decemal to the
character.
http://msdn.microsoft.com/en-us/library/ee177465.aspx
for example, the hex code and dec code for these two characters are as below:
Hex Dec
ř 159 345
ů 016F 367
So to get these two characters in VBA, you could code as below:
ChrW(&H159) or ChrW(345)
ChrW(&H16F) or ChrW(367)
You can get the hex code of the character by searching in the system character map(in the Win8.1 start view, search "character map"), then convert the hex code to decimal code by yourself.
Range("A1").Value = ChrW(&H159) & ChrW(&H16F)
Range("A1").Value = ChrW(345) & ChrW(367)
We are trying to better understand customer views on social support experience, so your participation in this interview project would be greatly appreciated if you have time. Thanks for helping make community forums a great place.
Click
HERE to participate the survey.
Maybe you are looking for
-
Unable to deploy a war file: could path length in a war file be an issue on WIN2K?
Hi all, I am unable to auto deploy a web app as a war file but has no problem to auto deploy it as an exploded directory that I drop under the $PortalHome/config/mydomain/applications directory of the WebLogic Server installation. Portal Server runs
-
Drivers are not working even after reinstalli​ng in Hp 2000 series windows 8.1
Suddenly my audio drivers is disabled. It shows error to re install drivers , i did but still its not working... I have Hp 2000 series laptop windows 8.1 Kindly help
-
Why exchange rate type "EURX" is used ?
Dear all, I post with FB65 a document with currency EUR. The local currency is THB and group currency is CHF. The document type doesn't have specific exchange rate type ==> by default the type "M" is taken. But, when I post a document with currency E
-
RAW Format for Nikon D5300 on Elements 3
I am running Elements 3. I just got a Nikon D5300 and shoot in RAW (NEF) format. Elements 3 does not display the thumbnails/pictures when I try to import into Elements. I did not have this problem with RAW shot on my Nikon D70.
-
Consuming Web Service Without Proxy
Hi, it is possible to consume a web service in JSP page without using any proxy (deployable or standalone)? Thanks.