Java Regular Expression for double spaces

I am using ([\\s\\s]+) this regex for finding double spaces but if in single segment (Text Line) there exists two or more double spaces then it is finding only one from it .
Matcher m = Pattern.compile( regexp ).matcher(Segment );
while(m.find)
p.println(" ERROR :: Punctuation (Double Dot ) Error");
Suggets solution
Thanks
amit

Hi,
I understood the question like this:
You have a String, and you want to find if the string have 2 or more than double spaces till the end of the string.
I have attached a sample code for the above problem. Please have a look into it.
public class FindDoubleSpacesInString {
     private static final String DOUBLE_SPACE_STRING = "tetett etetete etetete     etete et e tet ";
      * @param args
      * loganathank
      * void
     public static void main(String[] args) {
          String doubleSpaceRegExPattern = "\\s{2,}";
          Pattern doubleSpacePattern = Pattern.compile(doubleSpaceRegExPattern);
          Matcher doubleSpaceMatcher = doubleSpacePattern.matcher(DOUBLE_SPACE_STRING);
          while(doubleSpaceMatcher.find()) {
               System.out.println("double space found");
}Please let me know, whether it resolves your problem.
Regards,
Loga

Similar Messages

Java regular expression for CSV?

I found several regular expressions in the internet to parse/split csv data lines. Howeverm, they all don't work with the Java regular expression API. Is there a regular expression to tokenize CSV fields for the Java regexp API?

If the licensing of the above solution is too restrictive for you...I'm sure there are other types of parsers out there that do that type of thing.
In the meantime, here is some code I cooked up (no GPL...use it freely) that might get you started.
Don't know that it handles everything, but I never said it would...
Please READ and let me know what changes could be made. I'm always looking for improvements in my understanding of regular expressions...
import java.util.regex.*;
import java.util.*;
import java.util.List;
public class Example
   final static Pattern CSV_PATTERN;
   final static Pattern DOUBLE_QUOTE;
   static
      String regex = "(?: ([^q;]+) | (?: q ((?: (?:[^q]+) | (?:qq) )+ ) q) );?";
      //                       1          2          a           b       3    4
      // So, pretend your quote character is q
      // (you can change it to \" later when you understand what's going on.)
      // This regex (when applied iteravely) matches a token that:
      // 1) contains NO QUOTE MARKS whatsoever (;'s) (in group 1)
      //                       or
      // 2) starts with a QUOTE, then contains either
      //    a) no quotes at all inside or
      //    b) double quotes (to escape a quote)
      // 3) and ends with a QUOTE.
      // 4) and is followed by a separator (optional for the last value)
      // Note that (a) and (b) are captured in group 2 of the regex.
      CSV_PATTERN = Pattern.compile(regex, Pattern.COMMENTS);
      DOUBLE_QUOTE = Pattern.compile("qq");
    * Attempts to parse Excel CSV stuff...
    * @param text the CSV text.
    * @return a list of tokens.
   public static List parseCsv(String text)
      Matcher csvMatcher = CSV_PATTERN.matcher(text);
      Matcher doubleQuotes = DOUBLE_QUOTE.matcher("");
      List list = new ArrayList();
      while (csvMatcher.find())
         if (csvMatcher.group(1) != null)
            // The first one matched.
            list.add(csvMatcher.group(1));
         else
            doubleQuotes.reset(csvMatcher.group(2));
            list.add(doubleQuotes.replaceAll("q"));
      return list;
}

Java regular expression for Arabic

i want to use java regular expression to evaluate some string in Arabic
can some body tell me how to do a match for arabic characters

i have this code :
String poem="��";
 //String m1="\\p?";
 String m1= "\\p{�}";
 Matcher m =
 Pattern.compile(m1)
 .matcher(poem);
 while(m.find()) {
 for(int j = 0; j <= m.groupCount(); j++)
 System.out.print("[" + m.group(j) + "]");
 System.out.println();
 }i get the error:
Exception java.util.regex.PatternSyntaxException: Unknown character property name {?} near index 2
\p?
if you find that is hard to help with Arabic regex, can someone post a code on how to match Arabic regex or chineese or any thing not latin regex match
because a need to match a Strings in Arabic if some one can tell me how?

Re:java regular expression for website

Hi All,
I am using jdeveloper 11.1.2.3.0
My requirement is that I have a website attribute I need the regular expression for the website attribute
to display the format www.google.com www.oracle.com.
Thanks,
Sandeep

Hi Sandeep,
you can use the below code for website validation.
<af:inputText label="" id="time" simple="true" value="" contentStyle="width:100px;" maximumLength="100">
 <af:validateRegExp pattern="^www[.][a-z]{1,15}[.](com|org)$"
 messageDetailNoMatch="Website must be like www.google.com"
 hint="Website Format: www.google.com"/>
 </af:inputText>
as per your requirement you can change the pattern.
Thanks
Prabhat

Perl Regular expression to java Regular Expression

HI all,
How can i write java Regular expression for the below Perl Code
where data.html is my original Html file
and data2.html is output file.
open(FPR, "data.html") || die("Could not open data file");
while ($line=<FPR>) {
$content .= $line;
close(FPR);
open(FPR, ">data2.html") || die("Could not open data2 file");
# clean white spaces
$content =~ s/[\n\r\0 ]//g;
# divide data by td
$rxp='<tr.*?><td.*?>(.*?)<\/.*?td><td.*?>(.*?)<\/.*?td><td.*?>(.*?)<\/.*?td><td.*?>(.*?)<\/.*?td><td.*?>(.*?)<\/.*?td><td.*?>(.*?)<\/.*?td><td.*?>(.*?)<\/.*?td><td.*?>(.*?)<\/.*?td><\/.*?tr>';
while ($content=~ m/$rxp/g)
print FPR "\n".$1."\t".$2."\t".$3."\t".$4."\t".$5."\t".$6."\t".$7."\t".$8."\t";
print FPR " ";
close(FPR);
can you help in this regard
Thanks

I am able to retrive only one row in this format from data.html file
<trvalign=middlebordercolor=#ffffff><tdwidth='40'CLASS='tdbgpricespagecolorgrey'><fontface='Arial,Helvetica,sans-serif'size='2'>SB</td><t
dwidth="23"Class=tdbgpricespagecolorgrey><fontface='Arial,Helvetica,sansserif'size='2'>USAirways</td><tdwidth="34"Class=tdbgpricespagecolorgrey><fontface='Arial,Helvetica,sans-serif'size='2'>MIA</td><tdwidth="31"Class=tdbgpri
cespagecolorgrey><fontface='Arial,Helvetica,sans-erif'size='2'>LGW</td><tdwidth="23"Class=tdbgpricespagecolorgrey><fontface='Arial,Helvetica,sans-serif'size='2'>USAirways</td><tdwidth="34"Class=tdbgpricespagecolorgrey><fontface='Arial,Helvetica,sans-serif'size='2'>LGW</td>
But i need the output in this format
<fontface='Arial,Helvetica,sans-serif'size='2'>SB <fontface='Arial,Helvetica,sans-serif'size='2'>USAirways <fontface='Arial,Helvetica,sans-serif'size='2'>MIA <fontface='Arial,Helvetica,sans-serif'size='2'>LGW <fontface='Arial,Helvetica,sans-serif'size='2'>USAirways <fontface='Arial,Helvetica,sans-serif'size='2'>LGW <fontface='Arial,Helvetica,sans-serif'size='2'>MIA 
<fontface='Arial,Helvetica,sans-serif'size='2'>CS <fontface='Arial,Helvetica,sans-serif'size='2'>USAirways <fontface='Arial,Helvetica,sans-serif'size='2'>MIA <fontface='Arial,Helvetica,sans-serif'size='2'>LON <fontface='Arial,Helvetica,sans-serif'size='2'>USAirways <fontface='Arial,Helvetica,sans-serif'size='2'>LON <fontface='Arial,Helvetica,sans-serif'size='2'>MIA 
How can i rewrite the code to achive this.
Here is my java code
import java.io.*;
import java.util.*;
import java.util.regex.*;
public class parseHTML {
public static void main(String[] args)
try
BufferedReader in = new BufferedReader(new FileReader("C:\\data.html"));
PrintWriter out = new PrintWriter(new FileWriter("C:\\data1.html"));
String aLine = null;
String abc=null;
String pattern1 ="<tr.+?><td.+?>(.+?)</.+?td><td.+?>(.+?)</.+?td><td.+?>(.+?)</.+?td><td.+?>(.+?)</.+?td><td.+?>(.+?)</.+?td><td.+?>(.+?)</.+?td><td.+?>(.+?)</.+?td><td.+?>(.+?)</.+?td><td.+?>(.+?)</.+?td><td.+?>(.+?)</.+?td><td.+?>(.+?)</.+?td>++";
Pattern p1 = Pattern.compile(pattern1);
while((aLine = in.readLine()) != null)
abc=aLine.replaceAll("(\n|\t|\r)","").replaceAll(" ","");
Matcher m1 = p1.matcher(abc);
if(m1.find())
System.out.println("the value is...."+m1.group());
out.print(m1.group());
m1.reset(aLine);
in.close();
out.close();
catch(IOException exception)
exception.printStackTrace();
Thanks

Regular Expressions and Double Byte Characters ?

Is it possible to use Java Regular Expressions to parse
a file that will contain double byte characters ?
For example, I want a regular expression to match the following line
tag="double byte stuff" id="double byte stuff"

The comments on the bytes/strings were helpful. Thanks.
But I'm still confused as to what matching pattern could be used.
For example a pattern like:
[A-Za-z]
I assume would not match any double byte characters.
I also assume the following won't work either:
[\\p{Alpah}]
because it is posix - US-ASCII only.
So how do you say "match the tag, then take any characters,
double byte, ascii, whatever, then match the text tag - per the
original example ?

Regular Expression for PathName???

Anyone have a "ready to go" regular expression for detecting a pathname?
for example I need to detect the following:
myfile.txt
./myfile.txt
../my-file.ini
/home/my-home/myFile.foo
etc.
Now, in a perfect world, it could also do Windows (or ANY OS for that matter) pathnames (though this is not terrbibly important for my case at least).
TIA,
/m

import java.util.regex.*;
* @author Ian Schneider
public class FileRegex {
 static Pattern pattern;
 /** Creates a new instance of FileRegex */
 public FileRegex() {
 public Pattern getPattern() {
 if (pattern == null) {
 pattern = Pattern.compile("([\\/]?(\\w+|\\.|\\.\\.)[\\/])*(\\w+)\\.?(\\w+)?");
 return pattern;
 public String[] parts(String path) {
 Matcher m = getPattern().matcher(path);
 if (m.find()) {
 return new String[] { m.group(1),m.group(3),m.group(4) };
 return null;
 public boolean matches(String path) {
 return getPattern().matcher(path).matches();
 public static final void main(String[] args) throws Exception {
 FileRegex regex = new FileRegex();
 String[] files = {
 "myfile.txt",
 "../myfile.txt",
 "./myfile.txt",
 "/a/b/c/myfile.txt",
 "/a/../myfile.txt",
 "myfile"
 for (int i = 0, ii = files.length; i < ii; i++) {
 System.out.println( files[i] + " match " + regex.matches(files));
String[] pieces = regex.parts(files[i]);
if (pieces != null)
System.out.println(" path : " + pieces[0] + " file : " + pieces[1] + " ext : " + pieces[2]);
I will leave it to you as an excercise to add support for spaces in path names, different separator characters, etc..

Improving Java Regular Expression Compile Time

Hi,
Just wondering if anyone here knows how can i improve the compile time of Java Regular Expression?
The following is fragment of my code which I tired to see the running time.
Calendar rightNow = Calendar.getInstance();
System.out.println("Compile Pattern");
startCompileTime = rightNow.getTimeInMillis();
Pattern p = Pattern.compile(reg, Pattern.CASE_INSENSITIVE);
rightNow = Calendar.getInstance();
endCompileTime = rightNow.getTimeInMillis();
Below is fragment of my regular expression:
(?:tell|state|say|narrate|recount|spin|recite|order|enjoin|assure|ascertain|demonstrate|evidence|distinguish|separate|differentiate|secern|secernate|severalize|tell apart) me (?:about|abou|asti|approximately|close to|just about|some|roughly|more or less|around|or so|almost|most|all but|nearly|near|nigh|virtually|well-nigh) java
My regular expression is a very long one and the Pattern.compile just take too long. The worst case that I experience is 2949342 milliseconds.
Any idea how can I optimise my regular expression so that the compilation time is acceptable.
Thanks in advance

My regular expression is a very long one and the
Pattern.compile just take too long. The worst case
that I experience is 2949342 milliseconds.Wow, that's pretty pathological. I was going to tell you that you were measuring something wrong, because I had written a test program that could compile a 1 Mb "or" pattern (10,000 words, 100 bytes per) in under 200 ms ... but then I noticed that your patterns have two "or" components, so reran my test, and got over 14 seconds to run with a smaller pattern.
My guess is that the RE compiler, rather than decomposing the RE into a tree, is taking the naive approach of translating it into a state machine, and replicating the second component for each path through the first component.
If you can create a simple hand-rolled parser, that may be your best option. However, it appears that your substrings aren't easily tokenized (some include spaces), so your best bet is to break the regexes into pieces at the "or" constructs, and use Pattern.split() to apply each piece sequentially.
import java.util.Random;
import java.util.regex.Pattern;
public class RegexTest
 public static void main(String[] argv) throws Exception
 long initial = System.currentTimeMillis();
 String[] words = generateWords(10000);
// String patStr = buildRePortion(words);
// String patStr = buildRePortion(words) + " xxx ";
 String patStr = buildRePortion(words) + " xxx " + buildRePortion(words);
 long startCompile = System.currentTimeMillis();
 Pattern pattern = Pattern.compile(patStr, Pattern.CASE_INSENSITIVE);
 long finishCompile = System.currentTimeMillis();
 System.out.println("Number of components = " + words.length);
 System.out.println("ms to create pattern = " + (startCompile - initial));
 System.out.println("ms to compile = " + (finishCompile - startCompile));
 private final static String[] generateWords(int numWords)
 String[] results = new String[numWords];
 Random rnd = new Random();
 for (int ii = 0 ; ii < numWords ; ii++)
 char[] word = new char[20];
 for (int zz = 0 ; zz < word.length ; zz++)
 word[zz] = (char)(65 + rnd.nextInt(26));
 results[ii] = new String(word);
 return results;
 private static String buildRePortion(String[] words)
 StringBuffer sb = new StringBuffer("(?:");
 for (int ii = 0 ; ii < words.length ; ii++)
 sb.append(ii > 0 ? "|" : "")
 .append(words[ii]);
 sb.append(")");
 return sb.toString();
}

Logical AND in Java Regular Expressions

I'm trying to implement logical AND using Java Regular Expressions.
I couldn't figure out how to do it after reading Java docs and textbooks. I can do something like "abc.*def", which means that I'm looking for strings which have "abc", then anything, then "def", but it is not "pure" logical AND - I will not find "def.*abc" this way.
Any ideas, how to do it ?
Baken

First off, looks like you're really talking about an "OR", not an "AND" - you want it to match abc.*def OR def.*abc right? If you tried to match abc.*def AND def.*abc nothing would ever match that, as no string can begin with both "abc" and "def", just like no numeric value can be both 2 and 5.
Anyway, maybe regex isn't the right tool for this job. Can you not simply programmatically match it yourself using String methods? You want it to match if the string "starts with" abc and "ends with" def, or vice-versa. Just write some simple code.

Java – Regular Expressions – Finding any non digit byte in a multiple byte

Hello,
I’m new to JAVA and Regular Expressions; I’m trying to write a regular expression that will find any records that contain a non digit byte in a multiple byte field.
I thought the following was the correct expression but it is only finding records that contain “all” non digit bytes.
\D{1,}
\D = Non Digit
{1,} = at least 1 or more
Below is my sample data. I would like the regular expression to find all of the records that are not all numeric. However when I use the regular expression \D{1,} it is only finding the 2 records that all bytes are non digits. (i.e. “ “ and “A “)
“ 111229”
“2 111229”
“20091229”
“200912c9”
“201#1229”
“20101229”
“20110229”
“20111*29”
“20111029”
“20111229”
“20B11229”
“A “
“A0111229”
Please note I have also tried \D{1,}+ and \D{1,}? And they also do not return my desired results
Any assistance someone can provide would be greatly appreciated.

You don't show the code you are using but I surmise you are using String.matches() which requires that the whole target must match the regular expression not just part of it. Instead you should create a Pattern and then a Matcher and use the Matcher.find() method. Check the Javadoc for Pattern and Matcher and look at the Java regex tutorial - http://docs.oracle.com/javase/tutorial/essential/regex/ .
P.S. You can re-use the Pattern object - you don't have to create it every time you need one.
P.P.S. Java regular expressions work with characters not bytes and characters are not not not bytes.

Problems with java regular expressions

Hi everybody,
Could someone please help me sort out an issue with Java regular expressions? I have been using regular expressions in Python for years and I cannot figure out how to do what I am trying to do in Java.
For example, I have this code in java:
import java.util.regex.*;
String text = "abc";
 Pattern p = Pattern.compile("(a)b(c)");
 Matcher m = p.matcher(text);
if (m.matches())
 int count = m.groupCount();
 System.out.println("Groups found " + String.valueOf(count) );
 for (int i = 0; i < count; i++)
 System.out.println("group " + String.valueOf(i) + " " + m.group(i));
My expectation is that group 0 would capture "abc", group 1 - "a" and group 2 - "c". Yet, I I get this:
Groups found 2
group 0 abc
group 1 a
I have tried other patterns and input text but the issue remains the same: no matter what, I cannot capture any paranthesized expression found in the pattern except for the first one. I tried the same example with Jakarta Regexp 1.5 and that works without any problems, I get what I expect.
I am using Java 1.5.0 on Mac OS X 10.4.
Thank to all who can help.

paulcw wrote:
If the group count is X, then there are X plus one groups to go through: 0 for the whole match, then 1 through X for the individual groups.It does seem confusing that the designers chose to exclude the zero-group from group count, but the documentation is clear.
Matcher.groupCount():
Group zero denotes the entire pattern by convention. It is not included in this count.

Using regular expressions for validation in i18n

Can we use regular expressions for validation of inputs in a java application taking care of i18N aspects too. Zip code for different locales are different. Can we use regular expressions to validate zipcode inputs from different locales

hi,
For that shall i have to create individual patterns for matching the inputs from different locales or a single pattern will do in the case of validating phone nos. around the world, zip codes etc. In case different patterns are required, programmer should have a konwledge of difference in patters for different locales.
regards
sdas

Regular expression for BBcode list to html list

Hi,
we are migrating BBforum to Jive forum.
BBforums has data which contains BBcode Strings.i found the follwoing code after googled.
public static String bbcode(String text) {
String html = text;
Map<String, String> bbMap = new HashMap<String, String>();
bbMap.put("(\r\n|\r|\n|\n\r)", " ");
bbMap.put("\\[b\\](.+?)\\[b\\]", "$1");
for (Map.Entry entry : bbMap.entrySet()) {
html =
html.replaceAll(entry.getKey().toString(), entry.getValue().toString());
return html;
i have BBcode with format like
[list] [*]blue[*]red[*] green[list]
i have to replace this by <ul><li>blue</li><li>red</li>
Can any one sugeest me java regular expression which replace as above
Edited by: 875452 on Jul 31, 2011 8:03 AM

Moderator advice: Please read the announcement(s) at the top of the forum listings and the FAQ linked from every page. They are there for a purpose.
Then edit your post and format the code correctly.
Moderator action: Moved from Development Tools » General Questions
db

Regular expression for recognizing all tables in a sql statement

Hi all
I need a regular expression for recognizing all the tables bane in a geberic statement.
Unlikely i need a regular expression that manage also inner join .I 'm sorry but this matter is new for me and i cannot find any usefull help in the web.
Regards

If you insist it should be something like:
"SELECT ([A-Z0-9_]+)[.][A-Z0-9_]+(,([A-Z0-9_]+)[.][A-Z0-9_]+)* FROM (([A-Z0-9_]+)[.][A-Z0-9_]+) INNER JOIN (([A-Z0-9_]+)[.][A-Z0-9_]+) ON .+" plus spaces etc... Yes it's for this kind of statements only.
But SQL parser is better because anyway you'll need to at least remove duplicates from founded names...

Java Regular Expressions in J2EE

Does anybody know when Java Regular Expressions will be available in J2EE. They are currently in the latest release of J2SE in the java.util.regex package.

They are in the Standard Edition, so it does not make sense that they will also be in Enterprise Edition some day. You need to have the standard JRE installed before you can use the J2EE classes anyway.
If you want to use the regular expressions, install version 1.4 (beta) of the J2SE and use the current version of J2EE on top of that.
Jesper

Java Regular Expression for double spaces

Similar Messages

Maybe you are looking for