Regular Expression Character Sets with Pattern and Matcher

Hi,
I am a little bit confused about a regular expressions I am writing, it works in other languages but not in Java.
The regular expressions is to match LaTeX commands from a file, and is as follows:
\\begin{command}([.|\n\r\s]*)\\end{command}
This does not work in Java but does in PHP, C, etc...
The part that is strange is the . character. If placed as .* it works but if placed as [.]* it doesnt. Does this mean that . cannot be placed in a character range in Java?
Any help very much appreciated.
Kind Regards
Paul Bain

In PHP it seems that the "." still works as a all character operator inside character classes.
The regular expression posted did not work, but it does if I do:
\\begin{command}((.|[\n\r\s])*)?\\end{command}
Basically what I'm trying to match is a block of LaTeX, so the \\begin{command} and \\end{command} in LaTeX, not regex, although the \\ is a single one in LaTeX. I basically want to match any block which starts with one of those and ends in the end command. so really the regular expression that counts is the bit in the middle, ((.|[\n\r\s])*)?
Am I right it saying that the "?" will prevent the engine matching the first and last \\bein and \\end in the following example:
\\begin{command}
some stuff
\\end{command}
\\begin{command}
some stuff
\\end{command}

Similar Messages

Problem with character set with servlets and Oracle 9i AS

I am deploying a servlet with Jserv on a Sun machine.
My servlet is working but some special characters in text fields like "i", "h", "`" (French language) are replaced by "?" when refreshing the HTML pages.
Is that a Jserv or Oracle 9i AS settings problem or machine settings problem ?
Thanks for you help
Lilian

The solution to this problem is described in MetaLink note #211920.1. But this note is published with LIMITED access as it involves using a hidden parameter.
You can get access to the note through Oracle Support only.
The problem itself is solved generically, if the source database is at least 10.1.0.3 and the target database is 10.2
-- Sergiusz

Pattern and Matcher of Regular Expressions

Hello All,
MTMISRVLGLIRDQAISTTFGANAVTDAFWVAFRIPNFLRRLFAEGSFATAFVPVFTEVK
ETRPHADLRELMARVSGTLGGMLLLITALGLIFTPQLAAVFSDGAATNPEKYGLLVDLLR
LTFPFLLFVSLTALAGGALNSFQRFAIPALTPVILNLCMIAGALWLAPRLEVPILALGWA
VLVAGALQLLFQLPALKGIDLLTLPRWGWNHPDVRKVLTLMIPTLFGSSIAQINLMLDTV
IAARLADGSQSWLSLADRFLELPLGVFGVALGTVILPALARHHVKTDRSAFSGALDWGFR
TTLLIAMPAMLGLLLLAEPLVATLFQYRQFTAFDTRMTAMSVYGLSFGLPAYAMLKVLLP
I need some help with the regular expressions in java.
I have encountered a problem on how to retrieve two strings with Pattern and Matcher.
I have written this code to match one substring"MTMISRVLGLIRDQ", but I want to match multiple substrings in a string.
Pattern findstring = Pattern.compile("MTMISRVLGLIRDQ");
Matcher m = findstring.matcher(S);
while (m.find())
outputStream.println("Selected Sequence \"" + m.group() +
"\" starting at index " + m.start() +
" and ending at index " m.end() ".");
Any help would be appreciated.

Double post: http://forum.java.sun.com/thread.jspa?threadID=726158&tstart=0

String.matches vs Pattern and Matcher object

Hi,
I was trying to match some regex using String.matches but for me it is not working (probably I am not using it the way it should be used).
Here is a simple example:
/* This does not work */
String patternStr = "a";
String inputStr = "abc";
if(inputStr.matches( "a" ))
System.out.println("String matched");
/* This works */
Pattern p = Pattern.compile( "a" );
Matcher m = p.matcher( "abc" );
boolean found = false;
while(m.find())
System.out.println("Matched using Pattern and Matcher");
found = true;
if(!found)
System.out.println("Not matching with Pattern and Matcher");
Am I not matches method of String class properly?
Please throw some lights on this.
Thank you.

String.matches looks at the whole string.
bsh % "abc".matches("a");
<false>
bsh % "abc".matches("a.*");
<true>

Pattern and Matching question

Hey,
I'm trying to use the pattern and matcher to replace all instances of a website
address in some html documents as I process them and post them. I'm
including a sample of some of the HTML below and the code I"m using to
process it. For some reason it doesn't replace the sites in the underlying
images and i can't figure out what I'm doing wrong. Please forgive all the
unused variables, those are relics of another way i may have to do this if i
can't get the pattern thing to work.
Josh
     public static void setParameters(File fileName)
          FileReader theReader = null;
          try
               System.out.println("beginning setparameters guide2)");
               File fileForProcessing=new File(fileName.getAbsolutePath());
               //wrap the file in a filereader and buffered reader for maximum processing
               theReader=new FileReader(fileForProcessing);
               BufferedReader bufferedReader=new BufferedReader(theReader);
               //fill in data into the tempquestion variable to be populated
               //Set the question and answer texts back to default
               questionText="";
               answerText="";
               //Define the question variable as a Stringbuffer so new data can be appended to it
               StringBuffer endQuestion=new StringBuffer();//Stringbuffer to store all the lines
               String tempQuestion="";
               //Define new file with the absolutepath and the filename for use in parsing out question/answer data
               tempQuestion=bufferedReader.readLine();//reads the nextline of the question
               String tempAlteredQuestion="";//for temporary alteration of the nextline
               //while there are more lines append the stringbuffer with the new data to complete the question buffer
               StringTokenizer tokenizer=new StringTokenizer(tempQuestion, " ");//tokenizer for reading individual words
               StringBuffer temporaryLine; //reinstantiate temporary line holder each iterration
               String newToken;   //newToken gets the very next token every iterration? changed to tokenizer moretokens loop
               String newTokenTemp;   //reset newTokenTemp to null each iterration
               String theEndOfIt; //string to hold everything after .com
               char[] characters; //character array to hold the characters that are used to hold the entire link
               char lastCharChecked;
               Pattern thePattern=Pattern.compile("src=\"https:////fakesite.com//ics", Pattern.LITERAL);
               Matcher theMatcher=thePattern.matcher(tempQuestion);
                    while(tempQuestion!=null) //every time the tempquestion reads a newline, make sure you aren't at the end
                         String theReplacedString=theMatcher.replaceAll("https:////fakesite.com//UserGuide/");
                         //          temporaryLine=new StringBuffer();
                         //add the temporary line after processed back into the end question.
                         endQuestion.append(theReplacedString);                              //temporaryLine.toString());
                         //reset the tempquestion to the newline that is going to be read
                         tempQuestion=bufferedReader.readLine();
                         if(tempQuestion!=null)
                              theMatcher.reset(tempQuestion);
                         /*newTokenTemp=null;
                         while(tokenizer.hasMoreTokens())
                              newToken=tokenizer.nextToken(); //get the next token from the line for processing
                              System.out.println("uhhhhhh");
                              if(newToken.length()>36) //if the token is long enough chop it off to compare
                                   newTokenTemp=newToken.substring(0, 36);
                              if(newTokenTemp.equals("src=\"https://fakesite.com"));//compare against the known image source
                                   theEndOfIt=new String(); //intialize theEndOfIt
                                   characters=new char[newToken.length()]; //set the arraylength to the length of the initial token
                                   characters=newToken.toCharArray(); //point the character array to the actual characters for newToken
                                   lastCharChecked='a'; // the last character that was compared
                                   int x=0; //setup the iterration variable and go from the length of the whole token back till you find the first /
                                   for(x=newToken.length()-1;x>0&&lastCharChecked!='/';x--)
                                        System.out.println(newToken);
                                        //set last char checked to the lsat iterration run
                                        lastCharChecked=characters[x];
                                        //set the end of it to the last char checked and the rest of the chars checked combined
                                        theEndOfIt=Character.toString(lastCharChecked)+theEndOfIt;
                                   //reset the initial newToken value to the cut temporary newToken root + userguide addin, + the end
                                   newToken=newTokenTemp+"//Userguide"+theEndOfIt;
                              //add in the space aftr the token to the temporary line and the new token, this is where it should be parsed back together
                              temporaryLine.append(newToken+" ");
                         //add the temporary line after processed back into the end question.
                         endQuestion.append(temporaryLine.toString());
                         //reset the tempquestion to the newline that is going to be read
                         tempQuestion=bufferedReader.readLine();
                         //reset tokenizer to the new temporary question
                         if(tempQuestion!=null)
                         tokenizer=new StringTokenizer(tempQuestion);
               //Set the answer to the stringbuffer after converting to string
               answerText=endQuestion.toString();
               //code to take the filename and replace _ with a space and put that in the question text
               char theSpace=' ';
               char theUnderline='_';
               questionText=(fileName.getName()).replace(theUnderline, theSpace);
          catch(FileNotFoundException exception)
               if(logger.isLoggable(Level.WARNING))
               logger.log(Level.WARNING,"The File was Not Found\n"+exception.getMessage()+"\n"+exception.getStackTrace(),exception);
          catch(IOException exception)
               if(logger.isLoggable(Level.SEVERE))
               logger.log(Level.SEVERE,exception.getMessage()+"\n"+exception.getStackTrace(),exception);
          finally
               try
                    if(theReader!=null)
                         theReader.close();
               catch(Exception e)
<SCRIPT language=JavaScript1.2 type=text/javascript></SCRIPT>
<H1><IMG class=img_whs1 height=63 src="https://fakesite.com/ics/header4.jpg" width=816 border=0 x-maintain-ratio="TRUE"></H1>
<H1>Associate Existing Customers</H1>
<P>blahalbalhblabhlab blabhalha blabahbablablabhlablhalhab.<SPAN style="FONT-WEIGHT: bold"><B><IMG class=img_whs2 height=18 alt="Submit a

If you use just / it misinterprets it and it ruins
your " " tags for a string. I don't think so. '/' is not a special character for Java regex, nor for Java String.
The reason i used
literal is to try to force it to directly match,
originally i thought that was the reason it wasn't
working.That will be no problem because it enforces '.' to be treated as a dot, not as a regex 'any character'.
Message was edited by:
hiwa

How to Use Pattern and Matcher class.

HI Guys,
I am just trying to use Pattern and Matcher classes for my requirement.
My requirement is :- It should allow the numbers from 1-7 followed by a comma(,) again followed by the numbers from
1-7. For example:- 1,2,3,4,5 or 3,6,1 or 7,1,3 something like that.
But it should not allow 0,8 and 9. And also it should not allow any Alphabets and special characters except comma(,).
I have written some thing like..
Pattern p = Pattern.compile("([1-7])+([\\,])?([1-7])?");
Is there any problem with this pattern ??
Please help out..
I am new to pattern matching concept..
Thanks and regards
Sudheer

ok guys, this is how my code looks like..
class PatternTest
     public static void main(String[] args)
          System.out.println("Hello World!");
          String input = args[0];
          Pattern p = Pattern.compile("([1-7]{1},?)+");
          Matcher m = p.matcher(input);
          if(m.find()) {
               System.out.println("Pattern Found");
          } else {
               System.out.println("Invalid pattern");
}if I enter 8,1,3 its accepting and saying Pattern Found..
Please correct me if I am wrong.
Actually this is the test code I am presenting here.. I original requirement is..I will be uploading an excel sheets containg 10 columns and n rows.
In one of my column, I need to test whether the data in that column is between 1-7 or not..If I get a value consisting of numbers other than 1-7..Then I should
display him the msg..
Thanks and regards
Sudheer

Searching and Matching - Difference between 'Match Pattern' and 'Match Geometric Pattern'?

I was wondering if someone can explain to me the difference between 'Match Pattern' and 'Match Geometric Pattern' VIs? I'm really not sure which best to use for my application. I'm trying to search/match small spherical particles in a grey video in order to track their speed (I'm doing this after subtracting two subsequent frames to get rid of background motion artifacts).
Which should I use?
Thank you!
Solved!
Go to Solution.

Hi TKassis,
1.You may find from this link for the difference between these two,
Pattern Match : http://zone.ni.com/reference/en-XX/help/370281P-01/imaqvision/imaq_match_pattern_3/
Geometric Match : http://zone.ni.com/reference/en-XX/help/370281P-01/imaqvision/imaq_match_geometric_pattern/.
2. I always prefer match pattern because of its execution speed, and incase of geometric pattern match it took lot of time to match your result. You may find in the attached figure for same image with these two algorithm execution time.
Sasi.
Certified LabVIEW Associate Developer
If you can DREAM it, You can DO it - Walt Disney

I just bought an airport express. i set it up and was able to be detected by my macbook, but i still cant connect wirelessly. the same goes when i tried my iphone. please help!

i just bought an airport express. i set it up and was able to be detected by my macbook, but i still cant connect wirelessly. the same goes when i tried my iphone. please help!

Is the status light green?
Your Airport device may be in Bridge Mode.
Airport Utility 6:
Open Airport Utility, select the Airport Express and click Edit. Navigate to the Network tab and change Router Mode to DHCP and NAT. Then click Update on the bottom right.
Airport Utility 5:
Open Airport Utility and click Manual Setup. Navigate to the Internet icon and change Connection Sharing to Share a Public IP Address. Then click Update on the bottom right.

Patterns and Matcher.find()

I would be really grateful for some assistance on this. I have been at this for three hours and can't get it correct.
I have a string such as this: "Hello this is a great  little string"
I want to use RegEx, Pattern and Matcher.find() to extract "IMAGINE_SPACIAL_VECTOR" ie. the text between ""
Thanks in advance for your help

import java.util.regex.Matcher;
import java.util.regex.Pattern;
public class Example {
     public static void main(String[] args) {
          String data = "Hello this is a great  little string";
          Matcher matcher = Pattern.compile("").matcher(data);
          while (matcher.find()) {
               System.out.println(matcher.group(1));
}Might not be the best way, but it works.
Kaj

Regular expressions, using pattern and matcher but not include the pattern

Hi all
i have a regular expression but in my matcher it is including the text that is in my regular expression.
ie
String str="-------------------stuff===========";
Pattern mainBody = Pattern.compile("-----(.*?)=====", Pattern.MULTILINE);this matches -------------------stuff=====
now i expect to get some - in there as i match from the start, but i dont want to have the = in the match. how do i do a match that excludes the matching expressions.

nevermind, figured it out
when i do myMatch.group(1) it gives me just my match
sorry for wasting time :)

Regular Expression wierdness - problem with $ character

If I use the following KM code in Beanshell Technology - it works correctly and replaces "C$_0MYREMOTETABLE RMTALIAS, MYLOCALTABLE LOCALIAS, " with "C$_0MYREMOTETABLE_000111 RMTALIAS, MYLOCALTABLE LOCALIAS, "
But when I try to use the same exact code in 'Undefined' technology - it does not match anything in the source string - and does not replace anything.
If I change the regular expression to not use the $ it still does not work.
But if I change the source string to remove the $ - then the regular expression works.
If I use the same code in Beanshell technology - it works fine - but then I can't use the value in a later 'Undefined' technology step.
Does anyone know if the java technology does something special with $ characters when ODI parses the KM code?
Does anyone know if there is a way to use the value from a Beanshell variable in a 'Undefined' technology step?
String newSourceTableList = "";
String sessionNum ="<%=odiRef.getSession("SESS_NO") %>";
String sourceTableList = "<%=odiRef.getSrcTablesList("", "[WORK_SCHEMA].[TABLE_NAME] [POP_TAB_ALIAS]" , ",", ",") %>";
String matchExpr = "(C\\$_\\S*)"; (should end with two backslashes followed by 'S*' - this editor mangles it)
String replaceExpr = "$0_"+sessionNum+ " ";
newSourceTableList = sourceTableList.replaceAll(matchExpr,replaceExpr);
---------------------------------------------------

Phases of substitution in ODI:
The way ODI works allows for three separate phases of substitution, and you can use them all. The three phases are:
- First Phase: <% %> You will see these appear in the knowledge moduiles etc and these are substituted on generation. (when you generate a scenario, or tell ODI to execute an interface directly) this phase is used to generate the column names, table names etc which are known from the metadata at that phase.
- Second Phase: <? ?> This phase is substituted when the scenario is instatntiuated as an excution - session generation. At this point, ODI has the additional information which allows it to generate the schema names, as it has resolved the Logical/Physical Schemas through the use of the Context (which is provided for the execution to take place. All the substitutions at this point are written to the execution log.
- Third Phase <@ @> This phase is substituted when the execution code is read from the session log for execution. You will note that anything substituted in this phase is NEVER written to the execution log. (see PASSWORDS as a prime example, you don't want those written to the logs, with the security risks associated with that!)
Anything in <@ @> is always interpreted for substitution by the java beanshell, it does not have to be a Java Beanshell step, it can be any kind of step, it will be interpreted at that run-time point.

Help with regular expression to find a pattern in clob

can someone help me writing a regular expression to query a clob that containts xml type data?
query to find multiple occurrences of a variable string (i.e <EMPID-XX> - XX can be any number). If <EMPID-01> appears twice in the clob i want the result as EMPID-01,2 and if EMPID-02 appears 4 times i want the result as EMPID-02,4.

with
ofx_clob as
(select q'~
<EMPID>1
< UNQID>123456
< TIMESTAMP>...
< ADDRINFO>
< TITLE>^@~*
< FIRST>ABCD
< MI>
< LAST>EFGH
< ADDR1>ADDR1
< ADDR2>^@~*
< CITY>CITY
<EMPID>2
< UNQID>123457
< TIMESTAMP>...
< ADDRINFO>
< TITLE>^@~*
< FIRST>ABCD
< MI>
< LAST>EFGH
< ADDR1>ADDR1
< ADDR2>^@~*
< CITY>CITY
<EMPID>1
< UNQID>123458
< TIMESTAMP>...
< ADDRINFO>
< TITLE>^@~*
< FIRST>ABCD
< MI>
< LAST>EFGH
< ADDR1>ADDR1
< ADDR2>^@~*
< CITY>CITY
~' ofx from dual
select '<EMPID>' || to_char(ids) || '(' || to_char(count(*)) || ')' multi_empid
from (select replace(regexp_substr(ofx,'<EMPID>\d*',1,level),'<EMPID>') ids
          from ofx_clob
        connect by level <= regexp_count(ofx,'<EMPID>')
group by ids having count(*) > 1
MULTI_EMPID
<EMPID>1(2)
with
ofx_clob as
(select q'~
<EMPID>1
< UNQID>123456
< TIMESTAMP>...
< ADDRINFO>
< TITLE>^@~*
< FIRST>ABCD
< MI>
< LAST>EFGH
< ADDR1>ADDR1
< ADDR2>^@~*
< CITY>CITY
<EMPID>2
< UNQID>123457
< TIMESTAMP>...
< ADDRINFO>
< TITLE>^@~*
< FIRST>ABCD
< MI>
< LAST>EFGH
< ADDR1>ADDR1
< ADDR2>^@~*
< CITY>CITY
<EMPID>1
< UNQID>123456
< TIMESTAMP>...
< ADDRINFO>
< TITLE>^@~*
< FIRST>ABCD
< MI>
< LAST>EFGH
< ADDR1>ADDR1
< ADDR2>^@~*
< CITY>CITY
<EMPID>2
< UNQID>123456
< TIMESTAMP>...
< ADDRINFO>
< TITLE>^@~*
< FIRST>ABCD
< MI>
< LAST>EFGH
< ADDR1>ADDR1
< ADDR2>^@~*
< CITY>CITY
<EMPID>1
< UNQID>123458
< TIMESTAMP>...
< ADDRINFO>
< TITLE>^@~*
< FIRST>ABCD
< MI>
< LAST>EFGH
< ADDR1>ADDR1
< ADDR2>^@~*
< CITY>CITY
~' ofx from dual
select '<EMPID>' || listagg(to_char(ids) || '(' || to_char(count(*)) || ')',',') within group (order by ids) multi_empid
from (select replace(regexp_substr(ofx,'<EMPID>\d*',1,level),'<EMPID>') ids
          from ofx_clob
        connect by level <= regexp_count(ofx,'<EMPID>')
group by ids having count(*) > 1
MULTI_EMPID
<EMPID>1(3),2(2)
Regards
Etbin
Message was edited by: Etbin
used listagg to report more than one multiple <EMPID>

Expdp with network_link and different character sets for source and target?

DB versions 10.2 and 10.1
Is there any limitation for implementing expdp with network_link but target and source have different character sets?
I tried with many combinations, but only had success when target and source have the same character sets. In other combinations there was invalid character set conversion (loose of some characters in VARCHAR2 and CLOB fields).
I didn't find anything on the internet, forums or Oracle documentation. There was only limitation for transportable tablespace mode export (Database Utilities).
Thanks

Hi DBA-One
This link
http://download-east.oracle.com/docs/cd/B19306_01/server.102/b14215/dp_overview.htm#CEGFCFFI
is from Database Utilities 10g Release 2 (10.2) (I read it many times)
There is only one thing about different character sets.
I quote:
Data Pump supports character set conversion for both direct path and external tables. Most of the restrictions that exist for character set conversions in the original Import utility do not apply to Data Pump. The one case in which character set conversions are not supported under the Data Pump is when using transportable tablespaces.
Parameter VERSION also doesn't play role here, because the behaviour (character set) is the same if I use for target and source only 10.2 or (10.2 and 10.1) respectivly.

Regular expression technique needed to eliminate " and replace with \"

I am trying to figure out the best way to fix data coming from a database table through the use of ColdFusion, in which there are some quotation marks that JavaScript Flash doesn't want in the array before adding the text content to a quiz.
At the moment, if there are any quotation marks in the elements of the array, even though each element is surrounded by quotation marks, I have to add a backslash to escape them.
I was wondering what would be the regular expression to use to do this, not for the quotation marks surrounding each element, but for those used inside of them.
For example,
["08 Working with Flash Forms_16",
"The syntax for the submit button was .",
"name="submit" type="submit"",
"name="submit"action="submit"",
"id="submit" type="submit"",
"id="submit"action="submit"",
"No, name="submit" type="submit" is correct.",
"true",
"false",
"false",
"false",
"897"] ,
is an example of an element created that has quotation marks inside of the quotation marks for each element.
I am debating about handling this either with XML by writing code to do that with ColdFusion instead of using the code I generated with ColdFusion to create this.
I am also debating about writing the regular expression in a way that I could use it from either ActionScript 2 or 3.0, going at it at the file level maybe.
At the moment I am opening the file and experimenting with writing a Find expression that uses regular expression to select what I want to change before changing it.

Hi,
In excel menu Tools->macro
Enter the macro name say SAPBEXonRefresh
click 'create', will go to visual basic editor
To display '#' as '', paste the following code
Sub SAPBEXonRefresh(queryID As String, resultArea As Range)
Dim c As Range
For Each c In resultArea.Cells
If c.Value = "#" Then c.Value = ""
Next c
End Sub
Close the editor and click on refresh again.
Thanks..
Shambhu

Regular expression on words with % wildcard

Hi,
I've got some processing working using regular expression where I need to process words e.g.
regexp_replace('word1 word2','(\w+)','myprefix{\1}') - results in - 'myprefixword1 myprefixword2'
However, if I'm presented with this; '%word0 word1% wo%d2 word3', then I need to treat % as special case and leave the word as is, so result here would be; - '%word0 word1% wo%d2 myprefixword3', is this achievable using regexp ?

And for those who don't know, I guess we should explain why we're having to expand single spaces to double spaces...
(I'll use the "¬" character to represent spaces to make it clearer to see)
If we have a string such as
word1¬word2¬word3and we want to identify the words in the string (without using any special regexp word identifier) then we are going to use the spaces to identify the start and end of words. To make life easy, we manually put a space at the start and end of the string so we can say that each word in the string will have a space before and after it regardless of where it is in the string...
¬word1¬word2¬word3¬However, when we specify what we want to search for we are going to say we want a space, followed by a number of characters (not spaces), followed by a space...
¬[^¬]*¬So, ideally, you'd expect it to look through the string and say
¬word1¬word2¬word3¬
\_____/... found word1
¬word1¬word2¬word3¬
      \_____/... found word2
¬word1¬word2¬word3¬
            \_____/... found word3
Unfortunately, there is a problem. Once the first word has been found the pointer for searching the rest of the string is located on the next character after the match i.e.
¬word1¬word2¬word3¬
       ^So it won't be able to pick out word2 and will only get to word3. Let's see it in action...
SQL> ed
Wrote file afiedt.buf
1 with t as (select ' word1 word2 word3 ' as txt from dual)
2 --
3 select regexp_replace(txt, ' [^ ]* ', 'xxxxx') as txt
4* from t
SQL> /
TXT
xxxxxword2xxxxx
SQL>In order to deal with this, if we replace the single spaces with double spaces (not required at the start and end) our string looks like...
¬word1¬¬word2¬¬word3¬So as it searches it finds word1 as a match and then the pointer in the string is located...
¬word1¬¬word2¬¬word3¬
       ^... so the next match for the pattern of space-characters-space is word2 and then the pointer is located...
¬word1¬¬word2¬¬word3¬
              ^... ready to find word 3. Example...
SQL> ed
Wrote file afiedt.buf
1 with t as (select ' word1 word2 word3 ' as txt from dual)
2 --
3 select regexp_replace(txt, ' [^ ]* ', 'xxxxx') as txt
4* from t
SQL> /
TXT
xxxxxxxxxxxxxxx
SQL>Hopefully that's a little clearer. You just have to remember the "pointer" principle and the fact that once a match is found it is located on the character after the match.
;)

Regular Expression Character Sets with Pattern and Matcher

Similar Messages

Maybe you are looking for