Context Navigation

← Previous Revision
Latest Revision
Next Revision →
Blame
Revision Log

qregexp.html@ 203

Last change on this file since 203 was 190, checked in by rudi, 14 years ago
reference documentation added
File size: 57.6 KB

Line
1	<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
2	<!-- /home/espenr/tmp/qt-3.3.8-espenr-2499/qt-x11-free-3.3.8/src/tools/qregexp.cpp:77 -->
3	<html>
4	<head>
5	<meta http-equiv="Content-Type" content="text/html; charset=ISO-8859-1">
6	<title>QRegExp Class</title>
7	<style type="text/css"><!--
8	fn { margin-left: 1cm; text-indent: -1cm; }
9	a:link { color: #004faf; text-decoration: none }
10	a:visited { color: #672967; text-decoration: none }
11	body { background: #ffffff; color: black; }
12	--></style>
13	</head>
14	<body>
15
16	<table border="0" cellpadding="0" cellspacing="0" width="100%">
17	<tr bgcolor="#E5E5E5">
18	<td valign=center>
19	<a href="index.html">
20	<font color="#004faf">Home</font></a>
21	\| <a href="classes.html">
22	<font color="#004faf">All Classes</font></a>
23	\| <a href="mainclasses.html">
24	<font color="#004faf">Main Classes</font></a>
25	\| <a href="annotated.html">
26	<font color="#004faf">Annotated</font></a>
27	\| <a href="groups.html">
28	<font color="#004faf">Grouped Classes</font></a>
29	\| <a href="functions.html">
30	<font color="#004faf">Functions</font></a>
31	</td>
32	<td align="right" valign="center"><img src="logo32.png" align="right" width="64" height="32" border="0"></td></tr></table><h1 align=center>QRegExp Class Reference</h1>
33
34	<p>The QRegExp class provides pattern matching using regular expressions.
35	<a href="#details">More...</a>
36	<p>All the functions in this class are <a href="threads.html#reentrant">reentrant</a> when Qt is built with thread support.</p>
37	<p><tt>#include <<a href="qregexp-h.html">qregexp.h</a>></tt>
38	<p><a href="qregexp-members.html">List of all member functions.</a>
39	<h2>Public Members</h2>
40	<ul>
41	<li class=fn>enum <a href="#CaretMode-enum"><b>CaretMode</b></a> { CaretAtZero, CaretAtOffset, CaretWontMatch }</li>
42	<li class=fn><a href="#QRegExp"><b>QRegExp</b></a> ()</li>
43	<li class=fn><a href="#QRegExp-2"><b>QRegExp</b></a> ( const QString & pattern, bool caseSensitive = TRUE, bool wildcard = FALSE )</li>
44	<li class=fn><a href="#QRegExp-3"><b>QRegExp</b></a> ( const QRegExp & rx )</li>
45	<li class=fn><a href="#~QRegExp"><b>~QRegExp</b></a> ()</li>
46	<li class=fn>QRegExp & <a href="#operator-eq"><b>operator=</b></a> ( const QRegExp & rx )</li>
47	<li class=fn>bool <a href="#operator-eq-eq"><b>operator==</b></a> ( const QRegExp & rx ) const</li>
48	<li class=fn>bool <a href="#operator!-eq"><b>operator!=</b></a> ( const QRegExp & rx ) const</li>
49	<li class=fn>bool <a href="#isEmpty"><b>isEmpty</b></a> () const</li>
50	<li class=fn>bool <a href="#isValid"><b>isValid</b></a> () const</li>
51	<li class=fn>QString <a href="#pattern"><b>pattern</b></a> () const</li>
52	<li class=fn>void <a href="#setPattern"><b>setPattern</b></a> ( const QString & pattern )</li>
53	<li class=fn>bool <a href="#caseSensitive"><b>caseSensitive</b></a> () const</li>
54	<li class=fn>void <a href="#setCaseSensitive"><b>setCaseSensitive</b></a> ( bool sensitive )</li>
55	<li class=fn>bool <a href="#wildcard"><b>wildcard</b></a> () const</li>
56	<li class=fn>void <a href="#setWildcard"><b>setWildcard</b></a> ( bool wildcard )</li>
57	<li class=fn>bool <a href="#minimal"><b>minimal</b></a> () const</li>
58	<li class=fn>void <a href="#setMinimal"><b>setMinimal</b></a> ( bool minimal )</li>
59	<li class=fn>bool <a href="#exactMatch"><b>exactMatch</b></a> ( const QString & str ) const</li>
60	<li class=fn>int match ( const QString & str, int index = 0, int * len = 0, bool indexIsStart = TRUE ) const  <em>(obsolete)</em></li>
61	<li class=fn>int <a href="#search"><b>search</b></a> ( const QString & str, int offset = 0, CaretMode caretMode = CaretAtZero ) const</li>
62	<li class=fn>int <a href="#searchRev"><b>searchRev</b></a> ( const QString & str, int offset = -1, CaretMode caretMode = CaretAtZero ) const</li>
63	<li class=fn>int <a href="#matchedLength"><b>matchedLength</b></a> () const</li>
64	<li class=fn>int <a href="#numCaptures"><b>numCaptures</b></a> () const</li>
65	<li class=fn>QStringList <a href="#capturedTexts"><b>capturedTexts</b></a> ()</li>
66	<li class=fn>QString <a href="#cap"><b>cap</b></a> ( int nth = 0 )</li>
67	<li class=fn>int <a href="#pos"><b>pos</b></a> ( int nth = 0 )</li>
68	<li class=fn>QString <a href="#errorString"><b>errorString</b></a> ()</li>
69	</ul>
70	<h2>Static Public Members</h2>
71	<ul>
72	<li class=fn>QString <a href="#escape"><b>escape</b></a> ( const QString & str )</li>
73	</ul>
74	<hr><a name="details"></a><h2>Detailed Description</h2>
75
76
77
78	The QRegExp class provides pattern matching using regular expressions.
79	<p>
80
81
82
83	<!-- index regular expression --><a name="regular-expression"></a>
84	<p> Regular expressions, or "regexps", provide a way to find patterns
85	within text. This is useful in many contexts, for example:
86	<p> <center><table cellpadding="4" cellspacing="2" border="0">
87	<tr bgcolor="#f0f0f0"> <td valign="top">Validation
88	<td valign="top">A regexp can be used to check whether a piece of text
89	meets some criteria, e.g. is an integer or contains no
90	whitespace.
91	<tr bgcolor="#d0d0d0"> <td valign="top">Searching
92	<td valign="top">Regexps provide a much more powerful means of searching
93	text than simple string matching does. For example we can
94	create a regexp which says "find one of the words 'mail',
95	'letter' or 'correspondence' but not any of the words
96	'email', 'mailman' 'mailer', 'letterbox' etc."
97	<tr bgcolor="#f0f0f0"> <td valign="top">Search and Replace
98	<td valign="top">A regexp can be used to replace a pattern with a piece of
99	text, for example replace all occurrences of '&' with
100	'&amp;' except where the '&' is already followed by 'amp;'.
101	<tr bgcolor="#d0d0d0"> <td valign="top">String Splitting
102	<td valign="top">A regexp can be used to identify where a string should be
103	split into its component fields, e.g. splitting tab-delimited
104	strings.
105	</table></center>
106	<p> We present a very brief introduction to regexps, a description of
107	Qt's regexp language, some code examples, and finally the function
108	documentation itself. QRegExp is modeled on Perl's regexp
109	language, and also fully supports Unicode. QRegExp can also be
110	used in the weaker 'wildcard' (globbing) mode which works in a
111	similar way to command shells. A good text on regexps is <em>Mastering Regular Expressions: Powerful Techniques for Perl and Other Tools</em> by Jeffrey E. Friedl, ISBN 1565922573.
112	<p> Experienced regexp users may prefer to skip the introduction and
113	go directly to the relevant information.
114	<p> In case of multi-threaded programming, note that QRegExp depends on
115	<a href="qthreadstorage.html">QThreadStorage</a> internally. For that reason, QRegExp should only be
116	used with threads started with <a href="qthread.html">QThread</a>, i.e. not with threads
117	started with platform-specific APIs.
118	<p> <!-- toc -->
119	<ul>
120	<li><a href="#1"> Introduction
121	</a>
122	<li><a href="#1-1"> Characters and Abbreviations for Sets of Characters
123	</a>
124	<li><a href="#1-2"> Sets of Characters
125	</a>
126	<li><a href="#1-3"> Quantifiers
127	</a>
128	<li><a href="#1-4"> Capturing Text
129	</a>
130	<li><a href="#1-5"> Assertions
131	</a>
132	<li><a href="#1-6"> Wildcard Matching (globbing)
133	</a>
134	<li><a href="#1-7"> Notes for Perl Users
135	</a>
136	<li><a href="#1-8"> Code Examples
137	</a>
138	</ul>
139	<!-- endtoc -->
140
141	<p> <h3> Introduction
142	</h3>
143	<a name="1"></a><p> Regexps are built up from expressions, quantifiers, and assertions.
144	The simplest form of expression is simply a character, e.g.
145	<b>x</b> or <b>5</b>. An expression can also be a set of
146	characters. For example, <b>[ABCD]</b>, will match an <b>A</b> or
147	a <b>B</b> or a <b>C</b> or a <b>D</b>. As a shorthand we could
148	write this as <b>[A-D]</b>. If we want to match any of the
149	captital letters in the English alphabet we can write
150	<b>[A-Z]</b>. A quantifier tells the regexp engine how many
151	occurrences of the expression we want, e.g. <b>x{1,1}</b> means
152	match an <b>x</b> which occurs at least once and at most once.
153	We'll look at assertions and more complex expressions later.
154	<p> Note that in general regexps cannot be used to check for balanced
155	brackets or tags. For example if you want to match an opening html
156	<tt><b></tt> and its closing <tt></b></tt> you can only use a regexp if you
157	know that these tags are not nested; the html fragment, <tt><b>bold <b>bolder</b></b></tt> will not match as expected. If you know the
158	maximum level of nesting it is possible to create a regexp that
159	will match correctly, but for an unknown level of nesting, regexps
160	will fail.
161	<p> We'll start by writing a regexp to match integers in the range 0
162	to 99. We will require at least one digit so we will start with
163	<b>[0-9]{1,1}</b> which means match a digit exactly once. This
164	regexp alone will match integers in the range 0 to 9. To match one
165	or two digits we can increase the maximum number of occurrences so
166	the regexp becomes <b>[0-9]{1,2}</b> meaning match a digit at
167	least once and at most twice. However, this regexp as it stands
168	will not match correctly. This regexp will match one or two digits
169	<em>within</em> a string. To ensure that we match against the whole
170	string we must use the anchor assertions. We need <b>^</b> (caret)
171	which when it is the first character in the regexp means that the
172	regexp must match from the beginning of the string. And we also
173	need <b>$</b> (dollar) which when it is the last character in the
174	regexp means that the regexp must match until the end of the
175	string. So now our regexp is <b>^[0-9]{1,2}$</b>. Note that
176	assertions, such as <b>^</b> and <b>$</b>, do not match any
177	characters.
178	<p> If you've seen regexps elsewhere they may have looked different from
179	the ones above. This is because some sets of characters and some
180	quantifiers are so common that they have special symbols to
181	represent them. <b>[0-9]</b> can be replaced with the symbol
182	<b>\d</b>. The quantifier to match exactly one occurrence,
183	<b>{1,1}</b>, can be replaced with the expression itself. This means
184	that <b>x{1,1}</b> is exactly the same as <b>x</b> alone. So our 0
185	to 99 matcher could be written <b>^\d{1,2}$</b>. Another way of
186	writing it would be <b>^\d\d{0,1}$</b>, i.e. from the start of the
187	string match a digit followed by zero or one digits. In practice
188	most people would write it <b>^\d\d?$</b>. The <b>?</b> is a
189	shorthand for the quantifier <b>{0,1}</b>, i.e. a minimum of no
190	occurrences a maximum of one occurrence. This is used to make an
191	expression optional. The regexp <b>^\d\d?$</b> means "from the
192	beginning of the string match one digit followed by zero or one
193	digits and then the end of the string".
194	<p> Our second example is matching the words 'mail', 'letter' or
195	'correspondence' but without matching 'email', 'mailman',
196	'mailer', 'letterbox' etc. We'll start by just matching 'mail'. In
197	full the regexp is, <b>m{1,1}a{1,1}i{1,1}l{1,1}</b>, but since
198	each expression itself is automatically quantified by <b>{1,1}</b>
199	we can simply write this as <b>mail</b>; an 'm' followed by an 'a'
200	followed by an 'i' followed by an 'l'. The symbol '\|' (bar) is
201	used for <em>alternation</em>, so our regexp now becomes
202	<b>mail\|letter\|correspondence</b> which means match 'mail' <em>or</em>
203	'letter' <em>or</em> 'correspondence'. Whilst this regexp will find the
204	words we want it will also find words we don't want such as
205	'email'. We will start by putting our regexp in parentheses,
206	<b>(mail\|letter\|correspondence)</b>. Parentheses have two effects,
207	firstly they group expressions together and secondly they identify
208	parts of the regexp that we wish to <a href="#capturing-text">capture</a>. Our regexp still matches any of the three words but now
209	they are grouped together as a unit. This is useful for building
210	up more complex regexps. It is also useful because it allows us to
211	examine which of the words actually matched. We need to use
212	another assertion, this time <b>\b</b> "word boundary":
213	<b>\b(mail\|letter\|correspondence)\b</b>. This regexp means "match
214	a word boundary followed by the expression in parentheses followed
215	by another word boundary". The <b>\b</b> assertion matches at a <em>position</em> in the regexp not a <em>character</em> in the regexp. A word
216	boundary is any non-word character such as a space a newline or
217	the beginning or end of the string.
218	<p> For our third example we want to replace ampersands with the HTML
219	entity '&amp;'. The regexp to match is simple: <b>&</b>, i.e.
220	match one ampersand. Unfortunately this will mess up our text if
221	some of the ampersands have already been turned into HTML
222	entities. So what we really want to say is replace an ampersand
223	providing it is not followed by 'amp;'. For this we need the
224	negative lookahead assertion and our regexp becomes:
225	<b>&(?!amp;)</b>. The negative lookahead assertion is introduced
226	with '(?!' and finishes at the ')'. It means that the text it
227	contains, 'amp;' in our example, must <em>not</em> follow the expression
228	that preceeds it.
229	<p> Regexps provide a rich language that can be used in a variety of
230	ways. For example suppose we want to count all the occurrences of
231	'Eric' and 'Eirik' in a string. Two valid regexps to match these
232	are <b>\b(Eric\|Eirik)\b</b> and <b>\bEi?ri[ck]\b</b>. We need
233	the word boundary '\b' so we don't get 'Ericsson' etc. The second
234	regexp actually matches more than we want, 'Eric', 'Erik', 'Eiric'
235	and 'Eirik'.
236	<p> We will implement some the examples above in the
237	<a href="#code-examples">code examples</a> section.
238	<p> <a name="characters-and-abbreviations-for-sets-of-characters"></a>
239	<h3> Characters and Abbreviations for Sets of Characters
240	</h3>
241	<a name="1-1"></a><p> <center><table cellpadding="4" cellspacing="2" border="0">
242	<tr bgcolor="#a2c511"> <th valign="top">Element <th valign="top">Meaning
243	<tr bgcolor="#f0f0f0"> <td valign="top"><b>c</b>
244	<td valign="top">Any character represents itself unless it has a special
245	regexp meaning. Thus <b>c</b> matches the character <em>c</em>.
246	<tr bgcolor="#d0d0d0"> <td valign="top"><b>\c</b>
247	<td valign="top">A character that follows a backslash matches the character
248	itself except where mentioned below. For example if you
249	wished to match a literal caret at the beginning of a string
250	you would write <b>\^</b>.
251	<tr bgcolor="#f0f0f0"> <td valign="top"><b>\a</b>
252	<td valign="top">This matches the ASCII bell character (BEL, 0x07).
253	<tr bgcolor="#d0d0d0"> <td valign="top"><b>\f</b>
254	<td valign="top">This matches the ASCII form feed character (FF, 0x0C).
255	<tr bgcolor="#f0f0f0"> <td valign="top"><b>\n</b>
256	<td valign="top">This matches the ASCII line feed character (LF, 0x0A, Unix newline).
257	<tr bgcolor="#d0d0d0"> <td valign="top"><b>\r</b>
258	<td valign="top">This matches the ASCII carriage return character (CR, 0x0D).
259	<tr bgcolor="#f0f0f0"> <td valign="top"><b>\t</b>
260	<td valign="top">This matches the ASCII horizontal tab character (HT, 0x09).
261	<tr bgcolor="#d0d0d0"> <td valign="top"><b>\v</b>
262	<td valign="top">This matches the ASCII vertical tab character (VT, 0x0B).
263	<tr bgcolor="#f0f0f0"> <td valign="top"><b>\xhhhh</b>
264	<td valign="top">This matches the Unicode character corresponding to the
265	hexadecimal number hhhh (between 0x0000 and 0xFFFF). \0ooo
266	(i.e., \zero ooo) matches the ASCII/Latin-1 character
267	corresponding to the octal number ooo (between 0 and 0377).
268	<tr bgcolor="#d0d0d0"> <td valign="top"><b>. (dot)</b>
269	<td valign="top">This matches any character (including newline).
270	<tr bgcolor="#f0f0f0"> <td valign="top"><b>\d</b>
271	<td valign="top">This matches a digit (<a href="qchar.html#isDigit">QChar::isDigit</a>()).
272	<tr bgcolor="#d0d0d0"> <td valign="top"><b>\D</b>
273	<td valign="top">This matches a non-digit.
274	<tr bgcolor="#f0f0f0"> <td valign="top"><b>\s</b>
275	<td valign="top">This matches a whitespace (<a href="qchar.html#isSpace">QChar::isSpace</a>()).
276	<tr bgcolor="#d0d0d0"> <td valign="top"><b>\S</b>
277	<td valign="top">This matches a non-whitespace.
278	<tr bgcolor="#f0f0f0"> <td valign="top"><b>\w</b>
279	<td valign="top">This matches a word character (<a href="qchar.html#isLetterOrNumber">QChar::isLetterOrNumber</a>() or '_').
280	<tr bgcolor="#d0d0d0"> <td valign="top"><b>\W</b>
281	<td valign="top">This matches a non-word character.
282	<tr bgcolor="#f0f0f0"> <td valign="top"><b>\n</b>
283	<td valign="top">The n-th <a href="#capturing-text">backreference</a>,
284	e.g. \1, \2, etc.
285	</table></center>
286	<p> <em>Note that the C++ compiler transforms backslashes in strings so to include a <b>\</b> in a regexp you will need to enter it twice, i.e. <b>\\</b>.</em>
287	<p> <a name="sets-of-characters"></a>
288	<h3> Sets of Characters
289	</h3>
290	<a name="1-2"></a><p> Square brackets are used to match any character in the set of
291	characters contained within the square brackets. All the character
292	set abbreviations described above can be used within square
293	brackets. Apart from the character set abbreviations and the
294	following two exceptions no characters have special meanings in
295	square brackets.
296	<p> <center><table cellpadding="4" cellspacing="2" border="0">
297	<tr bgcolor="#d0d0d0"> <td valign="top"><b>^</b>
298	<td valign="top">The caret negates the character set if it occurs as the
299	first character, i.e. immediately after the opening square
300	bracket. For example, <b>[abc]</b> matches 'a' or 'b' or 'c',
301	but <b>[^abc]</b> matches anything <em>except</em> 'a' or 'b' or
302	'c'.
303	<tr bgcolor="#f0f0f0"> <td valign="top"><b>-</b>
304	<td valign="top">The dash is used to indicate a range of characters, for
305	example <b>[W-Z]</b> matches 'W' or 'X' or 'Y' or 'Z'.
306	</table></center>
307	<p> Using the predefined character set abbreviations is more portable
308	than using character ranges across platforms and languages. For
309	example, <b>[0-9]</b> matches a digit in Western alphabets but
310	<b>\d</b> matches a digit in <em>any</em> alphabet.
311	<p> Note that in most regexp literature sets of characters are called
312	"character classes".
313	<p> <a name="quantifiers"></a>
314	<h3> Quantifiers
315	</h3>
316	<a name="1-3"></a><p> By default an expression is automatically quantified by
317	<b>{1,1}</b>, i.e. it should occur exactly once. In the following
318	list <b><em>E</em></b> stands for any expression. An expression is a
319	character or an abbreviation for a set of characters or a set of
320	characters in square brackets or any parenthesised expression.
321	<p> <center><table cellpadding="4" cellspacing="2" border="0">
322	<tr bgcolor="#d0d0d0"> <td valign="top"><b><em>E</em>?</b>
323	<td valign="top">Matches zero or one occurrence of <em>E</em>. This quantifier
324	means "the previous expression is optional" since it will
325	match whether or not the expression occurs in the string. It
326	is the same as <b><em>E</em>{0,1}</b>. For example <b>dents?</b>
327	will match 'dent' and 'dents'.
328	<tr bgcolor="#f0f0f0"> <td valign="top"><b><em>E</em>+</b>
329	<td valign="top">Matches one or more occurrences of <em>E</em>. This is the same
330	as <b><em>E</em>{1,MAXINT}</b>. For example, <b>0+</b> will match
331	'0', '00', '000', etc.
332	<tr bgcolor="#d0d0d0"> <td valign="top"><b><em>E</em>*</b>
333	<td valign="top">Matches zero or more occurrences of <em>E</em>. This is the same
334	as <b><em>E</em>{0,MAXINT}</b>. The <b>*</b> quantifier is often
335	used by a mistake. Since it matches <em>zero</em> or more
336	occurrences it will match no occurrences at all. For example
337	if we want to match strings that end in whitespace and use
338	the regexp <b>\s*$</b> we would get a match on every string.
339	This is because we have said find zero or more whitespace
340	followed by the end of string, so even strings that don't end
341	in whitespace will match. The regexp we want in this case is
342	<b>\s+$</b> to match strings that have at least one
343	whitespace at the end.
344	<tr bgcolor="#f0f0f0"> <td valign="top"><b><em>E</em>{n}</b>
345	<td valign="top">Matches exactly <em>n</em> occurrences of the expression. This
346	is the same as repeating the expression <em>n</em> times. For
347	example, <b>x{5}</b> is the same as <b>xxxxx</b>. It is also
348	the same as <b><em>E</em>{n,n}</b>, e.g. <b>x{5,5}</b>.
349	<tr bgcolor="#d0d0d0"> <td valign="top"><b><em>E</em>{n,}</b>
350	<td valign="top">Matches at least <em>n</em> occurrences of the expression. This
351	is the same as <b><em>E</em>{n,MAXINT}</b>.
352	<tr bgcolor="#f0f0f0"> <td valign="top"><b><em>E</em>{,m}</b>
353	<td valign="top">Matches at most <em>m</em> occurrences of the expression. This
354	is the same as <b><em>E</em>{0,m}</b>.
355	<tr bgcolor="#d0d0d0"> <td valign="top"><b><em>E</em>{n,m}</b>
356	<td valign="top">Matches at least <em>n</em> occurrences of the expression and at
357	most <em>m</em> occurrences of the expression.
358	</table></center>
359	<p> (MAXINT is implementation dependent but will not be smaller than
360	1024.)
361	<p> If we wish to apply a quantifier to more than just the preceding
362	character we can use parentheses to group characters together in
363	an expression. For example, <b>tag+</b> matches a 't' followed by
364	an 'a' followed by at least one 'g', whereas <b>(tag)+</b> matches
365	at least one occurrence of 'tag'.
366	<p> Note that quantifiers are "greedy". They will match as much text
367	as they can. For example, <b>0+</b> will match as many zeros as it
368	can from the first zero it finds, e.g. '2.<u>000</u>5'.
369	Quantifiers can be made non-greedy, see <a href="#setMinimal">setMinimal</a>().
370	<p> <a name="capturing-text"></a>
371	<h3> Capturing Text
372	</h3>
373	<a name="1-4"></a><p> Parentheses allow us to group elements together so that we can
374	quantify and capture them. For example if we have the expression
375	<b>mail\|letter\|correspondence</b> that matches a string we know
376	that <em>one</em> of the words matched but not which one. Using
377	parentheses allows us to "capture" whatever is matched within
378	their bounds, so if we used <b>(mail\|letter\|correspondence)</b>
379	and matched this regexp against the string "I sent you some email"
380	we can use the <a href="#cap">cap</a>() or <a href="#capturedTexts">capturedTexts</a>() functions to extract the
381	matched characters, in this case 'mail'.
382	<p> We can use captured text within the regexp itself. To refer to the
383	captured text we use <em>backreferences</em> which are indexed from 1,
384	the same as for cap(). For example we could search for duplicate
385	words in a string using <b>\b(\w+)\W+\1\b</b> which means match a
386	word boundary followed by one or more word characters followed by
387	one or more non-word characters followed by the same text as the
388	first parenthesised expression followed by a word boundary.
389	<p> If we want to use parentheses purely for grouping and not for
390	capturing we can use the non-capturing syntax, e.g.
391	<b>(?:green\|blue)</b>. Non-capturing parentheses begin '(?:' and
392	end ')'. In this example we match either 'green' or 'blue' but we
393	do not capture the match so we only know whether or not we matched
394	but not which color we actually found. Using non-capturing
395	parentheses is more efficient than using capturing parentheses
396	since the regexp engine has to do less book-keeping.
397	<p> Both capturing and non-capturing parentheses may be nested.
398	<p> <a name="assertions"></a>
399	<h3> Assertions
400	</h3>
401	<a name="1-5"></a><p> Assertions make some statement about the text at the point where
402	they occur in the regexp but they do not match any characters. In
403	the following list <b><em>E</em></b> stands for any expression.
404	<p> <center><table cellpadding="4" cellspacing="2" border="0">
405	<tr bgcolor="#f0f0f0"> <td valign="top"><b>^</b>
406	<td valign="top">The caret signifies the beginning of the string. If you
407	wish to match a literal <tt>^</tt> you must escape it by
408	writing <b>\^</b>. For example, <b>^#include</b> will only
409	match strings which <em>begin</em> with the characters '#include'.
410	(When the caret is the first character of a character set it
411	has a special meaning, see <a href="#sets-of-characters">Sets of
412	Characters</a>.)
413	<tr bgcolor="#d0d0d0"> <td valign="top"><b>$</b>
414	<td valign="top">The dollar signifies the end of the string. For example
415	<b>\d\s*$</b> will match strings which end with a digit
416	optionally followed by whitespace. If you wish to match a
417	literal <tt>$</tt> you must escape it by writing
418	<b>\$</b>.
419	<tr bgcolor="#f0f0f0"> <td valign="top"><b>\b</b>
420	<td valign="top">A word boundary. For example the regexp
421	<b>\bOK\b</b> means match immediately after a word
422	boundary (e.g. start of string or whitespace) the letter 'O'
423	then the letter 'K' immediately before another word boundary
424	(e.g. end of string or whitespace). But note that the
425	assertion does not actually match any whitespace so if we
426	write <b>(\bOK\b)</b> and we have a match it will only
427	contain 'OK' even if the string is "Its <u>OK</u> now".
428	<tr bgcolor="#d0d0d0"> <td valign="top"><b>\B</b>
429	<td valign="top">A non-word boundary. This assertion is true wherever
430	<b>\b</b> is false. For example if we searched for
431	<b>\Bon\B</b> in "Left on" the match would fail (space
432	and end of string aren't non-word boundaries), but it would
433	match in "t<u>on</u>ne".
434	<tr bgcolor="#f0f0f0"> <td valign="top"><b>(?=<em>E</em>)</b>
435	<td valign="top">Positive lookahead. This assertion is true if the
436	expression matches at this point in the regexp. For example,
437	<b>const(?=\s+char)</b> matches 'const' whenever it is
438	followed by 'char', as in 'static <u>const</u> char *'.
439	(Compare with <b>const\s+char</b>, which matches 'static
440	<u>const char</u> *'.)
441	<tr bgcolor="#d0d0d0"> <td valign="top"><b>(?!<em>E</em>)</b>
442	<td valign="top">Negative lookahead. This assertion is true if the
443	expression does not match at this point in the regexp. For
444	example, <b>const(?!\s+char)</b> matches 'const' <em>except</em>
445	when it is followed by 'char'.
446	</table></center>
447	<p> <a name="wildcard-matching"></a>
448	<h3> Wildcard Matching (globbing)
449	</h3>
450	<a name="1-6"></a><p> Most command shells such as <em>bash</em> or <em>cmd.exe</em> support "file
451	globbing", the ability to identify a group of files by using
452	wildcards. The <a href="#setWildcard">setWildcard</a>() function is used to switch between
453	regexp and wildcard mode. Wildcard matching is much simpler than
454	full regexps and has only four features:
455	<p> <center><table cellpadding="4" cellspacing="2" border="0">
456	<tr bgcolor="#f0f0f0"> <td valign="top"><b>c</b>
457	<td valign="top">Any character represents itself apart from those mentioned
458	below. Thus <b>c</b> matches the character <em>c</em>.
459	<tr bgcolor="#d0d0d0"> <td valign="top"><b>?</b>
460	<td valign="top">This matches any single character. It is the same as
461	<b>.</b> in full regexps.
462	<tr bgcolor="#f0f0f0"> <td valign="top"><b>*</b>
463	<td valign="top">This matches zero or more of any characters. It is the
464	same as <b>.*</b> in full regexps.
465	<tr bgcolor="#d0d0d0"> <td valign="top"><b>[...]</b>
466	<td valign="top">Sets of characters can be represented in square brackets,
467	similar to full regexps. Within the character class, like
468	outside, backslash has no special meaning.
469	</table></center>
470	<p> For example if we are in wildcard mode and have strings which
471	contain filenames we could identify HTML files with <b>*.html</b>.
472	This will match zero or more characters followed by a dot followed
473	by 'h', 't', 'm' and 'l'.
474	<p> <a name="perl-users"></a>
475	<h3> Notes for Perl Users
476	</h3>
477	<a name="1-7"></a><p> Most of the character class abbreviations supported by Perl are
478	supported by QRegExp, see <a href="#characters-and-abbreviations-for-sets-of-characters">characters
479	and abbreviations for sets of characters</a>.
480	<p> In QRegExp, apart from within character classes, <tt>^</tt> always
481	signifies the start of the string, so carets must always be
482	escaped unless used for that purpose. In Perl the meaning of caret
483	varies automagically depending on where it occurs so escaping it
484	is rarely necessary. The same applies to <tt>$</tt> which in
485	QRegExp always signifies the end of the string.
486	<p> QRegExp's quantifiers are the same as Perl's greedy quantifiers.
487	Non-greedy matching cannot be applied to individual quantifiers,
488	but can be applied to all the quantifiers in the pattern. For
489	example, to match the Perl regexp <b>ro+?m</b> requires:
490	<pre>
491	QRegExp rx( "ro+m" );
492	rx.<a href="#setMinimal">setMinimal</a>( TRUE );
493	</pre>
494
495	<p> The equivalent of Perl's <tt>/i</tt> option is
496	<a href="#setCaseSensitive">setCaseSensitive</a>(FALSE).
497	<p> Perl's <tt>/g</tt> option can be emulated using a <a href="#cap_in_a_loop">loop</a>.
498	<p> In QRegExp <b>.</b> matches any character, therefore all QRegExp
499	regexps have the equivalent of Perl's <tt>/s</tt> option. QRegExp
500	does not have an equivalent to Perl's <tt>/m</tt> option, but this
501	can be emulated in various ways for example by splitting the input
502	into lines or by looping with a regexp that searches for newlines.
503	<p> Because QRegExp is string oriented there are no \A, \Z or \z
504	assertions. The \G assertion is not supported but can be emulated
505	in a loop.
506	<p> Perl's $& is <a href="#cap">cap</a>(0) or <a href="#capturedTexts">capturedTexts</a>()[0]. There are no QRegExp
507	equivalents for $`, $' or $+. Perl's capturing variables, $1, $2,
508	... correspond to cap(1) or capturedTexts()[1], cap(2) or
509	capturedTexts()[2], etc.
510	<p> To substitute a pattern use <a href="qstring.html#replace">QString::replace</a>().
511	<p> Perl's extended <tt>/x</tt> syntax is not supported, nor are
512	directives, e.g. (?i), or regexp comments, e.g. (?#comment). On
513	the other hand, C++'s rules for literal strings can be used to
514	achieve the same:
515	<pre>
516	QRegExp mark( "\\b" // word boundary
517	"[Mm]ark" // the word we want to match
518	);
519	</pre>
520
521	<p> Both zero-width positive and zero-width negative lookahead
522	assertions (?=pattern) and (?!pattern) are supported with the same
523	syntax as Perl. Perl's lookbehind assertions, "independent"
524	subexpressions and conditional expressions are not supported.
525	<p> Non-capturing parentheses are also supported, with the same
526	(?:pattern) syntax.
527	<p> See <a href="qstringlist.html#split">QStringList::split</a>() and <a href="qstringlist.html#join">QStringList::join</a>() for equivalents
528	to Perl's split and join functions.
529	<p> Note: because C++ transforms \'s they must be written <em>twice</em> in
530	code, e.g. <b>\b</b> must be written <b>\\b</b>.
531	<p> <a name="code-examples"></a>
532	<h3> Code Examples
533	</h3>
534	<a name="1-8"></a><p> <pre>
535	QRegExp rx( "^\\d\\d?$" ); // match integers 0 to 99
536	rx.<a href="#search">search</a>( "123" ); // returns -1 (no match)
537	rx.<a href="#search">search</a>( "-6" ); // returns -1 (no match)
538	rx.<a href="#search">search</a>( "6" ); // returns 0 (matched as position 0)
539	</pre>
540
541	<p> The third string matches '<u>6</u>'. This is a simple validation
542	regexp for integers in the range 0 to 99.
543	<p> <pre>
544	QRegExp rx( "^\\S+$" ); // match strings without whitespace
545	rx.<a href="#search">search</a>( "Hello world" ); // returns -1 (no match)
546	rx.<a href="#search">search</a>( "This_is-OK" ); // returns 0 (matched at position 0)
547	</pre>
548
549	<p> The second string matches '<u>This_is-OK</u>'. We've used the
550	character set abbreviation '\S' (non-whitespace) and the anchors
551	to match strings which contain no whitespace.
552	<p> In the following example we match strings containing 'mail' or
553	'letter' or 'correspondence' but only match whole words i.e. not
554	'email'
555	<p> <pre>
556	QRegExp rx( "\\b(mail\|letter\|correspondence)\\b" );
557	rx.<a href="#search">search</a>( "I sent you an email" ); // returns -1 (no match)
558	rx.<a href="#search">search</a>( "Please write the letter" ); // returns 17
559	</pre>
560
561	<p> The second string matches "Please write the <u>letter</u>". The
562	word 'letter' is also captured (because of the parentheses). We
563	can see what text we've captured like this:
564	<p> <pre>
565	<a href="qstring.html">QString</a> captured = rx.cap( 1 ); // captured == "letter"
566	</pre>
567
568	<p> This will capture the text from the first set of capturing
569	parentheses (counting capturing left parentheses from left to
570	right). The parentheses are counted from 1 since <a href="#cap">cap</a>( 0 ) is the
571	whole matched regexp (equivalent to '&' in most regexp engines).
572	<p> <pre>
573	QRegExp rx( "&(?!amp;)" ); // match ampersands but not &amp;
574	<a href="qstring.html">QString</a> line1 = "This & that";
575	line1.<a href="qstring.html#replace">replace</a>( rx, "&amp;" );
576	// line1 == "This &amp; that"
577	<a href="qstring.html">QString</a> line2 = "His &amp; hers & theirs";
578	line2.<a href="qstring.html#replace">replace</a>( rx, "&amp;" );
579	// line2 == "His &amp; hers &amp; theirs"
580	</pre>
581
582	<p> Here we've passed the QRegExp to <a href="qstring.html">QString</a>'s replace() function to
583	replace the matched text with new text.
584	<p> <pre>
585	<a href="qstring.html">QString</a> str = "One Eric another Eirik, and an Ericsson."
586	" How many Eiriks, Eric?";
587	QRegExp rx( "\\b(Eric\|Eirik)\\b" ); // match Eric or Eirik
588	int pos = 0; // where we are in the string
589	int count = 0; // how many Eric and Eirik's we've counted
590	while ( pos >= 0 ) {
591	pos = rx.<a href="#search">search</a>( str, pos );
592	if ( pos >= 0 ) {
593	pos++; // move along in str
594	count++; // count our Eric or Eirik
595	}
596	}
597	</pre>
598
599	<p> We've used the <a href="#search">search</a>() function to repeatedly match the regexp in
600	the string. Note that instead of moving forward by one character
601	at a time <tt>pos++</tt> we could have written <tt>pos += rx.matchedLength()</tt> to skip over the already matched string. The
602	count will equal 3, matching 'One <u>Eric</u> another
603	<u>Eirik</u>, and an Ericsson. How many Eiriks, <u>Eric</u>?'; it
604	doesn't match 'Ericsson' or 'Eiriks' because they are not bounded
605	by non-word boundaries.
606	<p> One common use of regexps is to split lines of delimited data into
607	their component fields.
608	<p> <pre>
609	str = "Trolltech AS\twww.trolltech.com\tNorway";
610	<a href="qstring.html">QString</a> company, web, country;
611	rx.setPattern( "^([^\t]+)\t([^\t]+)\t([^\t]+)$" );
612	if ( rx.search( str ) != -1 ) {
613	company = rx.cap( 1 );
614	web = rx.cap( 2 );
615	country = rx.cap( 3 );
616	}
617	</pre>
618
619	<p> In this example our input lines have the format company name, web
620	address and country. Unfortunately the regexp is rather long and
621	not very versatile -- the code will break if we add any more
622	fields. A simpler and better solution is to look for the
623	separator, '\t' in this case, and take the surrounding text. The
624	<a href="qstringlist.html">QStringList</a> split() function can take a separator string or regexp
625	as an argument and split a string accordingly.
626	<p> <pre>
627	<a href="qstringlist.html">QStringList</a> field = QStringList::<a href="qstringlist.html#split">split</a>( "\t", str );
628	</pre>
629
630	<p> Here field[0] is the company, field[1] the web address and so on.
631	<p> To imitate the matching of a shell we can use wildcard mode.
632	<p> <pre>
633	QRegExp rx( ".html" ); // invalid regexp: doesn't quantify anything
634	rx.<a href="#setWildcard">setWildcard</a>( TRUE ); // now it's a valid wildcard regexp
635	rx.<a href="#exactMatch">exactMatch</a>( "index.html" ); // returns TRUE
636	rx.<a href="#exactMatch">exactMatch</a>( "default.htm" ); // returns FALSE
637	rx.<a href="#exactMatch">exactMatch</a>( "readme.txt" ); // returns FALSE
638	</pre>
639
640	<p> Wildcard matching can be convenient because of its simplicity, but
641	any wildcard regexp can be defined using full regexps, e.g.
642	<b>.\.html$</b>. Notice that we can't match both <tt>.html</tt> and <tt>.htm</tt> files with a wildcard unless we use <b>.htm*</b> which will
643	also match 'test.html.bak'. A full regexp gives us the precision
644	we need, <b>.*\.html?$</b>.
645	<p> QRegExp can match case insensitively using <a href="#setCaseSensitive">setCaseSensitive</a>(), and
646	can use non-greedy matching, see <a href="#setMinimal">setMinimal</a>(). By default QRegExp
647	uses full regexps but this can be changed with <a href="#setWildcard">setWildcard</a>().
648	Searching can be forward with <a href="#search">search</a>() or backward with
649	<a href="#searchRev">searchRev</a>(). Captured text can be accessed using <a href="#capturedTexts">capturedTexts</a>()
650	which returns a string list of all captured strings, or using
651	<a href="#cap">cap</a>() which returns the captured string for the given index. The
652	<a href="#pos">pos</a>() function takes a match index and returns the position in the
653	string where the match was made (or -1 if there was no match).
654	<p> <p>See also <a href="qregexpvalidator.html">QRegExpValidator</a>, <a href="qstring.html">QString</a>, <a href="qstringlist.html">QStringList</a>, <a href="misc.html">Miscellaneous Classes</a>, <a href="shared.html">Implicitly and Explicitly Shared Classes</a>, and <a href="tools.html">Non-GUI Classes</a>.
655
656	<p> <a name="member-function-documentation"></a>
657
658	<hr><h2>Member Type Documentation</h2>
659	<h3 class=fn><a name="CaretMode-enum"></a>QRegExp::CaretMode</h3>
660
661	<p> The CaretMode enum defines the different meanings of the caret
662	(<b>^</b>) in a <a href="qregexp.html#regular-expression">regular expression</a>. The possible values are:
663	<ul>
664	<li><tt>QRegExp::CaretAtZero</tt> -
665	The caret corresponds to index 0 in the searched string.
666	<li><tt>QRegExp::CaretAtOffset</tt> -
667	The caret corresponds to the start offset of the search.
668	<li><tt>QRegExp::CaretWontMatch</tt> -
669	The caret never matches.
670	</ul>
671	<hr><h2>Member Function Documentation</h2>
672	<h3 class=fn><a name="QRegExp"></a>QRegExp::QRegExp ()
673	</h3>
674	Constructs an empty regexp.
675	<p> <p>See also <a href="#isValid">isValid</a>() and <a href="#errorString">errorString</a>().
676
677	<h3 class=fn><a name="QRegExp-2"></a>QRegExp::QRegExp ( const <a href="qstring.html">QString</a> & pattern, bool caseSensitive = TRUE, bool wildcard = FALSE )
678	</h3>
679	Constructs a <a href="qregexp.html#regular-expression">regular expression</a> object for the given <em>pattern</em>
680	string. The pattern must be given using wildcard notation if <em>wildcard</em> is TRUE (default is FALSE). The pattern is case
681	sensitive, unless <em>caseSensitive</em> is FALSE. Matching is greedy
682	(maximal), but can be changed by calling <a href="#setMinimal">setMinimal</a>().
683	<p> <p>See also <a href="#setPattern">setPattern</a>(), <a href="#setCaseSensitive">setCaseSensitive</a>(), <a href="#setWildcard">setWildcard</a>(), and <a href="#setMinimal">setMinimal</a>().
684
685	<h3 class=fn><a name="QRegExp-3"></a>QRegExp::QRegExp ( const <a href="qregexp.html">QRegExp</a> & rx )
686	</h3>
687	Constructs a <a href="qregexp.html#regular-expression">regular expression</a> as a copy of <em>rx</em>.
688	<p> <p>See also <a href="#operator-eq">operator=</a>().
689
690	<h3 class=fn><a name="~QRegExp"></a>QRegExp::~QRegExp ()
691	</h3>
692	Destroys the <a href="qregexp.html#regular-expression">regular expression</a> and cleans up its internal data.
693
694	<h3 class=fn><a href="qstring.html">QString</a> <a name="cap"></a>QRegExp::cap ( int nth = 0 )
695	</h3>
696	Returns the text captured by the <em>nth</em> subexpression. The entire
697	match has index 0 and the parenthesized subexpressions have
698	indices starting from 1 (excluding non-capturing parentheses).
699	<p> <pre>
700	QRegExp rxlen( "(\\d+)(?:\\s*)(cm\|inch)" );
701	int pos = rxlen.<a href="#search">search</a>( "Length: 189cm" );
702	if ( pos > -1 ) {
703	<a href="qstring.html">QString</a> value = rxlen.<a href="#cap">cap</a>( 1 ); // "189"
704	<a href="qstring.html">QString</a> unit = rxlen.<a href="#cap">cap</a>( 2 ); // "cm"
705	// ...
706	}
707	</pre>
708
709	<p> The order of elements matched by <a href="#cap">cap</a>() is as follows. The first
710	element, cap(0), is the entire matching string. Each subsequent
711	element corresponds to the next capturing open left parentheses.
712	Thus cap(1) is the text of the first capturing parentheses, cap(2)
713	is the text of the second, and so on.
714	<p> <a name="cap_in_a_loop"></a>
715	Some patterns may lead to a number of matches which cannot be
716	determined in advance, for example:
717	<p> <pre>
718	QRegExp rx( "(\\d+)" );
719	str = "Offsets: 12 14 99 231 7";
720	<a href="qstringlist.html">QStringList</a> list;
721	pos = 0;
722	while ( pos >= 0 ) {
723	pos = rx.<a href="#search">search</a>( str, pos );
724	if ( pos > -1 ) {
725	list += rx.<a href="#cap">cap</a>( 1 );
726	pos += rx.<a href="#matchedLength">matchedLength</a>();
727	}
728	}
729	// list contains "12", "14", "99", "231", "7"
730	</pre>
731
732	<p> <p>See also <a href="#capturedTexts">capturedTexts</a>(), <a href="#pos">pos</a>(), <a href="#exactMatch">exactMatch</a>(), <a href="#search">search</a>(), and <a href="#searchRev">searchRev</a>().
733
734	<p>Examples: <a href="archivesearch-example.html#x479">network/archivesearch/archivedialog.ui.h</a> and <a href="regexptester-example.html#x2485">regexptester/regexptester.cpp</a>.
735	<h3 class=fn><a href="qstringlist.html">QStringList</a> <a name="capturedTexts"></a>QRegExp::capturedTexts ()
736	</h3>
737	Returns a list of the captured text strings.
738	<p> The first string in the list is the entire matched string. Each
739	subsequent list element contains a string that matched a
740	(capturing) subexpression of the regexp.
741	<p> For example:
742	<pre>
743	QRegExp rx( "(\\d+)(\\s*)(cm\|inch(es)?)" );
744	int pos = rx.<a href="#search">search</a>( "Length: 36 inches" );
745	<a href="qstringlist.html">QStringList</a> list = rx.<a href="#capturedTexts">capturedTexts</a>();
746	// list is now ( "36 inches", "36", " ", "inches", "es" )
747	</pre>
748
749	<p> The above example also captures elements that may be present but
750	which we have no interest in. This problem can be solved by using
751	non-capturing parentheses:
752	<p> <pre>
753	QRegExp rx( "(\\d+)(?:\\s*)(cm\|inch(?:es)?)" );
754	int pos = rx.<a href="#search">search</a>( "Length: 36 inches" );
755	<a href="qstringlist.html">QStringList</a> list = rx.<a href="#capturedTexts">capturedTexts</a>();
756	// list is now ( "36 inches", "36", "inches" )
757	</pre>
758
759	<p> Note that if you want to iterate over the list, you should iterate
760	over a copy, e.g.
761	<pre>
762	<a href="qstringlist.html">QStringList</a> list = rx.capturedTexts();
763	QStringList::Iterator it = list.<a href="qvaluelist.html#begin">begin</a>();
764	while( it != list.<a href="qvaluelist.html#end">end</a>() ) {
765	myProcessing( *it );
766	++it;
767	}
768	</pre>
769
770	<p> Some regexps can match an indeterminate number of times. For
771	example if the input string is "Offsets: 12 14 99 231 7" and the
772	regexp, <tt>rx</tt>, is <b>(\d+)+</b>, we would hope to get a list of
773	all the numbers matched. However, after calling
774	<tt>rx.search(str)</tt>, <a href="#capturedTexts">capturedTexts</a>() will return the list ( "12",
775	"12" ), i.e. the entire match was "12" and the first subexpression
776	matched was "12". The correct approach is to use <a href="#cap">cap</a>() in a <a href="#cap_in_a_loop">loop</a>.
777	<p> The order of elements in the string list is as follows. The first
778	element is the entire matching string. Each subsequent element
779	corresponds to the next capturing open left parentheses. Thus
780	capturedTexts()[1] is the text of the first capturing parentheses,
781	capturedTexts()[2] is the text of the second and so on
782	(corresponding to $1, $2, etc., in some other regexp languages).
783	<p> <p>See also <a href="#cap">cap</a>(), <a href="#pos">pos</a>(), <a href="#exactMatch">exactMatch</a>(), <a href="#search">search</a>(), and <a href="#searchRev">searchRev</a>().
784
785	<h3 class=fn>bool <a name="caseSensitive"></a>QRegExp::caseSensitive () const
786	</h3>
787	Returns TRUE if case sensitivity is enabled; otherwise returns
788	FALSE. The default is TRUE.
789	<p> <p>See also <a href="#setCaseSensitive">setCaseSensitive</a>().
790
791	<h3 class=fn><a href="qstring.html">QString</a> <a name="errorString"></a>QRegExp::errorString ()
792	</h3>
793	Returns a text string that explains why a regexp pattern is
794	invalid the case being; otherwise returns "no error occurred".
795	<p> <p>See also <a href="#isValid">isValid</a>().
796
797	<p>Example: <a href="regexptester-example.html#x2486">regexptester/regexptester.cpp</a>.
798	<h3 class=fn><a href="qstring.html">QString</a> <a name="escape"></a>QRegExp::escape ( const <a href="qstring.html">QString</a> & str )<tt> [static]</tt>
799	</h3>
800	Returns the string <em>str</em> with every regexp special character
801	escaped with a backslash. The special characters are $, (, ), *, +,
802	., ?, [, \, ], ^, {, \| and }.
803	<p> Example:
804	<pre>
805	s1 = QRegExp::<a href="#escape">escape</a>( "bingo" ); // s1 == "bingo"
806	s2 = QRegExp::<a href="#escape">escape</a>( "f(x)" ); // s2 == "f\$x\$"
807	</pre>
808
809	<p> This function is useful to construct regexp patterns dynamically:
810	<p> <pre>
811	QRegExp rx( "(" + QRegExp::escape(name) +
812	"\|" + QRegExp::escape(alias) + ")" );
813	</pre>
814
815
816	<h3 class=fn>bool <a name="exactMatch"></a>QRegExp::exactMatch ( const <a href="qstring.html">QString</a> & str ) const
817	</h3>
818	Returns TRUE if <em>str</em> is matched exactly by this <a href="qregexp.html#regular-expression">regular expression</a>; otherwise returns FALSE. You can determine how much of
819	the string was matched by calling <a href="#matchedLength">matchedLength</a>().
820	<p> For a given regexp string, R, <a href="#exactMatch">exactMatch</a>("R") is the equivalent of
821	<a href="#search">search</a>("^R$") since exactMatch() effectively encloses the regexp
822	in the start of string and end of string anchors, except that it
823	sets matchedLength() differently.
824	<p> For example, if the regular expression is <b>blue</b>, then
825	exactMatch() returns TRUE only for input <tt>blue</tt>. For inputs <tt>bluebell</tt>, <tt>blutak</tt> and <tt>lightblue</tt>, exactMatch() returns FALSE
826	and matchedLength() will return 4, 3 and 0 respectively.
827	<p> Although const, this function sets matchedLength(),
828	<a href="#capturedTexts">capturedTexts</a>() and <a href="#pos">pos</a>().
829	<p> <p>See also <a href="#search">search</a>(), <a href="#searchRev">searchRev</a>(), and <a href="qregexpvalidator.html">QRegExpValidator</a>.
830
831	<h3 class=fn>bool <a name="isEmpty"></a>QRegExp::isEmpty () const
832	</h3>
833	Returns TRUE if the pattern string is empty; otherwise returns
834	FALSE.
835	<p> If you call <a href="#exactMatch">exactMatch</a>() with an empty pattern on an empty string
836	it will return TRUE; otherwise it returns FALSE since it operates
837	over the whole string. If you call <a href="#search">search</a>() with an empty pattern
838	on <em>any</em> string it will return the start offset (0 by default)
839	because the empty pattern matches the 'emptiness' at the start of
840	the string. In this case the length of the match returned by
841	<a href="#matchedLength">matchedLength</a>() will be 0.
842	<p> See <a href="qstring.html#isEmpty">QString::isEmpty</a>().
843
844	<h3 class=fn>bool <a name="isValid"></a>QRegExp::isValid () const
845	</h3>
846	Returns TRUE if the <a href="qregexp.html#regular-expression">regular expression</a> is valid; otherwise returns
847	FALSE. An invalid regular expression never matches.
848	<p> The pattern <b>[a-z</b> is an example of an invalid pattern, since
849	it lacks a closing square bracket.
850	<p> Note that the validity of a regexp may also depend on the setting
851	of the wildcard flag, for example <b>*.html</b> is a valid
852	wildcard regexp but an invalid full regexp.
853	<p> <p>See also <a href="#errorString">errorString</a>().
854
855	<p>Example: <a href="regexptester-example.html#x2487">regexptester/regexptester.cpp</a>.
856	<h3 class=fn>int <a name="match"></a>QRegExp::match ( const <a href="qstring.html">QString</a> & str, int index = 0, int * len = 0, bool indexIsStart = TRUE ) const
857	</h3> <b>This function is obsolete.</b> It is provided to keep old source working. We strongly advise against using it in new code.
858	<p> Attempts to match in <em>str</em>, starting from position <em>index</em>.
859	Returns the position of the match, or -1 if there was no match.
860	<p> The length of the match is stored in <em>*len</em>, unless <em>len</em> is a
861	null pointer.
862	<p> If <em>indexIsStart</em> is TRUE (the default), the position <em>index</em> in
863	the string will match the start of string anchor, <b>^</b>, in the
864	regexp, if present. Otherwise, position 0 in <em>str</em> will match.
865	<p> Use <a href="#search">search</a>() and <a href="#matchedLength">matchedLength</a>() instead of this function.
866	<p> <p>See also <a href="qstring.html#mid">QString::mid</a>() and <a href="qconststring.html">QConstString</a>.
867
868	<p>Example: <a href="qmag-example.html#x1791">qmag/qmag.cpp</a>.
869	<h3 class=fn>int <a name="matchedLength"></a>QRegExp::matchedLength () const
870	</h3>
871	Returns the length of the last matched string, or -1 if there was
872	no match.
873	<p> <p>See also <a href="#exactMatch">exactMatch</a>(), <a href="#search">search</a>(), and <a href="#searchRev">searchRev</a>().
874
875	<p>Examples: <a href="archivesearch-example.html#x480">network/archivesearch/archivedialog.ui.h</a> and <a href="regexptester-example.html#x2488">regexptester/regexptester.cpp</a>.
876	<h3 class=fn>bool <a name="minimal"></a>QRegExp::minimal () const
877	</h3>
878	Returns TRUE if minimal (non-greedy) matching is enabled;
879	otherwise returns FALSE.
880	<p> <p>See also <a href="#setMinimal">setMinimal</a>().
881
882	<h3 class=fn>int <a name="numCaptures"></a>QRegExp::numCaptures () const
883	</h3>
884	Returns the number of captures contained in the <a href="qregexp.html#regular-expression">regular expression</a>.
885
886	<p>Example: <a href="regexptester-example.html#x2489">regexptester/regexptester.cpp</a>.
887	<h3 class=fn>bool <a name="operator!-eq"></a>QRegExp::operator!= ( const <a href="qregexp.html">QRegExp</a> & rx ) const
888	</h3>
889
890	<p> Returns TRUE if this <a href="qregexp.html#regular-expression">regular expression</a> is not equal to <em>rx</em>;
891	otherwise returns FALSE.
892	<p> <p>See also <a href="#operator-eq-eq">operator==</a>().
893
894	<h3 class=fn><a href="qregexp.html">QRegExp</a> & <a name="operator-eq"></a>QRegExp::operator= ( const <a href="qregexp.html">QRegExp</a> & rx )
895	</h3>
896	Copies the <a href="qregexp.html#regular-expression">regular expression</a> <em>rx</em> and returns a reference to the
897	copy. The case sensitivity, wildcard and minimal matching options
898	are also copied.
899
900	<h3 class=fn>bool <a name="operator-eq-eq"></a>QRegExp::operator== ( const <a href="qregexp.html">QRegExp</a> & rx ) const
901	</h3>
902	Returns TRUE if this <a href="qregexp.html#regular-expression">regular expression</a> is equal to <em>rx</em>;
903	otherwise returns FALSE.
904	<p> Two QRegExp objects are equal if they have the same pattern
905	strings and the same settings for case sensitivity, wildcard and
906	minimal matching.
907
908	<h3 class=fn><a href="qstring.html">QString</a> <a name="pattern"></a>QRegExp::pattern () const
909	</h3>
910	Returns the pattern string of the <a href="qregexp.html#regular-expression">regular expression</a>. The pattern
911	has either regular expression syntax or wildcard syntax, depending
912	on <a href="#wildcard">wildcard</a>().
913	<p> <p>See also <a href="#setPattern">setPattern</a>().
914
915	<h3 class=fn>int <a name="pos"></a>QRegExp::pos ( int nth = 0 )
916	</h3>
917	Returns the position of the <em>nth</em> captured text in the searched
918	string. If <em>nth</em> is 0 (the default), <a href="#pos">pos</a>() returns the position
919	of the whole match.
920	<p> Example:
921	<pre>
922	QRegExp rx( "/([a-z]+)/([a-z]+)" );
923	rx.<a href="#search">search</a>( "Output /dev/null" ); // returns 7 (position of /dev/null)
924	rx.<a href="#pos">pos</a>( 0 ); // returns 7 (position of /dev/null)
925	rx.<a href="#pos">pos</a>( 1 ); // returns 8 (position of dev)
926	rx.<a href="#pos">pos</a>( 2 ); // returns 12 (position of null)
927	</pre>
928
929	<p> For zero-length matches, pos() always returns -1. (For example, if
930	<a href="#cap">cap</a>(4) would return an empty string, pos(4) returns -1.) This is
931	due to an implementation tradeoff.
932	<p> <p>See also <a href="#capturedTexts">capturedTexts</a>(), <a href="#exactMatch">exactMatch</a>(), <a href="#search">search</a>(), and <a href="#searchRev">searchRev</a>().
933
934	<h3 class=fn>int <a name="search"></a>QRegExp::search ( const <a href="qstring.html">QString</a> & str, int offset = 0, <a href="qregexp.html#CaretMode-enum">CaretMode</a> caretMode = CaretAtZero ) const
935	</h3>
936	Attempts to find a match in <em>str</em> from position <em>offset</em> (0 by
937	default). If <em>offset</em> is -1, the search starts at the last
938	character; if -2, at the next to last character; etc.
939	<p> Returns the position of the first match, or -1 if there was no
940	match.
941	<p> The <em>caretMode</em> parameter can be used to instruct whether <b>^</b>
942	should match at index 0 or at <em>offset</em>.
943	<p> You might prefer to use <a href="qstring.html#find">QString::find</a>(), <a href="qstring.html#contains">QString::contains</a>() or
944	even <a href="qstringlist.html#grep">QStringList::grep</a>(). To replace matches use
945	<a href="qstring.html#replace">QString::replace</a>().
946	<p> Example:
947	<pre>
948	<a href="qstring.html">QString</a> str = "offsets: 1.23 .50 71.00 6.00";
949	QRegExp rx( "\\d*\\.\\d+" ); // primitive floating point matching
950	int count = 0;
951	int pos = 0;
952	while ( (pos = rx.<a href="#search">search</a>(str, pos)) != -1 ) {
953	count++;
954	pos += rx.<a href="#matchedLength">matchedLength</a>();
955	}
956	// pos will be 9, 14, 18 and finally 24; count will end up as 4
957	</pre>
958
959	<p> Although const, this function sets <a href="#matchedLength">matchedLength</a>(),
960	<a href="#capturedTexts">capturedTexts</a>() and <a href="#pos">pos</a>().
961	<p> <p>See also <a href="#searchRev">searchRev</a>() and <a href="#exactMatch">exactMatch</a>().
962
963	<p>Examples: <a href="archivesearch-example.html#x481">network/archivesearch/archivedialog.ui.h</a> and <a href="regexptester-example.html#x2490">regexptester/regexptester.cpp</a>.
964	<h3 class=fn>int <a name="searchRev"></a>QRegExp::searchRev ( const <a href="qstring.html">QString</a> & str, int offset = -1, <a href="qregexp.html#CaretMode-enum">CaretMode</a> caretMode = CaretAtZero ) const
965	</h3>
966	Attempts to find a match backwards in <em>str</em> from position <em>offset</em>. If <em>offset</em> is -1 (the default), the search starts at the
967	last character; if -2, at the next to last character; etc.
968	<p> Returns the position of the first match, or -1 if there was no
969	match.
970	<p> The <em>caretMode</em> parameter can be used to instruct whether <b>^</b>
971	should match at index 0 or at <em>offset</em>.
972	<p> Although const, this function sets <a href="#matchedLength">matchedLength</a>(),
973	<a href="#capturedTexts">capturedTexts</a>() and <a href="#pos">pos</a>().
974	<p> <b>Warning:</b> Searching backwards is much slower than searching
975	forwards.
976	<p> <p>See also <a href="#search">search</a>() and <a href="#exactMatch">exactMatch</a>().
977
978	<h3 class=fn>void <a name="setCaseSensitive"></a>QRegExp::setCaseSensitive ( bool sensitive )
979	</h3>
980	Sets case sensitive matching to <em>sensitive</em>.
981	<p> If <em>sensitive</em> is TRUE, <b>\.txt$</b> matches <tt>readme.txt</tt> but
982	not <tt>README.TXT</tt>.
983	<p> <p>See also <a href="#caseSensitive">caseSensitive</a>().
984
985	<p>Example: <a href="regexptester-example.html#x2491">regexptester/regexptester.cpp</a>.
986	<h3 class=fn>void <a name="setMinimal"></a>QRegExp::setMinimal ( bool minimal )
987	</h3>
988	Enables or disables minimal matching. If <em>minimal</em> is FALSE,
989	matching is greedy (maximal) which is the default.
990	<p> For example, suppose we have the input string "We must be
991	<b>bold</b>, very <b>bold</b>!" and the pattern
992	<b><b>.*</b></b>. With the default greedy (maximal) matching,
993	the match is "We must be <u><b>bold</b>, very
994	<b>bold</b></u>!". But with minimal (non-greedy) matching the
995	first match is: "We must be <u><b>bold</b></u>, very
996	<b>bold</b>!" and the second match is "We must be <b>bold</b>,
997	very <u><b>bold</b></u>!". In practice we might use the pattern
998	<b><b>[^<]+</b></b> instead, although this will still fail for
999	nested tags.
1000	<p> <p>See also <a href="#minimal">minimal</a>().
1001
1002	<p>Examples: <a href="archivesearch-example.html#x482">network/archivesearch/archivedialog.ui.h</a> and <a href="regexptester-example.html#x2492">regexptester/regexptester.cpp</a>.
1003	<h3 class=fn>void <a name="setPattern"></a>QRegExp::setPattern ( const <a href="qstring.html">QString</a> & pattern )
1004	</h3>
1005	Sets the pattern string to <em>pattern</em>. The case sensitivity,
1006	wildcard and minimal matching options are not changed.
1007	<p> <p>See also <a href="#pattern">pattern</a>().
1008
1009	<h3 class=fn>void <a name="setWildcard"></a>QRegExp::setWildcard ( bool wildcard )
1010	</h3>
1011	Sets the wildcard mode for the <a href="qregexp.html#regular-expression">regular expression</a>. The default is
1012	FALSE.
1013	<p> Setting <em>wildcard</em> to TRUE enables simple shell-like wildcard
1014	matching. (See <a href="#wildcard-matching">wildcard matching
1015	(globbing)</a>.)
1016	<p> For example, <b>r*.txt</b> matches the string <tt>readme.txt</tt> in
1017	wildcard mode, but does not match <tt>readme</tt>.
1018	<p> <p>See also <a href="#wildcard">wildcard</a>().
1019
1020	<p>Example: <a href="regexptester-example.html#x2493">regexptester/regexptester.cpp</a>.
1021	<h3 class=fn>bool <a name="wildcard"></a>QRegExp::wildcard () const
1022	</h3>
1023	Returns TRUE if wildcard mode is enabled; otherwise returns FALSE.
1024	The default is FALSE.
1025	<p> <p>See also <a href="#setWildcard">setWildcard</a>().
1026
1027	<!-- eof -->
1028	<hr><p>
1029	This file is part of the <a href="index.html">Qt toolkit</a>.
1030	Copyright © 1995-2007
1031	<a href="http://www.trolltech.com/">Trolltech</a>. All Rights Reserved.<p><address><hr><div align=center>
1032	<table width=100% cellspacing=0 border=0><tr>
1033	<td>Copyright © 2007
1034	<a href="troll.html">Trolltech</a><td align=center><a href="trademarks.html">Trademarks</a>
1035	<td align=right><div align=right>Qt 3.3.8</div>
1036	</table></div></address></body>
1037	</html>

Note: See TracBrowser for help on using the repository browser.

Context Navigation

source: trunk/doc/html/qregexp.html@ 203

Download in other formats: