rEFind
Description
Uses a regular expression (RE) to search a string for a pattern. The search is case sensitive.
Categories
Related
Syntax
REFind(reg_expression, string [, start, returnsubexpressions ] )
Attributes
| Attribute | Description | Required | Default |
|---|---|---|---|
| reg_expression | Regular expression for which to search. Case sensitive. | ||
| returnsubexpressions | Optional. Boolean. Whether to return substrings of reg_expression, in arrays named len and pos: True: if the regular expression is found, the first array element contains the length and position, respectively, of the first match. If the regular expression contains parentheses that group subexpressions, each subsequent array element contains the length and position, respectively, of the first occurrence of each group. If the regular expression is not found, the arrays each contain one element with the value 0. False: the function returns the position in the string where the match begins. Default. | ||
| start | Optional. A positive integer, or a variable that contains one. Position in the string at which to start search. The default value is 1. | ||
| string | A string, or a variable that contains one, in which to search. |
Returns
Depends
on the value of the returnsubexpressionsparameter:
If returnsubexpressions= "False":
The position in the string where the match begins
0, if the regular expression is not matched in the string
If returnsubexpressions = "True": a structure
that contains two arrays, len and pos.
The array elements are as follows:
If the regular
expression is found in the string, the first element of the len and pos arrays
contains the length and position, respectively, of the first match
of the entire regular expression.
If the regular expression contains parentheses that group
subexpressions, each subsequent array element contains the length
and position, respectively, of the first occurrence of each group.
If the regular expression is not found in the string, the
first element of the len and pos arrays
contains 0.
Usage
This function finds the first occurrence of a regular expression in a string. To find the second and subsequent instances of the expression or of subexpressions in it, you call this function more than once, each time with a different start position. To determine the next start position, use the returnsubexpressionsparameter, and add the value returned in the first element of the length array to the value in the first element of the position array.
Example
<h3>REFind Example</h3>
This example shows the use of the REFind function with and without the
<i>returnsubexpressions</i> parameter set to True.
If you do not use the <i>returnsubexpressions</i> parameter,
REFind returns the position of the first occurrence of a regular
expression in a string starting from the specified position.
Returns 0 if no occurrences are found.</p>
REFind("a+c+", "abcaaccdd"):
<cfoutput>#REFind("a+c+", "abcaaccdd")#</cfoutput></p>
REFind("a+c*", "abcaaccdd"):
<cfoutput>#REFind("a+c*", "abcaaccdd")#</cfoutput></p>
REFind("[[:upper:]]", "abcaacCDD"):
<cfoutput>#REFind("[[:upper:]]", "abcaacCDD")#</cfoutput></p>
REFind("[\?&]rep = ", "report.cfm?rep = 1234&u = 5"):
<cfoutput>#REFind("[\?&]rep = ", "report.cfm?rep = 1234&u = 5")#
</cfoutput>
</p>
<!--- Set startPos to one; returnMatchedSubexpressions = TRUE --->
<hr size = "2" color = "#0000A0">
If you use the <i>returnssubexpression</i> parameter, REFind returns the
position and length of the first occurrence of a regular expression
in a string starting from the specified position. The position and
length variables are stored in a structure. To access position and length
information, use the keys <i>pos</i> and <i>len</i>, respectively.</p>
<cfset teststring = "The cat in the hat hat came back!">
The string in which the function is to search is:
<cfoutput><b>#teststring#</b></cfoutput>.</p>
The first call to REFind to search this string is:
<b>REFind("[A-Za-z]+",testString,1,"TRUE")</b></p>
This function returns a structure that contains two arrays: pos and len.</p>
To create this structure you can use a CFSET statement, for example: </p>
<CFSET st = REFind("[[:alpha:]]",testString,1,"TRUE")>
<cfset st = REFind("[[:alpha:]]",testString,1,"TRUE")>
<cfoutput>
The number of elements in each array: #ArrayLen(st.pos)#.
</cfoutput></p>
<b>The number of elements in the pos and len arrays is always one
if you do not use parentheses in the regular expression.</b></p>
The value of st.pos[1] is: <cfoutput>#st.pos[1]#.</cfoutput></p>
The value of st.len[1] is: <cfoutput>#st.len[1]#.</cfoutput></p>
<cfoutput>
Substring is <b>[#Mid(testString,st.pos[1],st.len[1])#]</B>
</cfoutput></p>
<hr size = "2" color = "#0000A0">
However, if you use parentheses in the regular expression, the first
element contains the position and length of the first instance
of the whole expression. The position and length of the first instance
of each parenthesized subexpression within is included in additional
array elements.</p>
For example:
<CFSET st1 = REFind("([[:alpha:]])[ ]+(\1)",testString,1,"TRUE")></p>
<cfset st1 = REFind("([[:alpha:]]+)[ ]+(\1)",testString,1,"TRUE")>
The number of elements in each array is <cfoutput>#ArrayLen(st1.pos)#
</cfoutput>.</p>
First whole expression match; position is
<cfoutput>#st1.pos[1]#;
length is #st1.len[1]#; whole expression match is
<B>[#Mid(testString,st1.pos[1],st1.len[1])#]</B>
</cfoutput></p>
Subsequent elements of the arrays provide the position and length of
the first instance of each parenthesized subexpression therein.</p>
<cfloop index = "i" from = "2" to = "#ArrayLen(st1.pos)#">
<cfoutput>Position is #st1.pos[i]#; Length is #st1.len[i]#;
Substring is <B>[#Mid(testString,st1.pos[i],st1.len[i])#]
</B></cfoutput></p>
</cfloop><br>