embedaddon/pcre/doc/pcre.txt - diff

Return to pcre.txt CVS log

Up to [ELWIX - Embedded LightWeight unIX -] / embedaddon / pcre / doc

Diff for /embedaddon/pcre/doc/pcre.txt between versions 1.1.1.4 and 1.1.1.5

FreeBSD-CVSweb <freebsd-cvsweb@FreeBSD.org>

version 1.1.1.4, 2013/07/22 08:25:56	version 1.1.1.5, 2014/06/15 19:46:04
Line 53 INTRODUCTION	Line 53 INTRODUCTION
5.12, including support for UTF-8/16/32 encoded strings and Unicode	5.12, including support for UTF-8/16/32 encoded strings and Unicode
general category properties. However, UTF-8/16/32 and Unicode support	general category properties. However, UTF-8/16/32 and Unicode support
has to be explicitly enabled; it is not the default. The Unicode tables	has to be explicitly enabled; it is not the default. The Unicode tables
correspond to Unicode release 6.2.0.	correspond to Unicode release 6.3.0.

In addition to the Perl-compatible matching function, PCRE contains an	In addition to the Perl-compatible matching function, PCRE contains an
alternative function that matches the same compiled patterns in a dif-	alternative function that matches the same compiled patterns in a dif-
Line 532 PCRE 32-BIT API BASIC FUNCTIONS	Line 532 PCRE 32-BIT API BASIC FUNCTIONS

pcre32 *pcre32_compile2(PCRE_SPTR32 pattern, int options,	pcre32 *pcre32_compile2(PCRE_SPTR32 pattern, int options,
int *errorcodeptr,	int *errorcodeptr,
const char *errptr, int erroffset,
const unsigned char *tableptr);	const unsigned char *tableptr);

pcre32_extra pcre32_study(const pcre32 code, int options,	pcre32_extra pcre32_study(const pcre32 code, int options,
Line 1458 THE ALTERNATIVE MATCHING ALGORITHM	Line 1457 THE ALTERNATIVE MATCHING ALGORITHM
at the fifth character of the subject. The algorithm does not automati-	at the fifth character of the subject. The algorithm does not automati-
cally move on to find matches that start at later positions.	cally move on to find matches that start at later positions.

	PCRE's "auto-possessification" optimization usually applies to charac-
	ter repeats at the end of a pattern (as well as internally). For exam-
	ple, the pattern "a\d+" is compiled as if it were "a\d++" because there
	is no point even considering the possibility of backtracking into the
	repeated digits. For DFA matching, this means that only one possible
	match is found. If you really do want multiple matches in such cases,
	either use an ungreedy repeat ("a\d+?") or set the PCRE_NO_AUTO_POSSESS
	option when compiling.

There are a number of features of PCRE regular expressions that are not	There are a number of features of PCRE regular expressions that are not
supported by the alternative matching algorithm. They are as follows:	supported by the alternative matching algorithm. They are as follows:

1. Because the algorithm finds all possible matches, the greedy or	1. Because the algorithm finds all possible matches, the greedy or
ungreedy nature of repetition quantifiers is not relevant. Greedy and	ungreedy nature of repetition quantifiers is not relevant. Greedy and
ungreedy quantifiers are treated in exactly the same way. However, pos-	ungreedy quantifiers are treated in exactly the same way. However, pos-
sessive quantifiers can make a difference when what follows could also	sessive quantifiers can make a difference when what follows could also
match what is quantified, for example in a pattern like this:	match what is quantified, for example in a pattern like this:

^a++\w!	^a++\w!

Removed from v.1.1.1.4
changed lines
	Added in v.1.1.1.5