Annotation of embedaddon/php/ext/pcre/pcrelib/NON-UNIX-USE, revision 1.1.1.1

1.1       misho       1: Compiling PCRE on non-Unix systems
                      2: ----------------------------------
                      3: 
                      4: This document contains the following sections:
                      5: 
                      6:   General
                      7:   Generic instructions for the PCRE C library
                      8:   The C++ wrapper functions
                      9:   Building for virtual Pascal
                     10:   Stack size in Windows environments
                     11:   Linking programs in Windows environments
                     12:   Comments about Win32 builds
                     13:   Building PCRE on Windows with CMake
                     14:   Use of relative paths with CMake on Windows
                     15:   Testing with RunTest.bat
                     16:   Building under Windows with BCC5.5
                     17:   Building PCRE on OpenVMS
                     18:   Building PCRE on Stratus OpenVOS
                     19: 
                     20: 
                     21: GENERAL
                     22: 
                     23: I (Philip Hazel) have no experience of Windows or VMS sytems and how their
                     24: libraries work. The items in the PCRE distribution and Makefile that relate to
                     25: anything other than Unix-like systems are untested by me.
                     26: 
                     27: There are some other comments and files (including some documentation in CHM
                     28: format) in the Contrib directory on the FTP site:
                     29: 
                     30:   ftp://ftp.csx.cam.ac.uk/pub/software/programming/pcre/Contrib
                     31: 
                     32: If you want to compile PCRE for a non-Unix system (especially for a system that
                     33: does not support "configure" and "make" files), note that the basic PCRE
                     34: library consists entirely of code written in Standard C, and so should compile
                     35: successfully on any system that has a Standard C compiler and library. The C++
                     36: wrapper functions are a separate issue (see below).
                     37: 
                     38: The PCRE distribution includes a "configure" file for use by the Configure/Make
                     39: build system, as found in many Unix-like environments. There is also support
                     40: support for CMake, which some users prefer, especially in Windows environments.
                     41: There are some instructions for CMake under Windows in the section entitled
                     42: "Building PCRE with CMake" below. CMake can also be used to build PCRE in
                     43: Unix-like systems.
                     44: 
                     45: 
                     46: GENERIC INSTRUCTIONS FOR THE PCRE C LIBRARY
                     47: 
                     48: The following are generic comments about building the PCRE C library "by hand".
                     49: 
                     50:  (1) Copy or rename the file config.h.generic as config.h, and edit the macro
                     51:      settings that it contains to whatever is appropriate for your environment.
                     52:      In particular, if you want to force a specific value for newline, you can
                     53:      define the NEWLINE macro. When you compile any of the PCRE modules, you
                     54:      must specify -DHAVE_CONFIG_H to your compiler so that config.h is included
                     55:      in the sources.
                     56: 
                     57:      An alternative approach is not to edit config.h, but to use -D on the
                     58:      compiler command line to make any changes that you need to the
                     59:      configuration options. In this case -DHAVE_CONFIG_H must not be set.
                     60: 
                     61:      NOTE: There have been occasions when the way in which certain parameters
                     62:      in config.h are used has changed between releases. (In the configure/make
                     63:      world, this is handled automatically.) When upgrading to a new release,
                     64:      you are strongly advised to review config.h.generic before re-using what
                     65:      you had previously.
                     66: 
                     67:  (2) Copy or rename the file pcre.h.generic as pcre.h.
                     68: 
                     69:  (3) EITHER:
                     70:        Copy or rename file pcre_chartables.c.dist as pcre_chartables.c.
                     71: 
                     72:      OR:
                     73:        Compile dftables.c as a stand-alone program (using -DHAVE_CONFIG_H if
                     74:        you have set up config.h), and then run it with the single argument
                     75:        "pcre_chartables.c". This generates a set of standard character tables
                     76:        and writes them to that file. The tables are generated using the default
                     77:        C locale for your system. If you want to use a locale that is specified
                     78:        by LC_xxx environment variables, add the -L option to the dftables
                     79:        command. You must use this method if you are building on a system that
                     80:        uses EBCDIC code.
                     81: 
                     82:      The tables in pcre_chartables.c are defaults. The caller of PCRE can
                     83:      specify alternative tables at run time.
                     84: 
                     85:  (4) Ensure that you have the following header files:
                     86: 
                     87:        pcre_internal.h
                     88:        ucp.h
                     89: 
                     90:  (5) Also ensure that you have the following file, which is #included as source
                     91:      when building a debugging version of PCRE, and is also used by pcretest.
                     92: 
                     93:        pcre_printint.src
                     94: 
                     95:  (6) Compile the following source files, setting -DHAVE_CONFIG_H as a compiler
                     96:      option if you have set up config.h with your configuration, or else use
                     97:      other -D settings to change the configuration as required.
                     98: 
                     99:        pcre_chartables.c
                    100:        pcre_compile.c
                    101:        pcre_config.c
                    102:        pcre_dfa_exec.c
                    103:        pcre_exec.c
                    104:        pcre_fullinfo.c
                    105:        pcre_get.c
                    106:        pcre_globals.c
                    107:        pcre_info.c
                    108:        pcre_maketables.c
                    109:        pcre_newline.c
                    110:        pcre_ord2utf8.c
                    111:        pcre_refcount.c
                    112:        pcre_study.c
                    113:        pcre_tables.c
                    114:        pcre_try_flipped.c
                    115:        pcre_ucd.c
                    116:        pcre_valid_utf8.c
                    117:        pcre_version.c
                    118:        pcre_xclass.c
                    119: 
                    120:      Make sure that you include -I. in the compiler command (or equivalent for
                    121:      an unusual compiler) so that all included PCRE header files are first
                    122:      sought in the current directory. Otherwise you run the risk of picking up
                    123:      a previously-installed file from somewhere else.
                    124: 
                    125:  (7) Now link all the compiled code into an object library in whichever form
                    126:      your system keeps such libraries. This is the basic PCRE C library. If
                    127:      your system has static and shared libraries, you may have to do this once
                    128:      for each type.
                    129: 
                    130:  (8) Similarly, if you want to build the POSIX wrapper functions, ensure that
                    131:      you have the pcreposix.h file and then compile pcreposix.c (remembering
                    132:      -DHAVE_CONFIG_H if necessary). Link the result (on its own) as the
                    133:      pcreposix library.
                    134: 
                    135:  (9) Compile the test program pcretest.c (again, don't forget -DHAVE_CONFIG_H).
                    136:      This needs the functions in the PCRE library when linking. It also needs
                    137:      the pcreposix wrapper functions unless you compile it with -DNOPOSIX. The
                    138:      pcretest.c program also needs the pcre_printint.src source file, which it
                    139:      #includes.
                    140: 
                    141: (10) Run pcretest on the testinput files in the testdata directory, and check
                    142:      that the output matches the corresponding testoutput files. Note that the
                    143:      supplied files are in Unix format, with just LF characters as line
                    144:      terminators. You may need to edit them to change this if your system uses
                    145:      a different convention. If you are using Windows, you probably should use
                    146:      the wintestinput3 file instead of testinput3 (and the corresponding output
                    147:      file). This is a locale test; wintestinput3 sets the locale to "french"
                    148:      rather than "fr_FR", and there some minor output differences.
                    149: 
                    150: (11) If you want to use the pcregrep command, compile and link pcregrep.c; it
                    151:      uses only the basic PCRE library (it does not need the pcreposix library).
                    152: 
                    153: 
                    154: THE C++ WRAPPER FUNCTIONS
                    155: 
                    156: The PCRE distribution also contains some C++ wrapper functions and tests,
                    157: contributed by Google Inc. On a system that can use "configure" and "make",
                    158: the functions are automatically built into a library called pcrecpp. It should
                    159: be straightforward to compile the .cc files manually on other systems. The
                    160: files called xxx_unittest.cc are test programs for each of the corresponding
                    161: xxx.cc files.
                    162: 
                    163: 
                    164: BUILDING FOR VIRTUAL PASCAL
                    165: 
                    166: A script for building PCRE using Borland's C++ compiler for use with VPASCAL
                    167: was contributed by Alexander Tokarev. Stefan Weber updated the script and added
                    168: additional files. The following files in the distribution are for building PCRE
                    169: for use with VP/Borland: makevp_c.txt, makevp_l.txt, makevp.bat, pcregexp.pas.
                    170: 
                    171: 
                    172: STACK SIZE IN WINDOWS ENVIRONMENTS
                    173: 
                    174: The default processor stack size of 1Mb in some Windows environments is too
                    175: small for matching patterns that need much recursion. In particular, test 2 may
                    176: fail because of this. Normally, running out of stack causes a crash, but there
                    177: have been cases where the test program has just died silently. See your linker
                    178: documentation for how to increase stack size if you experience problems. The
                    179: Linux default of 8Mb is a reasonable choice for the stack, though even that can
                    180: be too small for some pattern/subject combinations.
                    181: 
                    182: PCRE has a compile configuration option to disable the use of stack for
                    183: recursion so that heap is used instead. However, pattern matching is
                    184: significantly slower when this is done. There is more about stack usage in the
                    185: "pcrestack" documentation.
                    186: 
                    187: 
                    188: LINKING PROGRAMS IN WINDOWS ENVIRONMENTS
                    189: 
                    190: If you want to statically link a program against a PCRE library in the form of
                    191: a non-dll .a file, you must define PCRE_STATIC before including pcre.h or
                    192: pcrecpp.h, otherwise the pcre_malloc() and pcre_free() exported functions will
                    193: be declared __declspec(dllimport), with unwanted results.
                    194: 
                    195: 
                    196: CALLING CONVENTIONS IN WINDOWS ENVIRONMENTS
                    197: 
                    198: It is possible to compile programs to use different calling conventions using
                    199: MSVC. Search the web for "calling conventions" for more information. To make it
                    200: easier to change the calling convention for the exported functions in the
                    201: PCRE library, the macro PCRE_CALL_CONVENTION is present in all the external
                    202: definitions. It can be set externally when compiling (e.g. in CFLAGS). If it is
                    203: not set, it defaults to empty; the default calling convention is then used
                    204: (which is what is wanted most of the time).
                    205: 
                    206: 
                    207: COMMENTS ABOUT WIN32 BUILDS (see also "BUILDING PCRE WITH CMAKE" below)
                    208: 
                    209: There are two ways of building PCRE using the "configure, make, make install"
                    210: paradigm on Windows systems: using MinGW or using Cygwin. These are not at all
                    211: the same thing; they are completely different from each other. There is also
                    212: support for building using CMake, which some users find a more straightforward
                    213: way of building PCRE under Windows. However, the tests are not run
                    214: automatically when CMake is used.
                    215: 
                    216: The MinGW home page (http://www.mingw.org/) says this:
                    217: 
                    218:   MinGW: A collection of freely available and freely distributable Windows
                    219:   specific header files and import libraries combined with GNU toolsets that
                    220:   allow one to produce native Windows programs that do not rely on any
                    221:   3rd-party C runtime DLLs.
                    222: 
                    223: The Cygwin home page (http://www.cygwin.com/) says this:
                    224: 
                    225:   Cygwin is a Linux-like environment for Windows. It consists of two parts:
                    226: 
                    227:   . A DLL (cygwin1.dll) which acts as a Linux API emulation layer providing
                    228:     substantial Linux API functionality
                    229: 
                    230:   . A collection of tools which provide Linux look and feel.
                    231: 
                    232:   The Cygwin DLL currently works with all recent, commercially released x86 32
                    233:   bit and 64 bit versions of Windows, with the exception of Windows CE.
                    234: 
                    235: On both MinGW and Cygwin, PCRE should build correctly using:
                    236: 
                    237:   ./configure && make && make install
                    238: 
                    239: This should create two libraries called libpcre and libpcreposix, and, if you
                    240: have enabled building the C++ wrapper, a third one called libpcrecpp. These are
                    241: independent libraries: when you like with libpcreposix or libpcrecpp you must
                    242: also link with libpcre, which contains the basic functions. (Some earlier
                    243: releases of PCRE included the basic libpcre functions in libpcreposix. This no
                    244: longer happens.)
                    245: 
                    246: A user submitted a special-purpose patch that makes it easy to create
                    247: "pcre.dll" under mingw32 using the "msys" environment. It provides "pcre.dll"
                    248: as a special target. If you use this target, no other files are built, and in
                    249: particular, the pcretest and pcregrep programs are not built. An example of how
                    250: this might be used is:
                    251: 
                    252:   ./configure --enable-utf --disable-cpp CFLAGS="-03 -s"; make pcre.dll
                    253: 
                    254: Using Cygwin's compiler generates libraries and executables that depend on
                    255: cygwin1.dll. If a library that is generated this way is distributed,
                    256: cygwin1.dll has to be distributed as well. Since cygwin1.dll is under the GPL
                    257: licence, this forces not only PCRE to be under the GPL, but also the entire
                    258: application. A distributor who wants to keep their own code proprietary must
                    259: purchase an appropriate Cygwin licence.
                    260: 
                    261: MinGW has no such restrictions. The MinGW compiler generates a library or
                    262: executable that can run standalone on Windows without any third party dll or
                    263: licensing issues.
                    264: 
                    265: But there is more complication:
                    266: 
                    267: If a Cygwin user uses the -mno-cygwin Cygwin gcc flag, what that really does is
                    268: to tell Cygwin's gcc to use the MinGW gcc. Cygwin's gcc is only acting as a
                    269: front end to MinGW's gcc (if you install Cygwin's gcc, you get both Cygwin's
                    270: gcc and MinGW's gcc). So, a user can:
                    271: 
                    272: . Build native binaries by using MinGW or by getting Cygwin and using
                    273:   -mno-cygwin.
                    274: 
                    275: . Build binaries that depend on cygwin1.dll by using Cygwin with the normal
                    276:   compiler flags.
                    277: 
                    278: The test files that are supplied with PCRE are in Unix format, with LF
                    279: characters as line terminators. It may be necessary to change the line
                    280: terminators in order to get some of the tests to work.
                    281: 
                    282: 
                    283: BUILDING PCRE ON WINDOWS WITH CMAKE
                    284: 
                    285: CMake is an alternative configuration facility that can be used instead of the
                    286: traditional Unix "configure". CMake creates project files (make files, solution
                    287: files, etc.) tailored to numerous development environments, including Visual
                    288: Studio, Borland, Msys, MinGW, NMake, and Unix. The following instructions
                    289: were contributed by a PCRE user.
                    290: 
                    291: 1.  Install the latest CMake version available from http://www.cmake.org/, and
                    292:     ensure that cmake\bin is on your path.
                    293: 
                    294: 2.  Unzip (retaining folder structure) the PCRE source tree into a source
                    295:     directory such as C:\pcre.
                    296: 
                    297: 3.  Create a new, empty build directory, for example C:\pcre\build\
                    298: 
                    299: 4.  Run cmake-gui from the Shell envirornment of your build tool, for example,
                    300:     Msys for Msys/MinGW or Visual Studio Command Prompt for VC/VC++.
                    301: 
                    302: 5.  Enter C:\pcre\pcre-xx and C:\pcre\build for the source and build
                    303:     directories, respectively.
                    304: 
                    305: 6.  Hit the "Configure" button.
                    306: 
                    307: 7.  Select the particular IDE / build tool that you are using (Visual
                    308:     Studio, MSYS makefiles, MinGW makefiles, etc.)
                    309: 
                    310: 8.  The GUI will then list several configuration options. This is where
                    311:     you can enable UTF-8 support or other PCRE optional features.
                    312: 
                    313: 9.  Hit "Configure" again. The adjacent "Generate" button should now be
                    314:     active.
                    315: 
                    316: 10. Hit "Generate".
                    317: 
                    318: 11. The build directory should now contain a usable build system, be it a
                    319:     solution file for Visual Studio, makefiles for MinGW, etc. Exit from
                    320:     cmake-gui and use the generated build system with your compiler or IDE.
                    321: 
                    322: 
                    323: USE OF RELATIVE PATHS WITH CMAKE ON WINDOWS
                    324: 
                    325: A PCRE user comments as follows:
                    326: 
                    327: I thought that others may want to know the current state of
                    328: CMAKE_USE_RELATIVE_PATHS support on Windows.
                    329: 
                    330: Here it is:
                    331: -- AdditionalIncludeDirectories is only partially modified (only the
                    332: first path - see below)
                    333: -- Only some of the contained file paths are modified - shown below for
                    334: pcre.vcproj
                    335: -- It properly modifies
                    336: 
                    337: I am sure CMake people can fix that if they want to. Until then one will
                    338: need to replace existing absolute paths in project files with relative
                    339: paths manually (e.g. from VS) - relative to project file location. I did
                    340: just that before being told to try CMAKE_USE_RELATIVE_PATHS. Not a big
                    341: deal.
                    342: 
                    343: AdditionalIncludeDirectories="E:\builds\pcre\build;E:\builds\pcre\pcre-7.5;"
                    344: AdditionalIncludeDirectories=".;E:\builds\pcre\pcre-7.5;"
                    345: 
                    346: RelativePath="pcre.h">
                    347: RelativePath="pcre_chartables.c">
                    348: RelativePath="pcre_chartables.c.rule">
                    349: 
                    350: 
                    351: TESTING WITH RUNTEST.BAT
                    352: 
                    353: 1. Copy RunTest.bat into the directory where pcretest.exe has been created.
                    354: 
                    355: 2. Edit RunTest.bat and insert a line that indentifies the relative location of
                    356:    the pcre source, e.g.:
                    357: 
                    358:    set srcdir=..\pcre-7.4-RC3
                    359: 
                    360: 3. Run RunTest.bat from a command shell environment. Test outputs will
                    361:    automatically be compared to expected results, and discrepancies will
                    362:    identified in the console output.
                    363: 
                    364: 4. To test pcrecpp, run pcrecpp_unittest.exe, pcre_stringpiece_unittest.exe and
                    365:    pcre_scanner_unittest.exe.
                    366: 
                    367: 
                    368: BUILDING UNDER WINDOWS WITH BCC5.5
                    369: 
                    370: Michael Roy sent these comments about building PCRE under Windows with BCC5.5:
                    371: 
                    372:   Some of the core BCC libraries have a version of PCRE from 1998 built in,
                    373:   which can lead to pcre_exec() giving an erroneous PCRE_ERROR_NULL from a
                    374:   version mismatch. I'm including an easy workaround below, if you'd like to
                    375:   include it in the non-unix instructions:
                    376: 
                    377:   When linking a project with BCC5.5, pcre.lib must be included before any of
                    378:   the libraries cw32.lib, cw32i.lib, cw32mt.lib, and cw32mti.lib on the command
                    379:   line.
                    380: 
                    381: 
                    382: BUILDING UNDER WINDOWS CE WITH VISUAL STUDIO 200x
                    383: 
                    384: Vincent Richomme sent a zip archive of files to help with this process. They
                    385: can be found in the file "pcre-vsbuild.zip" in the Contrib directory of the FTP
                    386: site.
                    387: 
                    388: 
                    389: BUILDING PCRE ON OPENVMS
                    390: 
                    391: Dan Mooney sent the following comments about building PCRE on OpenVMS. They
                    392: relate to an older version of PCRE that used fewer source files, so the exact
                    393: commands will need changing. See the current list of source files above.
                    394: 
                    395: "It was quite easy to compile and link the library. I don't have a formal
                    396: make file but the attached file [reproduced below] contains the OpenVMS DCL
                    397: commands I used to build the library. I had to add #define
                    398: POSIX_MALLOC_THRESHOLD 10 to pcre.h since it was not defined anywhere.
                    399: 
                    400: The library was built on:
                    401: O/S: HP OpenVMS v7.3-1
                    402: Compiler: Compaq C v6.5-001-48BCD
                    403: Linker: vA13-01
                    404: 
                    405: The test results did not match 100% due to the issues you mention in your
                    406: documentation regarding isprint(), iscntrl(), isgraph() and ispunct(). I
                    407: modified some of the character tables temporarily and was able to get the
                    408: results to match. Tests using the fr locale did not match since I don't have
                    409: that locale loaded. The study size was always reported to be 3 less than the
                    410: value in the standard test output files."
                    411: 
                    412: =========================
                    413: $! This DCL procedure builds PCRE on OpenVMS
                    414: $!
                    415: $! I followed the instructions in the non-unix-use file in the distribution.
                    416: $!
                    417: $ COMPILE == "CC/LIST/NOMEMBER_ALIGNMENT/PREFIX_LIBRARY_ENTRIES=ALL_ENTRIES
                    418: $ COMPILE DFTABLES.C
                    419: $ LINK/EXE=DFTABLES.EXE DFTABLES.OBJ
                    420: $ RUN DFTABLES.EXE/OUTPUT=CHARTABLES.C
                    421: $ COMPILE MAKETABLES.C
                    422: $ COMPILE GET.C
                    423: $ COMPILE STUDY.C
                    424: $! I had to set POSIX_MALLOC_THRESHOLD to 10 in PCRE.H since the symbol
                    425: $! did not seem to be defined anywhere.
                    426: $! I edited pcre.h and added #DEFINE SUPPORT_UTF8 to enable UTF8 support.
                    427: $ COMPILE PCRE.C
                    428: $ LIB/CREATE PCRE MAKETABLES.OBJ, GET.OBJ, STUDY.OBJ, PCRE.OBJ
                    429: $! I had to set POSIX_MALLOC_THRESHOLD to 10 in PCRE.H since the symbol
                    430: $! did not seem to be defined anywhere.
                    431: $ COMPILE PCREPOSIX.C
                    432: $ LIB/CREATE PCREPOSIX PCREPOSIX.OBJ
                    433: $ COMPILE PCRETEST.C
                    434: $ LINK/EXE=PCRETEST.EXE PCRETEST.OBJ, PCRE/LIB, PCREPOSIX/LIB
                    435: $! C programs that want access to command line arguments must be
                    436: $! defined as a symbol
                    437: $ PCRETEST :== "$ SYS$ROADSUSERS:[DMOONEY.REGEXP]PCRETEST.EXE"
                    438: $! Arguments must be enclosed in quotes.
                    439: $ PCRETEST "-C"
                    440: $! Test results:
                    441: $!
                    442: $!   The test results did not match 100%. The functions isprint(), iscntrl(),
                    443: $!   isgraph() and ispunct() on OpenVMS must not produce the same results
                    444: $!   as the system that built the test output files provided with the
                    445: $!   distribution.
                    446: $!
                    447: $!   The study size did not match and was always 3 less on OpenVMS.
                    448: $!
                    449: $!   Locale could not be set to fr
                    450: $!
                    451: =========================
                    452: 
                    453: 
                    454: BUILDING PCRE ON STRATUS OPENVOS
                    455: 
                    456: These notes on the port of PCRE to VOS (lightly edited) were supplied by
                    457: Ashutosh Warikoo, whose email address has the local part awarikoo and the
                    458: domain nse.co.in. The port was for version 7.9 in August 2009.
                    459: 
                    460: 1.   Building PCRE
                    461: 
                    462: I built pcre on OpenVOS Release 17.0.1at using GNU Tools 3.4a without any
                    463: problems. I used the following packages to build PCRE:
                    464: 
                    465:   ftp://ftp.stratus.com/pub/vos/posix/ga/posix.save.evf.gz
                    466: 
                    467: Please read and follow the instructions that come with these packages. To start
                    468: the build of pcre, from the root of the package type:
                    469: 
                    470:   ./build.sh
                    471: 
                    472: 2. Installing PCRE
                    473: 
                    474: Once you have successfully built PCRE, login to the SysAdmin group, switch to
                    475: the root user, and type
                    476: 
                    477:   [ !create_dir (master_disk)>usr   --if needed ]
                    478:   [ !create_dir (master_disk)>usr>local   --if needed ]
                    479:     !gmake install
                    480: 
                    481: This installs PCRE and its man pages into /usr/local. You can add
                    482: (master_disk)>usr>local>bin to your command search paths, or if you are in
                    483: BASH, add /usr/local/bin to the PATH environment variable.
                    484: 
                    485: 4. Restrictions
                    486: 
                    487: This port requires readline library optionally. However during the build I
                    488: faced some yet unexplored errors while linking with readline. As it was an
                    489: optional component I chose to disable it.
                    490: 
                    491: 5. Known Problems
                    492: 
                    493: I ran a the test suite, but you will have to be your own judge of whether this
                    494: command, and this port, suits your purposes. If you find any problems that
                    495: appear to be related to the port itself, please let me know. Please see the
                    496: build.log file in the root of the package also.
                    497: 
                    498: 
                    499: =========================
                    500: Last Updated: 26 May 2010
                    501: ****

FreeBSD-CVSweb <freebsd-cvsweb@FreeBSD.org>