Annotation of embedaddon/lighttpd/doc/outdated/cml.txt, revision 1.1.1.1

1.1       misho       1: =========================
                      2: CML (Cache Meta Language)
                      3: =========================
                      4: 
                      5: ---------------
                      6: Module: mod_cml
                      7: ---------------
                      8: 
                      9: :Author: Jan Kneschke
                     10: :Date: $Date: 2004/11/03 22:26:05 $
                     11: :Revision: $Revision: 1.2 $
                     12: 
                     13: :abstract:
                     14:   CML is a Meta language to describe the dependencies of a page at one side and building a page from its fragments on the other side using LUA.
                     15: 
                     16: .. meta::
                     17:   :keywords: lighttpd, cml, lua
                     18: 
                     19: .. contents:: Table of Contents
                     20: 
                     21: Description
                     22: ===========
                     23: 
                     24: CML (Cache Meta Language) wants to solves several problems:
                     25: 
                     26:  * dynamic content needs caching to perform
                     27:  * checking if the content is dirty inside of the application is usually more expensive than sending out the cached data
                     28:  * a dynamic page is usually fragmented and the fragments have different livetimes
                     29:  * the different fragements can be cached independently
                     30: 
                     31: Cache Decision
                     32: --------------
                     33: 
                     34: A simple example should show how to a content caching the very simple way in PHP.
                     35: 
                     36: jan.kneschke.de has a very simple design:
                     37: 
                     38:  * the layout is taken from a template in templates/jk.tmpl
                     39:  * the menu is generated from a menu.csv file
                     40:  * the content is coming from files on the local directory named content-1, content-2 and so on
                     41: 
                     42: The page content is static as long non of the those tree items changes. A change in the layout
                     43: is affecting all pages, a change of menu.csv too, a change of content-x file only affects the
                     44: cached page itself.
                     45: 
                     46: If we model this in PHP we get: ::
                     47: 
                     48:   <?php
                     49: 
                     50:   ## ... fetch all content-* files into $content
                     51:   $cachefile = "/cache/dir/to/cached-content";
                     52: 
                     53:   function is_cachable($content, $cachefile) {
                     54:     if (!file_exists($cachefile)) {
                     55:       return 0;
                     56:     } else {
                     57:       $cachemtime = filemtime($cachefile);
                     58:     }
                     59: 
                     60:     foreach($content as $k => $v) {
                     61:       if (isset($v["file"]) &&
                     62:           filemtime($v["file"]) > $cachemtime) {
                     63:         return 0;
                     64:       }
                     65:     }
                     66: 
                     67:     if (filemtime("/menu/menu.csv") > $cachemtime) {
                     68:       return 0;
                     69:     }
                     70:     if (filemtime("/templates/jk.tmpl") > $cachemtime) {
                     71:       return 0;
                     72:     }
                     73:   }
                     74: 
                     75:   if (is_cachable(...), $cachefile) {
                     76:     readfile($cachefile);
                     77:     exit();
                     78:   } else {
                     79:     # generate content and write it to $cachefile
                     80:   }
                     81:   ?>
                     82: 
                     83: Quite simple. No magic involved. If the one of the files is new than the cached
                     84: content, the content is dirty and has to be regenerated.
                     85: 
                     86: Now let take a look at the numbers:
                     87: 
                     88:  * 150 req/s for a Cache-Hit
                     89:  * 100 req/s for a Cache-Miss
                     90: 
                     91: As you can see the increase is not as good as it could be. The main reason as the overhead
                     92: of the PHP interpreter to start up (a byte-code cache has been used here).
                     93: 
                     94: Moving these decisions out of the PHP script into a server module will remove the need
                     95: to start PHP for a cache-hit.
                     96: 
                     97: To transform this example into a CML you need 'index.cml' in the list of indexfiles
                     98: and the following index.cml file: ::
                     99: 
                    100:   output_contenttype = "text/html"
                    101: 
                    102:   b = request["DOCUMENT_ROOT"]
                    103:   cwd = request["CWD"]
                    104: 
                    105:   output_include = { b .. "_cache.html" }
                    106: 
                    107:   trigger_handler = "index.php"
                    108: 
                    109:   if file_mtime(b .. "../lib/php/menu.csv") > file_mtime(cwd .. "_cache.html") or
                    110:      file_mtime(b .. "templates/jk.tmpl")   > file_mtime(cwd .. "_cache.html") or
                    111:      file_mtime(b .. "content.html")        > file_mtime(cwd .. "_cache.html") then
                    112:      return CACHE_MISS
                    113:   else
                    114:      return CACHE_HIT
                    115:   end
                    116: 
                    117: Numbers again:
                    118: 
                    119:  * 4900 req/s for Cache-Hit
                    120:  *  100 req/s for Cache-Miss
                    121: 
                    122: Content Assembling
                    123: ------------------
                    124: 
                    125: Sometimes the different fragment are already generated externally. You have to cat them together: ::
                    126: 
                    127:   <?php
                    128:   readfile("head.html");
                    129:   readfile("menu.html");
                    130:   readfile("spacer.html");
                    131:   readfile("db-content.html");
                    132:   readfile("spacer2.html");
                    133:   readfile("news.html");
                    134:   readfile("footer.html");
                    135:   ?>
                    136: 
                    137: We we can do the same several times faster directly in the webserver.
                    138: 
                    139: Don't forget: Webserver are built to send out static content, that is what they can do best.
                    140: 
                    141: The index.cml for this looks like: ::
                    142: 
                    143:   output_contenttype = "text/html"
                    144: 
                    145:   cwd = request["CWD"]
                    146: 
                    147:   output_include = { cwd .. "head.html",
                    148:                      cwd .. "menu.html",
                    149:                      cwd .. "spacer.html",
                    150:                      cwd .. "db-content.html",
                    151:                      cwd .. "spacer2.html",
                    152:                      cwd .. "news.html",
                    153:                      cwd .. "footer.html" }
                    154: 
                    155:   return CACHE_HIT
                    156: 
                    157: Now we get about 10000 req/s instead of 600 req/s.
                    158: 
                    159: Power Magnet
                    160: ------------
                    161: 
                    162: Next to all the features about Cache Decisions CML can do more. Starting
                    163: with lighttpd 1.4.9 a power-magnet was added which attracts each request
                    164: and allows you to manipulate the request for your needs.
                    165: 
                    166: We want to display a maintainance page by putting a file in a specified
                    167: place:
                    168: 
                    169: We enable the power magnet: ::
                    170: 
                    171:   cml.power-magnet  = "/home/www/power-magnet.cml"
                    172: 
                    173: and create /home/www/power-magnet.cml with: ::
                    174: 
                    175:   dr = request["DOCUMENT_ROOT"]
                    176: 
                    177:   if file_isreg(dr .. 'maintainance.html') then
                    178:     output_include = { 'maintainance.html' }
                    179:     return CACHE_HIT
                    180:   end
                    181: 
                    182:   return CACHE_MISS
                    183: 
                    184: For each requested file the /home/www/power-magnet.cml is executed which
                    185: checks if maintainance.html exists in the docroot and displays it
                    186: instead of handling the usual request.
                    187: 
                    188: Another example, create thumbnail for requested image and serve it instead
                    189: of sending the big image: ::
                    190: 
                    191:   ## image-url is /album/baltic_winter_2005.jpg
                    192:   ## no params -> 640x480 is served
                    193:   ## /album/baltic_winter_2005.jpg/orig for full size
                    194:   ## /album/baltic_winter_2005.jpg/thumb for thumbnail
                    195: 
                    196:   dr = request["DOCUMENT_ROOT"]
                    197:   sn = request["SCRIPT_NAME"]
                    198: 
                    199:   ## to be continued :) ...
                    200: 
                    201:   trigger_handler = '/gen_image.php'
                    202: 
                    203:   return CACHE_MISS
                    204: 
                    205: 
                    206: Installation
                    207: ============
                    208: 
                    209: You need `lua <http://www.lua.org/>`_ and should install `libmemcache-1.3.x <http://people.freebsd.org/~seanc/libmemcache/>`_ and have to configure lighttpd with: ::
                    210: 
                    211:   ./configure ... --with-lua --with-memcache
                    212: 
                    213: To use the plugin you have to load it: ::
                    214: 
                    215:    server.modules = ( ..., "mod_cml", ... )
                    216: 
                    217: Options
                    218: =======
                    219: 
                    220: :cml.extension:
                    221:   the file extension that is bound to the cml-module
                    222: :cml.memcache-hosts:
                    223:   hosts for the memcache.* functions
                    224: :cml.memcache-namespace:
                    225:   (not used yet)
                    226: :cml.power-magnet:
                    227:   a cml file that is executed for each request
                    228: 
                    229: Language
                    230: ========
                    231: 
                    232: The language used for CML is provided by `LUA <http://www.lua.org/>`_.
                    233: 
                    234: Additionally to the functions provided by lua mod_cml provides: ::
                    235: 
                    236:   tables:
                    237: 
                    238:   request
                    239:     - REQUEST_URI
                    240:     - SCRIPT_NAME
                    241:     - SCRIPT_FILENAME
                    242:     - DOCUMENT_ROOT
                    243:     - PATH_INFO
                    244:     - CWD
                    245:     - BASEURI
                    246: 
                    247:   get
                    248:     - parameters from the query-string
                    249: 
                    250:   functions:
                    251:   string md5(string)
                    252:   number file_mtime(string)
                    253:   string memcache_get_string(string)
                    254:   number memcache_get_long(string)
                    255:   boolean memcache_exists(string)
                    256: 
                    257: 
                    258: What ever your script does, it has to return either CACHE_HIT or CACHE_MISS.
                    259: It case a error occures check the error-log, the user will get a error 500. If you don't like
                    260: the standard error-page use ``server.errorfile-prefix``.
                    261: 

FreeBSD-CVSweb <freebsd-cvsweb@FreeBSD.org>