Stories
Slash Boxes
Comments

News for nerds, stuff that matters

Extending and Embedding PHP

Posted by samzenpus on Mon Jul 31, 2006 02:44 PM
from the read-all-about-it dept.
Sebastian Bergmann writes "PHP is a widely-used general-purpose scripting language that is especially suited for Web development. The interpreter that executes programs written in the PHP programming language has been designed from the ground up to be easily embeddable (for instance into the Apache Web Server) and extendable. This extensibility is one of the reasons why PHP became the favourite "glue" of the Web: functionality from existing third-party libraries (database clients or image manipulation toolkits, for instance) can be made available through PHP with the ease of use you expect from a scripting language." Read the rest of Sebastian's review.
Extending and Embedding PHP
author Sara Golemon
pages 448
publisher SAMS
rating 9
reviewer Sebastian Bergmann
ISBN 067232704X
summary The new standard work on extending and embedding PHP.


"Extending and Embedding PHP" by Sara Golemon, a long-time contributor to the PHP project, remedies the fact that the internals of PHP are far from being as well documented as the rest of PHP. It brings writing extensions for PHP "to the masses", so to speak.

After a short introduction that makes the reader familiar with terms like PHP Extension, Zend Extension, Userland, and Internals that are used throughout the book, Chapter 1 ("The PHP Life Cycle") opens with an overview of how the PHP Interpreter works and what parts (TSRM, Zend Engine, SAPI, "PHP") it comprises.

Chapter 2 ("Variables from the Inside Out") shows how PHP handles and stores variables internally. The reader learns how to distinguish types, set and retrieve values, as well as how to work with symbol tables. It is in this chapter that the fundamental unit of data storage in PHP, the so-called zval (short for Zend Value) is discussed.

Chapter 3 ("Memory Management") builds upon the previous chapter and discusses more advanced operations on zvals, for instance creating and dealing with copies of a zval or the destruction of a zval when it is no longer needed. To this extent, the Zend Memory Manager is discussed as well as underlying principles such as Reference Counting and Copy-on-Write, for instance.

Chapter 4 ("Setting Up a Build Environment") guides the reader through setting up an environment, either on *NIX or on Microsoft Windows, for the development and debugging of PHP and PHP extensions.

After these first four chapters, the reader is ready to go about writing his or her first PHP extension. Chapter 5 ("Your First Extension") takes the reader through the steps necessary to write and build a simple working PHP extension. The following chapters build upon the knowledge gained here, so that the reader can ultimately implement or change any type of PHP feature.

Chapter 6 ("Returning Values") explains how to pass values (by value, by reference, and through their parameter stack using references) from internal (C-level) functions or methods to userland (PHP-level).

Chapter 7 ("Accepting Parameters") deals with the mechanisms involved in accepting parameters from userland calls to an internal function or method. This includes the discussion of the zend_parse_parameters() API which makes the parameters that are passed to the internal function or method as indirect zval references usable in your C-code. The handling of optional and arbitrary numbers of parameters is explained as well as the usage of type hinting and its arg_info API.

Chapter 8 ("Working with Arrays and Hash Tables") explains the Zend Engine's HashTable API, which is used to store any piece of data of any size, in detail. Its different data storage mechanisms supported are introduced and compared. To quote from the book, "A HashTable is a specialized form of a doubly linked list that adds the speed and efficiency of vectors in the form of lookup indices". Since these structures are used heavily throughout the Zend Engine and PHP and its extensions, a good understanding of this API is vital for any aspiring PHP extension developer.

Chapter 9 ("The Resource Data Type") introduces the reader to the first complex data type (excluding the Array data type that was discussed in the previous chapter, which is just a collection containing primitive data types like strings or numbers). A resource can be, for instance, a connection to a database. It allows the PHP extension developer to "connect abstract concepts like opaque pointers from third-party libraries to the easy-to-use userspace scripting language that makes PHP so powerful".

Chapters 10 ("PHP 4 Objects") and Chapter 11 ("PHP 5 Objects") delve into the last data type supported by the Zend Engine: objects. Sara Golemon dedicates one chapter each to the respective APIs of PHP 4 and PHP 5 because of the huge advancements that were introduced in PHP 5 and that totally changed the APIs.

After the previous chapter, all data types supported by the Zend Engine have been discussed and the book revisits a topic discussed earlier in the book: that of the PHP Interpreter's life cycle. Chapter 12 ("Startup, Shutdown, and a Few Places in Between") explains how to add state to a PHP extension by using thread-safe globals. Along the way, concepts such as internal and external (super) globals as well as thread safety are discussed.

Chapter 13 ("INI Settings") shows how a PHP extension can be made ready for runtime configuration through php.ini settings.

The next three chapters ("Accessing Streams", "Implementing Streams", and "Diverting the Stream") make the reader familiar with yet another important API of PHP: the Streams API. All file input/output in PHP userspace is processed through PHP's Streams Layer. This layer, that was introduced in PHP 4.3, is what makes working with files, compressed files, and remote files, for instance, seamlessly in PHP. The reader learns how to work with streams as well as how to expose streamable resources, whether remote network input/output or local data sources, using the Streams API, thus avoiding the need to reimplement all the tedious bits and pieces that are normally associated with this.

Chapter 17 ("Configuration and Linking") builds upon the tools and techniques introduced in Chapter 4 and adds the GNU autotools (autoconf, automake, and friends) to the reader's set of tools. These tools, if used correctly, allow the extension to be built in environments that the extension author does not know or has no access to.

Chapter 18 ("Extension Generators") takes a look at ext_skel (which comes with the source distribution of PHP) and PECL_Gen (which can be obtained, as the name suggests, from PECL, the PHP Extension Community Library). These two tools automate most of the steps described in the previous chapter and take a lot of tedious work out of the extension writer's hands.

Starting with simple embedding examples, the reader learns in Chapter 19 ("Setting Up a Host Environment") and Chapter 20 ("Advanced Embedding") how the PHP Interpreter can be embedded into almost any other application.

The book concludes with the "Zend API Reference", "PHP API Reference", "Extending and Embedding Cookbook", and "Additional Resources" appendixes. The first two are a great resource for both novice and experienced PHP extension writers (even for people working on PHP and the Zend Engine itself). The third features a collection of common use code snippets while the last one points the reader into the direction of PECL by suggesting a couple of existing extensions to look at and learn from.

Since the topic of this book is to extend the PHP Interpreter using extensions written in the C programming language (or to embed it into an application that is written in C), a good understanding of C syntax, its datatypes, and pointer management is important to get the most out of this book.

Being a contributor to the PHP project for about six years now, I have been looking forward to this book. True, there is always the source code of the PHP Interpreter as a source of information on how "things work". But although being the ultimate reference, reading the source code cannot replace a thoughtfully structured and well written guide that gets you started. If you are looking for such a guide, look no further: you will find it in this excellent book.

Although it deals with a very technical topic, "Extending and Embedding PHP" is readable and the many code examples are easy to follow. The reader profits from the knowledge of the author, who has been involved in the PHP project as a core developer for over four years now and is also the author and maintainer of a dozen PHP extensions that are available through PECL. The book covers both major versions of PHP that are currently used, PHP 4 and PHP 5, and it will continue to serve its purpose when PHP 6 comes out next year.

Sebastian Bergmann spends his free time with the development of Free Software, is a member of the PHP and Gentoo Linux development teams and author of a variety of PHP software projects such as PHPUnit."


You can purchase Extending and Embedding PHP from bn.com. Slashdot welcomes readers' book reviews -- to see your own review here, read the book review guidelines, then visit the submission page.
This discussion has been archived. No new comments can be posted.
Display Options Threshold:
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
  • php-embed (Score:5, Informative)

    by SIGALRM (784769) on Monday July 31 2006, @02:45PM (#15819816)
    (Last Journal: Friday August 27 2004, @01:39PM)
    The book sounds interesting. There's also an often-overlooked capability of PHP: the ability to use php-embed [php.net] to run embedded PHP within a C/C++ app. For example, our company created an HL7 accelerator--we chose PHP as the embedded language in our product--by which users can more easily create custom data transformations.

    The reason? PHP is easy to use, loosely-typed (which happened to be an advantage in this case), fast, and of course the license works. It was a great decision.

    PHP-embed is basically just a TSRMLS function wrapper. It's pretty straightforward; for example, zval integration is easy as pie, as I recall, something like:
    zval *zarray;
    MAKE_STD_ZVAL(zarray);
    ...

    if ( array_init(zarray) == FAILURE ) {
    // ... something wrong
    }

    add_assoc_string(zarray, str_name, str_val, 1);

    ZEND_SET_SYMBOL(&EG(symtab), tokenlevel, zarray);
    • Re:php-embed by creepynut (Score:2) Monday July 31 2006, @02:55PM
    • Re:php-embed by cerberusss (Score:2) Tuesday August 01 2006, @12:22AM
    • Re:php-embed by rycamor (Score:2) Monday July 31 2006, @03:02PM
      • 1 reply beneath your current threshold.
    • 1 reply beneath your current threshold.
  • by Anonymous Coward on Monday July 31 2006, @02:47PM (#15819831)
    "Extending and Embedding PHP"...

    Microsoft involved here? :)
  • Other People's Code (Score:3, Insightful)

    by neonprimetime (528653) on Monday July 31 2006, @02:57PM (#15819943)
    (http://twoturtlelovers.blogspot.com/ | Last Journal: Friday May 25, @03:01PM)
    But although being the ultimate reference, reading the source code cannot replace a thoughtfully structured and well written guide that gets you started.

    Agreed, especially when the source code you're reading isn't your own. I claim that 99% of programmers who are not me write totally obfuscated code. Damn them!
  • Sara Golemon (Score:2, Interesting)

    by larry bagina (561269) on Monday July 31 2006, @03:03PM (#15819994)
    (Last Journal: Friday October 19, @09:21PM)
    I knew her when she worked at berkeley (I think she works for yahoo! now). She really knows her shit.
  • 5 of first 7 comments trolling (Score:2, Interesting)

    by suggsjc (726146) on Monday July 31 2006, @03:04PM (#15819997)
    (http://www.millioninchange.com/)
    Ok, so 5 of the first 7 comments were trolling about how bad PHP is, insecure, buggy, etc (and I think they even managed to take a shot at Bush???)

    I've used PHP for some very small applications/sites. Can anyone give an unbiased (almost impossible I know) state of affairs for PHP? I know that it is a pretty common tool, has its strengths and weaknesses. However, is it really that bad or is bashing it just the current /. thing to do?
    • Re:5 of first 7 comments trolling by not already in use (Score:1) Monday July 31 2006, @03:17PM
    • Re:5 of first 7 comments trolling (Score:4, Insightful)

      by creimer (824291) on Monday July 31 2006, @03:18PM (#15820111)
      (http://www.creimer.ws/ | Last Journal: Friday January 26 2007, @12:40PM)
      However, is it really that bad or is bashing it just the current /. thing to do?

      I think PHP has replaced Java as the favorite "kick the dog" language on Slashdot. IMHO, PHP is no different than any other language. It takes work to write consistently clear code that other people can understand.
      [ Parent ]
      • Re:5 of first 7 comments trolling by Schraegstrichpunkt (Score:2) Monday July 31 2006, @04:36PM
        • Re:5 of first 7 comments trolling by creimer (Score:2) Monday July 31 2006, @04:48PM
          • Re:5 of first 7 comments trolling (Score:4, Insightful)

            by sfe_software (220870) * on Monday July 31 2006, @11:51PM (#15822969)
            (http://jm4n.com/)
            If people can't read your source code, it doesn't matter that the langauge itself sucks. Of course, it's easy for a lazy programmer to blame the language instead of cleaning up the source code to make it readable.

            I couldn't have said it better myself. I personally use PHP for many small applications. I also make sure to heavily comment my code, and I try not to obfuscate my code (it kills me that some people compete to see who can write the most obfuscated Perl, for example. Try interpreting or revising that code a year from now).

            Many times I've had to revisit code years after having written it, and when there are no comments, it is difficult to see what exactly I was thinking at the time -- in *any* language. Non-descriptive variable names, or attempting to put as much code in as few lines possible, are, IMHO, bad practices.

            Personally I see nothing inherently wrong with PHP. If I'm working with a web-based application, under Apache, using a MySQL database, PHP is the first thing that comes to mind. Image manipulation (now integrated) and HTTP features (headers, cookies, form data, file uploads, etc) make PHP an easy choice for many web applications. I've done all of this in Perl, and some in plain-old-C, but PHP makes these things so easy...

            Of course it's not for everything. I try to use whatever platform/language is most appropriate for the application at hand. Sometimes it's C or C++, perhaps it's Perl, whatever - I use whatever makes the most sense for what I'm hoping to accomplish. It just happens that, on my Linux server, PHP often stands out as the best choice. When writing Windows applications, I use a hybrid of VB6 and (in the form of a back-end DLL library) C/C++. On the server, PHP most often comes out as the clear choice. Ease of use, abundance of built-in functions/features, ease of database-to-web integration, and relative security all make PHP a good choice for many of my projects and ideas.

            Some have referred to PHP as "loose", and I admit sometimes it can be. There is no equivalent to Perl's "use strict", and it's easy to unintentionally leave an opportunity for a user to pass unexpected variables -- but as long as you are able to keep this in mind, it's not difficult to make a relatively secure PHP script. Just make sure any important variables are declared/set/validated at the start of the script. I admit, I do love Perl's "strict" module, since it leaves no question as to whether a variable's data is trustworthy... but PHP is a different language, with different features. You can't discount it as a viable language because of a single missing feature...
            [ Parent ]
        • Re:5 of first 7 comments trolling by jdbartlett (Score:2) Monday July 31 2006, @09:06PM
        • 1 reply beneath your current threshold.
      • Re:5 of first 7 comments trolling by jdbartlett (Score:2) Monday July 31 2006, @09:08PM
    • Re:5 of first 7 comments trolling by Bob of Dole (Score:2) Monday July 31 2006, @03:28PM
      • Re:5 of first 7 comments trolling by self assembled struc (Score:3) Monday July 31 2006, @04:01PM
        • 1 reply beneath your current threshold.
      • Re:5 of first 7 comments trolling by Touqen (Score:2) Monday July 31 2006, @04:02PM
      • Re:5 of first 7 comments trolling (Score:4, Informative)

        by KidSock (150684) on Monday July 31 2006, @04:31PM (#15820759)
        For those who may be curious, the proper way to actually prevent SQL injections is to wrap anything coming in with a function that calls stripslashes() and mysql_escape_string() (or equivalent function for another db). For example, the function I use looks like the following (this also adds quotes around anything that is not numeric):

        [sorry for the poor formatting, ./ is highly broken when it comes to posting code] // Quote variable to make safe
        function quote_smart($value)
        { // Stripslashes
                if (get_magic_quotes_gpc()) {
                        $value = stripslashes($value);
                } // Quote if not integer
                if (!is_numeric($value)) {
                        $value = "'" . mysql_escape_string($value) . "'";
                }

                return $value;
        }

        Now you call this through sprintf like:

        $res=mysql_query(sprintf("select data from users where userid=%s", quote_smart($_GET['u']));

        Now this is perfectly safe from SQL injection. Anyone who has done real web programming knows all about this and knows that you need to deal with this sort of thing regardless of what language you're using.

        Also, whenever you emit data that will appear in HTML you also need to wrap it. This time you just use the builtin htmlentities() function like:

        echo "<input name=\"u\" type=\"text\" value=\"" . htmlentities($user) . "\">\n";

        This prevents cross site scripting. Again, no different from any other language.

        PS: IMHO if someone goes out of their way to claim something "sucks" they probably don't know what they're talking about. Try the other languages and read the documentation so that you can evaluate which is best for your project.
        [ Parent ]
      • register_globals? by Mateo_LeFou (Score:2) Monday July 31 2006, @08:06PM
        • rethinking by Mateo_LeFou (Score:2) Monday July 31 2006, @08:11PM
      • Re:5 of first 7 comments trolling by julesh (Score:2) Tuesday August 01 2006, @01:33PM
    • Re:5 of first 7 comments trolling by Bogtha (Score:2) Monday July 31 2006, @03:40PM
    • Re:5 of first 7 comments trolling by nicklott (Score:3) Monday July 31 2006, @03:42PM
    • Re:5 of first 7 comments trolling by mobby_6kl (Score:1) Monday July 31 2006, @03:55PM
    • Re:5 of first 7 comments trolling by Anonymous Coward (Score:1) Monday July 31 2006, @04:20PM
      • I second by jdbartlett (Score:2) Monday July 31 2006, @10:00PM
    • Re:5 of first 7 comments trolling by ChronoFish (Score:1) Monday July 31 2006, @04:32PM
    • Re:5 of first 7 comments trolling by imroy (Score:2) Monday July 31 2006, @04:41PM
    • Re:5 of first 7 comments trolling by lewp (Score:2) Monday July 31 2006, @05:08PM
    • Ruby zealots are envious by Anonymous Coward (Score:2) Monday July 31 2006, @06:13PM
      • Try Ruby by jdbartlett (Score:2) Monday July 31 2006, @10:31PM
    • Re:5 of first 7 comments trolling by b17bmbr (Score:2) Tuesday August 01 2006, @01:51AM
    • Baroque doesn't begin to describe it by Just Some Guy (Score:2) Tuesday August 01 2006, @10:49AM
    • Re:Why I don't like PHP by dr. greenthumb (Score:2) Monday July 31 2006, @06:49PM
    • 7 replies beneath your current threshold.
  • by Bogtha (906264) on Monday July 31 2006, @03:29PM (#15820221)

    If you're interested in this, you'll probably be interested to know about Ian Bicking's work on embedding PHP in Python web applications [ianbicking.org] via PHP's FastCGI support. It's only in the experimental stages, but it's very promising, especially for developers like me who develop with Python but need to support legacy PHP code.

  • by jaimz22 (932159) on Monday July 31 2006, @03:31PM (#15820237)
    i've got this book, it's very well written and easy to follow. i recommend it.
  • Save $18.50 by buying the book here! (Score:1, Informative)

    by Anonymous Coward on Monday July 31 2006, @03:43PM (#15820371)
    Save yourself $18.50 by buying the book here: Extending and Embedding PHP [amazon.com]. And if you use the "secret" A9.com discount [amazon.com], you can save an extra 1.57%! That's a total savings of $18.99, or 38.58%!
  • PHP is not just for the web (Score:5, Interesting)

    by KidSock (150684) on Monday July 31 2006, @04:11PM (#15820598)
    Yeah, so PHP stands for "Personal Home Pages" but that's is an historical misnomer now. PHP has a CLI binary that can be used to run scripts on the commandline. Obligatory "hello world" follows:

        !#/usr/bin/php
        echo "Hello, world!";

    Now consider that PHP ships standard on virtually every Linux distro and comes with a large assortment of libraries. You can write LDAP scripts, do IMAP, generate images, the list is loooong. It amazes me that PHP isn't used more in corporate envirments. PHP is easy to use, arrays are surprisingly useful, and you can do a little OO (which is just the right amount IMO). And something that a lot of people take for granted is that the documentation on php.net is great. Everything is on one place unlike other languages (e.g. Python) where you just get redirected to every little sourceforge scribble and wiki there is.

    I'm a C person. I'll continue to use C for heavy lifting but you also need a good scripting language. I just wrote a Zend extension to interface with some of my C work and it exceeded all of my expectations.

    If you're looking for the lastest hot new "technology" then Ruby is a good buzzword. Otherwise, if you're just looking to get work done, so you can go home and play with your kids, PHP is a workhorse.

    PS: I don't know spit about this book but the tutorial on writing extensions on the Zend website was pretty good. Good enough for me anyway.
  • A lot of PHP bashing going on; I'd just like to chip in my 2 cents on the language (and demonstrate a mild interest in the book). I was big on programming when I was younger - by 14 I had written an adventure game in Basic and I invented a DOS-based graphical application that is eerily similar to Flash (two stickmen and some props on the screen with keyframes and interpolation tracking). Needless to say I was well advanced of my classmates throughout highschool. I also wrote a Chess AI (who hasn't?) in C. But that was the end of it - about 10 years ago now. I longed for programming but Real Life(tm) got in the way and other career paths curbed my free time. Needless to say I had lost a lot of skills and I don't even know what OOP stands for, but getting into the blogging world and creating a custom website to house it resulted in me having to learn some sort of web-based programming. I have to say that PHP was beautifully easy to (re-)learn and I was back in the programming seat with a big grin on my face with just a few weeks of self-learning (by looking at examples and open source, no books). I'm praising PHP as a very easy to learn, easy to use starting point for all my would-be programming friends.
  • Bash PHP For Fun and Profit (Score:4, Insightful)

    by KidSock (150684) on Monday July 31 2006, @07:55PM (#15821954)
    I think that there is a contingent of web programmers that are bored and upset that PHP is still the premier method for scripting websites. They want something new and fresh to work with. I can appreciate that. When you use the same language for a long time, it starts to look "old". This is exacerbated when they inherit sloppy code and are forced to decipher and fix some other dummy's spaghetti. So they declare the language "dead" in hope of creating enough spin and FUD that something new will take over. Something new that will create work and give them more job opportunities. The same something new that they invested a lot of time into learning.

    To the PHP bashers - you might succeed in selling something new but after the next guy inherits your spaghetti code they will start bashing *you*.

    Don't be fooled people. Every language has it's corners. I spend 90% of my time doing C but I just spent a month doing a standard LAMP site and I just don't see what these guys are hee'n and haw'n about. PHP is just as useful today as it was on 1998 so I'm willing to bet it will be around for a long time still. Don't be influenced by some bored guy saying "it sucks" and "I hate it". That's just not intelligent criticism. Try different things and make up your own mind.

    PHP has a huge install base and has served us very well for many years. Let's not forget that. The PHP bashers pushing Python and Ruby should be ashamed of themselves. Post some useful information about how Python or Ruby solves a problem you think PHP has. And no cryptic one liners thank you. Get a spine and post some useful comments.
  • You know, other languages have libraries and modules, too...
    PHP isn't the first language with MySQL bindings. It's not the first with GD bindings. And so on.
    And PHP wasn't really designed to be easy to properly embed in other applications either. It was designed to have the code embedded in HTML, granted. And it was designed to run in the web server. But I doubt it was designed to be put to all kinds of use. Who is embedding PHP except for web servers?

    If you want a language that was really designed for embedding, consider looking at lua. From what I know of it, it was really designed to be used as scripting language within other applications. From spreadsheets to games.
    I think my brother told me he was developing on a scientific instrument (most likely for fraunhofer, probably for the development of solar power related stuff), and that you could script it in lua.
    Then there is enigma, a fun game (port of Oxyd), it uses lua for scripting the levels.

    PHP has other uses. Like writing bad code. SCNR. But I do consider PHP a legacy language.
  • Re:I'd rather see a review (Score:1, Insightful)

    by Anonymous Coward on Monday July 31 2006, @03:17PM (#15820098)
    What makes you think such a book would be PHP specific?
    [ Parent ]
  • by jahknow (827266) on Monday July 31 2006, @09:14PM (#15822299)
    (Last Journal: Wednesday December 28 2005, @09:41PM)
    You have no chance to survive make your time.
    [ Parent ]
    • Mod parent up by Freedom451 (Score:1) Monday July 31 2006, @10:49PM
  • by headbone (914314) on Saturday August 12 2006, @11:27AM (#15894595)
    Oh you poor thing. It must be awful to be tied to your computer desk and forced at gunpoint to read slashdot. Too bad you couldn't just go away.
    [ Parent ]
  • 7 replies beneath your current threshold.