Catch up on stories from the past week (and beyond) at the Slashdot story archive

 



Forgot your password?
typodupeerror
×
Image

Book Review: R Graphs Cookbook 64

RickJWagner writes "Once upon a time, I thought communication was one of my strong suits. Alas, a few years into my programming career I realized I'm more of the head-down codeslinging type, not one of the schmoozing managerial types. So when I have a point to make, I really like to have my data ready to do the talking for me. In that capacity, this book is a very good weapon to have in my arsenal." Read on for the rest of Rick's review.
R Graphs Cookbook
author Hrishi Mittal
pages 272
publisher Packt Publishing
rating 8/10
reviewer RickJWagner
ISBN 1849513066
summary An invaluable reference book for expert R users
Right away, you should realize this is not a book that teaches R. R (an excellent open source statistical language) is a great tool for any technician. I've used it to analyze logs, find performance bottlenecks, and make sense of mountains of nearly unrecognizable data. But this book doesn't teach R, it teaches R graphing.

It turns out R has excellent graphing capabilities. You can draw scatter plots, line plots, pie graphs, bar charts, histograms, box and whisker plots, heat maps, contour maps and 'regular' maps. These are all good for demonstrating data in different ways, and the book lightly explains which graph will help you illustrate which point.

If you're getting a little interested, you'll also want to know that all this graphing can be scripted and scheduled. So you can get data-driven reports on a schedule, easily accomplished once you know how to write the graphing scripts (which are then scheduled using cron or a similar facility). One small caveat: To prepare your data for presentation, I think it's usually necessary to partner R with another language that's better for text extracting and manipulation. I prefer Python for this task, you might like another language.

The book is exceptionally easy to read and work with. This doesn't mean it's simplistic, though. Anyone who's tangled with R's graphing without a good example will testify that figuring out the various functions and arguments necessary to wrangle a descriptive graph can be really difficult. This book gives you the kind of graphs you need, with the bells and whistles you're going to want, in a series of snippets you can run immediately.

The book is written in Packt's "Recipe" format. In a nutshell, this means that it's a series of how-to sections worded in a templated form. There are headings for sections that inform you what you're going to accomplish, how it's done, and why it worked. You quickly realize it's a repetitive format, but it serves to make the book an excellent resource for quick reference.

Another really nice feature of the book is the downloadable source code and matching data. Knowing the data is half the battle, really. The specific formulas given are certainly useful, but without knowing how the underlying data is formatted you really wouldn't get nearly the practical value. For that reason, I urge anyone using this book to be sure they examine the underlying data for at least the first few formulas. After that, it'll be automatic, you'll know you want to look at that data when you're trying to master some graph type. Then when you go to make your own data ready for graphing, you reach for that secondary language like Python, extract the fields you want in a way similar to your example data set, and presto-- you've got the graph you want.

The book starts out with a first chapter that introduces the kinds of graphs you'll be able to produce and situations where each type is most useful. The next chapters, up until the final one, are in-depth sections on each of the graph types. Maps are treated to a different chapter than pie graphs, for instance. The final chapter covers putting final touches on your graphs, including saving them in different formats (PDF, PNG, JPEG, etc.) and niceties like adding scientific notations, mathematical symbols, etc.

The book states that the target audience is experienced R programmers. I really don't think that's necessary, though. There is an obligatory R installation section, and I think that a reasonably competent programmer with Google at his disposal could get off the ground (for graphing purposes) with this book and a little bumbling. If you already know R, then you needn't worry at all, there is nothing here that will look foreign to you.

If I could change one thing about the book, I'd want a comprehensive index of all the functions and arguments that augment the basic core functions that produce the example graphs. These functions and arguments tweak the basic function in ways that make them much more appealing than what the basic function alone can provide. But the book isn't able to show each and every combination with each graphing function, so it's up to the reader to figure out how to pick some of the options from one recipe and apply it to another. It's not difficult to do, but having an index to help you find the options you want would make this process easier.

You can purchase R Graphs Cookbook from amazon.com. Slashdot welcomes readers' book reviews -- to see your own review here, read the book review guidelines, then visit the submission page.

*

This discussion has been archived. No new comments can be posted.

Book Review: R Graphs Cookbook

Comments Filter:
  • Or you can use Excel (Score:4, Informative)

    by AdamInParadise ( 257888 ) on Monday April 18, 2011 @02:46PM (#35858644) Homepage

    Or any other spreadsheet program.

    Now of course I admit that Excel is probably not as flexible as R. However, unless your job is to produce stunning, tailor-made graphs, a spreadsheet application will deliver results a lot faster.

  • Far, far too basic. (Score:4, Informative)

    by dondelelcaro ( 81997 ) <don@donarmstrong.com> on Monday April 18, 2011 @03:14PM (#35858982) Homepage Journal

    Just from examining the few preview pages on amazon.com, this book appears to be far too basic for anyone who has actually done any serious work with R. I personally would forgo this entire book, and spend the time wandering through the R Graph Gallery [addictedtor.free.fr] which has far more examples with source code and underlying data. It's also rather odd that this book doesn't cover ggplot, grid graphics, lattice, or any of the more commonly used tools in advanced R graphics.

    Perhaps this book could be useful as your first foray into graphing with R... but I'm unconvinced it even covers that well.

  • by Beryllium Sphere(tm) ( 193358 ) on Monday April 18, 2011 @03:14PM (#35858986) Journal

    People who know more about statistics than I do severely criticize Excel, e.g. http://www.stat.uiowa.edu/~jcryer/JSMTalk2001.pdf [uiowa.edu]

  • by Anonymous Coward on Monday April 18, 2011 @03:29PM (#35859194)

    Wrong row limit. Sure you can _have_ 1M+ rows but you can still only graph 32K of them at a time.

  • by plopez ( 54068 ) on Monday April 18, 2011 @03:34PM (#35859250) Journal

    I like R because:
    1) It can handle the large (million or more) ata sets I need to crunch and compare

    2) Seriously, the latest versions of Excel seem to choke on larger datasets. The "Oh no! Excel is bogging down and getting ready to crash!" sensation is far too frequent. R is much more stable

    3) You can do nice graphics in R you can't do in Excel. See http://addictedtor.free.fr/graphiques/ [addictedtor.free.fr]

    4) There is a huge number of pre-rolled *serious* statistical libraries already written, and open sourced (including GPL'd) for it. FFT, geospatial stats, multivariate linear and non-linear statistical modeling, time series analysis, linear algebra, and more. Including OOP. I jam ust exploring how R does OOP now.

    5) The scripting language is in the Lisp family. It works the way I think.

    6) You can compile and link in your own packages in Fortran (pick your flavor 77, 88, 95, '03, or '08), C, C++, etc. If it links, you can link it.

    Sweet. Also more stable than Matlab (and cheaper), and more user friendly than SAS.

Work without a vision is slavery, Vision without work is a pipe dream, But vision with work is the hope of the world.

Working...