Practical File System Design with the Be File System 258
| Practical File System Design with the Be File System | |
| author | Dominic Giampaolo | 
| pages | 227 pp | 
| publisher | MORGAN KAUFMANN PUBLISHERS, INC. | 
| rating | 8.5 of 10 | 
| reviewer | erikharrison | 
| ISBN | 1558604979 | 
| summary | Discusses implemeting a file system, using the Be file system as example | 
Table of Contents
- Chapter 1 Introduction to the BeOS and BFS
- Chapter 2 What Is a File System?
- Chapter 3 Other File Systems
- Chapter 4 The Data Structures of BFS
- Chapter 5 Attributes, Indexing, and Queries
- Chapter 6 Allocation Policies
- Chapter 7 Journaling
- Chapter 8 The Disk Block Cache
- Chapter 9 File System Performance
- Chapter 10 The Vnode Layer
- Chapter 11 User-Level API
- Chapter 12 Testing
- Appendix A File System Construction Kit
First thing to note is that Giampaolo is not a great writer, nor is he a bad one. He does not have the gift that some tech writers have of making both an interesting technical document and a fun read. His style is very straightforward - introduce idea, explicate idea, summarize idea. On the other hand, he knows his topic inside and out, and has an obvious enthusiasm for the material, and a real talent for saying things simply without dumbing it down, and his occasional dry wit makes the book a surprisingly easy read.
Giampaolo is doing two things - discussing designing filesystems in general and documenting the Be filesystem. He does both well. BeFS has some advanced features - arbitrary metadata, attribute queries, and indexing. The desire to support these features influences the overall design of the system, but Giampaolo shows how changes to that design change implementation details. The result is a good overview of how a file system works, the trade-offs in optimizing for a particular usage pattern, and how to design one yourself.
The book can be roughly divided into three sections: the first is an overview of how filesystems work and some of the concepts that you encounter - extents, inodes, B-trees, superblocks, and the other standard pieces of a filesystem. Included in this early section is a good high-level overview of the design of five other file systems: BSD FFS, Linux's ext2, Macintosh HFS, Irix XFS, and Windows NT's NTFS. The coverage here strikes a proper balance between too much and too little information. Giampaolo prefers to show rather than to tell, and these filesystem overviews make the connection between design, performance, and features perfectly clear, and provide a solid background to talk about a specific implementation in detail - namely BeFS.
The second section is the bulk of the book - how to implement a filesystem from the ground up, leaning heavily on the BeFS implementation for examples. This is the most straightforward part of the book. Giampaolo covers a single issue in design and implementation in a "Here's the problem, here's and overview of possible solutions and their drawbacks, here's how I did it, now lets summarize" manner. Again, Giampaolo's style makes this an easy if somewhat dry read. As a filesystem and kernel ignoramus, I would have appreciated a slightly more detailed coverage of how all of the various data structures get to disk - how are they serialized, whether endianess is an issue, etc. The BeOS was pretty portable, running at one time or another on the AT&T Hobbit processor, PowerPC, and x86 - I would have liked to have seen portability issues discussed, however, BeFS wasn't written until after the move from the Hobbit to PowerPC, and the book was written prior to the move to x86, so the lack of coverage is reasonable.
Even considering the plain Jane style of this middle section, there are a few gems. The coverage of journaling is excellent, and while I've long understood journaling from a 10,000 foot perspective, this really made me understand the underlying concepts, combined with simple code snippets that helped understand implementation. The Allocation Policies chapter showed in clear terms that disk access is a major bottleneck, and filesystems have become very sophisticated in their optimizations.
The third section of the book deals with some of the more indirect concerns in implementing a file system; specifically, interacting with the kernel, designing a user level API and the major role of testing in filesystem development. This is the one place Giampaolo's writing shines. He really is a good teacher, and this section affords him the chance to talk about the broader perspective of OS design, and even recount a few war stories. For example, in terms of parentage, the BeOS has BSD and classic MacOS as its father and mother. In a few places, such as the Storage Kit API covered in chapter 11, this heritage shows some signs of less-than-seamless integration, and this offers Giampalo a chance to wax philosophical on the nature of OS design, company politics, and the pressure of shipping dates.
In short, the book lives up to it's title. The author is a pragmatist, and offers a clear roadmap for those who have a need to work with low level filesystem implementation. His emphasis on testing, careful optimization, and data structure protection not only helps to show the pitfalls of filesystem work, but also offers a Swiss army knife of techniques to dodge them. The book concludes with a short appendix which covers a file system construction kit, allowing a would-be implementor to begin work on his own filesystem safely without worrying about killing his hard disk. All in all, a solid read.
Here's a link to Practical File System Design with the Be File System as a PDF; you can also look for a used copy at Barnes & Noble. Slashdot welcomes readers' book reviews -- to see your own review here, carefully read the book review guidelines, then visit the submission page.
Mirror with PDF (Score:5, Informative)
Re:Mirror with PDF (Score:5, Informative)
Earthlink wasn't so charitable [insidemacgames.com] to a Halo fan who put up a movie previewing Halo and saw it downloaded 100,000 times. Earthlink charged him $30,000 for the 4500 gigabytes downloaded.
The moral here is that if you're going to put up anything which might be downloaded quite a lot (or if you're expecting a slashdotting), make sure your host doesn't charge through the nose for extra bandwidth. Or, if the file is over 5MB in size (and under 1GB), make a freecache.org link [freecache.org] and let others mirror it for you automatically.
Re:Mirror with PDF (Score:2)
Re:Mirror with PDF (Score:5, Informative)
Re:Mirror with PDF (Score:2)
Future is relational databases (Score:2, Insightful)
Re:Future is relational databases (Score:4, Insightful)
Re:Future is relational databases (Score:2)
Re:Future is relational databases (Score:3, Funny)
Re:Future is relational databases (Score:2)
Re:Future is relational databases (Score:2)
A DBMS can have an API like anything else, no need for shells.
Re:Future is relational databases (Score:3, Funny)
Re:Future is relational databases (Score:5, Informative)
Re:Future is relational databases (Score:3, Informative)
Re:Future is relational databases (Score:2)
Perhaps in the near future will remove the filesystem as a layer of abstraction, but not yet.
Re:Future is relational databases (Score:2, Informative)
Oracle also has an iFS, short for Internet File System. It's actually a filesystem inside the Oracle RDBMS, and can be connected to through HTTP, FTP, IMAP4, SMTP, SMB/NTFS, NFS, AFP, etc..
So Oracle has an relational database that can be used as a file system.
Re:Future is relational databases (Score:2)
Re:Future is relational databases (Score:2)
Re:Future is relational databases (Score:2, Informative)
Amazon.com uses it.
Re:Future is relational databases (Score:2)
If your current problem is that you have important data analysis that cannot be completed with your current Oracle/whatever relational DB due to an algorithm that could lower it's order a few exponents with an OO design, then yeah. There are all kinds of
Re:Future is relational databases (Score:2)
Re:Future is relational databases (Score:2)
Only if inertia and ignorance continue to hinder us.
No, this is a physical issue. The relational says nothing about the physical level, thus leaving the implementor total freedom to achieve the best performance possible. This is typically better than what is possible navigationally.
Not at all. What makes you think so?
Re:Future is relational databases (Score:2)
The amount of just I am sure I am right and all others are wrong is mind numbing. I can see how a file system based on a relationl database could work. I am just not sure that it is the ultimate solution.
As to it still requireing a file system. Yes even if they database uses a raw partiton it will need to keep track of free and used blocks. To me this is a filesystem.
As to OO databases already being considered useless. So far you ar
Re:Future is relational databases (Score:2)
Not soon. There are currently several problems:
First, OS research is stagnated. It tends to focus on low-level stuff like distribution or microkernels, or GUI stuff like Sun's. Anything else just doesn't happen, because it is considered that the POSIX model can't be overthrown, be it in its implementations like the GNU system or in its corruptions like MS Windows.
That said, there is the GNU Hurd. I have a suspicion it is meant by Stallman to be a stepping stone from POS
Re:Future is relational databases (Score:2)
OSs, ISA, and GUIs are pretty much stagnate.
I was pondering this relational database as a file system. But I was wondering how do you map files and directories to tables and tuples?
Yea Hurd is... Well not moving fast. I know Unix is good but is it the best that we can do? I wonder if something like Plan9 or openmosix is an interesting direction to look in. Lets face it networks of computers is the dominate system and is likeily to g
Re:Future is relational databases (Score:2)
OK, IANAFE (filesystem expert).
It wouldn't need a full-blown filesystem. That "filesystem" would be extremely fast and simple for storing and retrieving data. The database itself would be the real filesystem in terms of abstraction.
From the book:
The two fundamental concepts of any file system are
Re:Future is relational databases (Score:5, Interesting)
Clearly the BeOS designers agree with you.
It's also interesting that the author spends quite a while discussing how difficult it is to do well (particularly performance-wise) and how they almost left it out (IIRC) and/or had to limit its scope.
Clearly the BeOS developers think you're wrong.
And personally, I'll believe people who have actually tried to implement the technology in question over people who say others should do so.
Re:Future is relational databases (Score:3, Insightful)
No, they could be the beginnings of an implementation of the relational model.
If they did they would have gone full ahead instead of taking half measures.
No one ever
Re:Future is relational databases (Score:5, Informative)
Re:Future is relational databases (Score:2)
It was slow and crashed because it was a POS implimentation.
Seriously though, what would the user experiance on a system built on a database file system be like? Mostly I see this as useful for data that requires/benefits from lots of metadata, like MP3 files where I want rich tagging. But would the actual file contents be stored off in another table (the 'MP3FRAME' table for example, which contains all the frames of all MP3 files st
Re:Future is relational databases (Score:3, Informative)
Know you are joking, but for the benefit of others:
A DB isn't fast, it is just stored at a DBMS. Now a RDBMS can be faster than anything else, because of data independence, that is, the separation of the logical and physical levels of any given database.
What does that mean? OneLook gave me so many meanings, I gave up.
Anything you'd like. Hiera
Re: (Score:2)
Re:Future is relational databases (Score:3, Insightful)
There are some misunderstandings here.
First, both IBM DB2 and MS SQL Server are based on the SQL standard -- actually IBM DB2 defined the standard --, and SQL is in frontal violation of serveral fundamental characteristics of the relational model. It does, however, incorporates some relational ideas and is generall
Re:Future is relational databases (Score:2)
Re:Future is relational databases (Score:2)
Am not.
That's not what we need. What we need is a full relational database as the core of the OS, with a relational-enabled language as the primary systems programming language.
Re:Future is relational databases (Score:2)
The text isn't "backwards looking". It is a discussion of how one man implented a file system, and how filesystems are generally implemented. Dominic himself comments that his book is a textbook of sorts (or the textbook he wishes he's had in Graduate School, where he did FS work) and that BeFS is an attempt to implement already existing functionality of the BeOS (where they were using a true database as a file system) using tried and true methods. In fact he bemoans that he couldn't do any
Re:Future is relational databases (Score:2)
Without taking in account 30 years of research and development of the relational model.
I can understand it may sound strange, but it has been that long since hierarchies became obsolete as a way of organising -- not necessarily of presenting -- data.
No, that's a relationship, not a relation. The relational model isn't abo
Re:Future is relational databases (Score:2)
BeFS has all of the capability that a relational database has in a hierarchical arrangement with POSIX properties.
i don't see any need for anything more powerful than that.
what is it about relational databases that makes it so critically important? why isn't BeFS good enough for you? (as a side note, have you read the book?)
Re:Future is relational databases (Score:2)
Because the relational model is the only real data model in existence, giving us the power and performance we need by being simple, expressive, and by separating the logical from the physical level.
That is part the problem. More powerful views of files should be possible.
No.
Re:Future is relational databases WHY??? (Score:2)
When I go for a file, a DVD or one of the swords I know where it will be and what it will contain. Everything has a place and everything in it's place.
If you pull all that apart, and interconnect it then it doesn't have it's place any
Re:Future is relational databases WHY??? (Score:2)
It is just you. Or you're not quite truthful.
There is no interconnection. In fact, the relational model does away with pointers (and relationships) and substitutes relations instead.
The point of the relational model is not doing away with hierarchies as ways of presenting and organising data at the user or physical levels, but as ways
Re:Future is relational databases (Score:2)
Not quite... they were more doing something SQL-like.
It's a nice thought. (Score:5, Interesting)
But look at where he is now... (Score:2)
The reason he is at Apple is that he was brought in because of his work on the BeOS FS, so a lot of the ideas and inner workings (AFAIK) are being put into OSX.
I would hunt some links out but I can't be arsed, but El Reg did a very good interview with Dominic a couple of years ago that was quite a good read.
Without being too cynical (Score:5, Insightful)
But for most people this is seriously dull subject matter. Oscar Wilde would stuggle to get a chuckle out of this stuff.
Re:Without being too cynical (Score:2)
Mirrors: (Score:3, Insightful)
http:/beos.spb.ru/program/105/practical-file-sy
http://users.aber.ac.uk/mmb9/data/practical-fil
http://www.funtech.org/downloads/Temp/practical
Re:Mirrors: (Score:4, Informative)
Link 1 [beos.spb.ru]
Link 2 [aber.ac.uk]
Link 3 [funtech.org]
Really folks, is it so hard to throw a <a></a> tag around a url?
Re:Mirrors: (Score:2)
Re:Mirrors: (Score:2)
Tests... (Score:2)
If that's a link to
Silly submitter... (Score:5, Funny)
the BeOS rising to join the ranks of OSs that won't die
An OS isn't dead or dying until Netcraft confirms it.
Re:Silly submitter... (Score:3, Informative)
Re:Silly submitter... (Score:2)
<snippet>
Before Be, Inc. sold its assets to Palm, Inc., we managed to close a deal allowing us to distribute the PE version and had started negotiations over the future of the Pro version. Koch Media was ready to reissue the copies of the Pro Version that they hadn't managed to resell, to make it a part of a new distribution first called BeOS NG (New Generation), now renamed to "Zeta".
</snippet
Re:Silly submitter... (Score:2)
Re:Silly submitter... (Score:2)
Re:Silly submitter... (Score:2)
Also, it would help to be aware of how long it takes to copy a 17 M file from one disk to another under the OS.
Just what I always wanted (Score:5, Funny)
Holy Crap. I'm a Be-diot. (Score:4, Funny)
Why, oh why can't someone invent a "Parallel Universe Chunnel" so I can get myself a laptop BeBox. tiBooks come close, but Apple crack is still crack!
Ah well. At least I have something else (the
You can do this (Score:2)
ironic fact (Score:5, Informative)
Re:ironic fact (Score:3, Interesting)
Re:ironic fact (Score:2, Interesting)
Re:ironic fact (Score:2)
You mean HFS+ (HFS Plus)? That is the name of the file system.
Re:ironic fact (Score:2, Interesting)
Re:ironic fact (Score:2)
Re:ironic fact (Score:2)
HFSX is an extension to HFS Plus to allow additional features that are incompatible with HFS Plus. The only such feature currently defined is case-sensitive filenames.
Re:ironic fact (Score:2)
I was pointing out that the name he was using are not the normally used names (or correct) for those file systems and trying to make sure of what exactly we was talking about.
Re:Straight up: your sig is gay beyond belief (Score:2, Funny)
Thanks For The Reminder (Score:2, Funny)
When's the last time you had a PDF book as bathroom reading?
BitTorrent (Score:5, Informative)
I'll seed it for an hour or so.
Interesting (Score:3, Insightful)
Other possibility (Score:3, Informative)
Check its Programmer Reference Manuals if you can find these.
BeFS lives (Score:5, Interesting)
The BFS replacement has been one of the fastest progressing parts of the OpenBeOS project. Dominic Giampaolo has actually commented to the team lead of OpenBFS, and complemented the team on the good work they accomplished.
Re:BeFS lives (Score:2)
Sky Operating System (Score:3, Informative)
At Apple Now (Score:2, Interesting)
Be saw the potential of metadata on the filesystem level before everybody else did.
Alive? (Score:3, Interesting)
That being said, I remember using the last BeOS Personal Edition (the one that ran on top of FAT32).
It was incredibly cool. There's definitely something to be said for an OS which boots almost instantaneously.
The inability to print was somewhat of a drawback...
Explicate? (Score:4, Funny)
Isn't it ironic (dontcha think?) that 'explicate' appears in a sentence about straightforward writing? How about explain???
HERE IS ANOTHER GOOD MIRROR!!!! (Score:3, Interesting)
You are all welcome!
http://www.realcoders.org [realcoders.org]
Have Fun Everyone!
-Danny
joecamel (at) realcoders (dot) org
just remember... (Score:5, Interesting)
All that complexity comes at a price. UNIX was a reaction against putting so many features into the kernel, and, in my opinion, the UNIX arguments against putting those kinds of features into the kernel are as valid today as they were 30 years ago.
Unfortunately, the book gives very little historical perspective. It seems to simply assume that "more features" translates into "more advanced". From a quick perusal, division of functionality between kernel and user space seems to be not covered. File versioning at the file system level, another important feature, does not seem to be covered. Historically important file systems and functionality, like those found on VMS, IBM mainframes, and database-based file systems are hardly covered at all.
This book may give you a good idea of what kind of thinking went into the design of the BeOS file system, but it doesn't even come close to a book on file system design in general. And even as a book on the BeOS file system design, it tells you as much about what the creator of the file system didn't think about as what he did think about.
Funny Be IRC quotes (Score:3, Funny)
This first one is particularly applicable as it pertains to the "uncorruptable" BeOS filesystem.
<dEad{Ni}> but you have more problems with win95 than i have ever imagined anyone having
<Tolen{Ni}> nah...you should see some of the people on my dorm floor...
<Tolen{Ni}> one guy had to fdisk like 5 times last semester
<Magaera{Ni}> hehe
<Magaera{Ni}> You CAN'T corrupt the BeOS file system
<Magaera{Ni}> Even by kicking out the power cord
<Gunfighter{Ni}> you can't play Q2 on it either
<Magaera{Ni}> potty stop - brb
<Gunfighter{Ni}> overkill.. yellow card
<Magaera{Ni}> what, you'd rather say i was going to "the little programmer's room" or something??
<Deathwish{Ni}> I got take a BeOS
<Magaera{Ni}> "BeOS combines the best features of all the major operating systems: the ease-of-use of the Macintosh, the power and flexibility of Linux, and Minesweeper from Windows."
Need more than one filesystem (Score:5, Informative)
No. You need four:
1. Ext2 or Ext3 on primary hard drive
2. IS09660 on CDs
3. minix for initrd
4. FAT for floppies/USB devices
Re:Need more than one filesystem (Score:4, Insightful)
Some are good for a specific purpose, like ISO9660. It's got error correction in it, so it's good for write-once removable media.
Others are extremely widespread, like FAT. That's good for interoperability.
Others support advanced permissions like ACLs. That's good for system security.
Re:Need more than one filesystem (Score:2)
And let's not forget UDF for our DVDs.
Re:Need more than one filesystem (Score:2)
Huh ?!? My initrd is an ext2 fs  ...
Re:Crud more file Systems? (Score:4, Insightful)
Aren't there enough filesystems? All you need is one!
Yes, and 640K is enough for anyone, and everything worth inventing has already been invented.
But seriously, what's wrong with creating new things? A new filesystem just might be better than the current ones. Stranger things have happened. Much stranger things, in fact. Especially in Computer Science. Remember hierarchical databases? No? Well, they used to be all the rage back in the old days, before relational databases took over. These days, the idea of using any other database model seems ludicrous. However, even those might be superseded at some point in the future.
It never hurts to keep an open mind you know...
Re: Relational databases are the only answer (Score:2)
Re:Crud more file Systems? (Score:5, Funny)
Re:Crud more file Systems? (Score:4, Funny)
isn't love a journaling filesystem? My girlfriend can remember every little issue I would like to delete.
You think that's bad? Wait until you upgrade to the wife filesystem. It won't even let you delete entries for mowing the lawn, etc. from your crontab! Talk about rights management...
Re:Crud more file Systems? (Score:5, Funny)
Re:BeOS won't die? (Score:2)
Re:BeOS won't die? (Score:2)
Bruce
Re:BeOS won't die? (Score:2)
Bruce
Re:BeOS won't die? (Score:2)
Re:BeOS won't die? (Score:2)
Like a warm hug. (Score:2, Insightful)
Re:I love BeOS (Score:2)
So, uh, what program do you use?
Sorry, that was a cheap shot...
Re:I love BeOS (Score:3)
I think the main issue that Be ran into is that their vision was a little confuzzled. Anyone else who followed Be remember when they decided they wanted to drop the desktop market and move into set-top boxes? Perhaps if they'd managed to bring some of the big audio/video software companies on board and
Re:BeOS Books (Score:2)
i used BeOS before I ever even touched linux, and it gave me quite a headstart on BASH programming and many of the concepts behind linux (plain-text programming and config files, etc).
I also loved the API for it. It was amazingly simple to write GUI applications in C++. On the same level as Cocoa, h