1 | =head1 NAME
|
---|
2 |
|
---|
3 | perltoot - Tom's object-oriented tutorial for perl
|
---|
4 |
|
---|
5 | =head1 DESCRIPTION
|
---|
6 |
|
---|
7 | Object-oriented programming is a big seller these days. Some managers
|
---|
8 | would rather have objects than sliced bread. Why is that? What's so
|
---|
9 | special about an object? Just what I<is> an object anyway?
|
---|
10 |
|
---|
11 | An object is nothing but a way of tucking away complex behaviours into
|
---|
12 | a neat little easy-to-use bundle. (This is what professors call
|
---|
13 | abstraction.) Smart people who have nothing to do but sit around for
|
---|
14 | weeks on end figuring out really hard problems make these nifty
|
---|
15 | objects that even regular people can use. (This is what professors call
|
---|
16 | software reuse.) Users (well, programmers) can play with this little
|
---|
17 | bundle all they want, but they aren't to open it up and mess with the
|
---|
18 | insides. Just like an expensive piece of hardware, the contract says
|
---|
19 | that you void the warranty if you muck with the cover. So don't do that.
|
---|
20 |
|
---|
21 | The heart of objects is the class, a protected little private namespace
|
---|
22 | full of data and functions. A class is a set of related routines that
|
---|
23 | addresses some problem area. You can think of it as a user-defined type.
|
---|
24 | The Perl package mechanism, also used for more traditional modules,
|
---|
25 | is used for class modules as well. Objects "live" in a class, meaning
|
---|
26 | that they belong to some package.
|
---|
27 |
|
---|
28 | More often than not, the class provides the user with little bundles.
|
---|
29 | These bundles are objects. They know whose class they belong to,
|
---|
30 | and how to behave. Users ask the class to do something, like "give
|
---|
31 | me an object." Or they can ask one of these objects to do something.
|
---|
32 | Asking a class to do something for you is calling a I<class method>.
|
---|
33 | Asking an object to do something for you is calling an I<object method>.
|
---|
34 | Asking either a class (usually) or an object (sometimes) to give you
|
---|
35 | back an object is calling a I<constructor>, which is just a
|
---|
36 | kind of method.
|
---|
37 |
|
---|
38 | That's all well and good, but how is an object different from any other
|
---|
39 | Perl data type? Just what is an object I<really>; that is, what's its
|
---|
40 | fundamental type? The answer to the first question is easy. An object
|
---|
41 | is different from any other data type in Perl in one and only one way:
|
---|
42 | you may dereference it using not merely string or numeric subscripts
|
---|
43 | as with simple arrays and hashes, but with named subroutine calls.
|
---|
44 | In a word, with I<methods>.
|
---|
45 |
|
---|
46 | The answer to the second question is that it's a reference, and not just
|
---|
47 | any reference, mind you, but one whose referent has been I<bless>()ed
|
---|
48 | into a particular class (read: package). What kind of reference? Well,
|
---|
49 | the answer to that one is a bit less concrete. That's because in Perl
|
---|
50 | the designer of the class can employ any sort of reference they'd like
|
---|
51 | as the underlying intrinsic data type. It could be a scalar, an array,
|
---|
52 | or a hash reference. It could even be a code reference. But because
|
---|
53 | of its inherent flexibility, an object is usually a hash reference.
|
---|
54 |
|
---|
55 | =head1 Creating a Class
|
---|
56 |
|
---|
57 | Before you create a class, you need to decide what to name it. That's
|
---|
58 | because the class (package) name governs the name of the file used to
|
---|
59 | house it, just as with regular modules. Then, that class (package)
|
---|
60 | should provide one or more ways to generate objects. Finally, it should
|
---|
61 | provide mechanisms to allow users of its objects to indirectly manipulate
|
---|
62 | these objects from a distance.
|
---|
63 |
|
---|
64 | For example, let's make a simple Person class module. It gets stored in
|
---|
65 | the file Person.pm. If it were called a Happy::Person class, it would
|
---|
66 | be stored in the file Happy/Person.pm, and its package would become
|
---|
67 | Happy::Person instead of just Person. (On a personal computer not
|
---|
68 | running Unix or Plan 9, but something like Mac OS or VMS, the directory
|
---|
69 | separator may be different, but the principle is the same.) Do not assume
|
---|
70 | any formal relationship between modules based on their directory names.
|
---|
71 | This is merely a grouping convenience, and has no effect on inheritance,
|
---|
72 | variable accessibility, or anything else.
|
---|
73 |
|
---|
74 | For this module we aren't going to use Exporter, because we're
|
---|
75 | a well-behaved class module that doesn't export anything at all.
|
---|
76 | In order to manufacture objects, a class needs to have a I<constructor
|
---|
77 | method>. A constructor gives you back not just a regular data type,
|
---|
78 | but a brand-new object in that class. This magic is taken care of by
|
---|
79 | the bless() function, whose sole purpose is to enable its referent to
|
---|
80 | be used as an object. Remember: being an object really means nothing
|
---|
81 | more than that methods may now be called against it.
|
---|
82 |
|
---|
83 | While a constructor may be named anything you'd like, most Perl
|
---|
84 | programmers seem to like to call theirs new(). However, new() is not
|
---|
85 | a reserved word, and a class is under no obligation to supply such.
|
---|
86 | Some programmers have also been known to use a function with
|
---|
87 | the same name as the class as the constructor.
|
---|
88 |
|
---|
89 | =head2 Object Representation
|
---|
90 |
|
---|
91 | By far the most common mechanism used in Perl to represent a Pascal
|
---|
92 | record, a C struct, or a C++ class is an anonymous hash. That's because a
|
---|
93 | hash has an arbitrary number of data fields, each conveniently accessed by
|
---|
94 | an arbitrary name of your own devising.
|
---|
95 |
|
---|
96 | If you were just doing a simple
|
---|
97 | struct-like emulation, you would likely go about it something like this:
|
---|
98 |
|
---|
99 | $rec = {
|
---|
100 | name => "Jason",
|
---|
101 | age => 23,
|
---|
102 | peers => [ "Norbert", "Rhys", "Phineas"],
|
---|
103 | };
|
---|
104 |
|
---|
105 | If you felt like it, you could add a bit of visual distinction
|
---|
106 | by up-casing the hash keys:
|
---|
107 |
|
---|
108 | $rec = {
|
---|
109 | NAME => "Jason",
|
---|
110 | AGE => 23,
|
---|
111 | PEERS => [ "Norbert", "Rhys", "Phineas"],
|
---|
112 | };
|
---|
113 |
|
---|
114 | And so you could get at C<< $rec->{NAME} >> to find "Jason", or
|
---|
115 | C<< @{ $rec->{PEERS} } >> to get at "Norbert", "Rhys", and "Phineas".
|
---|
116 | (Have you ever noticed how many 23-year-old programmers seem to
|
---|
117 | be named "Jason" these days? :-)
|
---|
118 |
|
---|
119 | This same model is often used for classes, although it is not considered
|
---|
120 | the pinnacle of programming propriety for folks from outside the
|
---|
121 | class to come waltzing into an object, brazenly accessing its data
|
---|
122 | members directly. Generally speaking, an object should be considered
|
---|
123 | an opaque cookie that you use I<object methods> to access. Visually,
|
---|
124 | methods look like you're dereffing a reference using a function name
|
---|
125 | instead of brackets or braces.
|
---|
126 |
|
---|
127 | =head2 Class Interface
|
---|
128 |
|
---|
129 | Some languages provide a formal syntactic interface to a class's methods,
|
---|
130 | but Perl does not. It relies on you to read the documentation of each
|
---|
131 | class. If you try to call an undefined method on an object, Perl won't
|
---|
132 | complain, but the program will trigger an exception while it's running.
|
---|
133 | Likewise, if you call a method expecting a prime number as its argument
|
---|
134 | with a non-prime one instead, you can't expect the compiler to catch this.
|
---|
135 | (Well, you can expect it all you like, but it's not going to happen.)
|
---|
136 |
|
---|
137 | Let's suppose you have a well-educated user of your Person class,
|
---|
138 | someone who has read the docs that explain the prescribed
|
---|
139 | interface. Here's how they might use the Person class:
|
---|
140 |
|
---|
141 | use Person;
|
---|
142 |
|
---|
143 | $him = Person->new();
|
---|
144 | $him->name("Jason");
|
---|
145 | $him->age(23);
|
---|
146 | $him->peers( "Norbert", "Rhys", "Phineas" );
|
---|
147 |
|
---|
148 | push @All_Recs, $him; # save object in array for later
|
---|
149 |
|
---|
150 | printf "%s is %d years old.\n", $him->name, $him->age;
|
---|
151 | print "His peers are: ", join(", ", $him->peers), "\n";
|
---|
152 |
|
---|
153 | printf "Last rec's name is %s\n", $All_Recs[-1]->name;
|
---|
154 |
|
---|
155 | As you can see, the user of the class doesn't know (or at least, has no
|
---|
156 | business paying attention to the fact) that the object has one particular
|
---|
157 | implementation or another. The interface to the class and its objects
|
---|
158 | is exclusively via methods, and that's all the user of the class should
|
---|
159 | ever play with.
|
---|
160 |
|
---|
161 | =head2 Constructors and Instance Methods
|
---|
162 |
|
---|
163 | Still, I<someone> has to know what's in the object. And that someone is
|
---|
164 | the class. It implements methods that the programmer uses to access
|
---|
165 | the object. Here's how to implement the Person class using the standard
|
---|
166 | hash-ref-as-an-object idiom. We'll make a class method called new() to
|
---|
167 | act as the constructor, and three object methods called name(), age(), and
|
---|
168 | peers() to get at per-object data hidden away in our anonymous hash.
|
---|
169 |
|
---|
170 | package Person;
|
---|
171 | use strict;
|
---|
172 |
|
---|
173 | ##################################################
|
---|
174 | ## the object constructor (simplistic version) ##
|
---|
175 | ##################################################
|
---|
176 | sub new {
|
---|
177 | my $self = {};
|
---|
178 | $self->{NAME} = undef;
|
---|
179 | $self->{AGE} = undef;
|
---|
180 | $self->{PEERS} = [];
|
---|
181 | bless($self); # but see below
|
---|
182 | return $self;
|
---|
183 | }
|
---|
184 |
|
---|
185 | ##############################################
|
---|
186 | ## methods to access per-object data ##
|
---|
187 | ## ##
|
---|
188 | ## With args, they set the value. Without ##
|
---|
189 | ## any, they only retrieve it/them. ##
|
---|
190 | ##############################################
|
---|
191 |
|
---|
192 | sub name {
|
---|
193 | my $self = shift;
|
---|
194 | if (@_) { $self->{NAME} = shift }
|
---|
195 | return $self->{NAME};
|
---|
196 | }
|
---|
197 |
|
---|
198 | sub age {
|
---|
199 | my $self = shift;
|
---|
200 | if (@_) { $self->{AGE} = shift }
|
---|
201 | return $self->{AGE};
|
---|
202 | }
|
---|
203 |
|
---|
204 | sub peers {
|
---|
205 | my $self = shift;
|
---|
206 | if (@_) { @{ $self->{PEERS} } = @_ }
|
---|
207 | return @{ $self->{PEERS} };
|
---|
208 | }
|
---|
209 |
|
---|
210 | 1; # so the require or use succeeds
|
---|
211 |
|
---|
212 | We've created three methods to access an object's data, name(), age(),
|
---|
213 | and peers(). These are all substantially similar. If called with an
|
---|
214 | argument, they set the appropriate field; otherwise they return the
|
---|
215 | value held by that field, meaning the value of that hash key.
|
---|
216 |
|
---|
217 | =head2 Planning for the Future: Better Constructors
|
---|
218 |
|
---|
219 | Even though at this point you may not even know what it means, someday
|
---|
220 | you're going to worry about inheritance. (You can safely ignore this
|
---|
221 | for now and worry about it later if you'd like.) To ensure that this
|
---|
222 | all works out smoothly, you must use the double-argument form of bless().
|
---|
223 | The second argument is the class into which the referent will be blessed.
|
---|
224 | By not assuming our own class as the default second argument and instead
|
---|
225 | using the class passed into us, we make our constructor inheritable.
|
---|
226 |
|
---|
227 | sub new {
|
---|
228 | my $class = shift;
|
---|
229 | my $self = {};
|
---|
230 | $self->{NAME} = undef;
|
---|
231 | $self->{AGE} = undef;
|
---|
232 | $self->{PEERS} = [];
|
---|
233 | bless ($self, $class);
|
---|
234 | return $self;
|
---|
235 | }
|
---|
236 |
|
---|
237 | That's about all there is for constructors. These methods bring objects
|
---|
238 | to life, returning neat little opaque bundles to the user to be used in
|
---|
239 | subsequent method calls.
|
---|
240 |
|
---|
241 | =head2 Destructors
|
---|
242 |
|
---|
243 | Every story has a beginning and an end. The beginning of the object's
|
---|
244 | story is its constructor, explicitly called when the object comes into
|
---|
245 | existence. But the ending of its story is the I<destructor>, a method
|
---|
246 | implicitly called when an object leaves this life. Any per-object
|
---|
247 | clean-up code is placed in the destructor, which must (in Perl) be called
|
---|
248 | DESTROY.
|
---|
249 |
|
---|
250 | If constructors can have arbitrary names, then why not destructors?
|
---|
251 | Because while a constructor is explicitly called, a destructor is not.
|
---|
252 | Destruction happens automatically via Perl's garbage collection (GC)
|
---|
253 | system, which is a quick but somewhat lazy reference-based GC system.
|
---|
254 | To know what to call, Perl insists that the destructor be named DESTROY.
|
---|
255 | Perl's notion of the right time to call a destructor is not well-defined
|
---|
256 | currently, which is why your destructors should not rely on when they are
|
---|
257 | called.
|
---|
258 |
|
---|
259 | Why is DESTROY in all caps? Perl on occasion uses purely uppercase
|
---|
260 | function names as a convention to indicate that the function will
|
---|
261 | be automatically called by Perl in some way. Others that are called
|
---|
262 | implicitly include BEGIN, END, AUTOLOAD, plus all methods used by
|
---|
263 | tied objects, described in L<perltie>.
|
---|
264 |
|
---|
265 | In really good object-oriented programming languages, the user doesn't
|
---|
266 | care when the destructor is called. It just happens when it's supposed
|
---|
267 | to. In low-level languages without any GC at all, there's no way to
|
---|
268 | depend on this happening at the right time, so the programmer must
|
---|
269 | explicitly call the destructor to clean up memory and state, crossing
|
---|
270 | their fingers that it's the right time to do so. Unlike C++, an
|
---|
271 | object destructor is nearly never needed in Perl, and even when it is,
|
---|
272 | explicit invocation is uncalled for. In the case of our Person class,
|
---|
273 | we don't need a destructor because Perl takes care of simple matters
|
---|
274 | like memory deallocation.
|
---|
275 |
|
---|
276 | The only situation where Perl's reference-based GC won't work is
|
---|
277 | when there's a circularity in the data structure, such as:
|
---|
278 |
|
---|
279 | $this->{WHATEVER} = $this;
|
---|
280 |
|
---|
281 | In that case, you must delete the self-reference manually if you expect
|
---|
282 | your program not to leak memory. While admittedly error-prone, this is
|
---|
283 | the best we can do right now. Nonetheless, rest assured that when your
|
---|
284 | program is finished, its objects' destructors are all duly called.
|
---|
285 | So you are guaranteed that an object I<eventually> gets properly
|
---|
286 | destroyed, except in the unique case of a program that never exits.
|
---|
287 | (If you're running Perl embedded in another application, this full GC
|
---|
288 | pass happens a bit more frequently--whenever a thread shuts down.)
|
---|
289 |
|
---|
290 | =head2 Other Object Methods
|
---|
291 |
|
---|
292 | The methods we've talked about so far have either been constructors or
|
---|
293 | else simple "data methods", interfaces to data stored in the object.
|
---|
294 | These are a bit like an object's data members in the C++ world, except
|
---|
295 | that strangers don't access them as data. Instead, they should only
|
---|
296 | access the object's data indirectly via its methods. This is an
|
---|
297 | important rule: in Perl, access to an object's data should I<only>
|
---|
298 | be made through methods.
|
---|
299 |
|
---|
300 | Perl doesn't impose restrictions on who gets to use which methods.
|
---|
301 | The public-versus-private distinction is by convention, not syntax.
|
---|
302 | (Well, unless you use the Alias module described below in
|
---|
303 | L<Data Members as Variables>.) Occasionally you'll see method names beginning or ending
|
---|
304 | with an underscore or two. This marking is a convention indicating
|
---|
305 | that the methods are private to that class alone and sometimes to its
|
---|
306 | closest acquaintances, its immediate subclasses. But this distinction
|
---|
307 | is not enforced by Perl itself. It's up to the programmer to behave.
|
---|
308 |
|
---|
309 | There's no reason to limit methods to those that simply access data.
|
---|
310 | Methods can do anything at all. The key point is that they're invoked
|
---|
311 | against an object or a class. Let's say we'd like object methods that
|
---|
312 | do more than fetch or set one particular field.
|
---|
313 |
|
---|
314 | sub exclaim {
|
---|
315 | my $self = shift;
|
---|
316 | return sprintf "Hi, I'm %s, age %d, working with %s",
|
---|
317 | $self->{NAME}, $self->{AGE}, join(", ", @{$self->{PEERS}});
|
---|
318 | }
|
---|
319 |
|
---|
320 | Or maybe even one like this:
|
---|
321 |
|
---|
322 | sub happy_birthday {
|
---|
323 | my $self = shift;
|
---|
324 | return ++$self->{AGE};
|
---|
325 | }
|
---|
326 |
|
---|
327 | Some might argue that one should go at these this way:
|
---|
328 |
|
---|
329 | sub exclaim {
|
---|
330 | my $self = shift;
|
---|
331 | return sprintf "Hi, I'm %s, age %d, working with %s",
|
---|
332 | $self->name, $self->age, join(", ", $self->peers);
|
---|
333 | }
|
---|
334 |
|
---|
335 | sub happy_birthday {
|
---|
336 | my $self = shift;
|
---|
337 | return $self->age( $self->age() + 1 );
|
---|
338 | }
|
---|
339 |
|
---|
340 | But since these methods are all executing in the class itself, this
|
---|
341 | may not be critical. There are tradeoffs to be made. Using direct
|
---|
342 | hash access is faster (about an order of magnitude faster, in fact), and
|
---|
343 | it's more convenient when you want to interpolate in strings. But using
|
---|
344 | methods (the external interface) internally shields not just the users of
|
---|
345 | your class but even you yourself from changes in your data representation.
|
---|
346 |
|
---|
347 | =head1 Class Data
|
---|
348 |
|
---|
349 | What about "class data", data items common to each object in a class?
|
---|
350 | What would you want that for? Well, in your Person class, you might
|
---|
351 | like to keep track of the total people alive. How do you implement that?
|
---|
352 |
|
---|
353 | You I<could> make it a global variable called $Person::Census. But about
|
---|
354 | only reason you'd do that would be if you I<wanted> people to be able to
|
---|
355 | get at your class data directly. They could just say $Person::Census
|
---|
356 | and play around with it. Maybe this is ok in your design scheme.
|
---|
357 | You might even conceivably want to make it an exported variable. To be
|
---|
358 | exportable, a variable must be a (package) global. If this were a
|
---|
359 | traditional module rather than an object-oriented one, you might do that.
|
---|
360 |
|
---|
361 | While this approach is expected in most traditional modules, it's
|
---|
362 | generally considered rather poor form in most object modules. In an
|
---|
363 | object module, you should set up a protective veil to separate interface
|
---|
364 | from implementation. So provide a class method to access class data
|
---|
365 | just as you provide object methods to access object data.
|
---|
366 |
|
---|
367 | So, you I<could> still keep $Census as a package global and rely upon
|
---|
368 | others to honor the contract of the module and therefore not play around
|
---|
369 | with its implementation. You could even be supertricky and make $Census a
|
---|
370 | tied object as described in L<perltie>, thereby intercepting all accesses.
|
---|
371 |
|
---|
372 | But more often than not, you just want to make your class data a
|
---|
373 | file-scoped lexical. To do so, simply put this at the top of the file:
|
---|
374 |
|
---|
375 | my $Census = 0;
|
---|
376 |
|
---|
377 | Even though the scope of a my() normally expires when the block in which
|
---|
378 | it was declared is done (in this case the whole file being required or
|
---|
379 | used), Perl's deep binding of lexical variables guarantees that the
|
---|
380 | variable will not be deallocated, remaining accessible to functions
|
---|
381 | declared within that scope. This doesn't work with global variables
|
---|
382 | given temporary values via local(), though.
|
---|
383 |
|
---|
384 | Irrespective of whether you leave $Census a package global or make
|
---|
385 | it instead a file-scoped lexical, you should make these
|
---|
386 | changes to your Person::new() constructor:
|
---|
387 |
|
---|
388 | sub new {
|
---|
389 | my $class = shift;
|
---|
390 | my $self = {};
|
---|
391 | $Census++;
|
---|
392 | $self->{NAME} = undef;
|
---|
393 | $self->{AGE} = undef;
|
---|
394 | $self->{PEERS} = [];
|
---|
395 | bless ($self, $class);
|
---|
396 | return $self;
|
---|
397 | }
|
---|
398 |
|
---|
399 | sub population {
|
---|
400 | return $Census;
|
---|
401 | }
|
---|
402 |
|
---|
403 | Now that we've done this, we certainly do need a destructor so that
|
---|
404 | when Person is destroyed, the $Census goes down. Here's how
|
---|
405 | this could be done:
|
---|
406 |
|
---|
407 | sub DESTROY { --$Census }
|
---|
408 |
|
---|
409 | Notice how there's no memory to deallocate in the destructor? That's
|
---|
410 | something that Perl takes care of for you all by itself.
|
---|
411 |
|
---|
412 | Alternatively, you could use the Class::Data::Inheritable module from
|
---|
413 | CPAN.
|
---|
414 |
|
---|
415 |
|
---|
416 | =head2 Accessing Class Data
|
---|
417 |
|
---|
418 | It turns out that this is not really a good way to go about handling
|
---|
419 | class data. A good scalable rule is that I<you must never reference class
|
---|
420 | data directly from an object method>. Otherwise you aren't building a
|
---|
421 | scalable, inheritable class. The object must be the rendezvous point
|
---|
422 | for all operations, especially from an object method. The globals
|
---|
423 | (class data) would in some sense be in the "wrong" package in your
|
---|
424 | derived classes. In Perl, methods execute in the context of the class
|
---|
425 | they were defined in, I<not> that of the object that triggered them.
|
---|
426 | Therefore, namespace visibility of package globals in methods is unrelated
|
---|
427 | to inheritance.
|
---|
428 |
|
---|
429 | Got that? Maybe not. Ok, let's say that some other class "borrowed"
|
---|
430 | (well, inherited) the DESTROY method as it was defined above. When those
|
---|
431 | objects are destroyed, the original $Census variable will be altered,
|
---|
432 | not the one in the new class's package namespace. Perhaps this is what
|
---|
433 | you want, but probably it isn't.
|
---|
434 |
|
---|
435 | Here's how to fix this. We'll store a reference to the data in the
|
---|
436 | value accessed by the hash key "_CENSUS". Why the underscore? Well,
|
---|
437 | mostly because an initial underscore already conveys strong feelings
|
---|
438 | of magicalness to a C programmer. It's really just a mnemonic device
|
---|
439 | to remind ourselves that this field is special and not to be used as
|
---|
440 | a public data member in the same way that NAME, AGE, and PEERS are.
|
---|
441 | (Because we've been developing this code under the strict pragma, prior
|
---|
442 | to perl version 5.004 we'll have to quote the field name.)
|
---|
443 |
|
---|
444 | sub new {
|
---|
445 | my $class = shift;
|
---|
446 | my $self = {};
|
---|
447 | $self->{NAME} = undef;
|
---|
448 | $self->{AGE} = undef;
|
---|
449 | $self->{PEERS} = [];
|
---|
450 | # "private" data
|
---|
451 | $self->{"_CENSUS"} = \$Census;
|
---|
452 | bless ($self, $class);
|
---|
453 | ++ ${ $self->{"_CENSUS"} };
|
---|
454 | return $self;
|
---|
455 | }
|
---|
456 |
|
---|
457 | sub population {
|
---|
458 | my $self = shift;
|
---|
459 | if (ref $self) {
|
---|
460 | return ${ $self->{"_CENSUS"} };
|
---|
461 | } else {
|
---|
462 | return $Census;
|
---|
463 | }
|
---|
464 | }
|
---|
465 |
|
---|
466 | sub DESTROY {
|
---|
467 | my $self = shift;
|
---|
468 | -- ${ $self->{"_CENSUS"} };
|
---|
469 | }
|
---|
470 |
|
---|
471 | =head2 Debugging Methods
|
---|
472 |
|
---|
473 | It's common for a class to have a debugging mechanism. For example,
|
---|
474 | you might want to see when objects are created or destroyed. To do that,
|
---|
475 | add a debugging variable as a file-scoped lexical. For this, we'll pull
|
---|
476 | in the standard Carp module to emit our warnings and fatal messages.
|
---|
477 | That way messages will come out with the caller's filename and
|
---|
478 | line number instead of our own; if we wanted them to be from our own
|
---|
479 | perspective, we'd just use die() and warn() directly instead of croak()
|
---|
480 | and carp() respectively.
|
---|
481 |
|
---|
482 | use Carp;
|
---|
483 | my $Debugging = 0;
|
---|
484 |
|
---|
485 | Now add a new class method to access the variable.
|
---|
486 |
|
---|
487 | sub debug {
|
---|
488 | my $class = shift;
|
---|
489 | if (ref $class) { confess "Class method called as object method" }
|
---|
490 | unless (@_ == 1) { confess "usage: CLASSNAME->debug(level)" }
|
---|
491 | $Debugging = shift;
|
---|
492 | }
|
---|
493 |
|
---|
494 | Now fix up DESTROY to murmur a bit as the moribund object expires:
|
---|
495 |
|
---|
496 | sub DESTROY {
|
---|
497 | my $self = shift;
|
---|
498 | if ($Debugging) { carp "Destroying $self " . $self->name }
|
---|
499 | -- ${ $self->{"_CENSUS"} };
|
---|
500 | }
|
---|
501 |
|
---|
502 | One could conceivably make a per-object debug state. That
|
---|
503 | way you could call both of these:
|
---|
504 |
|
---|
505 | Person->debug(1); # entire class
|
---|
506 | $him->debug(1); # just this object
|
---|
507 |
|
---|
508 | To do so, we need our debugging method to be a "bimodal" one, one that
|
---|
509 | works on both classes I<and> objects. Therefore, adjust the debug()
|
---|
510 | and DESTROY methods as follows:
|
---|
511 |
|
---|
512 | sub debug {
|
---|
513 | my $self = shift;
|
---|
514 | confess "usage: thing->debug(level)" unless @_ == 1;
|
---|
515 | my $level = shift;
|
---|
516 | if (ref($self)) {
|
---|
517 | $self->{"_DEBUG"} = $level; # just myself
|
---|
518 | } else {
|
---|
519 | $Debugging = $level; # whole class
|
---|
520 | }
|
---|
521 | }
|
---|
522 |
|
---|
523 | sub DESTROY {
|
---|
524 | my $self = shift;
|
---|
525 | if ($Debugging || $self->{"_DEBUG"}) {
|
---|
526 | carp "Destroying $self " . $self->name;
|
---|
527 | }
|
---|
528 | -- ${ $self->{"_CENSUS"} };
|
---|
529 | }
|
---|
530 |
|
---|
531 | What happens if a derived class (which we'll call Employee) inherits
|
---|
532 | methods from this Person base class? Then C<< Employee->debug() >>, when called
|
---|
533 | as a class method, manipulates $Person::Debugging not $Employee::Debugging.
|
---|
534 |
|
---|
535 | =head2 Class Destructors
|
---|
536 |
|
---|
537 | The object destructor handles the death of each distinct object. But sometimes
|
---|
538 | you want a bit of cleanup when the entire class is shut down, which
|
---|
539 | currently only happens when the program exits. To make such a
|
---|
540 | I<class destructor>, create a function in that class's package named
|
---|
541 | END. This works just like the END function in traditional modules,
|
---|
542 | meaning that it gets called whenever your program exits unless it execs
|
---|
543 | or dies of an uncaught signal. For example,
|
---|
544 |
|
---|
545 | sub END {
|
---|
546 | if ($Debugging) {
|
---|
547 | print "All persons are going away now.\n";
|
---|
548 | }
|
---|
549 | }
|
---|
550 |
|
---|
551 | When the program exits, all the class destructors (END functions) are
|
---|
552 | be called in the opposite order that they were loaded in (LIFO order).
|
---|
553 |
|
---|
554 | =head2 Documenting the Interface
|
---|
555 |
|
---|
556 | And there you have it: we've just shown you the I<implementation> of this
|
---|
557 | Person class. Its I<interface> would be its documentation. Usually this
|
---|
558 | means putting it in pod ("plain old documentation") format right there
|
---|
559 | in the same file. In our Person example, we would place the following
|
---|
560 | docs anywhere in the Person.pm file. Even though it looks mostly like
|
---|
561 | code, it's not. It's embedded documentation such as would be used by
|
---|
562 | the pod2man, pod2html, or pod2text programs. The Perl compiler ignores
|
---|
563 | pods entirely, just as the translators ignore code. Here's an example of
|
---|
564 | some pods describing the informal interface:
|
---|
565 |
|
---|
566 | =head1 NAME
|
---|
567 |
|
---|
568 | Person - class to implement people
|
---|
569 |
|
---|
570 | =head1 SYNOPSIS
|
---|
571 |
|
---|
572 | use Person;
|
---|
573 |
|
---|
574 | #################
|
---|
575 | # class methods #
|
---|
576 | #################
|
---|
577 | $ob = Person->new;
|
---|
578 | $count = Person->population;
|
---|
579 |
|
---|
580 | #######################
|
---|
581 | # object data methods #
|
---|
582 | #######################
|
---|
583 |
|
---|
584 | ### get versions ###
|
---|
585 | $who = $ob->name;
|
---|
586 | $years = $ob->age;
|
---|
587 | @pals = $ob->peers;
|
---|
588 |
|
---|
589 | ### set versions ###
|
---|
590 | $ob->name("Jason");
|
---|
591 | $ob->age(23);
|
---|
592 | $ob->peers( "Norbert", "Rhys", "Phineas" );
|
---|
593 |
|
---|
594 | ########################
|
---|
595 | # other object methods #
|
---|
596 | ########################
|
---|
597 |
|
---|
598 | $phrase = $ob->exclaim;
|
---|
599 | $ob->happy_birthday;
|
---|
600 |
|
---|
601 | =head1 DESCRIPTION
|
---|
602 |
|
---|
603 | The Person class implements dah dee dah dee dah....
|
---|
604 |
|
---|
605 | That's all there is to the matter of interface versus implementation.
|
---|
606 | A programmer who opens up the module and plays around with all the private
|
---|
607 | little shiny bits that were safely locked up behind the interface contract
|
---|
608 | has voided the warranty, and you shouldn't worry about their fate.
|
---|
609 |
|
---|
610 | =head1 Aggregation
|
---|
611 |
|
---|
612 | Suppose you later want to change the class to implement better names.
|
---|
613 | Perhaps you'd like to support both given names (called Christian names,
|
---|
614 | irrespective of one's religion) and family names (called surnames), plus
|
---|
615 | nicknames and titles. If users of your Person class have been properly
|
---|
616 | accessing it through its documented interface, then you can easily change
|
---|
617 | the underlying implementation. If they haven't, then they lose and
|
---|
618 | it's their fault for breaking the contract and voiding their warranty.
|
---|
619 |
|
---|
620 | To do this, we'll make another class, this one called Fullname. What's
|
---|
621 | the Fullname class look like? To answer that question, you have to
|
---|
622 | first figure out how you want to use it. How about we use it this way:
|
---|
623 |
|
---|
624 | $him = Person->new();
|
---|
625 | $him->fullname->title("St");
|
---|
626 | $him->fullname->christian("Thomas");
|
---|
627 | $him->fullname->surname("Aquinas");
|
---|
628 | $him->fullname->nickname("Tommy");
|
---|
629 | printf "His normal name is %s\n", $him->name;
|
---|
630 | printf "But his real name is %s\n", $him->fullname->as_string;
|
---|
631 |
|
---|
632 | Ok. To do this, we'll change Person::new() so that it supports
|
---|
633 | a full name field this way:
|
---|
634 |
|
---|
635 | sub new {
|
---|
636 | my $class = shift;
|
---|
637 | my $self = {};
|
---|
638 | $self->{FULLNAME} = Fullname->new();
|
---|
639 | $self->{AGE} = undef;
|
---|
640 | $self->{PEERS} = [];
|
---|
641 | $self->{"_CENSUS"} = \$Census;
|
---|
642 | bless ($self, $class);
|
---|
643 | ++ ${ $self->{"_CENSUS"} };
|
---|
644 | return $self;
|
---|
645 | }
|
---|
646 |
|
---|
647 | sub fullname {
|
---|
648 | my $self = shift;
|
---|
649 | return $self->{FULLNAME};
|
---|
650 | }
|
---|
651 |
|
---|
652 | Then to support old code, define Person::name() this way:
|
---|
653 |
|
---|
654 | sub name {
|
---|
655 | my $self = shift;
|
---|
656 | return $self->{FULLNAME}->nickname(@_)
|
---|
657 | || $self->{FULLNAME}->christian(@_);
|
---|
658 | }
|
---|
659 |
|
---|
660 | Here's the Fullname class. We'll use the same technique
|
---|
661 | of using a hash reference to hold data fields, and methods
|
---|
662 | by the appropriate name to access them:
|
---|
663 |
|
---|
664 | package Fullname;
|
---|
665 | use strict;
|
---|
666 |
|
---|
667 | sub new {
|
---|
668 | my $class = shift;
|
---|
669 | my $self = {
|
---|
670 | TITLE => undef,
|
---|
671 | CHRISTIAN => undef,
|
---|
672 | SURNAME => undef,
|
---|
673 | NICK => undef,
|
---|
674 | };
|
---|
675 | bless ($self, $class);
|
---|
676 | return $self;
|
---|
677 | }
|
---|
678 |
|
---|
679 | sub christian {
|
---|
680 | my $self = shift;
|
---|
681 | if (@_) { $self->{CHRISTIAN} = shift }
|
---|
682 | return $self->{CHRISTIAN};
|
---|
683 | }
|
---|
684 |
|
---|
685 | sub surname {
|
---|
686 | my $self = shift;
|
---|
687 | if (@_) { $self->{SURNAME} = shift }
|
---|
688 | return $self->{SURNAME};
|
---|
689 | }
|
---|
690 |
|
---|
691 | sub nickname {
|
---|
692 | my $self = shift;
|
---|
693 | if (@_) { $self->{NICK} = shift }
|
---|
694 | return $self->{NICK};
|
---|
695 | }
|
---|
696 |
|
---|
697 | sub title {
|
---|
698 | my $self = shift;
|
---|
699 | if (@_) { $self->{TITLE} = shift }
|
---|
700 | return $self->{TITLE};
|
---|
701 | }
|
---|
702 |
|
---|
703 | sub as_string {
|
---|
704 | my $self = shift;
|
---|
705 | my $name = join(" ", @$self{'CHRISTIAN', 'SURNAME'});
|
---|
706 | if ($self->{TITLE}) {
|
---|
707 | $name = $self->{TITLE} . " " . $name;
|
---|
708 | }
|
---|
709 | return $name;
|
---|
710 | }
|
---|
711 |
|
---|
712 | 1;
|
---|
713 |
|
---|
714 | Finally, here's the test program:
|
---|
715 |
|
---|
716 | #!/usr/bin/perl -w
|
---|
717 | use strict;
|
---|
718 | use Person;
|
---|
719 | sub END { show_census() }
|
---|
720 |
|
---|
721 | sub show_census () {
|
---|
722 | printf "Current population: %d\n", Person->population;
|
---|
723 | }
|
---|
724 |
|
---|
725 | Person->debug(1);
|
---|
726 |
|
---|
727 | show_census();
|
---|
728 |
|
---|
729 | my $him = Person->new();
|
---|
730 |
|
---|
731 | $him->fullname->christian("Thomas");
|
---|
732 | $him->fullname->surname("Aquinas");
|
---|
733 | $him->fullname->nickname("Tommy");
|
---|
734 | $him->fullname->title("St");
|
---|
735 | $him->age(1);
|
---|
736 |
|
---|
737 | printf "%s is really %s.\n", $him->name, $him->fullname->as_string;
|
---|
738 | printf "%s's age: %d.\n", $him->name, $him->age;
|
---|
739 | $him->happy_birthday;
|
---|
740 | printf "%s's age: %d.\n", $him->name, $him->age;
|
---|
741 |
|
---|
742 | show_census();
|
---|
743 |
|
---|
744 | =head1 Inheritance
|
---|
745 |
|
---|
746 | Object-oriented programming systems all support some notion of
|
---|
747 | inheritance. Inheritance means allowing one class to piggy-back on
|
---|
748 | top of another one so you don't have to write the same code again and
|
---|
749 | again. It's about software reuse, and therefore related to Laziness,
|
---|
750 | the principal virtue of a programmer. (The import/export mechanisms in
|
---|
751 | traditional modules are also a form of code reuse, but a simpler one than
|
---|
752 | the true inheritance that you find in object modules.)
|
---|
753 |
|
---|
754 | Sometimes the syntax of inheritance is built into the core of the
|
---|
755 | language, and sometimes it's not. Perl has no special syntax for
|
---|
756 | specifying the class (or classes) to inherit from. Instead, it's all
|
---|
757 | strictly in the semantics. Each package can have a variable called @ISA,
|
---|
758 | which governs (method) inheritance. If you try to call a method on an
|
---|
759 | object or class, and that method is not found in that object's package,
|
---|
760 | Perl then looks to @ISA for other packages to go looking through in
|
---|
761 | search of the missing method.
|
---|
762 |
|
---|
763 | Like the special per-package variables recognized by Exporter (such as
|
---|
764 | @EXPORT, @EXPORT_OK, @EXPORT_FAIL, %EXPORT_TAGS, and $VERSION), the @ISA
|
---|
765 | array I<must> be a package-scoped global and not a file-scoped lexical
|
---|
766 | created via my(). Most classes have just one item in their @ISA array.
|
---|
767 | In this case, we have what's called "single inheritance", or SI for short.
|
---|
768 |
|
---|
769 | Consider this class:
|
---|
770 |
|
---|
771 | package Employee;
|
---|
772 | use Person;
|
---|
773 | @ISA = ("Person");
|
---|
774 | 1;
|
---|
775 |
|
---|
776 | Not a lot to it, eh? All it's doing so far is loading in another
|
---|
777 | class and stating that this one will inherit methods from that
|
---|
778 | other class if need be. We have given it none of its own methods.
|
---|
779 | We rely upon an Employee to behave just like a Person.
|
---|
780 |
|
---|
781 | Setting up an empty class like this is called the "empty subclass test";
|
---|
782 | that is, making a derived class that does nothing but inherit from a
|
---|
783 | base class. If the original base class has been designed properly,
|
---|
784 | then the new derived class can be used as a drop-in replacement for the
|
---|
785 | old one. This means you should be able to write a program like this:
|
---|
786 |
|
---|
787 | use Employee;
|
---|
788 | my $empl = Employee->new();
|
---|
789 | $empl->name("Jason");
|
---|
790 | $empl->age(23);
|
---|
791 | printf "%s is age %d.\n", $empl->name, $empl->age;
|
---|
792 |
|
---|
793 | By proper design, we mean always using the two-argument form of bless(),
|
---|
794 | avoiding direct access of global data, and not exporting anything. If you
|
---|
795 | look back at the Person::new() function we defined above, we were careful
|
---|
796 | to do that. There's a bit of package data used in the constructor,
|
---|
797 | but the reference to this is stored on the object itself and all other
|
---|
798 | methods access package data via that reference, so we should be ok.
|
---|
799 |
|
---|
800 | What do we mean by the Person::new() function -- isn't that actually
|
---|
801 | a method? Well, in principle, yes. A method is just a function that
|
---|
802 | expects as its first argument a class name (package) or object
|
---|
803 | (blessed reference). Person::new() is the function that both the
|
---|
804 | C<< Person->new() >> method and the C<< Employee->new() >> method end
|
---|
805 | up calling. Understand that while a method call looks a lot like a
|
---|
806 | function call, they aren't really quite the same, and if you treat them
|
---|
807 | as the same, you'll very soon be left with nothing but broken programs.
|
---|
808 | First, the actual underlying calling conventions are different: method
|
---|
809 | calls get an extra argument. Second, function calls don't do inheritance,
|
---|
810 | but methods do.
|
---|
811 |
|
---|
812 | Method Call Resulting Function Call
|
---|
813 | ----------- ------------------------
|
---|
814 | Person->new() Person::new("Person")
|
---|
815 | Employee->new() Person::new("Employee")
|
---|
816 |
|
---|
817 | So don't use function calls when you mean to call a method.
|
---|
818 |
|
---|
819 | If an employee is just a Person, that's not all too very interesting.
|
---|
820 | So let's add some other methods. We'll give our employee
|
---|
821 | data fields to access their salary, their employee ID, and their
|
---|
822 | start date.
|
---|
823 |
|
---|
824 | If you're getting a little tired of creating all these nearly identical
|
---|
825 | methods just to get at the object's data, do not despair. Later,
|
---|
826 | we'll describe several different convenience mechanisms for shortening
|
---|
827 | this up. Meanwhile, here's the straight-forward way:
|
---|
828 |
|
---|
829 | sub salary {
|
---|
830 | my $self = shift;
|
---|
831 | if (@_) { $self->{SALARY} = shift }
|
---|
832 | return $self->{SALARY};
|
---|
833 | }
|
---|
834 |
|
---|
835 | sub id_number {
|
---|
836 | my $self = shift;
|
---|
837 | if (@_) { $self->{ID} = shift }
|
---|
838 | return $self->{ID};
|
---|
839 | }
|
---|
840 |
|
---|
841 | sub start_date {
|
---|
842 | my $self = shift;
|
---|
843 | if (@_) { $self->{START_DATE} = shift }
|
---|
844 | return $self->{START_DATE};
|
---|
845 | }
|
---|
846 |
|
---|
847 | =head2 Overridden Methods
|
---|
848 |
|
---|
849 | What happens when both a derived class and its base class have the same
|
---|
850 | method defined? Well, then you get the derived class's version of that
|
---|
851 | method. For example, let's say that we want the peers() method called on
|
---|
852 | an employee to act a bit differently. Instead of just returning the list
|
---|
853 | of peer names, let's return slightly different strings. So doing this:
|
---|
854 |
|
---|
855 | $empl->peers("Peter", "Paul", "Mary");
|
---|
856 | printf "His peers are: %s\n", join(", ", $empl->peers);
|
---|
857 |
|
---|
858 | will produce:
|
---|
859 |
|
---|
860 | His peers are: PEON=PETER, PEON=PAUL, PEON=MARY
|
---|
861 |
|
---|
862 | To do this, merely add this definition into the Employee.pm file:
|
---|
863 |
|
---|
864 | sub peers {
|
---|
865 | my $self = shift;
|
---|
866 | if (@_) { @{ $self->{PEERS} } = @_ }
|
---|
867 | return map { "PEON=\U$_" } @{ $self->{PEERS} };
|
---|
868 | }
|
---|
869 |
|
---|
870 | There, we've just demonstrated the high-falutin' concept known in certain
|
---|
871 | circles as I<polymorphism>. We've taken on the form and behaviour of
|
---|
872 | an existing object, and then we've altered it to suit our own purposes.
|
---|
873 | This is a form of Laziness. (Getting polymorphed is also what happens
|
---|
874 | when the wizard decides you'd look better as a frog.)
|
---|
875 |
|
---|
876 | Every now and then you'll want to have a method call trigger both its
|
---|
877 | derived class (also known as "subclass") version as well as its base class
|
---|
878 | (also known as "superclass") version. In practice, constructors and
|
---|
879 | destructors are likely to want to do this, and it probably also makes
|
---|
880 | sense in the debug() method we showed previously.
|
---|
881 |
|
---|
882 | To do this, add this to Employee.pm:
|
---|
883 |
|
---|
884 | use Carp;
|
---|
885 | my $Debugging = 0;
|
---|
886 |
|
---|
887 | sub debug {
|
---|
888 | my $self = shift;
|
---|
889 | confess "usage: thing->debug(level)" unless @_ == 1;
|
---|
890 | my $level = shift;
|
---|
891 | if (ref($self)) {
|
---|
892 | $self->{"_DEBUG"} = $level;
|
---|
893 | } else {
|
---|
894 | $Debugging = $level; # whole class
|
---|
895 | }
|
---|
896 | Person::debug($self, $Debugging); # don't really do this
|
---|
897 | }
|
---|
898 |
|
---|
899 | As you see, we turn around and call the Person package's debug() function.
|
---|
900 | But this is far too fragile for good design. What if Person doesn't
|
---|
901 | have a debug() function, but is inheriting I<its> debug() method
|
---|
902 | from elsewhere? It would have been slightly better to say
|
---|
903 |
|
---|
904 | Person->debug($Debugging);
|
---|
905 |
|
---|
906 | But even that's got too much hard-coded. It's somewhat better to say
|
---|
907 |
|
---|
908 | $self->Person::debug($Debugging);
|
---|
909 |
|
---|
910 | Which is a funny way to say to start looking for a debug() method up
|
---|
911 | in Person. This strategy is more often seen on overridden object methods
|
---|
912 | than on overridden class methods.
|
---|
913 |
|
---|
914 | There is still something a bit off here. We've hard-coded our
|
---|
915 | superclass's name. This in particular is bad if you change which classes
|
---|
916 | you inherit from, or add others. Fortunately, the pseudoclass SUPER
|
---|
917 | comes to the rescue here.
|
---|
918 |
|
---|
919 | $self->SUPER::debug($Debugging);
|
---|
920 |
|
---|
921 | This way it starts looking in my class's @ISA. This only makes sense
|
---|
922 | from I<within> a method call, though. Don't try to access anything
|
---|
923 | in SUPER:: from anywhere else, because it doesn't exist outside
|
---|
924 | an overridden method call. Note that C<SUPER> refers to the superclass of
|
---|
925 | the current package, I<not> to the superclass of C<$self>.
|
---|
926 |
|
---|
927 | Things are getting a bit complicated here. Have we done anything
|
---|
928 | we shouldn't? As before, one way to test whether we're designing
|
---|
929 | a decent class is via the empty subclass test. Since we already have
|
---|
930 | an Employee class that we're trying to check, we'd better get a new
|
---|
931 | empty subclass that can derive from Employee. Here's one:
|
---|
932 |
|
---|
933 | package Boss;
|
---|
934 | use Employee; # :-)
|
---|
935 | @ISA = qw(Employee);
|
---|
936 |
|
---|
937 | And here's the test program:
|
---|
938 |
|
---|
939 | #!/usr/bin/perl -w
|
---|
940 | use strict;
|
---|
941 | use Boss;
|
---|
942 | Boss->debug(1);
|
---|
943 |
|
---|
944 | my $boss = Boss->new();
|
---|
945 |
|
---|
946 | $boss->fullname->title("Don");
|
---|
947 | $boss->fullname->surname("Pichon Alvarez");
|
---|
948 | $boss->fullname->christian("Federico Jesus");
|
---|
949 | $boss->fullname->nickname("Fred");
|
---|
950 |
|
---|
951 | $boss->age(47);
|
---|
952 | $boss->peers("Frank", "Felipe", "Faust");
|
---|
953 |
|
---|
954 | printf "%s is age %d.\n", $boss->fullname->as_string, $boss->age;
|
---|
955 | printf "His peers are: %s\n", join(", ", $boss->peers);
|
---|
956 |
|
---|
957 | Running it, we see that we're still ok. If you'd like to dump out your
|
---|
958 | object in a nice format, somewhat like the way the 'x' command works in
|
---|
959 | the debugger, you could use the Data::Dumper module from CPAN this way:
|
---|
960 |
|
---|
961 | use Data::Dumper;
|
---|
962 | print "Here's the boss:\n";
|
---|
963 | print Dumper($boss);
|
---|
964 |
|
---|
965 | Which shows us something like this:
|
---|
966 |
|
---|
967 | Here's the boss:
|
---|
968 | $VAR1 = bless( {
|
---|
969 | _CENSUS => \1,
|
---|
970 | FULLNAME => bless( {
|
---|
971 | TITLE => 'Don',
|
---|
972 | SURNAME => 'Pichon Alvarez',
|
---|
973 | NICK => 'Fred',
|
---|
974 | CHRISTIAN => 'Federico Jesus'
|
---|
975 | }, 'Fullname' ),
|
---|
976 | AGE => 47,
|
---|
977 | PEERS => [
|
---|
978 | 'Frank',
|
---|
979 | 'Felipe',
|
---|
980 | 'Faust'
|
---|
981 | ]
|
---|
982 | }, 'Boss' );
|
---|
983 |
|
---|
984 | Hm.... something's missing there. What about the salary, start date,
|
---|
985 | and ID fields? Well, we never set them to anything, even undef, so they
|
---|
986 | don't show up in the hash's keys. The Employee class has no new() method
|
---|
987 | of its own, and the new() method in Person doesn't know about Employees.
|
---|
988 | (Nor should it: proper OO design dictates that a subclass be allowed to
|
---|
989 | know about its immediate superclass, but never vice-versa.) So let's
|
---|
990 | fix up Employee::new() this way:
|
---|
991 |
|
---|
992 | sub new {
|
---|
993 | my $class = shift;
|
---|
994 | my $self = $class->SUPER::new();
|
---|
995 | $self->{SALARY} = undef;
|
---|
996 | $self->{ID} = undef;
|
---|
997 | $self->{START_DATE} = undef;
|
---|
998 | bless ($self, $class); # reconsecrate
|
---|
999 | return $self;
|
---|
1000 | }
|
---|
1001 |
|
---|
1002 | Now if you dump out an Employee or Boss object, you'll find
|
---|
1003 | that new fields show up there now.
|
---|
1004 |
|
---|
1005 | =head2 Multiple Inheritance
|
---|
1006 |
|
---|
1007 | Ok, at the risk of confusing beginners and annoying OO gurus, it's
|
---|
1008 | time to confess that Perl's object system includes that controversial
|
---|
1009 | notion known as multiple inheritance, or MI for short. All this means
|
---|
1010 | is that rather than having just one parent class who in turn might
|
---|
1011 | itself have a parent class, etc., that you can directly inherit from
|
---|
1012 | two or more parents. It's true that some uses of MI can get you into
|
---|
1013 | trouble, although hopefully not quite so much trouble with Perl as with
|
---|
1014 | dubiously-OO languages like C++.
|
---|
1015 |
|
---|
1016 | The way it works is actually pretty simple: just put more than one package
|
---|
1017 | name in your @ISA array. When it comes time for Perl to go finding
|
---|
1018 | methods for your object, it looks at each of these packages in order.
|
---|
1019 | Well, kinda. It's actually a fully recursive, depth-first order.
|
---|
1020 | Consider a bunch of @ISA arrays like this:
|
---|
1021 |
|
---|
1022 | @First::ISA = qw( Alpha );
|
---|
1023 | @Second::ISA = qw( Beta );
|
---|
1024 | @Third::ISA = qw( First Second );
|
---|
1025 |
|
---|
1026 | If you have an object of class Third:
|
---|
1027 |
|
---|
1028 | my $ob = Third->new();
|
---|
1029 | $ob->spin();
|
---|
1030 |
|
---|
1031 | How do we find a spin() method (or a new() method for that matter)?
|
---|
1032 | Because the search is depth-first, classes will be looked up
|
---|
1033 | in the following order: Third, First, Alpha, Second, and Beta.
|
---|
1034 |
|
---|
1035 | In practice, few class modules have been seen that actually
|
---|
1036 | make use of MI. One nearly always chooses simple containership of
|
---|
1037 | one class within another over MI. That's why our Person
|
---|
1038 | object I<contained> a Fullname object. That doesn't mean
|
---|
1039 | it I<was> one.
|
---|
1040 |
|
---|
1041 | However, there is one particular area where MI in Perl is rampant:
|
---|
1042 | borrowing another class's class methods. This is rather common,
|
---|
1043 | especially with some bundled "objectless" classes,
|
---|
1044 | like Exporter, DynaLoader, AutoLoader, and SelfLoader. These classes
|
---|
1045 | do not provide constructors; they exist only so you may inherit their
|
---|
1046 | class methods. (It's not entirely clear why inheritance was done
|
---|
1047 | here rather than traditional module importation.)
|
---|
1048 |
|
---|
1049 | For example, here is the POSIX module's @ISA:
|
---|
1050 |
|
---|
1051 | package POSIX;
|
---|
1052 | @ISA = qw(Exporter DynaLoader);
|
---|
1053 |
|
---|
1054 | The POSIX module isn't really an object module, but then,
|
---|
1055 | neither are Exporter or DynaLoader. They're just lending their
|
---|
1056 | classes' behaviours to POSIX.
|
---|
1057 |
|
---|
1058 | Why don't people use MI for object methods much? One reason is that
|
---|
1059 | it can have complicated side-effects. For one thing, your inheritance
|
---|
1060 | graph (no longer a tree) might converge back to the same base class.
|
---|
1061 | Although Perl guards against recursive inheritance, merely having parents
|
---|
1062 | who are related to each other via a common ancestor, incestuous though
|
---|
1063 | it sounds, is not forbidden. What if in our Third class shown above we
|
---|
1064 | wanted its new() method to also call both overridden constructors in its
|
---|
1065 | two parent classes? The SUPER notation would only find the first one.
|
---|
1066 | Also, what about if the Alpha and Beta classes both had a common ancestor,
|
---|
1067 | like Nought? If you kept climbing up the inheritance tree calling
|
---|
1068 | overridden methods, you'd end up calling Nought::new() twice,
|
---|
1069 | which might well be a bad idea.
|
---|
1070 |
|
---|
1071 | =head2 UNIVERSAL: The Root of All Objects
|
---|
1072 |
|
---|
1073 | Wouldn't it be convenient if all objects were rooted at some ultimate
|
---|
1074 | base class? That way you could give every object common methods without
|
---|
1075 | having to go and add it to each and every @ISA. Well, it turns out that
|
---|
1076 | you can. You don't see it, but Perl tacitly and irrevocably assumes
|
---|
1077 | that there's an extra element at the end of @ISA: the class UNIVERSAL.
|
---|
1078 | In version 5.003, there were no predefined methods there, but you could put
|
---|
1079 | whatever you felt like into it.
|
---|
1080 |
|
---|
1081 | However, as of version 5.004 (or some subversive releases, like 5.003_08),
|
---|
1082 | UNIVERSAL has some methods in it already. These are builtin to your Perl
|
---|
1083 | binary, so they don't take any extra time to load. Predefined methods
|
---|
1084 | include isa(), can(), and VERSION(). isa() tells you whether an object or
|
---|
1085 | class "is" another one without having to traverse the hierarchy yourself:
|
---|
1086 |
|
---|
1087 | $has_io = $fd->isa("IO::Handle");
|
---|
1088 | $itza_handle = IO::Socket->isa("IO::Handle");
|
---|
1089 |
|
---|
1090 | The can() method, called against that object or class, reports back
|
---|
1091 | whether its string argument is a callable method name in that class.
|
---|
1092 | In fact, it gives you back a function reference to that method:
|
---|
1093 |
|
---|
1094 | $his_print_method = $obj->can('as_string');
|
---|
1095 |
|
---|
1096 | Finally, the VERSION method checks whether the class (or the object's
|
---|
1097 | class) has a package global called $VERSION that's high enough, as in:
|
---|
1098 |
|
---|
1099 | Some_Module->VERSION(3.0);
|
---|
1100 | $his_vers = $ob->VERSION();
|
---|
1101 |
|
---|
1102 | However, we don't usually call VERSION ourselves. (Remember that an all
|
---|
1103 | uppercase function name is a Perl convention that indicates that the
|
---|
1104 | function will be automatically used by Perl in some way.) In this case,
|
---|
1105 | it happens when you say
|
---|
1106 |
|
---|
1107 | use Some_Module 3.0;
|
---|
1108 |
|
---|
1109 | If you wanted to add version checking to your Person class explained
|
---|
1110 | above, just add this to Person.pm:
|
---|
1111 |
|
---|
1112 | our $VERSION = '1.1';
|
---|
1113 |
|
---|
1114 | and then in Employee.pm you can say
|
---|
1115 |
|
---|
1116 | use Person 1.1;
|
---|
1117 |
|
---|
1118 | And it would make sure that you have at least that version number or
|
---|
1119 | higher available. This is not the same as loading in that exact version
|
---|
1120 | number. No mechanism currently exists for concurrent installation of
|
---|
1121 | multiple versions of a module. Lamentably.
|
---|
1122 |
|
---|
1123 | =head1 Alternate Object Representations
|
---|
1124 |
|
---|
1125 | Nothing requires objects to be implemented as hash references. An object
|
---|
1126 | can be any sort of reference so long as its referent has been suitably
|
---|
1127 | blessed. That means scalar, array, and code references are also fair
|
---|
1128 | game.
|
---|
1129 |
|
---|
1130 | A scalar would work if the object has only one datum to hold. An array
|
---|
1131 | would work for most cases, but makes inheritance a bit dodgy because
|
---|
1132 | you have to invent new indices for the derived classes.
|
---|
1133 |
|
---|
1134 | =head2 Arrays as Objects
|
---|
1135 |
|
---|
1136 | If the user of your class honors the contract and sticks to the advertised
|
---|
1137 | interface, then you can change its underlying interface if you feel
|
---|
1138 | like it. Here's another implementation that conforms to the same
|
---|
1139 | interface specification. This time we'll use an array reference
|
---|
1140 | instead of a hash reference to represent the object.
|
---|
1141 |
|
---|
1142 | package Person;
|
---|
1143 | use strict;
|
---|
1144 |
|
---|
1145 | my($NAME, $AGE, $PEERS) = ( 0 .. 2 );
|
---|
1146 |
|
---|
1147 | ############################################
|
---|
1148 | ## the object constructor (array version) ##
|
---|
1149 | ############################################
|
---|
1150 | sub new {
|
---|
1151 | my $self = [];
|
---|
1152 | $self->[$NAME] = undef; # this is unnecessary
|
---|
1153 | $self->[$AGE] = undef; # as is this
|
---|
1154 | $self->[$PEERS] = []; # but this isn't, really
|
---|
1155 | bless($self);
|
---|
1156 | return $self;
|
---|
1157 | }
|
---|
1158 |
|
---|
1159 | sub name {
|
---|
1160 | my $self = shift;
|
---|
1161 | if (@_) { $self->[$NAME] = shift }
|
---|
1162 | return $self->[$NAME];
|
---|
1163 | }
|
---|
1164 |
|
---|
1165 | sub age {
|
---|
1166 | my $self = shift;
|
---|
1167 | if (@_) { $self->[$AGE] = shift }
|
---|
1168 | return $self->[$AGE];
|
---|
1169 | }
|
---|
1170 |
|
---|
1171 | sub peers {
|
---|
1172 | my $self = shift;
|
---|
1173 | if (@_) { @{ $self->[$PEERS] } = @_ }
|
---|
1174 | return @{ $self->[$PEERS] };
|
---|
1175 | }
|
---|
1176 |
|
---|
1177 | 1; # so the require or use succeeds
|
---|
1178 |
|
---|
1179 | You might guess that the array access would be a lot faster than the
|
---|
1180 | hash access, but they're actually comparable. The array is a I<little>
|
---|
1181 | bit faster, but not more than ten or fifteen percent, even when you
|
---|
1182 | replace the variables above like $AGE with literal numbers, like 1.
|
---|
1183 | A bigger difference between the two approaches can be found in memory use.
|
---|
1184 | A hash representation takes up more memory than an array representation
|
---|
1185 | because you have to allocate memory for the keys as well as for the values.
|
---|
1186 | However, it really isn't that bad, especially since as of version 5.004,
|
---|
1187 | memory is only allocated once for a given hash key, no matter how many
|
---|
1188 | hashes have that key. It's expected that sometime in the future, even
|
---|
1189 | these differences will fade into obscurity as more efficient underlying
|
---|
1190 | representations are devised.
|
---|
1191 |
|
---|
1192 | Still, the tiny edge in speed (and somewhat larger one in memory)
|
---|
1193 | is enough to make some programmers choose an array representation
|
---|
1194 | for simple classes. There's still a little problem with
|
---|
1195 | scalability, though, because later in life when you feel
|
---|
1196 | like creating subclasses, you'll find that hashes just work
|
---|
1197 | out better.
|
---|
1198 |
|
---|
1199 | =head2 Closures as Objects
|
---|
1200 |
|
---|
1201 | Using a code reference to represent an object offers some fascinating
|
---|
1202 | possibilities. We can create a new anonymous function (closure) who
|
---|
1203 | alone in all the world can see the object's data. This is because we
|
---|
1204 | put the data into an anonymous hash that's lexically visible only to
|
---|
1205 | the closure we create, bless, and return as the object. This object's
|
---|
1206 | methods turn around and call the closure as a regular subroutine call,
|
---|
1207 | passing it the field we want to affect. (Yes,
|
---|
1208 | the double-function call is slow, but if you wanted fast, you wouldn't
|
---|
1209 | be using objects at all, eh? :-)
|
---|
1210 |
|
---|
1211 | Use would be similar to before:
|
---|
1212 |
|
---|
1213 | use Person;
|
---|
1214 | $him = Person->new();
|
---|
1215 | $him->name("Jason");
|
---|
1216 | $him->age(23);
|
---|
1217 | $him->peers( [ "Norbert", "Rhys", "Phineas" ] );
|
---|
1218 | printf "%s is %d years old.\n", $him->name, $him->age;
|
---|
1219 | print "His peers are: ", join(", ", @{$him->peers}), "\n";
|
---|
1220 |
|
---|
1221 | but the implementation would be radically, perhaps even sublimely
|
---|
1222 | different:
|
---|
1223 |
|
---|
1224 | package Person;
|
---|
1225 |
|
---|
1226 | sub new {
|
---|
1227 | my $class = shift;
|
---|
1228 | my $self = {
|
---|
1229 | NAME => undef,
|
---|
1230 | AGE => undef,
|
---|
1231 | PEERS => [],
|
---|
1232 | };
|
---|
1233 | my $closure = sub {
|
---|
1234 | my $field = shift;
|
---|
1235 | if (@_) { $self->{$field} = shift }
|
---|
1236 | return $self->{$field};
|
---|
1237 | };
|
---|
1238 | bless($closure, $class);
|
---|
1239 | return $closure;
|
---|
1240 | }
|
---|
1241 |
|
---|
1242 | sub name { &{ $_[0] }("NAME", @_[ 1 .. $#_ ] ) }
|
---|
1243 | sub age { &{ $_[0] }("AGE", @_[ 1 .. $#_ ] ) }
|
---|
1244 | sub peers { &{ $_[0] }("PEERS", @_[ 1 .. $#_ ] ) }
|
---|
1245 |
|
---|
1246 | 1;
|
---|
1247 |
|
---|
1248 | Because this object is hidden behind a code reference, it's probably a bit
|
---|
1249 | mysterious to those whose background is more firmly rooted in standard
|
---|
1250 | procedural or object-based programming languages than in functional
|
---|
1251 | programming languages whence closures derive. The object
|
---|
1252 | created and returned by the new() method is itself not a data reference
|
---|
1253 | as we've seen before. It's an anonymous code reference that has within
|
---|
1254 | it access to a specific version (lexical binding and instantiation)
|
---|
1255 | of the object's data, which are stored in the private variable $self.
|
---|
1256 | Although this is the same function each time, it contains a different
|
---|
1257 | version of $self.
|
---|
1258 |
|
---|
1259 | When a method like C<$him-E<gt>name("Jason")> is called, its implicit
|
---|
1260 | zeroth argument is the invoking object--just as it is with all method
|
---|
1261 | calls. But in this case, it's our code reference (something like a
|
---|
1262 | function pointer in C++, but with deep binding of lexical variables).
|
---|
1263 | There's not a lot to be done with a code reference beyond calling it, so
|
---|
1264 | that's just what we do when we say C<&{$_[0]}>. This is just a regular
|
---|
1265 | function call, not a method call. The initial argument is the string
|
---|
1266 | "NAME", and any remaining arguments are whatever had been passed to the
|
---|
1267 | method itself.
|
---|
1268 |
|
---|
1269 | Once we're executing inside the closure that had been created in new(),
|
---|
1270 | the $self hash reference suddenly becomes visible. The closure grabs
|
---|
1271 | its first argument ("NAME" in this case because that's what the name()
|
---|
1272 | method passed it), and uses that string to subscript into the private
|
---|
1273 | hash hidden in its unique version of $self.
|
---|
1274 |
|
---|
1275 | Nothing under the sun will allow anyone outside the executing method to
|
---|
1276 | be able to get at this hidden data. Well, nearly nothing. You I<could>
|
---|
1277 | single step through the program using the debugger and find out the
|
---|
1278 | pieces while you're in the method, but everyone else is out of luck.
|
---|
1279 |
|
---|
1280 | There, if that doesn't excite the Scheme folks, then I just don't know
|
---|
1281 | what will. Translation of this technique into C++, Java, or any other
|
---|
1282 | braindead-static language is left as a futile exercise for aficionados
|
---|
1283 | of those camps.
|
---|
1284 |
|
---|
1285 | You could even add a bit of nosiness via the caller() function and
|
---|
1286 | make the closure refuse to operate unless called via its own package.
|
---|
1287 | This would no doubt satisfy certain fastidious concerns of programming
|
---|
1288 | police and related puritans.
|
---|
1289 |
|
---|
1290 | If you were wondering when Hubris, the third principle virtue of a
|
---|
1291 | programmer, would come into play, here you have it. (More seriously,
|
---|
1292 | Hubris is just the pride in craftsmanship that comes from having written
|
---|
1293 | a sound bit of well-designed code.)
|
---|
1294 |
|
---|
1295 | =head1 AUTOLOAD: Proxy Methods
|
---|
1296 |
|
---|
1297 | Autoloading is a way to intercept calls to undefined methods. An autoload
|
---|
1298 | routine may choose to create a new function on the fly, either loaded
|
---|
1299 | from disk or perhaps just eval()ed right there. This define-on-the-fly
|
---|
1300 | strategy is why it's called autoloading.
|
---|
1301 |
|
---|
1302 | But that's only one possible approach. Another one is to just
|
---|
1303 | have the autoloaded method itself directly provide the
|
---|
1304 | requested service. When used in this way, you may think
|
---|
1305 | of autoloaded methods as "proxy" methods.
|
---|
1306 |
|
---|
1307 | When Perl tries to call an undefined function in a particular package
|
---|
1308 | and that function is not defined, it looks for a function in
|
---|
1309 | that same package called AUTOLOAD. If one exists, it's called
|
---|
1310 | with the same arguments as the original function would have had.
|
---|
1311 | The fully-qualified name of the function is stored in that package's
|
---|
1312 | global variable $AUTOLOAD. Once called, the function can do anything
|
---|
1313 | it would like, including defining a new function by the right name, and
|
---|
1314 | then doing a really fancy kind of C<goto> right to it, erasing itself
|
---|
1315 | from the call stack.
|
---|
1316 |
|
---|
1317 | What does this have to do with objects? After all, we keep talking about
|
---|
1318 | functions, not methods. Well, since a method is just a function with
|
---|
1319 | an extra argument and some fancier semantics about where it's found,
|
---|
1320 | we can use autoloading for methods, too. Perl doesn't start looking
|
---|
1321 | for an AUTOLOAD method until it has exhausted the recursive hunt up
|
---|
1322 | through @ISA, though. Some programmers have even been known to define
|
---|
1323 | a UNIVERSAL::AUTOLOAD method to trap unresolved method calls to any
|
---|
1324 | kind of object.
|
---|
1325 |
|
---|
1326 | =head2 Autoloaded Data Methods
|
---|
1327 |
|
---|
1328 | You probably began to get a little suspicious about the duplicated
|
---|
1329 | code way back earlier when we first showed you the Person class, and
|
---|
1330 | then later the Employee class. Each method used to access the
|
---|
1331 | hash fields looked virtually identical. This should have tickled
|
---|
1332 | that great programming virtue, Impatience, but for the time,
|
---|
1333 | we let Laziness win out, and so did nothing. Proxy methods can cure
|
---|
1334 | this.
|
---|
1335 |
|
---|
1336 | Instead of writing a new function every time we want a new data field,
|
---|
1337 | we'll use the autoload mechanism to generate (actually, mimic) methods on
|
---|
1338 | the fly. To verify that we're accessing a valid member, we will check
|
---|
1339 | against an C<_permitted> (pronounced "under-permitted") field, which
|
---|
1340 | is a reference to a file-scoped lexical (like a C file static) hash of permitted fields in this record
|
---|
1341 | called %fields. Why the underscore? For the same reason as the _CENSUS
|
---|
1342 | field we once used: as a marker that means "for internal use only".
|
---|
1343 |
|
---|
1344 | Here's what the module initialization code and class
|
---|
1345 | constructor will look like when taking this approach:
|
---|
1346 |
|
---|
1347 | package Person;
|
---|
1348 | use Carp;
|
---|
1349 | our $AUTOLOAD; # it's a package global
|
---|
1350 |
|
---|
1351 | my %fields = (
|
---|
1352 | name => undef,
|
---|
1353 | age => undef,
|
---|
1354 | peers => undef,
|
---|
1355 | );
|
---|
1356 |
|
---|
1357 | sub new {
|
---|
1358 | my $class = shift;
|
---|
1359 | my $self = {
|
---|
1360 | _permitted => \%fields,
|
---|
1361 | %fields,
|
---|
1362 | };
|
---|
1363 | bless $self, $class;
|
---|
1364 | return $self;
|
---|
1365 | }
|
---|
1366 |
|
---|
1367 | If we wanted our record to have default values, we could fill those in
|
---|
1368 | where current we have C<undef> in the %fields hash.
|
---|
1369 |
|
---|
1370 | Notice how we saved a reference to our class data on the object itself?
|
---|
1371 | Remember that it's important to access class data through the object
|
---|
1372 | itself instead of having any method reference %fields directly, or else
|
---|
1373 | you won't have a decent inheritance.
|
---|
1374 |
|
---|
1375 | The real magic, though, is going to reside in our proxy method, which
|
---|
1376 | will handle all calls to undefined methods for objects of class Person
|
---|
1377 | (or subclasses of Person). It has to be called AUTOLOAD. Again, it's
|
---|
1378 | all caps because it's called for us implicitly by Perl itself, not by
|
---|
1379 | a user directly.
|
---|
1380 |
|
---|
1381 | sub AUTOLOAD {
|
---|
1382 | my $self = shift;
|
---|
1383 | my $type = ref($self)
|
---|
1384 | or croak "$self is not an object";
|
---|
1385 |
|
---|
1386 | my $name = $AUTOLOAD;
|
---|
1387 | $name =~ s/.*://; # strip fully-qualified portion
|
---|
1388 |
|
---|
1389 | unless (exists $self->{_permitted}->{$name} ) {
|
---|
1390 | croak "Can't access `$name' field in class $type";
|
---|
1391 | }
|
---|
1392 |
|
---|
1393 | if (@_) {
|
---|
1394 | return $self->{$name} = shift;
|
---|
1395 | } else {
|
---|
1396 | return $self->{$name};
|
---|
1397 | }
|
---|
1398 | }
|
---|
1399 |
|
---|
1400 | Pretty nifty, eh? All we have to do to add new data fields
|
---|
1401 | is modify %fields. No new functions need be written.
|
---|
1402 |
|
---|
1403 | I could have avoided the C<_permitted> field entirely, but I
|
---|
1404 | wanted to demonstrate how to store a reference to class data on the
|
---|
1405 | object so you wouldn't have to access that class data
|
---|
1406 | directly from an object method.
|
---|
1407 |
|
---|
1408 | =head2 Inherited Autoloaded Data Methods
|
---|
1409 |
|
---|
1410 | But what about inheritance? Can we define our Employee
|
---|
1411 | class similarly? Yes, so long as we're careful enough.
|
---|
1412 |
|
---|
1413 | Here's how to be careful:
|
---|
1414 |
|
---|
1415 | package Employee;
|
---|
1416 | use Person;
|
---|
1417 | use strict;
|
---|
1418 | our @ISA = qw(Person);
|
---|
1419 |
|
---|
1420 | my %fields = (
|
---|
1421 | id => undef,
|
---|
1422 | salary => undef,
|
---|
1423 | );
|
---|
1424 |
|
---|
1425 | sub new {
|
---|
1426 | my $class = shift;
|
---|
1427 | my $self = $class->SUPER::new();
|
---|
1428 | my($element);
|
---|
1429 | foreach $element (keys %fields) {
|
---|
1430 | $self->{_permitted}->{$element} = $fields{$element};
|
---|
1431 | }
|
---|
1432 | @{$self}{keys %fields} = values %fields;
|
---|
1433 | return $self;
|
---|
1434 | }
|
---|
1435 |
|
---|
1436 | Once we've done this, we don't even need to have an
|
---|
1437 | AUTOLOAD function in the Employee package, because
|
---|
1438 | we'll grab Person's version of that via inheritance,
|
---|
1439 | and it will all work out just fine.
|
---|
1440 |
|
---|
1441 | =head1 Metaclassical Tools
|
---|
1442 |
|
---|
1443 | Even though proxy methods can provide a more convenient approach to making
|
---|
1444 | more struct-like classes than tediously coding up data methods as
|
---|
1445 | functions, it still leaves a bit to be desired. For one thing, it means
|
---|
1446 | you have to handle bogus calls that you don't mean to trap via your proxy.
|
---|
1447 | It also means you have to be quite careful when dealing with inheritance,
|
---|
1448 | as detailed above.
|
---|
1449 |
|
---|
1450 | Perl programmers have responded to this by creating several different
|
---|
1451 | class construction classes. These metaclasses are classes
|
---|
1452 | that create other classes. A couple worth looking at are
|
---|
1453 | Class::Struct and Alias. These and other related metaclasses can be
|
---|
1454 | found in the modules directory on CPAN.
|
---|
1455 |
|
---|
1456 | =head2 Class::Struct
|
---|
1457 |
|
---|
1458 | One of the older ones is Class::Struct. In fact, its syntax and
|
---|
1459 | interface were sketched out long before perl5 even solidified into a
|
---|
1460 | real thing. What it does is provide you a way to "declare" a class
|
---|
1461 | as having objects whose fields are of a specific type. The function
|
---|
1462 | that does this is called, not surprisingly enough, struct(). Because
|
---|
1463 | structures or records are not base types in Perl, each time you want to
|
---|
1464 | create a class to provide a record-like data object, you yourself have
|
---|
1465 | to define a new() method, plus separate data-access methods for each of
|
---|
1466 | that record's fields. You'll quickly become bored with this process.
|
---|
1467 | The Class::Struct::struct() function alleviates this tedium.
|
---|
1468 |
|
---|
1469 | Here's a simple example of using it:
|
---|
1470 |
|
---|
1471 | use Class::Struct qw(struct);
|
---|
1472 | use Jobbie; # user-defined; see below
|
---|
1473 |
|
---|
1474 | struct 'Fred' => {
|
---|
1475 | one => '$',
|
---|
1476 | many => '@',
|
---|
1477 | profession => 'Jobbie', # does not call Jobbie->new()
|
---|
1478 | };
|
---|
1479 |
|
---|
1480 | $ob = Fred->new(profession => Jobbie->new());
|
---|
1481 | $ob->one("hmmmm");
|
---|
1482 |
|
---|
1483 | $ob->many(0, "here");
|
---|
1484 | $ob->many(1, "you");
|
---|
1485 | $ob->many(2, "go");
|
---|
1486 | print "Just set: ", $ob->many(2), "\n";
|
---|
1487 |
|
---|
1488 | $ob->profession->salary(10_000);
|
---|
1489 |
|
---|
1490 | You can declare types in the struct to be basic Perl types, or
|
---|
1491 | user-defined types (classes). User types will be initialized by calling
|
---|
1492 | that class's new() method.
|
---|
1493 |
|
---|
1494 | Take care that the C<Jobbie> object is not created automatically by the
|
---|
1495 | C<Fred> class's new() method, so you should specify a C<Jobbie> object
|
---|
1496 | when you create an instance of C<Fred>.
|
---|
1497 |
|
---|
1498 | Here's a real-world example of using struct generation. Let's say you
|
---|
1499 | wanted to override Perl's idea of gethostbyname() and gethostbyaddr() so
|
---|
1500 | that they would return objects that acted like C structures. We don't
|
---|
1501 | care about high-falutin' OO gunk. All we want is for these objects to
|
---|
1502 | act like structs in the C sense.
|
---|
1503 |
|
---|
1504 | use Socket;
|
---|
1505 | use Net::hostent;
|
---|
1506 | $h = gethostbyname("perl.com"); # object return
|
---|
1507 | printf "perl.com's real name is %s, address %s\n",
|
---|
1508 | $h->name, inet_ntoa($h->addr);
|
---|
1509 |
|
---|
1510 | Here's how to do this using the Class::Struct module.
|
---|
1511 | The crux is going to be this call:
|
---|
1512 |
|
---|
1513 | struct 'Net::hostent' => [ # note bracket
|
---|
1514 | name => '$',
|
---|
1515 | aliases => '@',
|
---|
1516 | addrtype => '$',
|
---|
1517 | 'length' => '$',
|
---|
1518 | addr_list => '@',
|
---|
1519 | ];
|
---|
1520 |
|
---|
1521 | Which creates object methods of those names and types.
|
---|
1522 | It even creates a new() method for us.
|
---|
1523 |
|
---|
1524 | We could also have implemented our object this way:
|
---|
1525 |
|
---|
1526 | struct 'Net::hostent' => { # note brace
|
---|
1527 | name => '$',
|
---|
1528 | aliases => '@',
|
---|
1529 | addrtype => '$',
|
---|
1530 | 'length' => '$',
|
---|
1531 | addr_list => '@',
|
---|
1532 | };
|
---|
1533 |
|
---|
1534 | and then Class::Struct would have used an anonymous hash as the object
|
---|
1535 | type, instead of an anonymous array. The array is faster and smaller,
|
---|
1536 | but the hash works out better if you eventually want to do inheritance.
|
---|
1537 | Since for this struct-like object we aren't planning on inheritance,
|
---|
1538 | this time we'll opt for better speed and size over better flexibility.
|
---|
1539 |
|
---|
1540 | Here's the whole implementation:
|
---|
1541 |
|
---|
1542 | package Net::hostent;
|
---|
1543 | use strict;
|
---|
1544 |
|
---|
1545 | BEGIN {
|
---|
1546 | use Exporter ();
|
---|
1547 | our @EXPORT = qw(gethostbyname gethostbyaddr gethost);
|
---|
1548 | our @EXPORT_OK = qw(
|
---|
1549 | $h_name @h_aliases
|
---|
1550 | $h_addrtype $h_length
|
---|
1551 | @h_addr_list $h_addr
|
---|
1552 | );
|
---|
1553 | our %EXPORT_TAGS = ( FIELDS => [ @EXPORT_OK, @EXPORT ] );
|
---|
1554 | }
|
---|
1555 | our @EXPORT_OK;
|
---|
1556 |
|
---|
1557 | # Class::Struct forbids use of @ISA
|
---|
1558 | sub import { goto &Exporter::import }
|
---|
1559 |
|
---|
1560 | use Class::Struct qw(struct);
|
---|
1561 | struct 'Net::hostent' => [
|
---|
1562 | name => '$',
|
---|
1563 | aliases => '@',
|
---|
1564 | addrtype => '$',
|
---|
1565 | 'length' => '$',
|
---|
1566 | addr_list => '@',
|
---|
1567 | ];
|
---|
1568 |
|
---|
1569 | sub addr { shift->addr_list->[0] }
|
---|
1570 |
|
---|
1571 | sub populate (@) {
|
---|
1572 | return unless @_;
|
---|
1573 | my $hob = new(); # Class::Struct made this!
|
---|
1574 | $h_name = $hob->[0] = $_[0];
|
---|
1575 | @h_aliases = @{ $hob->[1] } = split ' ', $_[1];
|
---|
1576 | $h_addrtype = $hob->[2] = $_[2];
|
---|
1577 | $h_length = $hob->[3] = $_[3];
|
---|
1578 | $h_addr = $_[4];
|
---|
1579 | @h_addr_list = @{ $hob->[4] } = @_[ (4 .. $#_) ];
|
---|
1580 | return $hob;
|
---|
1581 | }
|
---|
1582 |
|
---|
1583 | sub gethostbyname ($) { populate(CORE::gethostbyname(shift)) }
|
---|
1584 |
|
---|
1585 | sub gethostbyaddr ($;$) {
|
---|
1586 | my ($addr, $addrtype);
|
---|
1587 | $addr = shift;
|
---|
1588 | require Socket unless @_;
|
---|
1589 | $addrtype = @_ ? shift : Socket::AF_INET();
|
---|
1590 | populate(CORE::gethostbyaddr($addr, $addrtype))
|
---|
1591 | }
|
---|
1592 |
|
---|
1593 | sub gethost($) {
|
---|
1594 | if ($_[0] =~ /^\d+(?:\.\d+(?:\.\d+(?:\.\d+)?)?)?$/) {
|
---|
1595 | require Socket;
|
---|
1596 | &gethostbyaddr(Socket::inet_aton(shift));
|
---|
1597 | } else {
|
---|
1598 | &gethostbyname;
|
---|
1599 | }
|
---|
1600 | }
|
---|
1601 |
|
---|
1602 | 1;
|
---|
1603 |
|
---|
1604 | We've snuck in quite a fair bit of other concepts besides just dynamic
|
---|
1605 | class creation, like overriding core functions, import/export bits,
|
---|
1606 | function prototyping, short-cut function call via C<&whatever>, and
|
---|
1607 | function replacement with C<goto &whatever>. These all mostly make
|
---|
1608 | sense from the perspective of a traditional module, but as you can see,
|
---|
1609 | we can also use them in an object module.
|
---|
1610 |
|
---|
1611 | You can look at other object-based, struct-like overrides of core
|
---|
1612 | functions in the 5.004 release of Perl in File::stat, Net::hostent,
|
---|
1613 | Net::netent, Net::protoent, Net::servent, Time::gmtime, Time::localtime,
|
---|
1614 | User::grent, and User::pwent. These modules have a final component
|
---|
1615 | that's all lowercase, by convention reserved for compiler pragmas,
|
---|
1616 | because they affect the compilation and change a builtin function.
|
---|
1617 | They also have the type names that a C programmer would most expect.
|
---|
1618 |
|
---|
1619 | =head2 Data Members as Variables
|
---|
1620 |
|
---|
1621 | If you're used to C++ objects, then you're accustomed to being able to
|
---|
1622 | get at an object's data members as simple variables from within a method.
|
---|
1623 | The Alias module provides for this, as well as a good bit more, such
|
---|
1624 | as the possibility of private methods that the object can call but folks
|
---|
1625 | outside the class cannot.
|
---|
1626 |
|
---|
1627 | Here's an example of creating a Person using the Alias module.
|
---|
1628 | When you update these magical instance variables, you automatically
|
---|
1629 | update value fields in the hash. Convenient, eh?
|
---|
1630 |
|
---|
1631 | package Person;
|
---|
1632 |
|
---|
1633 | # this is the same as before...
|
---|
1634 | sub new {
|
---|
1635 | my $class = shift;
|
---|
1636 | my $self = {
|
---|
1637 | NAME => undef,
|
---|
1638 | AGE => undef,
|
---|
1639 | PEERS => [],
|
---|
1640 | };
|
---|
1641 | bless($self, $class);
|
---|
1642 | return $self;
|
---|
1643 | }
|
---|
1644 |
|
---|
1645 | use Alias qw(attr);
|
---|
1646 | our ($NAME, $AGE, $PEERS);
|
---|
1647 |
|
---|
1648 | sub name {
|
---|
1649 | my $self = attr shift;
|
---|
1650 | if (@_) { $NAME = shift; }
|
---|
1651 | return $NAME;
|
---|
1652 | }
|
---|
1653 |
|
---|
1654 | sub age {
|
---|
1655 | my $self = attr shift;
|
---|
1656 | if (@_) { $AGE = shift; }
|
---|
1657 | return $AGE;
|
---|
1658 | }
|
---|
1659 |
|
---|
1660 | sub peers {
|
---|
1661 | my $self = attr shift;
|
---|
1662 | if (@_) { @PEERS = @_; }
|
---|
1663 | return @PEERS;
|
---|
1664 | }
|
---|
1665 |
|
---|
1666 | sub exclaim {
|
---|
1667 | my $self = attr shift;
|
---|
1668 | return sprintf "Hi, I'm %s, age %d, working with %s",
|
---|
1669 | $NAME, $AGE, join(", ", @PEERS);
|
---|
1670 | }
|
---|
1671 |
|
---|
1672 | sub happy_birthday {
|
---|
1673 | my $self = attr shift;
|
---|
1674 | return ++$AGE;
|
---|
1675 | }
|
---|
1676 |
|
---|
1677 | The need for the C<our> declaration is because what Alias does
|
---|
1678 | is play with package globals with the same name as the fields. To use
|
---|
1679 | globals while C<use strict> is in effect, you have to predeclare them.
|
---|
1680 | These package variables are localized to the block enclosing the attr()
|
---|
1681 | call just as if you'd used a local() on them. However, that means that
|
---|
1682 | they're still considered global variables with temporary values, just
|
---|
1683 | as with any other local().
|
---|
1684 |
|
---|
1685 | It would be nice to combine Alias with
|
---|
1686 | something like Class::Struct or Class::MethodMaker.
|
---|
1687 |
|
---|
1688 | =head1 NOTES
|
---|
1689 |
|
---|
1690 | =head2 Object Terminology
|
---|
1691 |
|
---|
1692 | In the various OO literature, it seems that a lot of different words
|
---|
1693 | are used to describe only a few different concepts. If you're not
|
---|
1694 | already an object programmer, then you don't need to worry about all
|
---|
1695 | these fancy words. But if you are, then you might like to know how to
|
---|
1696 | get at the same concepts in Perl.
|
---|
1697 |
|
---|
1698 | For example, it's common to call an object an I<instance> of a class
|
---|
1699 | and to call those objects' methods I<instance methods>. Data fields
|
---|
1700 | peculiar to each object are often called I<instance data> or I<object
|
---|
1701 | attributes>, and data fields common to all members of that class are
|
---|
1702 | I<class data>, I<class attributes>, or I<static data members>.
|
---|
1703 |
|
---|
1704 | Also, I<base class>, I<generic class>, and I<superclass> all describe
|
---|
1705 | the same notion, whereas I<derived class>, I<specific class>, and
|
---|
1706 | I<subclass> describe the other related one.
|
---|
1707 |
|
---|
1708 | C++ programmers have I<static methods> and I<virtual methods>,
|
---|
1709 | but Perl only has I<class methods> and I<object methods>.
|
---|
1710 | Actually, Perl only has methods. Whether a method gets used
|
---|
1711 | as a class or object method is by usage only. You could accidentally
|
---|
1712 | call a class method (one expecting a string argument) on an
|
---|
1713 | object (one expecting a reference), or vice versa.
|
---|
1714 |
|
---|
1715 | From the C++ perspective, all methods in Perl are virtual.
|
---|
1716 | This, by the way, is why they are never checked for function
|
---|
1717 | prototypes in the argument list as regular builtin and user-defined
|
---|
1718 | functions can be.
|
---|
1719 |
|
---|
1720 | Because a class is itself something of an object, Perl's classes can be
|
---|
1721 | taken as describing both a "class as meta-object" (also called I<object
|
---|
1722 | factory>) philosophy and the "class as type definition" (I<declaring>
|
---|
1723 | behaviour, not I<defining> mechanism) idea. C++ supports the latter
|
---|
1724 | notion, but not the former.
|
---|
1725 |
|
---|
1726 | =head1 SEE ALSO
|
---|
1727 |
|
---|
1728 | The following manpages will doubtless provide more
|
---|
1729 | background for this one:
|
---|
1730 | L<perlmod>,
|
---|
1731 | L<perlref>,
|
---|
1732 | L<perlobj>,
|
---|
1733 | L<perlbot>,
|
---|
1734 | L<perltie>,
|
---|
1735 | and
|
---|
1736 | L<overload>.
|
---|
1737 |
|
---|
1738 | L<perlboot> is a kinder, gentler introduction to object-oriented
|
---|
1739 | programming.
|
---|
1740 |
|
---|
1741 | L<perltooc> provides more detail on class data.
|
---|
1742 |
|
---|
1743 | Some modules which might prove interesting are Class::Accessor,
|
---|
1744 | Class::Class, Class::Contract, Class::Data::Inheritable,
|
---|
1745 | Class::MethodMaker and Tie::SecureHash
|
---|
1746 |
|
---|
1747 |
|
---|
1748 | =head1 AUTHOR AND COPYRIGHT
|
---|
1749 |
|
---|
1750 | Copyright (c) 1997, 1998 Tom Christiansen
|
---|
1751 | All rights reserved.
|
---|
1752 |
|
---|
1753 | This documentation is free; you can redistribute it and/or modify it
|
---|
1754 | under the same terms as Perl itself.
|
---|
1755 |
|
---|
1756 | Irrespective of its distribution, all code examples in this file
|
---|
1757 | are hereby placed into the public domain. You are permitted and
|
---|
1758 | encouraged to use this code in your own programs for fun
|
---|
1759 | or for profit as you see fit. A simple comment in the code giving
|
---|
1760 | credit would be courteous but is not required.
|
---|
1761 |
|
---|
1762 | =head1 COPYRIGHT
|
---|
1763 |
|
---|
1764 | =head2 Acknowledgments
|
---|
1765 |
|
---|
1766 | Thanks to
|
---|
1767 | Larry Wall,
|
---|
1768 | Roderick Schertler,
|
---|
1769 | Gurusamy Sarathy,
|
---|
1770 | Dean Roehrich,
|
---|
1771 | Raphael Manfredi,
|
---|
1772 | Brent Halsey,
|
---|
1773 | Greg Bacon,
|
---|
1774 | Brad Appleton,
|
---|
1775 | and many others for their helpful comments.
|
---|