source: vendor/python/2.5/Doc/lib/xmldom.tex

Last change on this file was 3225, checked in by bird, 18 years ago

Python 2.5

File size: 36.7 KB
Line 
1\section{\module{xml.dom} ---
2 The Document Object Model API}
3
4\declaremodule{standard}{xml.dom}
5\modulesynopsis{Document Object Model API for Python.}
6\sectionauthor{Paul Prescod}{paul@prescod.net}
7\sectionauthor{Martin v. L\"owis}{martin@v.loewis.de}
8
9\versionadded{2.0}
10
11The Document Object Model, or ``DOM,'' is a cross-language API from
12the World Wide Web Consortium (W3C) for accessing and modifying XML
13documents. A DOM implementation presents an XML document as a tree
14structure, or allows client code to build such a structure from
15scratch. It then gives access to the structure through a set of
16objects which provided well-known interfaces.
17
18The DOM is extremely useful for random-access applications. SAX only
19allows you a view of one bit of the document at a time. If you are
20looking at one SAX element, you have no access to another. If you are
21looking at a text node, you have no access to a containing element.
22When you write a SAX application, you need to keep track of your
23program's position in the document somewhere in your own code. SAX
24does not do it for you. Also, if you need to look ahead in the XML
25document, you are just out of luck.
26
27Some applications are simply impossible in an event driven model with
28no access to a tree. Of course you could build some sort of tree
29yourself in SAX events, but the DOM allows you to avoid writing that
30code. The DOM is a standard tree representation for XML data.
31
32%What if your needs are somewhere between SAX and the DOM? Perhaps
33%you cannot afford to load the entire tree in memory but you find the
34%SAX model somewhat cumbersome and low-level. There is also a module
35%called xml.dom.pulldom that allows you to build trees of only the
36%parts of a document that you need structured access to. It also has
37%features that allow you to find your way around the DOM.
38% See http://www.prescod.net/python/pulldom
39
40The Document Object Model is being defined by the W3C in stages, or
41``levels'' in their terminology. The Python mapping of the API is
42substantially based on the DOM Level~2 recommendation. The mapping of
43the Level~3 specification, currently only available in draft form, is
44being developed by the \ulink{Python XML Special Interest
45Group}{http://www.python.org/sigs/xml-sig/} as part of the
46\ulink{PyXML package}{http://pyxml.sourceforge.net/}. Refer to the
47documentation bundled with that package for information on the current
48state of DOM Level~3 support.
49
50DOM applications typically start by parsing some XML into a DOM. How
51this is accomplished is not covered at all by DOM Level~1, and Level~2
52provides only limited improvements: There is a
53\class{DOMImplementation} object class which provides access to
54\class{Document} creation methods, but no way to access an XML
55reader/parser/Document builder in an implementation-independent way.
56There is also no well-defined way to access these methods without an
57existing \class{Document} object. In Python, each DOM implementation
58will provide a function \function{getDOMImplementation()}. DOM Level~3
59adds a Load/Store specification, which defines an interface to the
60reader, but this is not yet available in the Python standard library.
61
62Once you have a DOM document object, you can access the parts of your
63XML document through its properties and methods. These properties are
64defined in the DOM specification; this portion of the reference manual
65describes the interpretation of the specification in Python.
66
67The specification provided by the W3C defines the DOM API for Java,
68ECMAScript, and OMG IDL. The Python mapping defined here is based in
69large part on the IDL version of the specification, but strict
70compliance is not required (though implementations are free to support
71the strict mapping from IDL). See section \ref{dom-conformance},
72``Conformance,'' for a detailed discussion of mapping requirements.
73
74
75\begin{seealso}
76 \seetitle[http://www.w3.org/TR/DOM-Level-2-Core/]{Document Object
77 Model (DOM) Level~2 Specification}
78 {The W3C recommendation upon which the Python DOM API is
79 based.}
80 \seetitle[http://www.w3.org/TR/REC-DOM-Level-1/]{Document Object
81 Model (DOM) Level~1 Specification}
82 {The W3C recommendation for the
83 DOM supported by \module{xml.dom.minidom}.}
84 \seetitle[http://pyxml.sourceforge.net]{PyXML}{Users that require a
85 full-featured implementation of DOM should use the PyXML
86 package.}
87 \seetitle[http://www.omg.org/docs/formal/02-11-05.pdf]{Python
88 Language Mapping Specification}
89 {This specifies the mapping from OMG IDL to Python.}
90\end{seealso}
91
92\subsection{Module Contents}
93
94The \module{xml.dom} contains the following functions:
95
96\begin{funcdesc}{registerDOMImplementation}{name, factory}
97Register the \var{factory} function with the name \var{name}. The
98factory function should return an object which implements the
99\class{DOMImplementation} interface. The factory function can return
100the same object every time, or a new one for each call, as appropriate
101for the specific implementation (e.g. if that implementation supports
102some customization).
103\end{funcdesc}
104
105\begin{funcdesc}{getDOMImplementation}{\optional{name\optional{, features}}}
106Return a suitable DOM implementation. The \var{name} is either
107well-known, the module name of a DOM implementation, or
108\code{None}. If it is not \code{None}, imports the corresponding
109module and returns a \class{DOMImplementation} object if the import
110succeeds. If no name is given, and if the environment variable
111\envvar{PYTHON_DOM} is set, this variable is used to find the
112implementation.
113
114If name is not given, this examines the available implementations to
115find one with the required feature set. If no implementation can be
116found, raise an \exception{ImportError}. The features list must be a
117sequence of \code{(\var{feature}, \var{version})} pairs which are
118passed to the \method{hasFeature()} method on available
119\class{DOMImplementation} objects.
120\end{funcdesc}
121
122
123Some convenience constants are also provided:
124
125\begin{datadesc}{EMPTY_NAMESPACE}
126 The value used to indicate that no namespace is associated with a
127 node in the DOM. This is typically found as the
128 \member{namespaceURI} of a node, or used as the \var{namespaceURI}
129 parameter to a namespaces-specific method.
130 \versionadded{2.2}
131\end{datadesc}
132
133\begin{datadesc}{XML_NAMESPACE}
134 The namespace URI associated with the reserved prefix \code{xml}, as
135 defined by
136 \citetitle[http://www.w3.org/TR/REC-xml-names/]{Namespaces in XML}
137 (section~4).
138 \versionadded{2.2}
139\end{datadesc}
140
141\begin{datadesc}{XMLNS_NAMESPACE}
142 The namespace URI for namespace declarations, as defined by
143 \citetitle[http://www.w3.org/TR/DOM-Level-2-Core/core.html]{Document
144 Object Model (DOM) Level~2 Core Specification} (section~1.1.8).
145 \versionadded{2.2}
146\end{datadesc}
147
148\begin{datadesc}{XHTML_NAMESPACE}
149 The URI of the XHTML namespace as defined by
150 \citetitle[http://www.w3.org/TR/xhtml1/]{XHTML 1.0: The Extensible
151 HyperText Markup Language} (section~3.1.1).
152 \versionadded{2.2}
153\end{datadesc}
154
155
156% Should the Node documentation go here?
157
158In addition, \module{xml.dom} contains a base \class{Node} class and
159the DOM exception classes. The \class{Node} class provided by this
160module does not implement any of the methods or attributes defined by
161the DOM specification; concrete DOM implementations must provide
162those. The \class{Node} class provided as part of this module does
163provide the constants used for the \member{nodeType} attribute on
164concrete \class{Node} objects; they are located within the class
165rather than at the module level to conform with the DOM
166specifications.
167
168
169\subsection{Objects in the DOM \label{dom-objects}}
170
171The definitive documentation for the DOM is the DOM specification from
172the W3C.
173
174Note that DOM attributes may also be manipulated as nodes instead of
175as simple strings. It is fairly rare that you must do this, however,
176so this usage is not yet documented.
177
178
179\begin{tableiii}{l|l|l}{class}{Interface}{Section}{Purpose}
180 \lineiii{DOMImplementation}{\ref{dom-implementation-objects}}
181 {Interface to the underlying implementation.}
182 \lineiii{Node}{\ref{dom-node-objects}}
183 {Base interface for most objects in a document.}
184 \lineiii{NodeList}{\ref{dom-nodelist-objects}}
185 {Interface for a sequence of nodes.}
186 \lineiii{DocumentType}{\ref{dom-documenttype-objects}}
187 {Information about the declarations needed to process a document.}
188 \lineiii{Document}{\ref{dom-document-objects}}
189 {Object which represents an entire document.}
190 \lineiii{Element}{\ref{dom-element-objects}}
191 {Element nodes in the document hierarchy.}
192 \lineiii{Attr}{\ref{dom-attr-objects}}
193 {Attribute value nodes on element nodes.}
194 \lineiii{Comment}{\ref{dom-comment-objects}}
195 {Representation of comments in the source document.}
196 \lineiii{Text}{\ref{dom-text-objects}}
197 {Nodes containing textual content from the document.}
198 \lineiii{ProcessingInstruction}{\ref{dom-pi-objects}}
199 {Processing instruction representation.}
200\end{tableiii}
201
202An additional section describes the exceptions defined for working
203with the DOM in Python.
204
205
206\subsubsection{DOMImplementation Objects
207 \label{dom-implementation-objects}}
208
209The \class{DOMImplementation} interface provides a way for
210applications to determine the availability of particular features in
211the DOM they are using. DOM Level~2 added the ability to create new
212\class{Document} and \class{DocumentType} objects using the
213\class{DOMImplementation} as well.
214
215\begin{methoddesc}[DOMImplementation]{hasFeature}{feature, version}
216Return true if the feature identified by the pair of strings
217\var{feature} and \var{version} is implemented.
218\end{methoddesc}
219
220\begin{methoddesc}[DOMImplementation]{createDocument}{namespaceUri, qualifiedName, doctype}
221Return a new \class{Document} object (the root of the DOM), with a
222child \class{Element} object having the given \var{namespaceUri} and
223\var{qualifiedName}. The \var{doctype} must be a \class{DocumentType}
224object created by \method{createDocumentType()}, or \code{None}.
225In the Python DOM API, the first two arguments can also be \code{None}
226in order to indicate that no \class{Element} child is to be created.
227\end{methoddesc}
228
229\begin{methoddesc}[DOMImplementation]{createDocumentType}{qualifiedName, publicId, systemId}
230Return a new \class{DocumentType} object that encapsulates the given
231\var{qualifiedName}, \var{publicId}, and \var{systemId} strings,
232representing the information contained in an XML document type
233declaration.
234\end{methoddesc}
235
236
237\subsubsection{Node Objects \label{dom-node-objects}}
238
239All of the components of an XML document are subclasses of
240\class{Node}.
241
242\begin{memberdesc}[Node]{nodeType}
243An integer representing the node type. Symbolic constants for the
244types are on the \class{Node} object:
245\constant{ELEMENT_NODE}, \constant{ATTRIBUTE_NODE},
246\constant{TEXT_NODE}, \constant{CDATA_SECTION_NODE},
247\constant{ENTITY_NODE}, \constant{PROCESSING_INSTRUCTION_NODE},
248\constant{COMMENT_NODE}, \constant{DOCUMENT_NODE},
249\constant{DOCUMENT_TYPE_NODE}, \constant{NOTATION_NODE}.
250This is a read-only attribute.
251\end{memberdesc}
252
253\begin{memberdesc}[Node]{parentNode}
254The parent of the current node, or \code{None} for the document node.
255The value is always a \class{Node} object or \code{None}. For
256\class{Element} nodes, this will be the parent element, except for the
257root element, in which case it will be the \class{Document} object.
258For \class{Attr} nodes, this is always \code{None}.
259This is a read-only attribute.
260\end{memberdesc}
261
262\begin{memberdesc}[Node]{attributes}
263A \class{NamedNodeMap} of attribute objects. Only elements have
264actual values for this; others provide \code{None} for this attribute.
265This is a read-only attribute.
266\end{memberdesc}
267
268\begin{memberdesc}[Node]{previousSibling}
269The node that immediately precedes this one with the same parent. For
270instance the element with an end-tag that comes just before the
271\var{self} element's start-tag. Of course, XML documents are made
272up of more than just elements so the previous sibling could be text, a
273comment, or something else. If this node is the first child of the
274parent, this attribute will be \code{None}.
275This is a read-only attribute.
276\end{memberdesc}
277
278\begin{memberdesc}[Node]{nextSibling}
279The node that immediately follows this one with the same parent. See
280also \member{previousSibling}. If this is the last child of the
281parent, this attribute will be \code{None}.
282This is a read-only attribute.
283\end{memberdesc}
284
285\begin{memberdesc}[Node]{childNodes}
286A list of nodes contained within this node.
287This is a read-only attribute.
288\end{memberdesc}
289
290\begin{memberdesc}[Node]{firstChild}
291The first child of the node, if there are any, or \code{None}.
292This is a read-only attribute.
293\end{memberdesc}
294
295\begin{memberdesc}[Node]{lastChild}
296The last child of the node, if there are any, or \code{None}.
297This is a read-only attribute.
298\end{memberdesc}
299
300\begin{memberdesc}[Node]{localName}
301The part of the \member{tagName} following the colon if there is one,
302else the entire \member{tagName}. The value is a string.
303\end{memberdesc}
304
305\begin{memberdesc}[Node]{prefix}
306The part of the \member{tagName} preceding the colon if there is one,
307else the empty string. The value is a string, or \code{None}
308\end{memberdesc}
309
310\begin{memberdesc}[Node]{namespaceURI}
311The namespace associated with the element name. This will be a
312string or \code{None}. This is a read-only attribute.
313\end{memberdesc}
314
315\begin{memberdesc}[Node]{nodeName}
316This has a different meaning for each node type; see the DOM
317specification for details. You can always get the information you
318would get here from another property such as the \member{tagName}
319property for elements or the \member{name} property for attributes.
320For all node types, the value of this attribute will be either a
321string or \code{None}. This is a read-only attribute.
322\end{memberdesc}
323
324\begin{memberdesc}[Node]{nodeValue}
325This has a different meaning for each node type; see the DOM
326specification for details. The situation is similar to that with
327\member{nodeName}. The value is a string or \code{None}.
328\end{memberdesc}
329
330\begin{methoddesc}[Node]{hasAttributes}{}
331Returns true if the node has any attributes.
332\end{methoddesc}
333
334\begin{methoddesc}[Node]{hasChildNodes}{}
335Returns true if the node has any child nodes.
336\end{methoddesc}
337
338\begin{methoddesc}[Node]{isSameNode}{other}
339Returns true if \var{other} refers to the same node as this node.
340This is especially useful for DOM implementations which use any sort
341of proxy architecture (because more than one object can refer to the
342same node).
343
344\begin{notice}
345 This is based on a proposed DOM Level~3 API which is still in the
346 ``working draft'' stage, but this particular interface appears
347 uncontroversial. Changes from the W3C will not necessarily affect
348 this method in the Python DOM interface (though any new W3C API for
349 this would also be supported).
350\end{notice}
351\end{methoddesc}
352
353\begin{methoddesc}[Node]{appendChild}{newChild}
354Add a new child node to this node at the end of the list of children,
355returning \var{newChild}.
356\end{methoddesc}
357
358\begin{methoddesc}[Node]{insertBefore}{newChild, refChild}
359Insert a new child node before an existing child. It must be the case
360that \var{refChild} is a child of this node; if not,
361\exception{ValueError} is raised. \var{newChild} is returned. If
362\var{refChild} is \code{None}, it inserts \var{newChild} at the end of
363the children's list.
364\end{methoddesc}
365
366\begin{methoddesc}[Node]{removeChild}{oldChild}
367Remove a child node. \var{oldChild} must be a child of this node; if
368not, \exception{ValueError} is raised. \var{oldChild} is returned on
369success. If \var{oldChild} will not be used further, its
370\method{unlink()} method should be called.
371\end{methoddesc}
372
373\begin{methoddesc}[Node]{replaceChild}{newChild, oldChild}
374Replace an existing node with a new node. It must be the case that
375\var{oldChild} is a child of this node; if not,
376\exception{ValueError} is raised.
377\end{methoddesc}
378
379\begin{methoddesc}[Node]{normalize}{}
380Join adjacent text nodes so that all stretches of text are stored as
381single \class{Text} instances. This simplifies processing text from a
382DOM tree for many applications.
383\versionadded{2.1}
384\end{methoddesc}
385
386\begin{methoddesc}[Node]{cloneNode}{deep}
387Clone this node. Setting \var{deep} means to clone all child nodes as
388well. This returns the clone.
389\end{methoddesc}
390
391
392\subsubsection{NodeList Objects \label{dom-nodelist-objects}}
393
394A \class{NodeList} represents a sequence of nodes. These objects are
395used in two ways in the DOM Core recommendation: the
396\class{Element} objects provides one as its list of child nodes, and
397the \method{getElementsByTagName()} and
398\method{getElementsByTagNameNS()} methods of \class{Node} return
399objects with this interface to represent query results.
400
401The DOM Level~2 recommendation defines one method and one attribute
402for these objects:
403
404\begin{methoddesc}[NodeList]{item}{i}
405 Return the \var{i}'th item from the sequence, if there is one, or
406 \code{None}. The index \var{i} is not allowed to be less then zero
407 or greater than or equal to the length of the sequence.
408\end{methoddesc}
409
410\begin{memberdesc}[NodeList]{length}
411 The number of nodes in the sequence.
412\end{memberdesc}
413
414In addition, the Python DOM interface requires that some additional
415support is provided to allow \class{NodeList} objects to be used as
416Python sequences. All \class{NodeList} implementations must include
417support for \method{__len__()} and \method{__getitem__()}; this allows
418iteration over the \class{NodeList} in \keyword{for} statements and
419proper support for the \function{len()} built-in function.
420
421If a DOM implementation supports modification of the document, the
422\class{NodeList} implementation must also support the
423\method{__setitem__()} and \method{__delitem__()} methods.
424
425
426\subsubsection{DocumentType Objects \label{dom-documenttype-objects}}
427
428Information about the notations and entities declared by a document
429(including the external subset if the parser uses it and can provide
430the information) is available from a \class{DocumentType} object. The
431\class{DocumentType} for a document is available from the
432\class{Document} object's \member{doctype} attribute; if there is no
433\code{DOCTYPE} declaration for the document, the document's
434\member{doctype} attribute will be set to \code{None} instead of an
435instance of this interface.
436
437\class{DocumentType} is a specialization of \class{Node}, and adds the
438following attributes:
439
440\begin{memberdesc}[DocumentType]{publicId}
441 The public identifier for the external subset of the document type
442 definition. This will be a string or \code{None}.
443\end{memberdesc}
444
445\begin{memberdesc}[DocumentType]{systemId}
446 The system identifier for the external subset of the document type
447 definition. This will be a URI as a string, or \code{None}.
448\end{memberdesc}
449
450\begin{memberdesc}[DocumentType]{internalSubset}
451 A string giving the complete internal subset from the document.
452 This does not include the brackets which enclose the subset. If the
453 document has no internal subset, this should be \code{None}.
454\end{memberdesc}
455
456\begin{memberdesc}[DocumentType]{name}
457 The name of the root element as given in the \code{DOCTYPE}
458 declaration, if present.
459\end{memberdesc}
460
461\begin{memberdesc}[DocumentType]{entities}
462 This is a \class{NamedNodeMap} giving the definitions of external
463 entities. For entity names defined more than once, only the first
464 definition is provided (others are ignored as required by the XML
465 recommendation). This may be \code{None} if the information is not
466 provided by the parser, or if no entities are defined.
467\end{memberdesc}
468
469\begin{memberdesc}[DocumentType]{notations}
470 This is a \class{NamedNodeMap} giving the definitions of notations.
471 For notation names defined more than once, only the first definition
472 is provided (others are ignored as required by the XML
473 recommendation). This may be \code{None} if the information is not
474 provided by the parser, or if no notations are defined.
475\end{memberdesc}
476
477
478\subsubsection{Document Objects \label{dom-document-objects}}
479
480A \class{Document} represents an entire XML document, including its
481constituent elements, attributes, processing instructions, comments
482etc. Remeber that it inherits properties from \class{Node}.
483
484\begin{memberdesc}[Document]{documentElement}
485The one and only root element of the document.
486\end{memberdesc}
487
488\begin{methoddesc}[Document]{createElement}{tagName}
489Create and return a new element node. The element is not inserted
490into the document when it is created. You need to explicitly insert
491it with one of the other methods such as \method{insertBefore()} or
492\method{appendChild()}.
493\end{methoddesc}
494
495\begin{methoddesc}[Document]{createElementNS}{namespaceURI, tagName}
496Create and return a new element with a namespace. The
497\var{tagName} may have a prefix. The element is not inserted into the
498document when it is created. You need to explicitly insert it with
499one of the other methods such as \method{insertBefore()} or
500\method{appendChild()}.
501\end{methoddesc}
502
503\begin{methoddesc}[Document]{createTextNode}{data}
504Create and return a text node containing the data passed as a
505parameter. As with the other creation methods, this one does not
506insert the node into the tree.
507\end{methoddesc}
508
509\begin{methoddesc}[Document]{createComment}{data}
510Create and return a comment node containing the data passed as a
511parameter. As with the other creation methods, this one does not
512insert the node into the tree.
513\end{methoddesc}
514
515\begin{methoddesc}[Document]{createProcessingInstruction}{target, data}
516Create and return a processing instruction node containing the
517\var{target} and \var{data} passed as parameters. As with the other
518creation methods, this one does not insert the node into the tree.
519\end{methoddesc}
520
521\begin{methoddesc}[Document]{createAttribute}{name}
522Create and return an attribute node. This method does not associate
523the attribute node with any particular element. You must use
524\method{setAttributeNode()} on the appropriate \class{Element} object
525to use the newly created attribute instance.
526\end{methoddesc}
527
528\begin{methoddesc}[Document]{createAttributeNS}{namespaceURI, qualifiedName}
529Create and return an attribute node with a namespace. The
530\var{tagName} may have a prefix. This method does not associate the
531attribute node with any particular element. You must use
532\method{setAttributeNode()} on the appropriate \class{Element} object
533to use the newly created attribute instance.
534\end{methoddesc}
535
536\begin{methoddesc}[Document]{getElementsByTagName}{tagName}
537Search for all descendants (direct children, children's children,
538etc.) with a particular element type name.
539\end{methoddesc}
540
541\begin{methoddesc}[Document]{getElementsByTagNameNS}{namespaceURI, localName}
542Search for all descendants (direct children, children's children,
543etc.) with a particular namespace URI and localname. The localname is
544the part of the namespace after the prefix.
545\end{methoddesc}
546
547
548\subsubsection{Element Objects \label{dom-element-objects}}
549
550\class{Element} is a subclass of \class{Node}, so inherits all the
551attributes of that class.
552
553\begin{memberdesc}[Element]{tagName}
554The element type name. In a namespace-using document it may have
555colons in it. The value is a string.
556\end{memberdesc}
557
558\begin{methoddesc}[Element]{getElementsByTagName}{tagName}
559Same as equivalent method in the \class{Document} class.
560\end{methoddesc}
561
562\begin{methoddesc}[Element]{getElementsByTagNameNS}{tagName}
563Same as equivalent method in the \class{Document} class.
564\end{methoddesc}
565
566\begin{methoddesc}[Element]{hasAttribute}{name}
567Returns true if the element has an attribute named by \var{name}.
568\end{methoddesc}
569
570\begin{methoddesc}[Element]{hasAttributeNS}{namespaceURI, localName}
571Returns true if the element has an attribute named by
572\var{namespaceURI} and \var{localName}.
573\end{methoddesc}
574
575\begin{methoddesc}[Element]{getAttribute}{name}
576Return the value of the attribute named by \var{name} as a
577string. If no such attribute exists, an empty string is returned,
578as if the attribute had no value.
579\end{methoddesc}
580
581\begin{methoddesc}[Element]{getAttributeNode}{attrname}
582Return the \class{Attr} node for the attribute named by
583\var{attrname}.
584\end{methoddesc}
585
586\begin{methoddesc}[Element]{getAttributeNS}{namespaceURI, localName}
587Return the value of the attribute named by \var{namespaceURI} and
588\var{localName} as a string. If no such attribute exists, an empty
589string is returned, as if the attribute had no value.
590\end{methoddesc}
591
592\begin{methoddesc}[Element]{getAttributeNodeNS}{namespaceURI, localName}
593Return an attribute value as a node, given a \var{namespaceURI} and
594\var{localName}.
595\end{methoddesc}
596
597\begin{methoddesc}[Element]{removeAttribute}{name}
598Remove an attribute by name. No exception is raised if there is no
599matching attribute.
600\end{methoddesc}
601
602\begin{methoddesc}[Element]{removeAttributeNode}{oldAttr}
603Remove and return \var{oldAttr} from the attribute list, if present.
604If \var{oldAttr} is not present, \exception{NotFoundErr} is raised.
605\end{methoddesc}
606
607\begin{methoddesc}[Element]{removeAttributeNS}{namespaceURI, localName}
608Remove an attribute by name. Note that it uses a localName, not a
609qname. No exception is raised if there is no matching attribute.
610\end{methoddesc}
611
612\begin{methoddesc}[Element]{setAttribute}{name, value}
613Set an attribute value from a string.
614\end{methoddesc}
615
616\begin{methoddesc}[Element]{setAttributeNode}{newAttr}
617Add a new attribute node to the element, replacing an existing
618attribute if necessary if the \member{name} attribute matches. If a
619replacement occurs, the old attribute node will be returned. If
620\var{newAttr} is already in use, \exception{InuseAttributeErr} will be
621raised.
622\end{methoddesc}
623
624\begin{methoddesc}[Element]{setAttributeNodeNS}{newAttr}
625Add a new attribute node to the element, replacing an existing
626attribute if necessary if the \member{namespaceURI} and
627\member{localName} attributes match. If a replacement occurs, the old
628attribute node will be returned. If \var{newAttr} is already in use,
629\exception{InuseAttributeErr} will be raised.
630\end{methoddesc}
631
632\begin{methoddesc}[Element]{setAttributeNS}{namespaceURI, qname, value}
633Set an attribute value from a string, given a \var{namespaceURI} and a
634\var{qname}. Note that a qname is the whole attribute name. This is
635different than above.
636\end{methoddesc}
637
638
639\subsubsection{Attr Objects \label{dom-attr-objects}}
640
641\class{Attr} inherits from \class{Node}, so inherits all its
642attributes.
643
644\begin{memberdesc}[Attr]{name}
645The attribute name. In a namespace-using document it may have colons
646in it.
647\end{memberdesc}
648
649\begin{memberdesc}[Attr]{localName}
650The part of the name following the colon if there is one, else the
651entire name. This is a read-only attribute.
652\end{memberdesc}
653
654\begin{memberdesc}[Attr]{prefix}
655The part of the name preceding the colon if there is one, else the
656empty string.
657\end{memberdesc}
658
659
660\subsubsection{NamedNodeMap Objects \label{dom-attributelist-objects}}
661
662\class{NamedNodeMap} does \emph{not} inherit from \class{Node}.
663
664\begin{memberdesc}[NamedNodeMap]{length}
665The length of the attribute list.
666\end{memberdesc}
667
668\begin{methoddesc}[NamedNodeMap]{item}{index}
669Return an attribute with a particular index. The order you get the
670attributes in is arbitrary but will be consistent for the life of a
671DOM. Each item is an attribute node. Get its value with the
672\member{value} attribute.
673\end{methoddesc}
674
675There are also experimental methods that give this class more mapping
676behavior. You can use them or you can use the standardized
677\method{getAttribute*()} family of methods on the \class{Element}
678objects.
679
680
681\subsubsection{Comment Objects \label{dom-comment-objects}}
682
683\class{Comment} represents a comment in the XML document. It is a
684subclass of \class{Node}, but cannot have child nodes.
685
686\begin{memberdesc}[Comment]{data}
687The content of the comment as a string. The attribute contains all
688characters between the leading \code{<!-}\code{-} and trailing
689\code{-}\code{->}, but does not include them.
690\end{memberdesc}
691
692
693\subsubsection{Text and CDATASection Objects \label{dom-text-objects}}
694
695The \class{Text} interface represents text in the XML document. If
696the parser and DOM implementation support the DOM's XML extension,
697portions of the text enclosed in CDATA marked sections are stored in
698\class{CDATASection} objects. These two interfaces are identical, but
699provide different values for the \member{nodeType} attribute.
700
701These interfaces extend the \class{Node} interface. They cannot have
702child nodes.
703
704\begin{memberdesc}[Text]{data}
705The content of the text node as a string.
706\end{memberdesc}
707
708\begin{notice}
709 The use of a \class{CDATASection} node does not indicate that the
710 node represents a complete CDATA marked section, only that the
711 content of the node was part of a CDATA section. A single CDATA
712 section may be represented by more than one node in the document
713 tree. There is no way to determine whether two adjacent
714 \class{CDATASection} nodes represent different CDATA marked
715 sections.
716\end{notice}
717
718
719\subsubsection{ProcessingInstruction Objects \label{dom-pi-objects}}
720
721Represents a processing instruction in the XML document; this inherits
722from the \class{Node} interface and cannot have child nodes.
723
724\begin{memberdesc}[ProcessingInstruction]{target}
725The content of the processing instruction up to the first whitespace
726character. This is a read-only attribute.
727\end{memberdesc}
728
729\begin{memberdesc}[ProcessingInstruction]{data}
730The content of the processing instruction following the first
731whitespace character.
732\end{memberdesc}
733
734
735\subsubsection{Exceptions \label{dom-exceptions}}
736
737\versionadded{2.1}
738
739The DOM Level~2 recommendation defines a single exception,
740\exception{DOMException}, and a number of constants that allow
741applications to determine what sort of error occurred.
742\exception{DOMException} instances carry a \member{code} attribute
743that provides the appropriate value for the specific exception.
744
745The Python DOM interface provides the constants, but also expands the
746set of exceptions so that a specific exception exists for each of the
747exception codes defined by the DOM. The implementations must raise
748the appropriate specific exception, each of which carries the
749appropriate value for the \member{code} attribute.
750
751\begin{excdesc}{DOMException}
752 Base exception class used for all specific DOM exceptions. This
753 exception class cannot be directly instantiated.
754\end{excdesc}
755
756\begin{excdesc}{DomstringSizeErr}
757 Raised when a specified range of text does not fit into a string.
758 This is not known to be used in the Python DOM implementations, but
759 may be received from DOM implementations not written in Python.
760\end{excdesc}
761
762\begin{excdesc}{HierarchyRequestErr}
763 Raised when an attempt is made to insert a node where the node type
764 is not allowed.
765\end{excdesc}
766
767\begin{excdesc}{IndexSizeErr}
768 Raised when an index or size parameter to a method is negative or
769 exceeds the allowed values.
770\end{excdesc}
771
772\begin{excdesc}{InuseAttributeErr}
773 Raised when an attempt is made to insert an \class{Attr} node that
774 is already present elsewhere in the document.
775\end{excdesc}
776
777\begin{excdesc}{InvalidAccessErr}
778 Raised if a parameter or an operation is not supported on the
779 underlying object.
780\end{excdesc}
781
782\begin{excdesc}{InvalidCharacterErr}
783 This exception is raised when a string parameter contains a
784 character that is not permitted in the context it's being used in by
785 the XML 1.0 recommendation. For example, attempting to create an
786 \class{Element} node with a space in the element type name will
787 cause this error to be raised.
788\end{excdesc}
789
790\begin{excdesc}{InvalidModificationErr}
791 Raised when an attempt is made to modify the type of a node.
792\end{excdesc}
793
794\begin{excdesc}{InvalidStateErr}
795 Raised when an attempt is made to use an object that is not defined or is no
796 longer usable.
797\end{excdesc}
798
799\begin{excdesc}{NamespaceErr}
800 If an attempt is made to change any object in a way that is not
801 permitted with regard to the
802 \citetitle[http://www.w3.org/TR/REC-xml-names/]{Namespaces in XML}
803 recommendation, this exception is raised.
804\end{excdesc}
805
806\begin{excdesc}{NotFoundErr}
807 Exception when a node does not exist in the referenced context. For
808 example, \method{NamedNodeMap.removeNamedItem()} will raise this if
809 the node passed in does not exist in the map.
810\end{excdesc}
811
812\begin{excdesc}{NotSupportedErr}
813 Raised when the implementation does not support the requested type
814 of object or operation.
815\end{excdesc}
816
817\begin{excdesc}{NoDataAllowedErr}
818 This is raised if data is specified for a node which does not
819 support data.
820 % XXX a better explanation is needed!
821\end{excdesc}
822
823\begin{excdesc}{NoModificationAllowedErr}
824 Raised on attempts to modify an object where modifications are not
825 allowed (such as for read-only nodes).
826\end{excdesc}
827
828\begin{excdesc}{SyntaxErr}
829 Raised when an invalid or illegal string is specified.
830 % XXX how is this different from InvalidCharacterErr ???
831\end{excdesc}
832
833\begin{excdesc}{WrongDocumentErr}
834 Raised when a node is inserted in a different document than it
835 currently belongs to, and the implementation does not support
836 migrating the node from one document to the other.
837\end{excdesc}
838
839The exception codes defined in the DOM recommendation map to the
840exceptions described above according to this table:
841
842\begin{tableii}{l|l}{constant}{Constant}{Exception}
843 \lineii{DOMSTRING_SIZE_ERR}{\exception{DomstringSizeErr}}
844 \lineii{HIERARCHY_REQUEST_ERR}{\exception{HierarchyRequestErr}}
845 \lineii{INDEX_SIZE_ERR}{\exception{IndexSizeErr}}
846 \lineii{INUSE_ATTRIBUTE_ERR}{\exception{InuseAttributeErr}}
847 \lineii{INVALID_ACCESS_ERR}{\exception{InvalidAccessErr}}
848 \lineii{INVALID_CHARACTER_ERR}{\exception{InvalidCharacterErr}}
849 \lineii{INVALID_MODIFICATION_ERR}{\exception{InvalidModificationErr}}
850 \lineii{INVALID_STATE_ERR}{\exception{InvalidStateErr}}
851 \lineii{NAMESPACE_ERR}{\exception{NamespaceErr}}
852 \lineii{NOT_FOUND_ERR}{\exception{NotFoundErr}}
853 \lineii{NOT_SUPPORTED_ERR}{\exception{NotSupportedErr}}
854 \lineii{NO_DATA_ALLOWED_ERR}{\exception{NoDataAllowedErr}}
855 \lineii{NO_MODIFICATION_ALLOWED_ERR}{\exception{NoModificationAllowedErr}}
856 \lineii{SYNTAX_ERR}{\exception{SyntaxErr}}
857 \lineii{WRONG_DOCUMENT_ERR}{\exception{WrongDocumentErr}}
858\end{tableii}
859
860
861\subsection{Conformance \label{dom-conformance}}
862
863This section describes the conformance requirements and relationships
864between the Python DOM API, the W3C DOM recommendations, and the OMG
865IDL mapping for Python.
866
867
868\subsubsection{Type Mapping \label{dom-type-mapping}}
869
870The primitive IDL types used in the DOM specification are mapped to
871Python types according to the following table.
872
873\begin{tableii}{l|l}{code}{IDL Type}{Python Type}
874 \lineii{boolean}{\code{IntegerType} (with a value of \code{0} or \code{1})}
875 \lineii{int}{\code{IntegerType}}
876 \lineii{long int}{\code{IntegerType}}
877 \lineii{unsigned int}{\code{IntegerType}}
878\end{tableii}
879
880Additionally, the \class{DOMString} defined in the recommendation is
881mapped to a Python string or Unicode string. Applications should
882be able to handle Unicode whenever a string is returned from the DOM.
883
884The IDL \keyword{null} value is mapped to \code{None}, which may be
885accepted or provided by the implementation whenever \keyword{null} is
886allowed by the API.
887
888
889\subsubsection{Accessor Methods \label{dom-accessor-methods}}
890
891The mapping from OMG IDL to Python defines accessor functions for IDL
892\keyword{attribute} declarations in much the way the Java mapping
893does. Mapping the IDL declarations
894
895\begin{verbatim}
896readonly attribute string someValue;
897 attribute string anotherValue;
898\end{verbatim}
899
900yields three accessor functions: a ``get'' method for
901\member{someValue} (\method{_get_someValue()}), and ``get'' and
902``set'' methods for
903\member{anotherValue} (\method{_get_anotherValue()} and
904\method{_set_anotherValue()}). The mapping, in particular, does not
905require that the IDL attributes are accessible as normal Python
906attributes: \code{\var{object}.someValue} is \emph{not} required to
907work, and may raise an \exception{AttributeError}.
908
909The Python DOM API, however, \emph{does} require that normal attribute
910access work. This means that the typical surrogates generated by
911Python IDL compilers are not likely to work, and wrapper objects may
912be needed on the client if the DOM objects are accessed via CORBA.
913While this does require some additional consideration for CORBA DOM
914clients, the implementers with experience using DOM over CORBA from
915Python do not consider this a problem. Attributes that are declared
916\keyword{readonly} may not restrict write access in all DOM
917implementations.
918
919In the Python DOM API, accessor functions are not required. If provided,
920they should take the form defined by the Python IDL mapping, but
921these methods are considered unnecessary since the attributes are
922accessible directly from Python. ``Set'' accessors should never be
923provided for \keyword{readonly} attributes.
924
925The IDL definitions do not fully embody the requirements of the W3C DOM
926API, such as the notion of certain objects, such as the return value of
927\method{getElementsByTagName()}, being ``live''. The Python DOM API
928does not require implementations to enforce such requirements.
Note: See TracBrowser for help on using the repository browser.