1 | \input texinfo
|
---|
2 | @c Copyright 1991, 1992, 1993, 1994, 1995, 1996, 1997, 1998, 1999, 2000,
|
---|
3 | @c 2001, 2002, 2003
|
---|
4 | @c Free Software Foundation, Inc.
|
---|
5 | @setfilename internals.info
|
---|
6 | @node Top
|
---|
7 | @top Assembler Internals
|
---|
8 | @raisesections
|
---|
9 | @cindex internals
|
---|
10 |
|
---|
11 | This chapter describes the internals of the assembler. It is incomplete, but
|
---|
12 | it may help a bit.
|
---|
13 |
|
---|
14 | This chapter is not updated regularly, and it may be out of date.
|
---|
15 |
|
---|
16 | @menu
|
---|
17 | * GAS versions:: GAS versions
|
---|
18 | * Data types:: Data types
|
---|
19 | * GAS processing:: What GAS does when it runs
|
---|
20 | * Porting GAS:: Porting GAS
|
---|
21 | * Relaxation:: Relaxation
|
---|
22 | * Broken words:: Broken words
|
---|
23 | * Internal functions:: Internal functions
|
---|
24 | * Test suite:: Test suite
|
---|
25 | @end menu
|
---|
26 |
|
---|
27 | @node GAS versions
|
---|
28 | @section GAS versions
|
---|
29 |
|
---|
30 | GAS has acquired layers of code over time. The original GAS only supported the
|
---|
31 | a.out object file format, with three sections. Support for multiple sections
|
---|
32 | has been added in two different ways.
|
---|
33 |
|
---|
34 | The preferred approach is to use the version of GAS created when the symbol
|
---|
35 | @code{BFD_ASSEMBLER} is defined. The other versions of GAS are documented for
|
---|
36 | historical purposes, and to help anybody who has to debug code written for
|
---|
37 | them.
|
---|
38 |
|
---|
39 | The type @code{segT} is used to represent a section in code which must work
|
---|
40 | with all versions of GAS.
|
---|
41 |
|
---|
42 | @menu
|
---|
43 | * Original GAS:: Original GAS version
|
---|
44 | * MANY_SEGMENTS:: MANY_SEGMENTS gas version
|
---|
45 | * BFD_ASSEMBLER:: BFD_ASSEMBLER gas version
|
---|
46 | @end menu
|
---|
47 |
|
---|
48 | @node Original GAS
|
---|
49 | @subsection Original GAS
|
---|
50 |
|
---|
51 | The original GAS only supported the a.out object file format with three
|
---|
52 | sections: @samp{.text}, @samp{.data}, and @samp{.bss}. This is the version of
|
---|
53 | GAS that is compiled if neither @code{BFD_ASSEMBLER} nor @code{MANY_SEGMENTS}
|
---|
54 | is defined. This version of GAS is still used for the m68k-aout target, and
|
---|
55 | perhaps others.
|
---|
56 |
|
---|
57 | This version of GAS should not be used for any new development.
|
---|
58 |
|
---|
59 | There is still code that is specific to this version of GAS, notably in
|
---|
60 | @file{write.c}. There is no way for this code to loop through all the
|
---|
61 | sections; it simply looks at global variables like @code{text_frag_root} and
|
---|
62 | @code{data_frag_root}.
|
---|
63 |
|
---|
64 | The type @code{segT} is an enum.
|
---|
65 |
|
---|
66 | @node MANY_SEGMENTS
|
---|
67 | @subsection MANY_SEGMENTS gas version
|
---|
68 | @cindex MANY_SEGMENTS
|
---|
69 |
|
---|
70 | The @code{MANY_SEGMENTS} version of gas is only used for COFF. It uses the BFD
|
---|
71 | library, but it writes out all the data itself using @code{bfd_write}. This
|
---|
72 | version of gas supports up to 40 normal sections. The section names are stored
|
---|
73 | in the @code{seg_name} array. Other information is stored in the
|
---|
74 | @code{segment_info} array.
|
---|
75 |
|
---|
76 | The type @code{segT} is an enum. Code that wants to examine all the sections
|
---|
77 | can use a @code{segT} variable as loop index from @code{SEG_E0} up to but not
|
---|
78 | including @code{SEG_UNKNOWN}.
|
---|
79 |
|
---|
80 | Most of the code specific to this version of GAS is in the file
|
---|
81 | @file{config/obj-coff.c}, in the portion of that file that is compiled when
|
---|
82 | @code{BFD_ASSEMBLER} is not defined.
|
---|
83 |
|
---|
84 | This version of GAS is still used for several COFF targets.
|
---|
85 |
|
---|
86 | @node BFD_ASSEMBLER
|
---|
87 | @subsection BFD_ASSEMBLER gas version
|
---|
88 | @cindex BFD_ASSEMBLER
|
---|
89 |
|
---|
90 | The preferred version of GAS is the @code{BFD_ASSEMBLER} version. In this
|
---|
91 | version of GAS, the output file is a normal BFD, and the BFD routines are used
|
---|
92 | to generate the output.
|
---|
93 |
|
---|
94 | @code{BFD_ASSEMBLER} will automatically be used for certain targets, including
|
---|
95 | those that use the ELF, ECOFF, and SOM object file formats, and also all Alpha,
|
---|
96 | MIPS, PowerPC, and SPARC targets. You can force the use of
|
---|
97 | @code{BFD_ASSEMBLER} for other targets with the configure option
|
---|
98 | @samp{--enable-bfd-assembler}; however, it has not been tested for many
|
---|
99 | targets, and can not be assumed to work.
|
---|
100 |
|
---|
101 | @node Data types
|
---|
102 | @section Data types
|
---|
103 | @cindex internals, data types
|
---|
104 |
|
---|
105 | This section describes some fundamental GAS data types.
|
---|
106 |
|
---|
107 | @menu
|
---|
108 | * Symbols:: The symbolS structure
|
---|
109 | * Expressions:: The expressionS structure
|
---|
110 | * Fixups:: The fixS structure
|
---|
111 | * Frags:: The fragS structure
|
---|
112 | @end menu
|
---|
113 |
|
---|
114 | @node Symbols
|
---|
115 | @subsection Symbols
|
---|
116 | @cindex internals, symbols
|
---|
117 | @cindex symbols, internal
|
---|
118 | @cindex symbolS structure
|
---|
119 |
|
---|
120 | The definition for the symbol structure, @code{symbolS}, is located in
|
---|
121 | @file{struc-symbol.h}.
|
---|
122 |
|
---|
123 | In general, the fields of this structure may not be referred to directly.
|
---|
124 | Instead, you must use one of the accessor functions defined in @file{symbol.h}.
|
---|
125 | These accessor functions should work for any GAS version.
|
---|
126 |
|
---|
127 | Symbol structures contain the following fields:
|
---|
128 |
|
---|
129 | @table @code
|
---|
130 | @item sy_value
|
---|
131 | This is an @code{expressionS} that describes the value of the symbol. It might
|
---|
132 | refer to one or more other symbols; if so, its true value may not be known
|
---|
133 | until @code{resolve_symbol_value} is called with @var{finalize_syms} non-zero
|
---|
134 | in @code{write_object_file}.
|
---|
135 |
|
---|
136 | The expression is often simply a constant. Before @code{resolve_symbol_value}
|
---|
137 | is called with @var{finalize_syms} set, the value is the offset from the frag
|
---|
138 | (@pxref{Frags}). Afterward, the frag address has been added in.
|
---|
139 |
|
---|
140 | @item sy_resolved
|
---|
141 | This field is non-zero if the symbol's value has been completely resolved. It
|
---|
142 | is used during the final pass over the symbol table.
|
---|
143 |
|
---|
144 | @item sy_resolving
|
---|
145 | This field is used to detect loops while resolving the symbol's value.
|
---|
146 |
|
---|
147 | @item sy_used_in_reloc
|
---|
148 | This field is non-zero if the symbol is used by a relocation entry. If a local
|
---|
149 | symbol is used in a relocation entry, it must be possible to redirect those
|
---|
150 | relocations to other symbols, or this symbol cannot be removed from the final
|
---|
151 | symbol list.
|
---|
152 |
|
---|
153 | @item sy_next
|
---|
154 | @itemx sy_previous
|
---|
155 | These pointers to other @code{symbolS} structures describe a singly or doubly
|
---|
156 | linked list. (If @code{SYMBOLS_NEED_BACKPOINTERS} is not defined, the
|
---|
157 | @code{sy_previous} field will be omitted; @code{SYMBOLS_NEED_BACKPOINTERS} is
|
---|
158 | always defined if @code{BFD_ASSEMBLER}.) These fields should be accessed with
|
---|
159 | the @code{symbol_next} and @code{symbol_previous} macros.
|
---|
160 |
|
---|
161 | @item sy_frag
|
---|
162 | This points to the frag (@pxref{Frags}) that this symbol is attached to.
|
---|
163 |
|
---|
164 | @item sy_used
|
---|
165 | Whether the symbol is used as an operand or in an expression. Note: Not all of
|
---|
166 | the backends keep this information accurate; backends which use this bit are
|
---|
167 | responsible for setting it when a symbol is used in backend routines.
|
---|
168 |
|
---|
169 | @item sy_mri_common
|
---|
170 | Whether the symbol is an MRI common symbol created by the @code{COMMON}
|
---|
171 | pseudo-op when assembling in MRI mode.
|
---|
172 |
|
---|
173 | @item bsym
|
---|
174 | If @code{BFD_ASSEMBLER} is defined, this points to the BFD @code{asymbol} that
|
---|
175 | will be used in writing the object file.
|
---|
176 |
|
---|
177 | @item sy_name_offset
|
---|
178 | (Only used if @code{BFD_ASSEMBLER} is not defined.) This is the position of
|
---|
179 | the symbol's name in the string table of the object file. On some formats,
|
---|
180 | this will start at position 4, with position 0 reserved for unnamed symbols.
|
---|
181 | This field is not used until @code{write_object_file} is called.
|
---|
182 |
|
---|
183 | @item sy_symbol
|
---|
184 | (Only used if @code{BFD_ASSEMBLER} is not defined.) This is the
|
---|
185 | format-specific symbol structure, as it would be written into the object file.
|
---|
186 |
|
---|
187 | @item sy_number
|
---|
188 | (Only used if @code{BFD_ASSEMBLER} is not defined.) This is a 24-bit symbol
|
---|
189 | number, for use in constructing relocation table entries.
|
---|
190 |
|
---|
191 | @item sy_obj
|
---|
192 | This format-specific data is of type @code{OBJ_SYMFIELD_TYPE}. If no macro by
|
---|
193 | that name is defined in @file{obj-format.h}, this field is not defined.
|
---|
194 |
|
---|
195 | @item sy_tc
|
---|
196 | This processor-specific data is of type @code{TC_SYMFIELD_TYPE}. If no macro
|
---|
197 | by that name is defined in @file{targ-cpu.h}, this field is not defined.
|
---|
198 |
|
---|
199 | @end table
|
---|
200 |
|
---|
201 | Here is a description of the accessor functions. These should be used rather
|
---|
202 | than referring to the fields of @code{symbolS} directly.
|
---|
203 |
|
---|
204 | @table @code
|
---|
205 | @item S_SET_VALUE
|
---|
206 | @cindex S_SET_VALUE
|
---|
207 | Set the symbol's value.
|
---|
208 |
|
---|
209 | @item S_GET_VALUE
|
---|
210 | @cindex S_GET_VALUE
|
---|
211 | Get the symbol's value. This will cause @code{resolve_symbol_value} to be
|
---|
212 | called if necessary.
|
---|
213 |
|
---|
214 | @item S_SET_SEGMENT
|
---|
215 | @cindex S_SET_SEGMENT
|
---|
216 | Set the section of the symbol.
|
---|
217 |
|
---|
218 | @item S_GET_SEGMENT
|
---|
219 | @cindex S_GET_SEGMENT
|
---|
220 | Get the symbol's section.
|
---|
221 |
|
---|
222 | @item S_GET_NAME
|
---|
223 | @cindex S_GET_NAME
|
---|
224 | Get the name of the symbol.
|
---|
225 |
|
---|
226 | @item S_SET_NAME
|
---|
227 | @cindex S_SET_NAME
|
---|
228 | Set the name of the symbol.
|
---|
229 |
|
---|
230 | @item S_IS_EXTERNAL
|
---|
231 | @cindex S_IS_EXTERNAL
|
---|
232 | Return non-zero if the symbol is externally visible.
|
---|
233 |
|
---|
234 | @item S_IS_EXTERN
|
---|
235 | @cindex S_IS_EXTERN
|
---|
236 | A synonym for @code{S_IS_EXTERNAL}. Don't use it.
|
---|
237 |
|
---|
238 | @item S_IS_WEAK
|
---|
239 | @cindex S_IS_WEAK
|
---|
240 | Return non-zero if the symbol is weak.
|
---|
241 |
|
---|
242 | @item S_IS_COMMON
|
---|
243 | @cindex S_IS_COMMON
|
---|
244 | Return non-zero if this is a common symbol. Common symbols are sometimes
|
---|
245 | represented as undefined symbols with a value, in which case this function will
|
---|
246 | not be reliable.
|
---|
247 |
|
---|
248 | @item S_IS_DEFINED
|
---|
249 | @cindex S_IS_DEFINED
|
---|
250 | Return non-zero if this symbol is defined. This function is not reliable when
|
---|
251 | called on a common symbol.
|
---|
252 |
|
---|
253 | @item S_IS_DEBUG
|
---|
254 | @cindex S_IS_DEBUG
|
---|
255 | Return non-zero if this is a debugging symbol.
|
---|
256 |
|
---|
257 | @item S_IS_LOCAL
|
---|
258 | @cindex S_IS_LOCAL
|
---|
259 | Return non-zero if this is a local assembler symbol which should not be
|
---|
260 | included in the final symbol table. Note that this is not the opposite of
|
---|
261 | @code{S_IS_EXTERNAL}. The @samp{-L} assembler option affects the return value
|
---|
262 | of this function.
|
---|
263 |
|
---|
264 | @item S_SET_EXTERNAL
|
---|
265 | @cindex S_SET_EXTERNAL
|
---|
266 | Mark the symbol as externally visible.
|
---|
267 |
|
---|
268 | @item S_CLEAR_EXTERNAL
|
---|
269 | @cindex S_CLEAR_EXTERNAL
|
---|
270 | Mark the symbol as not externally visible.
|
---|
271 |
|
---|
272 | @item S_SET_WEAK
|
---|
273 | @cindex S_SET_WEAK
|
---|
274 | Mark the symbol as weak.
|
---|
275 |
|
---|
276 | @item S_GET_TYPE
|
---|
277 | @item S_GET_DESC
|
---|
278 | @item S_GET_OTHER
|
---|
279 | @cindex S_GET_TYPE
|
---|
280 | @cindex S_GET_DESC
|
---|
281 | @cindex S_GET_OTHER
|
---|
282 | Get the @code{type}, @code{desc}, and @code{other} fields of the symbol. These
|
---|
283 | are only defined for object file formats for which they make sense (primarily
|
---|
284 | a.out).
|
---|
285 |
|
---|
286 | @item S_SET_TYPE
|
---|
287 | @item S_SET_DESC
|
---|
288 | @item S_SET_OTHER
|
---|
289 | @cindex S_SET_TYPE
|
---|
290 | @cindex S_SET_DESC
|
---|
291 | @cindex S_SET_OTHER
|
---|
292 | Set the @code{type}, @code{desc}, and @code{other} fields of the symbol. These
|
---|
293 | are only defined for object file formats for which they make sense (primarily
|
---|
294 | a.out).
|
---|
295 |
|
---|
296 | @item S_GET_SIZE
|
---|
297 | @cindex S_GET_SIZE
|
---|
298 | Get the size of a symbol. This is only defined for object file formats for
|
---|
299 | which it makes sense (primarily ELF).
|
---|
300 |
|
---|
301 | @item S_SET_SIZE
|
---|
302 | @cindex S_SET_SIZE
|
---|
303 | Set the size of a symbol. This is only defined for object file formats for
|
---|
304 | which it makes sense (primarily ELF).
|
---|
305 |
|
---|
306 | @item symbol_get_value_expression
|
---|
307 | @cindex symbol_get_value_expression
|
---|
308 | Get a pointer to an @code{expressionS} structure which represents the value of
|
---|
309 | the symbol as an expression.
|
---|
310 |
|
---|
311 | @item symbol_set_value_expression
|
---|
312 | @cindex symbol_set_value_expression
|
---|
313 | Set the value of a symbol to an expression.
|
---|
314 |
|
---|
315 | @item symbol_set_frag
|
---|
316 | @cindex symbol_set_frag
|
---|
317 | Set the frag where a symbol is defined.
|
---|
318 |
|
---|
319 | @item symbol_get_frag
|
---|
320 | @cindex symbol_get_frag
|
---|
321 | Get the frag where a symbol is defined.
|
---|
322 |
|
---|
323 | @item symbol_mark_used
|
---|
324 | @cindex symbol_mark_used
|
---|
325 | Mark a symbol as having been used in an expression.
|
---|
326 |
|
---|
327 | @item symbol_clear_used
|
---|
328 | @cindex symbol_clear_used
|
---|
329 | Clear the mark indicating that a symbol was used in an expression.
|
---|
330 |
|
---|
331 | @item symbol_used_p
|
---|
332 | @cindex symbol_used_p
|
---|
333 | Return whether a symbol was used in an expression.
|
---|
334 |
|
---|
335 | @item symbol_mark_used_in_reloc
|
---|
336 | @cindex symbol_mark_used_in_reloc
|
---|
337 | Mark a symbol as having been used by a relocation.
|
---|
338 |
|
---|
339 | @item symbol_clear_used_in_reloc
|
---|
340 | @cindex symbol_clear_used_in_reloc
|
---|
341 | Clear the mark indicating that a symbol was used in a relocation.
|
---|
342 |
|
---|
343 | @item symbol_used_in_reloc_p
|
---|
344 | @cindex symbol_used_in_reloc_p
|
---|
345 | Return whether a symbol was used in a relocation.
|
---|
346 |
|
---|
347 | @item symbol_mark_mri_common
|
---|
348 | @cindex symbol_mark_mri_common
|
---|
349 | Mark a symbol as an MRI common symbol.
|
---|
350 |
|
---|
351 | @item symbol_clear_mri_common
|
---|
352 | @cindex symbol_clear_mri_common
|
---|
353 | Clear the mark indicating that a symbol is an MRI common symbol.
|
---|
354 |
|
---|
355 | @item symbol_mri_common_p
|
---|
356 | @cindex symbol_mri_common_p
|
---|
357 | Return whether a symbol is an MRI common symbol.
|
---|
358 |
|
---|
359 | @item symbol_mark_written
|
---|
360 | @cindex symbol_mark_written
|
---|
361 | Mark a symbol as having been written.
|
---|
362 |
|
---|
363 | @item symbol_clear_written
|
---|
364 | @cindex symbol_clear_written
|
---|
365 | Clear the mark indicating that a symbol was written.
|
---|
366 |
|
---|
367 | @item symbol_written_p
|
---|
368 | @cindex symbol_written_p
|
---|
369 | Return whether a symbol was written.
|
---|
370 |
|
---|
371 | @item symbol_mark_resolved
|
---|
372 | @cindex symbol_mark_resolved
|
---|
373 | Mark a symbol as having been resolved.
|
---|
374 |
|
---|
375 | @item symbol_resolved_p
|
---|
376 | @cindex symbol_resolved_p
|
---|
377 | Return whether a symbol has been resolved.
|
---|
378 |
|
---|
379 | @item symbol_section_p
|
---|
380 | @cindex symbol_section_p
|
---|
381 | Return whether a symbol is a section symbol.
|
---|
382 |
|
---|
383 | @item symbol_equated_p
|
---|
384 | @cindex symbol_equated_p
|
---|
385 | Return whether a symbol is equated to another symbol.
|
---|
386 |
|
---|
387 | @item symbol_constant_p
|
---|
388 | @cindex symbol_constant_p
|
---|
389 | Return whether a symbol has a constant value, including being an offset within
|
---|
390 | some frag.
|
---|
391 |
|
---|
392 | @item symbol_get_bfdsym
|
---|
393 | @cindex symbol_get_bfdsym
|
---|
394 | Return the BFD symbol associated with a symbol.
|
---|
395 |
|
---|
396 | @item symbol_set_bfdsym
|
---|
397 | @cindex symbol_set_bfdsym
|
---|
398 | Set the BFD symbol associated with a symbol.
|
---|
399 |
|
---|
400 | @item symbol_get_obj
|
---|
401 | @cindex symbol_get_obj
|
---|
402 | Return a pointer to the @code{OBJ_SYMFIELD_TYPE} field of a symbol.
|
---|
403 |
|
---|
404 | @item symbol_set_obj
|
---|
405 | @cindex symbol_set_obj
|
---|
406 | Set the @code{OBJ_SYMFIELD_TYPE} field of a symbol.
|
---|
407 |
|
---|
408 | @item symbol_get_tc
|
---|
409 | @cindex symbol_get_tc
|
---|
410 | Return a pointer to the @code{TC_SYMFIELD_TYPE} field of a symbol.
|
---|
411 |
|
---|
412 | @item symbol_set_tc
|
---|
413 | @cindex symbol_set_tc
|
---|
414 | Set the @code{TC_SYMFIELD_TYPE} field of a symbol.
|
---|
415 |
|
---|
416 | @end table
|
---|
417 |
|
---|
418 | When @code{BFD_ASSEMBLER} is defined, GAS attempts to store local
|
---|
419 | symbols--symbols which will not be written to the output file--using a
|
---|
420 | different structure, @code{struct local_symbol}. This structure can only
|
---|
421 | represent symbols whose value is an offset within a frag.
|
---|
422 |
|
---|
423 | Code outside of the symbol handler will always deal with @code{symbolS}
|
---|
424 | structures and use the accessor functions. The accessor functions correctly
|
---|
425 | deal with local symbols. @code{struct local_symbol} is much smaller than
|
---|
426 | @code{symbolS} (which also automatically creates a bfd @code{asymbol}
|
---|
427 | structure), so this saves space when assembling large files.
|
---|
428 |
|
---|
429 | The first field of @code{symbolS} is @code{bsym}, the pointer to the BFD
|
---|
430 | symbol. The first field of @code{struct local_symbol} is a pointer which is
|
---|
431 | always set to NULL. This is how the symbol accessor functions can distinguish
|
---|
432 | local symbols from ordinary symbols. The symbol accessor functions
|
---|
433 | automatically convert a local symbol into an ordinary symbol when necessary.
|
---|
434 |
|
---|
435 | @node Expressions
|
---|
436 | @subsection Expressions
|
---|
437 | @cindex internals, expressions
|
---|
438 | @cindex expressions, internal
|
---|
439 | @cindex expressionS structure
|
---|
440 |
|
---|
441 | Expressions are stored in an @code{expressionS} structure. The structure is
|
---|
442 | defined in @file{expr.h}.
|
---|
443 |
|
---|
444 | @cindex expression
|
---|
445 | The macro @code{expression} will create an @code{expressionS} structure based
|
---|
446 | on the text found at the global variable @code{input_line_pointer}.
|
---|
447 |
|
---|
448 | @cindex make_expr_symbol
|
---|
449 | @cindex expr_symbol_where
|
---|
450 | A single @code{expressionS} structure can represent a single operation.
|
---|
451 | Complex expressions are formed by creating @dfn{expression symbols} and
|
---|
452 | combining them in @code{expressionS} structures. An expression symbol is
|
---|
453 | created by calling @code{make_expr_symbol}. An expression symbol should
|
---|
454 | naturally never appear in a symbol table, and the implementation of
|
---|
455 | @code{S_IS_LOCAL} (@pxref{Symbols}) reflects that. The function
|
---|
456 | @code{expr_symbol_where} returns non-zero if a symbol is an expression symbol,
|
---|
457 | and also returns the file and line for the expression which caused it to be
|
---|
458 | created.
|
---|
459 |
|
---|
460 | The @code{expressionS} structure has two symbol fields, a number field, an
|
---|
461 | operator field, and a field indicating whether the number is unsigned.
|
---|
462 |
|
---|
463 | The operator field is of type @code{operatorT}, and describes how to interpret
|
---|
464 | the other fields; see the definition in @file{expr.h} for the possibilities.
|
---|
465 |
|
---|
466 | An @code{operatorT} value of @code{O_big} indicates either a floating point
|
---|
467 | number, stored in the global variable @code{generic_floating_point_number}, or
|
---|
468 | an integer too large to store in an @code{offsetT} type, stored in the global
|
---|
469 | array @code{generic_bignum}. This rather inflexible approach makes it
|
---|
470 | impossible to use floating point numbers or large expressions in complex
|
---|
471 | expressions.
|
---|
472 |
|
---|
473 | @node Fixups
|
---|
474 | @subsection Fixups
|
---|
475 | @cindex internals, fixups
|
---|
476 | @cindex fixups
|
---|
477 | @cindex fixS structure
|
---|
478 |
|
---|
479 | A @dfn{fixup} is basically anything which can not be resolved in the first
|
---|
480 | pass. Sometimes a fixup can be resolved by the end of the assembly; if not,
|
---|
481 | the fixup becomes a relocation entry in the object file.
|
---|
482 |
|
---|
483 | @cindex fix_new
|
---|
484 | @cindex fix_new_exp
|
---|
485 | A fixup is created by a call to @code{fix_new} or @code{fix_new_exp}. Both
|
---|
486 | take a frag (@pxref{Frags}), a position within the frag, a size, an indication
|
---|
487 | of whether the fixup is PC relative, and a type. In a @code{BFD_ASSEMBLER}
|
---|
488 | GAS, the type is nominally a @code{bfd_reloc_code_real_type}, but several
|
---|
489 | targets use other type codes to represent fixups that can not be described as
|
---|
490 | relocations.
|
---|
491 |
|
---|
492 | The @code{fixS} structure has a number of fields, several of which are obsolete
|
---|
493 | or are only used by a particular target. The important fields are:
|
---|
494 |
|
---|
495 | @table @code
|
---|
496 | @item fx_frag
|
---|
497 | The frag (@pxref{Frags}) this fixup is in.
|
---|
498 |
|
---|
499 | @item fx_where
|
---|
500 | The location within the frag where the fixup occurs.
|
---|
501 |
|
---|
502 | @item fx_addsy
|
---|
503 | The symbol this fixup is against. Typically, the value of this symbol is added
|
---|
504 | into the object contents. This may be NULL.
|
---|
505 |
|
---|
506 | @item fx_subsy
|
---|
507 | The value of this symbol is subtracted from the object contents. This is
|
---|
508 | normally NULL.
|
---|
509 |
|
---|
510 | @item fx_offset
|
---|
511 | A number which is added into the fixup.
|
---|
512 |
|
---|
513 | @item fx_addnumber
|
---|
514 | Some CPU backends use this field to convey information between
|
---|
515 | @code{md_apply_fix3} and @code{tc_gen_reloc}. The machine independent code does
|
---|
516 | not use it.
|
---|
517 |
|
---|
518 | @item fx_next
|
---|
519 | The next fixup in the section.
|
---|
520 |
|
---|
521 | @item fx_r_type
|
---|
522 | The type of the fixup. This field is only defined if @code{BFD_ASSEMBLER}, or
|
---|
523 | if the target defines @code{NEED_FX_R_TYPE}.
|
---|
524 |
|
---|
525 | @item fx_size
|
---|
526 | The size of the fixup. This is mostly used for error checking.
|
---|
527 |
|
---|
528 | @item fx_pcrel
|
---|
529 | Whether the fixup is PC relative.
|
---|
530 |
|
---|
531 | @item fx_done
|
---|
532 | Non-zero if the fixup has been applied, and no relocation entry needs to be
|
---|
533 | generated.
|
---|
534 |
|
---|
535 | @item fx_file
|
---|
536 | @itemx fx_line
|
---|
537 | The file and line where the fixup was created.
|
---|
538 |
|
---|
539 | @item tc_fix_data
|
---|
540 | This has the type @code{TC_FIX_TYPE}, and is only defined if the target defines
|
---|
541 | that macro.
|
---|
542 | @end table
|
---|
543 |
|
---|
544 | @node Frags
|
---|
545 | @subsection Frags
|
---|
546 | @cindex internals, frags
|
---|
547 | @cindex frags
|
---|
548 | @cindex fragS structure.
|
---|
549 |
|
---|
550 | The @code{fragS} structure is defined in @file{as.h}. Each frag represents a
|
---|
551 | portion of the final object file. As GAS reads the source file, it creates
|
---|
552 | frags to hold the data that it reads. At the end of the assembly the frags and
|
---|
553 | fixups are processed to produce the final contents.
|
---|
554 |
|
---|
555 | @table @code
|
---|
556 | @item fr_address
|
---|
557 | The address of the frag. This is not set until the assembler rescans the list
|
---|
558 | of all frags after the entire input file is parsed. The function
|
---|
559 | @code{relax_segment} fills in this field.
|
---|
560 |
|
---|
561 | @item fr_next
|
---|
562 | Pointer to the next frag in this (sub)section.
|
---|
563 |
|
---|
564 | @item fr_fix
|
---|
565 | Fixed number of characters we know we're going to emit to the output file. May
|
---|
566 | be zero.
|
---|
567 |
|
---|
568 | @item fr_var
|
---|
569 | Variable number of characters we may output, after the initial @code{fr_fix}
|
---|
570 | characters. May be zero.
|
---|
571 |
|
---|
572 | @item fr_offset
|
---|
573 | The interpretation of this field is controlled by @code{fr_type}. Generally,
|
---|
574 | if @code{fr_var} is non-zero, this is a repeat count: the @code{fr_var}
|
---|
575 | characters are output @code{fr_offset} times.
|
---|
576 |
|
---|
577 | @item line
|
---|
578 | Holds line number info when an assembler listing was requested.
|
---|
579 |
|
---|
580 | @item fr_type
|
---|
581 | Relaxation state. This field indicates the interpretation of @code{fr_offset},
|
---|
582 | @code{fr_symbol} and the variable-length tail of the frag, as well as the
|
---|
583 | treatment it gets in various phases of processing. It does not affect the
|
---|
584 | initial @code{fr_fix} characters; they are always supposed to be output
|
---|
585 | verbatim (fixups aside). See below for specific values this field can have.
|
---|
586 |
|
---|
587 | @item fr_subtype
|
---|
588 | Relaxation substate. If the macro @code{md_relax_frag} isn't defined, this is
|
---|
589 | assumed to be an index into @code{TC_GENERIC_RELAX_TABLE} for the generic
|
---|
590 | relaxation code to process (@pxref{Relaxation}). If @code{md_relax_frag} is
|
---|
591 | defined, this field is available for any use by the CPU-specific code.
|
---|
592 |
|
---|
593 | @item fr_symbol
|
---|
594 | This normally indicates the symbol to use when relaxing the frag according to
|
---|
595 | @code{fr_type}.
|
---|
596 |
|
---|
597 | @item fr_opcode
|
---|
598 | Points to the lowest-addressed byte of the opcode, for use in relaxation.
|
---|
599 |
|
---|
600 | @item tc_frag_data
|
---|
601 | Target specific fragment data of type TC_FRAG_TYPE.
|
---|
602 | Only present if @code{TC_FRAG_TYPE} is defined.
|
---|
603 |
|
---|
604 | @item fr_file
|
---|
605 | @itemx fr_line
|
---|
606 | The file and line where this frag was last modified.
|
---|
607 |
|
---|
608 | @item fr_literal
|
---|
609 | Declared as a one-character array, this last field grows arbitrarily large to
|
---|
610 | hold the actual contents of the frag.
|
---|
611 | @end table
|
---|
612 |
|
---|
613 | These are the possible relaxation states, provided in the enumeration type
|
---|
614 | @code{relax_stateT}, and the interpretations they represent for the other
|
---|
615 | fields:
|
---|
616 |
|
---|
617 | @table @code
|
---|
618 | @item rs_align
|
---|
619 | @itemx rs_align_code
|
---|
620 | The start of the following frag should be aligned on some boundary. In this
|
---|
621 | frag, @code{fr_offset} is the logarithm (base 2) of the alignment in bytes.
|
---|
622 | (For example, if alignment on an 8-byte boundary were desired, @code{fr_offset}
|
---|
623 | would have a value of 3.) The variable characters indicate the fill pattern to
|
---|
624 | be used. The @code{fr_subtype} field holds the maximum number of bytes to skip
|
---|
625 | when doing this alignment. If more bytes are needed, the alignment is not
|
---|
626 | done. An @code{fr_subtype} value of 0 means no maximum, which is the normal
|
---|
627 | case. Target backends can use @code{rs_align_code} to handle certain types of
|
---|
628 | alignment differently.
|
---|
629 |
|
---|
630 | @item rs_broken_word
|
---|
631 | This indicates that ``broken word'' processing should be done (@pxref{Broken
|
---|
632 | words}). If broken word processing is not necessary on the target machine,
|
---|
633 | this enumerator value will not be defined.
|
---|
634 |
|
---|
635 | @item rs_cfa
|
---|
636 | This state is used to implement exception frame optimizations. The
|
---|
637 | @code{fr_symbol} is an expression symbol for the subtraction which may be
|
---|
638 | relaxed. The @code{fr_opcode} field holds the frag for the preceding command
|
---|
639 | byte. The @code{fr_offset} field holds the offset within that frag. The
|
---|
640 | @code{fr_subtype} field is used during relaxation to hold the current size of
|
---|
641 | the frag.
|
---|
642 |
|
---|
643 | @item rs_fill
|
---|
644 | The variable characters are to be repeated @code{fr_offset} times. If
|
---|
645 | @code{fr_offset} is 0, this frag has a length of @code{fr_fix}. Most frags
|
---|
646 | have this type.
|
---|
647 |
|
---|
648 | @item rs_leb128
|
---|
649 | This state is used to implement the DWARF ``little endian base 128''
|
---|
650 | variable length number format. The @code{fr_symbol} is always an expression
|
---|
651 | symbol, as constant expressions are emitted directly. The @code{fr_offset}
|
---|
652 | field is used during relaxation to hold the previous size of the number so
|
---|
653 | that we can determine if the fragment changed size.
|
---|
654 |
|
---|
655 | @item rs_machine_dependent
|
---|
656 | Displacement relaxation is to be done on this frag. The target is indicated by
|
---|
657 | @code{fr_symbol} and @code{fr_offset}, and @code{fr_subtype} indicates the
|
---|
658 | particular machine-specific addressing mode desired. @xref{Relaxation}.
|
---|
659 |
|
---|
660 | @item rs_org
|
---|
661 | The start of the following frag should be pushed back to some specific offset
|
---|
662 | within the section. (Some assemblers use the value as an absolute address; GAS
|
---|
663 | does not handle final absolute addresses, but rather requires that the linker
|
---|
664 | set them.) The offset is given by @code{fr_symbol} and @code{fr_offset}; one
|
---|
665 | character from the variable-length tail is used as the fill character.
|
---|
666 | @end table
|
---|
667 |
|
---|
668 | @cindex frchainS structure
|
---|
669 | A chain of frags is built up for each subsection. The data structure
|
---|
670 | describing a chain is called a @code{frchainS}, and contains the following
|
---|
671 | fields:
|
---|
672 |
|
---|
673 | @table @code
|
---|
674 | @item frch_root
|
---|
675 | Points to the first frag in the chain. May be NULL if there are no frags in
|
---|
676 | this chain.
|
---|
677 | @item frch_last
|
---|
678 | Points to the last frag in the chain, or NULL if there are none.
|
---|
679 | @item frch_next
|
---|
680 | Next in the list of @code{frchainS} structures.
|
---|
681 | @item frch_seg
|
---|
682 | Indicates the section this frag chain belongs to.
|
---|
683 | @item frch_subseg
|
---|
684 | Subsection (subsegment) number of this frag chain.
|
---|
685 | @item fix_root, fix_tail
|
---|
686 | (Defined only if @code{BFD_ASSEMBLER} is defined). Point to first and last
|
---|
687 | @code{fixS} structures associated with this subsection.
|
---|
688 | @item frch_obstack
|
---|
689 | Not currently used. Intended to be used for frag allocation for this
|
---|
690 | subsection. This should reduce frag generation caused by switching sections.
|
---|
691 | @item frch_frag_now
|
---|
692 | The current frag for this subsegment.
|
---|
693 | @end table
|
---|
694 |
|
---|
695 | A @code{frchainS} corresponds to a subsection; each section has a list of
|
---|
696 | @code{frchainS} records associated with it. In most cases, only one subsection
|
---|
697 | of each section is used, so the list will only be one element long, but any
|
---|
698 | processing of frag chains should be prepared to deal with multiple chains per
|
---|
699 | section.
|
---|
700 |
|
---|
701 | After the input files have been completely processed, and no more frags are to
|
---|
702 | be generated, the frag chains are joined into one per section for further
|
---|
703 | processing. After this point, it is safe to operate on one chain per section.
|
---|
704 |
|
---|
705 | The assembler always has a current frag, named @code{frag_now}. More space is
|
---|
706 | allocated for the current frag using the @code{frag_more} function; this
|
---|
707 | returns a pointer to the amount of requested space. Relaxing is done using
|
---|
708 | variant frags allocated by @code{frag_var} or @code{frag_variant}
|
---|
709 | (@pxref{Relaxation}).
|
---|
710 |
|
---|
711 | @node GAS processing
|
---|
712 | @section What GAS does when it runs
|
---|
713 | @cindex internals, overview
|
---|
714 |
|
---|
715 | This is a quick look at what an assembler run looks like.
|
---|
716 |
|
---|
717 | @itemize @bullet
|
---|
718 | @item
|
---|
719 | The assembler initializes itself by calling various init routines.
|
---|
720 |
|
---|
721 | @item
|
---|
722 | For each source file, the @code{read_a_source_file} function reads in the file
|
---|
723 | and parses it. The global variable @code{input_line_pointer} points to the
|
---|
724 | current text; it is guaranteed to be correct up to the end of the line, but not
|
---|
725 | farther.
|
---|
726 |
|
---|
727 | @item
|
---|
728 | For each line, the assembler passes labels to the @code{colon} function, and
|
---|
729 | isolates the first word. If it looks like a pseudo-op, the word is looked up
|
---|
730 | in the pseudo-op hash table @code{po_hash} and dispatched to a pseudo-op
|
---|
731 | routine. Otherwise, the target dependent @code{md_assemble} routine is called
|
---|
732 | to parse the instruction.
|
---|
733 |
|
---|
734 | @item
|
---|
735 | When pseudo-ops or instructions output data, they add it to a frag, calling
|
---|
736 | @code{frag_more} to get space to store it in.
|
---|
737 |
|
---|
738 | @item
|
---|
739 | Pseudo-ops and instructions can also output fixups created by @code{fix_new} or
|
---|
740 | @code{fix_new_exp}.
|
---|
741 |
|
---|
742 | @item
|
---|
743 | For certain targets, instructions can create variant frags which are used to
|
---|
744 | store relaxation information (@pxref{Relaxation}).
|
---|
745 |
|
---|
746 | @item
|
---|
747 | When the input file is finished, the @code{write_object_file} routine is
|
---|
748 | called. It assigns addresses to all the frags (@code{relax_segment}), resolves
|
---|
749 | all the fixups (@code{fixup_segment}), resolves all the symbol values (using
|
---|
750 | @code{resolve_symbol_value}), and finally writes out the file (in the
|
---|
751 | @code{BFD_ASSEMBLER} case, this is done by simply calling @code{bfd_close}).
|
---|
752 | @end itemize
|
---|
753 |
|
---|
754 | @node Porting GAS
|
---|
755 | @section Porting GAS
|
---|
756 | @cindex porting
|
---|
757 |
|
---|
758 | Each GAS target specifies two main things: the CPU file and the object format
|
---|
759 | file. Two main switches in the @file{configure.in} file handle this. The
|
---|
760 | first switches on CPU type to set the shell variable @code{cpu_type}. The
|
---|
761 | second switches on the entire target to set the shell variable @code{fmt}.
|
---|
762 |
|
---|
763 | The configure script uses the value of @code{cpu_type} to select two files in
|
---|
764 | the @file{config} directory: @file{tc-@var{CPU}.c} and @file{tc-@var{CPU}.h}.
|
---|
765 | The configuration process will create a file named @file{targ-cpu.h} in the
|
---|
766 | build directory which includes @file{tc-@var{CPU}.h}.
|
---|
767 |
|
---|
768 | The configure script also uses the value of @code{fmt} to select two files:
|
---|
769 | @file{obj-@var{fmt}.c} and @file{obj-@var{fmt}.h}. The configuration process
|
---|
770 | will create a file named @file{obj-format.h} in the build directory which
|
---|
771 | includes @file{obj-@var{fmt}.h}.
|
---|
772 |
|
---|
773 | You can also set the emulation in the configure script by setting the @code{em}
|
---|
774 | variable. Normally the default value of @samp{generic} is fine. The
|
---|
775 | configuration process will create a file named @file{targ-env.h} in the build
|
---|
776 | directory which includes @file{te-@var{em}.h}.
|
---|
777 |
|
---|
778 | There is a special case for COFF. For historical reason, the GNU COFF
|
---|
779 | assembler doesn't follow the documented behavior on certain debug symbols for
|
---|
780 | the compatibility with other COFF assemblers. A port can define
|
---|
781 | @code{STRICTCOFF} in the configure script to make the GNU COFF assembler
|
---|
782 | to follow the documented behavior.
|
---|
783 |
|
---|
784 | Porting GAS to a new CPU requires writing the @file{tc-@var{CPU}} files.
|
---|
785 | Porting GAS to a new object file format requires writing the
|
---|
786 | @file{obj-@var{fmt}} files. There is sometimes some interaction between these
|
---|
787 | two files, but it is normally minimal.
|
---|
788 |
|
---|
789 | The best approach is, of course, to copy existing files. The documentation
|
---|
790 | below assumes that you are looking at existing files to see usage details.
|
---|
791 |
|
---|
792 | These interfaces have grown over time, and have never been carefully thought
|
---|
793 | out or designed. Nothing about the interfaces described here is cast in stone.
|
---|
794 | It is possible that they will change from one version of the assembler to the
|
---|
795 | next. Also, new macros are added all the time as they are needed.
|
---|
796 |
|
---|
797 | @menu
|
---|
798 | * CPU backend:: Writing a CPU backend
|
---|
799 | * Object format backend:: Writing an object format backend
|
---|
800 | * Emulations:: Writing emulation files
|
---|
801 | @end menu
|
---|
802 |
|
---|
803 | @node CPU backend
|
---|
804 | @subsection Writing a CPU backend
|
---|
805 | @cindex CPU backend
|
---|
806 | @cindex @file{tc-@var{CPU}}
|
---|
807 |
|
---|
808 | The CPU backend files are the heart of the assembler. They are the only parts
|
---|
809 | of the assembler which actually know anything about the instruction set of the
|
---|
810 | processor.
|
---|
811 |
|
---|
812 | You must define a reasonably small list of macros and functions in the CPU
|
---|
813 | backend files. You may define a large number of additional macros in the CPU
|
---|
814 | backend files, not all of which are documented here. You must, of course,
|
---|
815 | define macros in the @file{.h} file, which is included by every assembler
|
---|
816 | source file. You may define the functions as macros in the @file{.h} file, or
|
---|
817 | as functions in the @file{.c} file.
|
---|
818 |
|
---|
819 | @table @code
|
---|
820 | @item TC_@var{CPU}
|
---|
821 | @cindex TC_@var{CPU}
|
---|
822 | By convention, you should define this macro in the @file{.h} file. For
|
---|
823 | example, @file{tc-m68k.h} defines @code{TC_M68K}. You might have to use this
|
---|
824 | if it is necessary to add CPU specific code to the object format file.
|
---|
825 |
|
---|
826 | @item TARGET_FORMAT
|
---|
827 | This macro is the BFD target name to use when creating the output file. This
|
---|
828 | will normally depend upon the @code{OBJ_@var{FMT}} macro.
|
---|
829 |
|
---|
830 | @item TARGET_ARCH
|
---|
831 | This macro is the BFD architecture to pass to @code{bfd_set_arch_mach}.
|
---|
832 |
|
---|
833 | @item TARGET_MACH
|
---|
834 | This macro is the BFD machine number to pass to @code{bfd_set_arch_mach}. If
|
---|
835 | it is not defined, GAS will use 0.
|
---|
836 |
|
---|
837 | @item TARGET_BYTES_BIG_ENDIAN
|
---|
838 | You should define this macro to be non-zero if the target is big endian, and
|
---|
839 | zero if the target is little endian.
|
---|
840 |
|
---|
841 | @item md_shortopts
|
---|
842 | @itemx md_longopts
|
---|
843 | @itemx md_longopts_size
|
---|
844 | @itemx md_parse_option
|
---|
845 | @itemx md_show_usage
|
---|
846 | @itemx md_after_parse_args
|
---|
847 | @cindex md_shortopts
|
---|
848 | @cindex md_longopts
|
---|
849 | @cindex md_longopts_size
|
---|
850 | @cindex md_parse_option
|
---|
851 | @cindex md_show_usage
|
---|
852 | @cindex md_after_parse_args
|
---|
853 | GAS uses these variables and functions during option processing.
|
---|
854 | @code{md_shortopts} is a @code{const char *} which GAS adds to the machine
|
---|
855 | independent string passed to @code{getopt}. @code{md_longopts} is a
|
---|
856 | @code{struct option []} which GAS adds to the machine independent long options
|
---|
857 | passed to @code{getopt}; you may use @code{OPTION_MD_BASE}, defined in
|
---|
858 | @file{as.h}, as the start of a set of long option indices, if necessary.
|
---|
859 | @code{md_longopts_size} is a @code{size_t} holding the size @code{md_longopts}.
|
---|
860 | GAS will call @code{md_parse_option} whenever @code{getopt} returns an
|
---|
861 | unrecognized code, presumably indicating a special code value which appears in
|
---|
862 | @code{md_longopts}. GAS will call @code{md_show_usage} when a usage message is
|
---|
863 | printed; it should print a description of the machine specific options.
|
---|
864 | @code{md_after_pase_args}, if defined, is called after all options are
|
---|
865 | processed, to let the backend override settings done by the generic option
|
---|
866 | parsing.
|
---|
867 |
|
---|
868 | @item md_begin
|
---|
869 | @cindex md_begin
|
---|
870 | GAS will call this function at the start of the assembly, after the command
|
---|
871 | line arguments have been parsed and all the machine independent initializations
|
---|
872 | have been completed.
|
---|
873 |
|
---|
874 | @item md_cleanup
|
---|
875 | @cindex md_cleanup
|
---|
876 | If you define this macro, GAS will call it at the end of each input file.
|
---|
877 |
|
---|
878 | @item md_assemble
|
---|
879 | @cindex md_assemble
|
---|
880 | GAS will call this function for each input line which does not contain a
|
---|
881 | pseudo-op. The argument is a null terminated string. The function should
|
---|
882 | assemble the string as an instruction with operands. Normally
|
---|
883 | @code{md_assemble} will do this by calling @code{frag_more} and writing out
|
---|
884 | some bytes (@pxref{Frags}). @code{md_assemble} will call @code{fix_new} to
|
---|
885 | create fixups as needed (@pxref{Fixups}). Targets which need to do special
|
---|
886 | purpose relaxation will call @code{frag_var}.
|
---|
887 |
|
---|
888 | @item md_pseudo_table
|
---|
889 | @cindex md_pseudo_table
|
---|
890 | This is a const array of type @code{pseudo_typeS}. It is a mapping from
|
---|
891 | pseudo-op names to functions. You should use this table to implement
|
---|
892 | pseudo-ops which are specific to the CPU.
|
---|
893 |
|
---|
894 | @item tc_conditional_pseudoop
|
---|
895 | @cindex tc_conditional_pseudoop
|
---|
896 | If this macro is defined, GAS will call it with a @code{pseudo_typeS} argument.
|
---|
897 | It should return non-zero if the pseudo-op is a conditional which controls
|
---|
898 | whether code is assembled, such as @samp{.if}. GAS knows about the normal
|
---|
899 | conditional pseudo-ops, and you should normally not have to define this macro.
|
---|
900 |
|
---|
901 | @item comment_chars
|
---|
902 | @cindex comment_chars
|
---|
903 | This is a null terminated @code{const char} array of characters which start a
|
---|
904 | comment.
|
---|
905 |
|
---|
906 | @item tc_comment_chars
|
---|
907 | @cindex tc_comment_chars
|
---|
908 | If this macro is defined, GAS will use it instead of @code{comment_chars}.
|
---|
909 |
|
---|
910 | @item tc_symbol_chars
|
---|
911 | @cindex tc_symbol_chars
|
---|
912 | If this macro is defined, it is a pointer to a null terminated list of
|
---|
913 | characters which may appear in an operand. GAS already assumes that all
|
---|
914 | alphanumberic characters, and @samp{$}, @samp{.}, and @samp{_} may appear in an
|
---|
915 | operand (see @samp{symbol_chars} in @file{app.c}). This macro may be defined
|
---|
916 | to treat additional characters as appearing in an operand. This affects the
|
---|
917 | way in which GAS removes whitespace before passing the string to
|
---|
918 | @samp{md_assemble}.
|
---|
919 |
|
---|
920 | @item line_comment_chars
|
---|
921 | @cindex line_comment_chars
|
---|
922 | This is a null terminated @code{const char} array of characters which start a
|
---|
923 | comment when they appear at the start of a line.
|
---|
924 |
|
---|
925 | @item line_separator_chars
|
---|
926 | @cindex line_separator_chars
|
---|
927 | This is a null terminated @code{const char} array of characters which separate
|
---|
928 | lines (null and newline are such characters by default, and need not be
|
---|
929 | listed in this array). Note that line_separator_chars do not separate lines
|
---|
930 | if found in a comment, such as after a character in line_comment_chars or
|
---|
931 | comment_chars.
|
---|
932 |
|
---|
933 | @item EXP_CHARS
|
---|
934 | @cindex EXP_CHARS
|
---|
935 | This is a null terminated @code{const char} array of characters which may be
|
---|
936 | used as the exponent character in a floating point number. This is normally
|
---|
937 | @code{"eE"}.
|
---|
938 |
|
---|
939 | @item FLT_CHARS
|
---|
940 | @cindex FLT_CHARS
|
---|
941 | This is a null terminated @code{const char} array of characters which may be
|
---|
942 | used to indicate a floating point constant. A zero followed by one of these
|
---|
943 | characters is assumed to be followed by a floating point number; thus they
|
---|
944 | operate the way that @code{0x} is used to indicate a hexadecimal constant.
|
---|
945 | Usually this includes @samp{r} and @samp{f}.
|
---|
946 |
|
---|
947 | @item LEX_AT
|
---|
948 | @cindex LEX_AT
|
---|
949 | You may define this macro to the lexical type of the @kbd{@@} character. The
|
---|
950 | default is zero.
|
---|
951 |
|
---|
952 | Lexical types are a combination of @code{LEX_NAME} and @code{LEX_BEGIN_NAME},
|
---|
953 | both defined in @file{read.h}. @code{LEX_NAME} indicates that the character
|
---|
954 | may appear in a name. @code{LEX_BEGIN_NAME} indicates that the character may
|
---|
955 | appear at the beginning of a name.
|
---|
956 |
|
---|
957 | @item LEX_BR
|
---|
958 | @cindex LEX_BR
|
---|
959 | You may define this macro to the lexical type of the brace characters @kbd{@{},
|
---|
960 | @kbd{@}}, @kbd{[}, and @kbd{]}. The default value is zero.
|
---|
961 |
|
---|
962 | @item LEX_PCT
|
---|
963 | @cindex LEX_PCT
|
---|
964 | You may define this macro to the lexical type of the @kbd{%} character. The
|
---|
965 | default value is zero.
|
---|
966 |
|
---|
967 | @item LEX_QM
|
---|
968 | @cindex LEX_QM
|
---|
969 | You may define this macro to the lexical type of the @kbd{?} character. The
|
---|
970 | default value it zero.
|
---|
971 |
|
---|
972 | @item LEX_DOLLAR
|
---|
973 | @cindex LEX_DOLLAR
|
---|
974 | You may define this macro to the lexical type of the @kbd{$} character. The
|
---|
975 | default value is @code{LEX_NAME | LEX_BEGIN_NAME}.
|
---|
976 |
|
---|
977 | @item NUMBERS_WITH_SUFFIX
|
---|
978 | @cindex NUMBERS_WITH_SUFFIX
|
---|
979 | When this macro is defined to be non-zero, the parser allows the radix of a
|
---|
980 | constant to be indicated with a suffix. Valid suffixes are binary (B),
|
---|
981 | octal (Q), and hexadecimal (H). Case is not significant.
|
---|
982 |
|
---|
983 | @item SINGLE_QUOTE_STRINGS
|
---|
984 | @cindex SINGLE_QUOTE_STRINGS
|
---|
985 | If you define this macro, GAS will treat single quotes as string delimiters.
|
---|
986 | Normally only double quotes are accepted as string delimiters.
|
---|
987 |
|
---|
988 | @item NO_STRING_ESCAPES
|
---|
989 | @cindex NO_STRING_ESCAPES
|
---|
990 | If you define this macro, GAS will not permit escape sequences in a string.
|
---|
991 |
|
---|
992 | @item ONLY_STANDARD_ESCAPES
|
---|
993 | @cindex ONLY_STANDARD_ESCAPES
|
---|
994 | If you define this macro, GAS will warn about the use of nonstandard escape
|
---|
995 | sequences in a string.
|
---|
996 |
|
---|
997 | @item md_start_line_hook
|
---|
998 | @cindex md_start_line_hook
|
---|
999 | If you define this macro, GAS will call it at the start of each line.
|
---|
1000 |
|
---|
1001 | @item LABELS_WITHOUT_COLONS
|
---|
1002 | @cindex LABELS_WITHOUT_COLONS
|
---|
1003 | If you define this macro, GAS will assume that any text at the start of a line
|
---|
1004 | is a label, even if it does not have a colon.
|
---|
1005 |
|
---|
1006 | @item TC_START_LABEL
|
---|
1007 | @itemx TC_START_LABEL_WITHOUT_COLON
|
---|
1008 | @cindex TC_START_LABEL
|
---|
1009 | You may define this macro to control what GAS considers to be a label. The
|
---|
1010 | default definition is to accept any name followed by a colon character.
|
---|
1011 |
|
---|
1012 | @item TC_START_LABEL_WITHOUT_COLON
|
---|
1013 | @cindex TC_START_LABEL_WITHOUT_COLON
|
---|
1014 | Same as TC_START_LABEL, but should be used instead of TC_START_LABEL when
|
---|
1015 | LABELS_WITHOUT_COLONS is defined.
|
---|
1016 |
|
---|
1017 | @item NO_PSEUDO_DOT
|
---|
1018 | @cindex NO_PSEUDO_DOT
|
---|
1019 | If you define this macro, GAS will not require pseudo-ops to start with a
|
---|
1020 | @kbd{.} character.
|
---|
1021 |
|
---|
1022 | @item TC_EQUAL_IN_INSN
|
---|
1023 | @cindex TC_EQUAL_IN_INSN
|
---|
1024 | If you define this macro, it should return nonzero if the instruction is
|
---|
1025 | permitted to contain an @kbd{=} character. GAS will call it with two
|
---|
1026 | arguments, the character before the @kbd{=} character, and the value of
|
---|
1027 | @code{input_line_pointer} at that point. GAS uses this macro to decide if a
|
---|
1028 | @kbd{=} is an assignment or an instruction.
|
---|
1029 |
|
---|
1030 | @item TC_EOL_IN_INSN
|
---|
1031 | @cindex TC_EOL_IN_INSN
|
---|
1032 | If you define this macro, it should return nonzero if the current input line
|
---|
1033 | pointer should be treated as the end of a line.
|
---|
1034 |
|
---|
1035 | @item md_parse_name
|
---|
1036 | @cindex md_parse_name
|
---|
1037 | If this macro is defined, GAS will call it for any symbol found in an
|
---|
1038 | expression. You can define this to handle special symbols in a special way.
|
---|
1039 | If a symbol always has a certain value, you should normally enter it in the
|
---|
1040 | symbol table, perhaps using @code{reg_section}.
|
---|
1041 |
|
---|
1042 | @item md_undefined_symbol
|
---|
1043 | @cindex md_undefined_symbol
|
---|
1044 | GAS will call this function when a symbol table lookup fails, before it
|
---|
1045 | creates a new symbol. Typically this would be used to supply symbols whose
|
---|
1046 | name or value changes dynamically, possibly in a context sensitive way.
|
---|
1047 | Predefined symbols with fixed values, such as register names or condition
|
---|
1048 | codes, are typically entered directly into the symbol table when @code{md_begin}
|
---|
1049 | is called. One argument is passed, a @code{char *} for the symbol.
|
---|
1050 |
|
---|
1051 | @item md_operand
|
---|
1052 | @cindex md_operand
|
---|
1053 | GAS will call this function with one argument, an @code{expressionS}
|
---|
1054 | pointer, for any expression that can not be recognized. When the function
|
---|
1055 | is called, @code{input_line_pointer} will point to the start of the
|
---|
1056 | expression.
|
---|
1057 |
|
---|
1058 | @item tc_unrecognized_line
|
---|
1059 | @cindex tc_unrecognized_line
|
---|
1060 | If you define this macro, GAS will call it when it finds a line that it can not
|
---|
1061 | parse.
|
---|
1062 |
|
---|
1063 | @item md_do_align
|
---|
1064 | @cindex md_do_align
|
---|
1065 | You may define this macro to handle an alignment directive. GAS will call it
|
---|
1066 | when the directive is seen in the input file. For example, the i386 backend
|
---|
1067 | uses this to generate efficient nop instructions of varying lengths, depending
|
---|
1068 | upon the number of bytes that the alignment will skip.
|
---|
1069 |
|
---|
1070 | @item HANDLE_ALIGN
|
---|
1071 | @cindex HANDLE_ALIGN
|
---|
1072 | You may define this macro to do special handling for an alignment directive.
|
---|
1073 | GAS will call it at the end of the assembly.
|
---|
1074 |
|
---|
1075 | @item TC_IMPLICIT_LCOMM_ALIGNMENT (@var{size}, @var{p2var})
|
---|
1076 | @cindex TC_IMPLICIT_LCOMM_ALIGNMENT
|
---|
1077 | An @code{.lcomm} directive with no explicit alignment parameter will use this
|
---|
1078 | macro to set @var{p2var} to the alignment that a request for @var{size} bytes
|
---|
1079 | will have. The alignment is expressed as a power of two. If no alignment
|
---|
1080 | should take place, the macro definition should do nothing. Some targets define
|
---|
1081 | a @code{.bss} directive that is also affected by this macro. The default
|
---|
1082 | definition will set @var{p2var} to the truncated power of two of sizes up to
|
---|
1083 | eight bytes.
|
---|
1084 |
|
---|
1085 | @item md_flush_pending_output
|
---|
1086 | @cindex md_flush_pending_output
|
---|
1087 | If you define this macro, GAS will call it each time it skips any space because of a
|
---|
1088 | space filling or alignment or data allocation pseudo-op.
|
---|
1089 |
|
---|
1090 | @item TC_PARSE_CONS_EXPRESSION
|
---|
1091 | @cindex TC_PARSE_CONS_EXPRESSION
|
---|
1092 | You may define this macro to parse an expression used in a data allocation
|
---|
1093 | pseudo-op such as @code{.word}. You can use this to recognize relocation
|
---|
1094 | directives that may appear in such directives.
|
---|
1095 |
|
---|
1096 | @item BITFIELD_CONS_EXPRESSION
|
---|
1097 | @cindex BITFIELD_CONS_EXPRESSION
|
---|
1098 | If you define this macro, GAS will recognize bitfield instructions in data
|
---|
1099 | allocation pseudo-ops, as used on the i960.
|
---|
1100 |
|
---|
1101 | @item REPEAT_CONS_EXPRESSION
|
---|
1102 | @cindex REPEAT_CONS_EXPRESSION
|
---|
1103 | If you define this macro, GAS will recognize repeat counts in data allocation
|
---|
1104 | pseudo-ops, as used on the MIPS.
|
---|
1105 |
|
---|
1106 | @item md_cons_align
|
---|
1107 | @cindex md_cons_align
|
---|
1108 | You may define this macro to do any special alignment before a data allocation
|
---|
1109 | pseudo-op.
|
---|
1110 |
|
---|
1111 | @item TC_CONS_FIX_NEW
|
---|
1112 | @cindex TC_CONS_FIX_NEW
|
---|
1113 | You may define this macro to generate a fixup for a data allocation pseudo-op.
|
---|
1114 |
|
---|
1115 | @item TC_INIT_FIX_DATA (@var{fixp})
|
---|
1116 | @cindex TC_INIT_FIX_DATA
|
---|
1117 | A C statement to initialize the target specific fields of fixup @var{fixp}.
|
---|
1118 | These fields are defined with the @code{TC_FIX_TYPE} macro.
|
---|
1119 |
|
---|
1120 | @item TC_FIX_DATA_PRINT (@var{stream}, @var{fixp})
|
---|
1121 | @cindex TC_FIX_DATA_PRINT
|
---|
1122 | A C statement to output target specific debugging information for
|
---|
1123 | fixup @var{fixp} to @var{stream}. This macro is called by @code{print_fixup}.
|
---|
1124 |
|
---|
1125 | @item TC_FRAG_INIT (@var{fragp})
|
---|
1126 | @cindex TC_FRAG_INIT
|
---|
1127 | A C statement to initialize the target specific fields of frag @var{fragp}.
|
---|
1128 | These fields are defined with the @code{TC_FRAG_TYPE} macro.
|
---|
1129 |
|
---|
1130 | @item md_number_to_chars
|
---|
1131 | @cindex md_number_to_chars
|
---|
1132 | This should just call either @code{number_to_chars_bigendian} or
|
---|
1133 | @code{number_to_chars_littleendian}, whichever is appropriate. On targets like
|
---|
1134 | the MIPS which support options to change the endianness, which function to call
|
---|
1135 | is a runtime decision. On other targets, @code{md_number_to_chars} can be a
|
---|
1136 | simple macro.
|
---|
1137 |
|
---|
1138 | @item md_atof (@var{type},@var{litP},@var{sizeP})
|
---|
1139 | @cindex md_atof
|
---|
1140 | This function is called to convert an ASCII string into a floating point value
|
---|
1141 | in format used by the CPU. It takes three arguments. The first is @var{type}
|
---|
1142 | which is a byte describing the type of floating point number to be created.
|
---|
1143 | Possible values are @var{'f'} or @var{'s'} for single precision, @var{'d'} or
|
---|
1144 | @var{'r'} for double precision and @var{'x'} or @var{'p'} for extended
|
---|
1145 | precision. Either lower or upper case versions of these letters can be used.
|
---|
1146 |
|
---|
1147 | The second parameter is @var{litP} which is a pointer to a byte array where the
|
---|
1148 | converted value should be stored. The third argument is @var{sizeP}, which is
|
---|
1149 | a pointer to a integer that should be filled in with the number of
|
---|
1150 | @var{LITTLENUM}s emitted into the byte array. (@var{LITTLENUM} is defined in
|
---|
1151 | gas/bignum.h). The function should return NULL upon success or an error string
|
---|
1152 | upon failure.
|
---|
1153 |
|
---|
1154 | @item TC_LARGEST_EXPONENT_IS_NORMAL
|
---|
1155 | @cindex TC_LARGEST_EXPONENT_IS_NORMAL (@var{precision})
|
---|
1156 | This macro is used only by @file{atof-ieee.c}. It should evaluate to true
|
---|
1157 | if floats of the given precision use the largest exponent for normal numbers
|
---|
1158 | instead of NaNs and infinities. @var{precision} is @samp{F_PRECISION} for
|
---|
1159 | single precision, @samp{D_PRECISION} for double precision, or
|
---|
1160 | @samp{X_PRECISION} for extended double precision.
|
---|
1161 |
|
---|
1162 | The macro has a default definition which returns 0 for all cases.
|
---|
1163 |
|
---|
1164 | @item md_reloc_size
|
---|
1165 | @cindex md_reloc_size
|
---|
1166 | This variable is only used in the original version of gas (not
|
---|
1167 | @code{BFD_ASSEMBLER} and not @code{MANY_SEGMENTS}). It holds the size of a
|
---|
1168 | relocation entry.
|
---|
1169 |
|
---|
1170 | @item WORKING_DOT_WORD
|
---|
1171 | @itemx md_short_jump_size
|
---|
1172 | @itemx md_long_jump_size
|
---|
1173 | @itemx md_create_short_jump
|
---|
1174 | @itemx md_create_long_jump
|
---|
1175 | @itemx TC_CHECK_ADJUSTED_BROKEN_DOT_WORD
|
---|
1176 | @cindex WORKING_DOT_WORD
|
---|
1177 | @cindex md_short_jump_size
|
---|
1178 | @cindex md_long_jump_size
|
---|
1179 | @cindex md_create_short_jump
|
---|
1180 | @cindex md_create_long_jump
|
---|
1181 | @cindex TC_CHECK_ADJUSTED_BROKEN_DOT_WORD
|
---|
1182 | If @code{WORKING_DOT_WORD} is defined, GAS will not do broken word processing
|
---|
1183 | (@pxref{Broken words}). Otherwise, you should set @code{md_short_jump_size} to
|
---|
1184 | the size of a short jump (a jump that is just long enough to jump around a
|
---|
1185 | number of long jumps) and @code{md_long_jump_size} to the size of a long jump
|
---|
1186 | (a jump that can go anywhere in the function). You should define
|
---|
1187 | @code{md_create_short_jump} to create a short jump around a number of long
|
---|
1188 | jumps, and define @code{md_create_long_jump} to create a long jump.
|
---|
1189 | If defined, the macro TC_CHECK_ADJUSTED_BROKEN_DOT_WORD will be called for each
|
---|
1190 | adjusted word just before the word is output. The macro takes two arguments,
|
---|
1191 | an @code{addressT} with the adjusted word and a pointer to the current
|
---|
1192 | @code{struct broken_word}.
|
---|
1193 |
|
---|
1194 | @item md_estimate_size_before_relax
|
---|
1195 | @cindex md_estimate_size_before_relax
|
---|
1196 | This function returns an estimate of the size of a @code{rs_machine_dependent}
|
---|
1197 | frag before any relaxing is done. It may also create any necessary
|
---|
1198 | relocations.
|
---|
1199 |
|
---|
1200 | @item md_relax_frag
|
---|
1201 | @cindex md_relax_frag
|
---|
1202 | This macro may be defined to relax a frag. GAS will call this with the
|
---|
1203 | segment, the frag, and the change in size of all previous frags;
|
---|
1204 | @code{md_relax_frag} should return the change in size of the frag.
|
---|
1205 | @xref{Relaxation}.
|
---|
1206 |
|
---|
1207 | @item TC_GENERIC_RELAX_TABLE
|
---|
1208 | @cindex TC_GENERIC_RELAX_TABLE
|
---|
1209 | If you do not define @code{md_relax_frag}, you may define
|
---|
1210 | @code{TC_GENERIC_RELAX_TABLE} as a table of @code{relax_typeS} structures. The
|
---|
1211 | machine independent code knows how to use such a table to relax PC relative
|
---|
1212 | references. See @file{tc-m68k.c} for an example. @xref{Relaxation}.
|
---|
1213 |
|
---|
1214 | @item md_prepare_relax_scan
|
---|
1215 | @cindex md_prepare_relax_scan
|
---|
1216 | If defined, it is a C statement that is invoked prior to scanning
|
---|
1217 | the relax table.
|
---|
1218 |
|
---|
1219 | @item LINKER_RELAXING_SHRINKS_ONLY
|
---|
1220 | @cindex LINKER_RELAXING_SHRINKS_ONLY
|
---|
1221 | If you define this macro, and the global variable @samp{linkrelax} is set
|
---|
1222 | (because of a command line option, or unconditionally in @code{md_begin}), a
|
---|
1223 | @samp{.align} directive will cause extra space to be allocated. The linker can
|
---|
1224 | then discard this space when relaxing the section.
|
---|
1225 |
|
---|
1226 | @item TC_LINKRELAX_FIXUP (@var{segT})
|
---|
1227 | @cindex TC_LINKRELAX_FIXUP
|
---|
1228 | If defined, this macro allows control over whether fixups for a
|
---|
1229 | given section will be processed when the @var{linkrelax} variable is
|
---|
1230 | set. The macro is given the N_TYPE bits for the section in its
|
---|
1231 | @var{segT} argument. If the macro evaluates to a non-zero value
|
---|
1232 | then the fixups will be converted into relocs, otherwise they will
|
---|
1233 | be passed to @var{md_apply_fix3} as normal.
|
---|
1234 |
|
---|
1235 | @item md_convert_frag
|
---|
1236 | @cindex md_convert_frag
|
---|
1237 | GAS will call this for each rs_machine_dependent fragment.
|
---|
1238 | The instruction is completed using the data from the relaxation pass.
|
---|
1239 | It may also create any necessary relocations.
|
---|
1240 | @xref{Relaxation}.
|
---|
1241 |
|
---|
1242 | @item TC_FINALIZE_SYMS_BEFORE_SIZE_SEG
|
---|
1243 | @cindex TC_FINALIZE_SYMS_BEFORE_SIZE_SEG
|
---|
1244 | Specifies the value to be assigned to @code{finalize_syms} before the function
|
---|
1245 | @code{size_segs} is called. Since @code{size_segs} calls @code{cvt_frag_to_fill}
|
---|
1246 | which can call @code{md_convert_frag}, this constant governs whether the symbols
|
---|
1247 | accessed in @code{md_convert_frag} will be fully resolved. In particular it
|
---|
1248 | governs whether local symbols will have been resolved, and had their frag
|
---|
1249 | information removed. Depending upon the processing performed by
|
---|
1250 | @code{md_convert_frag} the frag information may or may not be necessary, as may
|
---|
1251 | the resolved values of the symbols. The default value is 1.
|
---|
1252 |
|
---|
1253 | @item TC_VALIDATE_FIX (@var{fixP}, @var{seg}, @var{skip})
|
---|
1254 | @cindex TC_VALIDATE_FIX
|
---|
1255 | This macro is evaluated for each fixup (when @var{linkrelax} is not set).
|
---|
1256 | It may be used to change the fixup in @code{struct fix *@var{fixP}} before
|
---|
1257 | the generic code sees it, or to fully process the fixup. In the latter case,
|
---|
1258 | a @code{goto @var{skip}} will bypass the generic code.
|
---|
1259 |
|
---|
1260 | @item md_apply_fix3 (@var{fixP}, @var{valP}, @var{seg})
|
---|
1261 | @cindex md_apply_fix3
|
---|
1262 | GAS will call this for each fixup that passes the @code{TC_VALIDATE_FIX} test
|
---|
1263 | when @var{linkrelax} is not set. It should store the correct value in the
|
---|
1264 | object file. @code{struct fix *@var{fixP}} is the fixup @code{md_apply_fix3}
|
---|
1265 | is operating on. @code{valueT *@var{valP}} is the value to store into the
|
---|
1266 | object files, or at least is the generic code's best guess. Specifically,
|
---|
1267 | *@var{valP} is the value of the fixup symbol, perhaps modified by
|
---|
1268 | @code{MD_APPLY_SYM_VALUE}, plus @code{@var{fixP}->fx_offset} (symbol addend),
|
---|
1269 | less @code{MD_PCREL_FROM_SECTION} for pc-relative fixups.
|
---|
1270 | @code{segT @var{seg}} is the section the fix is in.
|
---|
1271 | @code{fixup_segment} performs a generic overflow check on *@var{valP} after
|
---|
1272 | @code{md_apply_fix3} returns. If the overflow check is relevant for the target
|
---|
1273 | machine, then @code{md_apply_fix3} should modify *@var{valP}, typically to the
|
---|
1274 | value stored in the object file.
|
---|
1275 |
|
---|
1276 | @item TC_FORCE_RELOCATION (@var{fix})
|
---|
1277 | @cindex TC_FORCE_RELOCATION
|
---|
1278 | If this macro returns non-zero, it guarantees that a relocation will be emitted
|
---|
1279 | even when the value can be resolved locally, as @code{fixup_segment} tries to
|
---|
1280 | reduce the number of relocations emitted. For example, a fixup expression
|
---|
1281 | against an absolute symbol will normally not require a reloc. If undefined,
|
---|
1282 | a default of @w{@code{(S_FORCE_RELOC ((@var{fix})->fx_addsy))}} is used.
|
---|
1283 |
|
---|
1284 | @item TC_FORCE_RELOCATION_ABS (@var{fix})
|
---|
1285 | @cindex TC_FORCE_RELOCATION_ABS
|
---|
1286 | Like @code{TC_FORCE_RELOCATION}, but used only for fixup expressions against an
|
---|
1287 | absolute symbol. If undefined, @code{TC_FORCE_RELOCATION} will be used.
|
---|
1288 |
|
---|
1289 | @item TC_FORCE_RELOCATION_LOCAL (@var{fix})
|
---|
1290 | @cindex TC_FORCE_RELOCATION_LOCAL
|
---|
1291 | Like @code{TC_FORCE_RELOCATION}, but used only for fixup expressions against a
|
---|
1292 | symbol in the current section. If undefined, fixups that are not
|
---|
1293 | @code{fx_pcrel} or @code{fx_plt} or for which @code{TC_FORCE_RELOCATION}
|
---|
1294 | returns non-zero, will emit relocs.
|
---|
1295 |
|
---|
1296 | @item TC_FORCE_RELOCATION_SUB_SAME (@var{fix}, @var{seg})
|
---|
1297 | @cindex TC_FORCE_RELOCATION_SUB_SAME
|
---|
1298 | This macro controls resolution of fixup expressions involving the
|
---|
1299 | difference of two symbols in the same section. If this macro returns zero,
|
---|
1300 | the subtrahend will be resolved and @code{fx_subsy} set to @code{NULL} for
|
---|
1301 | @code{md_apply_fix3}. If undefined, the default of
|
---|
1302 | @w{@code{! SEG_NORMAL (@var{seg}) || TC_FORCE_RELOCATION (@var{fix})}} will
|
---|
1303 | be used.
|
---|
1304 |
|
---|
1305 | @item TC_FORCE_RELOCATION_SUB_ABS (@var{fix})
|
---|
1306 | @cindex TC_FORCE_RELOCATION_SUB_ABS
|
---|
1307 | Like @code{TC_FORCE_RELOCATION_SUB_SAME}, but used when the subtrahend is an
|
---|
1308 | absolute symbol. If the macro is undefined a default of @code{0} is used.
|
---|
1309 |
|
---|
1310 | @item TC_FORCE_RELOCATION_SUB_LOCAL (@var{fix})
|
---|
1311 | @cindex TC_FORCE_RELOCATION_SUB_LOCAL
|
---|
1312 | Like @code{TC_FORCE_RELOCATION_SUB_ABS}, but the subtrahend is a symbol in the
|
---|
1313 | same section as the fixup.
|
---|
1314 |
|
---|
1315 | @item TC_VALIDATE_FIX_SUB (@var{fix})
|
---|
1316 | @cindex TC_VALIDATE_FIX_SUB
|
---|
1317 | This macro is evaluated for any fixup with a @code{fx_subsy} that
|
---|
1318 | @code{fixup_segment} cannot reduce to a number. If the macro returns
|
---|
1319 | @code{false} an error will be reported.
|
---|
1320 |
|
---|
1321 | @item MD_APPLY_SYM_VALUE (@var{fix})
|
---|
1322 | @cindex MD_APPLY_SYM_VALUE
|
---|
1323 | This macro controls whether the symbol value becomes part of the value passed
|
---|
1324 | to @code{md_apply_fix3}. If the macro is undefined, or returns non-zero, the
|
---|
1325 | symbol value will be included. For ELF, a suitable definition might simply be
|
---|
1326 | @code{0}, because ELF relocations don't include the symbol value in the addend.
|
---|
1327 |
|
---|
1328 | @item S_FORCE_RELOC (@var{sym}, @var{strict})
|
---|
1329 | @cindex S_FORCE_RELOC
|
---|
1330 | This macro (or function, for @code{BFD_ASSEMBLER} gas) returns true for symbols
|
---|
1331 | that should not be reduced to section symbols or eliminated from expressions,
|
---|
1332 | because they may be overridden by the linker. ie. for symbols that are
|
---|
1333 | undefined or common, and when @var{strict} is set, weak, or global (for ELF
|
---|
1334 | assemblers that support ELF shared library linking semantics).
|
---|
1335 |
|
---|
1336 | @item EXTERN_FORCE_RELOC
|
---|
1337 | @cindex EXTERN_FORCE_RELOC
|
---|
1338 | This macro controls whether @code{S_FORCE_RELOC} returns true for global
|
---|
1339 | symbols. If undefined, the default is @code{true} for ELF assemblers, and
|
---|
1340 | @code{false} for non-ELF.
|
---|
1341 |
|
---|
1342 | @item tc_gen_reloc
|
---|
1343 | @cindex tc_gen_reloc
|
---|
1344 | A @code{BFD_ASSEMBLER} GAS will call this to generate a reloc. GAS will pass
|
---|
1345 | the resulting reloc to @code{bfd_install_relocation}. This currently works
|
---|
1346 | poorly, as @code{bfd_install_relocation} often does the wrong thing, and
|
---|
1347 | instances of @code{tc_gen_reloc} have been written to work around the problems,
|
---|
1348 | which in turns makes it difficult to fix @code{bfd_install_relocation}.
|
---|
1349 |
|
---|
1350 | @item RELOC_EXPANSION_POSSIBLE
|
---|
1351 | @cindex RELOC_EXPANSION_POSSIBLE
|
---|
1352 | If you define this macro, it means that @code{tc_gen_reloc} may return multiple
|
---|
1353 | relocation entries for a single fixup. In this case, the return value of
|
---|
1354 | @code{tc_gen_reloc} is a pointer to a null terminated array.
|
---|
1355 |
|
---|
1356 | @item MAX_RELOC_EXPANSION
|
---|
1357 | @cindex MAX_RELOC_EXPANSION
|
---|
1358 | You must define this if @code{RELOC_EXPANSION_POSSIBLE} is defined; it
|
---|
1359 | indicates the largest number of relocs which @code{tc_gen_reloc} may return for
|
---|
1360 | a single fixup.
|
---|
1361 |
|
---|
1362 | @item tc_fix_adjustable
|
---|
1363 | @cindex tc_fix_adjustable
|
---|
1364 | You may define this macro to indicate whether a fixup against a locally defined
|
---|
1365 | symbol should be adjusted to be against the section symbol. It should return a
|
---|
1366 | non-zero value if the adjustment is acceptable.
|
---|
1367 |
|
---|
1368 | @item MD_PCREL_FROM_SECTION (@var{fixp}, @var{section})
|
---|
1369 | @cindex MD_PCREL_FROM_SECTION
|
---|
1370 | If you define this macro, it should return the position from which the PC
|
---|
1371 | relative adjustment for a PC relative fixup should be made. On many
|
---|
1372 | processors, the base of a PC relative instruction is the next instruction,
|
---|
1373 | so this macro would return the length of an instruction, plus the address of
|
---|
1374 | the PC relative fixup. The latter can be calculated as
|
---|
1375 | @var{fixp}->fx_where + @var{fixp}->fx_frag->fr_address .
|
---|
1376 |
|
---|
1377 | @item md_pcrel_from
|
---|
1378 | @cindex md_pcrel_from
|
---|
1379 | This is the default value of @code{MD_PCREL_FROM_SECTION}. The difference is
|
---|
1380 | that @code{md_pcrel_from} does not take a section argument.
|
---|
1381 |
|
---|
1382 | @item tc_frob_label
|
---|
1383 | @cindex tc_frob_label
|
---|
1384 | If you define this macro, GAS will call it each time a label is defined.
|
---|
1385 |
|
---|
1386 | @item md_section_align
|
---|
1387 | @cindex md_section_align
|
---|
1388 | GAS will call this function for each section at the end of the assembly, to
|
---|
1389 | permit the CPU backend to adjust the alignment of a section. The function
|
---|
1390 | must take two arguments, a @code{segT} for the section and a @code{valueT}
|
---|
1391 | for the size of the section, and return a @code{valueT} for the rounded
|
---|
1392 | size.
|
---|
1393 |
|
---|
1394 | @item md_macro_start
|
---|
1395 | @cindex md_macro_start
|
---|
1396 | If defined, GAS will call this macro when it starts to include a macro
|
---|
1397 | expansion. @code{macro_nest} indicates the current macro nesting level, which
|
---|
1398 | includes the one being expanded.
|
---|
1399 |
|
---|
1400 | @item md_macro_info
|
---|
1401 | @cindex md_macro_info
|
---|
1402 | If defined, GAS will call this macro after the macro expansion has been
|
---|
1403 | included in the input and after parsing the macro arguments. The single
|
---|
1404 | argument is a pointer to the macro processing's internal representation of the
|
---|
1405 | macro (macro_entry *), which includes expansion of the formal arguments.
|
---|
1406 |
|
---|
1407 | @item md_macro_end
|
---|
1408 | @cindex md_macro_end
|
---|
1409 | Complement to md_macro_start. If defined, it is called when finished
|
---|
1410 | processing an inserted macro expansion, just before decrementing macro_nest.
|
---|
1411 |
|
---|
1412 | @item DOUBLEBAR_PARALLEL
|
---|
1413 | @cindex DOUBLEBAR_PARALLEL
|
---|
1414 | Affects the preprocessor so that lines containing '||' don't have their
|
---|
1415 | whitespace stripped following the double bar. This is useful for targets that
|
---|
1416 | implement parallel instructions.
|
---|
1417 |
|
---|
1418 | @item KEEP_WHITE_AROUND_COLON
|
---|
1419 | @cindex KEEP_WHITE_AROUND_COLON
|
---|
1420 | Normally, whitespace is compressed and removed when, in the presence of the
|
---|
1421 | colon, the adjoining tokens can be distinguished. This option affects the
|
---|
1422 | preprocessor so that whitespace around colons is preserved. This is useful
|
---|
1423 | when colons might be removed from the input after preprocessing but before
|
---|
1424 | assembling, so that adjoining tokens can still be distinguished if there is
|
---|
1425 | whitespace, or concatenated if there is not.
|
---|
1426 |
|
---|
1427 | @item tc_frob_section
|
---|
1428 | @cindex tc_frob_section
|
---|
1429 | If you define this macro, a @code{BFD_ASSEMBLER} GAS will call it for each
|
---|
1430 | section at the end of the assembly.
|
---|
1431 |
|
---|
1432 | @item tc_frob_file_before_adjust
|
---|
1433 | @cindex tc_frob_file_before_adjust
|
---|
1434 | If you define this macro, GAS will call it after the symbol values are
|
---|
1435 | resolved, but before the fixups have been changed from local symbols to section
|
---|
1436 | symbols.
|
---|
1437 |
|
---|
1438 | @item tc_frob_symbol
|
---|
1439 | @cindex tc_frob_symbol
|
---|
1440 | If you define this macro, GAS will call it for each symbol. You can indicate
|
---|
1441 | that the symbol should not be included in the object file by defining this
|
---|
1442 | macro to set its second argument to a non-zero value.
|
---|
1443 |
|
---|
1444 | @item tc_frob_file
|
---|
1445 | @cindex tc_frob_file
|
---|
1446 | If you define this macro, GAS will call it after the symbol table has been
|
---|
1447 | completed, but before the relocations have been generated.
|
---|
1448 |
|
---|
1449 | @item tc_frob_file_after_relocs
|
---|
1450 | If you define this macro, GAS will call it after the relocs have been
|
---|
1451 | generated.
|
---|
1452 |
|
---|
1453 | @item md_post_relax_hook
|
---|
1454 | If you define this macro, GAS will call it after relaxing and sizing the
|
---|
1455 | segments.
|
---|
1456 |
|
---|
1457 | @item LISTING_HEADER
|
---|
1458 | A string to use on the header line of a listing. The default value is simply
|
---|
1459 | @code{"GAS LISTING"}.
|
---|
1460 |
|
---|
1461 | @item LISTING_WORD_SIZE
|
---|
1462 | The number of bytes to put into a word in a listing. This affects the way the
|
---|
1463 | bytes are clumped together in the listing. For example, a value of 2 might
|
---|
1464 | print @samp{1234 5678} where a value of 1 would print @samp{12 34 56 78}. The
|
---|
1465 | default value is 4.
|
---|
1466 |
|
---|
1467 | @item LISTING_LHS_WIDTH
|
---|
1468 | The number of words of data to print on the first line of a listing for a
|
---|
1469 | particular source line, where each word is @code{LISTING_WORD_SIZE} bytes. The
|
---|
1470 | default value is 1.
|
---|
1471 |
|
---|
1472 | @item LISTING_LHS_WIDTH_SECOND
|
---|
1473 | Like @code{LISTING_LHS_WIDTH}, but applying to the second and subsequent line
|
---|
1474 | of the data printed for a particular source line. The default value is 1.
|
---|
1475 |
|
---|
1476 | @item LISTING_LHS_CONT_LINES
|
---|
1477 | The maximum number of continuation lines to print in a listing for a particular
|
---|
1478 | source line. The default value is 4.
|
---|
1479 |
|
---|
1480 | @item LISTING_RHS_WIDTH
|
---|
1481 | The maximum number of characters to print from one line of the input file. The
|
---|
1482 | default value is 100.
|
---|
1483 |
|
---|
1484 | @item TC_COFF_SECTION_DEFAULT_ATTRIBUTES
|
---|
1485 | @cindex TC_COFF_SECTION_DEFAULT_ATTRIBUTES
|
---|
1486 | The COFF @code{.section} directive will use the value of this macro to set
|
---|
1487 | a new section's attributes when a directive has no valid flags or when the
|
---|
1488 | flag is @code{w}. The default value of the macro is @code{SEC_LOAD | SEC_DATA}.
|
---|
1489 |
|
---|
1490 | @item DWARF2_FORMAT ()
|
---|
1491 | @cindex DWARF2_FORMAT
|
---|
1492 | If you define this, it should return one of @code{dwarf2_format_32bit},
|
---|
1493 | @code{dwarf2_format_64bit}, or @code{dwarf2_format_64bit_irix} to indicate
|
---|
1494 | the size of internal DWARF section offsets and the format of the DWARF initial
|
---|
1495 | length fields. When @code{dwarf2_format_32bit} is returned, the initial
|
---|
1496 | length field will be 4 bytes long and section offsets are 32 bits in size.
|
---|
1497 | For @code{dwarf2_format_64bit} and @code{dwarf2_format_64bit_irix}, section
|
---|
1498 | offsets are 64 bits in size, but the initial length field differs. An 8 byte
|
---|
1499 | initial length is indicated by @code{dwarf2_format_64bit_irix} and
|
---|
1500 | @code{dwarf2_format_64bit} indicates a 12 byte initial length field in
|
---|
1501 | which the first four bytes are 0xffffffff and the next 8 bytes are
|
---|
1502 | the section's length.
|
---|
1503 |
|
---|
1504 | If you don't define this, @code{dwarf2_format_32bit} will be used as
|
---|
1505 | the default.
|
---|
1506 |
|
---|
1507 | This define only affects @code{.debug_info} and @code{.debug_line}
|
---|
1508 | sections generated by the assembler. DWARF 2 sections generated by
|
---|
1509 | other tools will be unaffected by this setting.
|
---|
1510 |
|
---|
1511 | @item DWARF2_ADDR_SIZE (@var{bfd})
|
---|
1512 | @cindex DWARF2_ADDR_SIZE
|
---|
1513 | It should return the size of an address, as it should be represented in
|
---|
1514 | debugging info. If you don't define this macro, the default definition uses
|
---|
1515 | the number of bits per address, as defined in @var{bfd}, divided by 8.
|
---|
1516 |
|
---|
1517 | @end table
|
---|
1518 |
|
---|
1519 | @node Object format backend
|
---|
1520 | @subsection Writing an object format backend
|
---|
1521 | @cindex object format backend
|
---|
1522 | @cindex @file{obj-@var{fmt}}
|
---|
1523 |
|
---|
1524 | As with the CPU backend, the object format backend must define a few things,
|
---|
1525 | and may define some other things. The interface to the object format backend
|
---|
1526 | is generally simpler; most of the support for an object file format consists of
|
---|
1527 | defining a number of pseudo-ops.
|
---|
1528 |
|
---|
1529 | The object format @file{.h} file must include @file{targ-cpu.h}.
|
---|
1530 |
|
---|
1531 | This section will only define the @code{BFD_ASSEMBLER} version of GAS. It is
|
---|
1532 | impossible to support a new object file format using any other version anyhow,
|
---|
1533 | as the original GAS version only supports a.out, and the @code{MANY_SEGMENTS}
|
---|
1534 | GAS version only supports COFF.
|
---|
1535 |
|
---|
1536 | @table @code
|
---|
1537 | @item OBJ_@var{format}
|
---|
1538 | @cindex OBJ_@var{format}
|
---|
1539 | By convention, you should define this macro in the @file{.h} file. For
|
---|
1540 | example, @file{obj-elf.h} defines @code{OBJ_ELF}. You might have to use this
|
---|
1541 | if it is necessary to add object file format specific code to the CPU file.
|
---|
1542 |
|
---|
1543 | @item obj_begin
|
---|
1544 | If you define this macro, GAS will call it at the start of the assembly, after
|
---|
1545 | the command line arguments have been parsed and all the machine independent
|
---|
1546 | initializations have been completed.
|
---|
1547 |
|
---|
1548 | @item obj_app_file
|
---|
1549 | @cindex obj_app_file
|
---|
1550 | If you define this macro, GAS will invoke it when it sees a @code{.file}
|
---|
1551 | pseudo-op or a @samp{#} line as used by the C preprocessor.
|
---|
1552 |
|
---|
1553 | @item OBJ_COPY_SYMBOL_ATTRIBUTES
|
---|
1554 | @cindex OBJ_COPY_SYMBOL_ATTRIBUTES
|
---|
1555 | You should define this macro to copy object format specific information from
|
---|
1556 | one symbol to another. GAS will call it when one symbol is equated to
|
---|
1557 | another.
|
---|
1558 |
|
---|
1559 | @item obj_sec_sym_ok_for_reloc
|
---|
1560 | @cindex obj_sec_sym_ok_for_reloc
|
---|
1561 | You may define this macro to indicate that it is OK to use a section symbol in
|
---|
1562 | a relocation entry. If it is not, GAS will define a new symbol at the start
|
---|
1563 | of a section.
|
---|
1564 |
|
---|
1565 | @item EMIT_SECTION_SYMBOLS
|
---|
1566 | @cindex EMIT_SECTION_SYMBOLS
|
---|
1567 | You should define this macro with a zero value if you do not want to include
|
---|
1568 | section symbols in the output symbol table. The default value for this macro
|
---|
1569 | is one.
|
---|
1570 |
|
---|
1571 | @item obj_adjust_symtab
|
---|
1572 | @cindex obj_adjust_symtab
|
---|
1573 | If you define this macro, GAS will invoke it just before setting the symbol
|
---|
1574 | table of the output BFD. For example, the COFF support uses this macro to
|
---|
1575 | generate a @code{.file} symbol if none was generated previously.
|
---|
1576 |
|
---|
1577 | @item SEPARATE_STAB_SECTIONS
|
---|
1578 | @cindex SEPARATE_STAB_SECTIONS
|
---|
1579 | You may define this macro to a nonzero value to indicate that stabs should be
|
---|
1580 | placed in separate sections, as in ELF.
|
---|
1581 |
|
---|
1582 | @item INIT_STAB_SECTION
|
---|
1583 | @cindex INIT_STAB_SECTION
|
---|
1584 | You may define this macro to initialize the stabs section in the output file.
|
---|
1585 |
|
---|
1586 | @item OBJ_PROCESS_STAB
|
---|
1587 | @cindex OBJ_PROCESS_STAB
|
---|
1588 | You may define this macro to do specific processing on a stabs entry.
|
---|
1589 |
|
---|
1590 | @item obj_frob_section
|
---|
1591 | @cindex obj_frob_section
|
---|
1592 | If you define this macro, GAS will call it for each section at the end of the
|
---|
1593 | assembly.
|
---|
1594 |
|
---|
1595 | @item obj_frob_file_before_adjust
|
---|
1596 | @cindex obj_frob_file_before_adjust
|
---|
1597 | If you define this macro, GAS will call it after the symbol values are
|
---|
1598 | resolved, but before the fixups have been changed from local symbols to section
|
---|
1599 | symbols.
|
---|
1600 |
|
---|
1601 | @item obj_frob_symbol
|
---|
1602 | @cindex obj_frob_symbol
|
---|
1603 | If you define this macro, GAS will call it for each symbol. You can indicate
|
---|
1604 | that the symbol should not be included in the object file by defining this
|
---|
1605 | macro to set its second argument to a non-zero value.
|
---|
1606 |
|
---|
1607 | @item obj_frob_file
|
---|
1608 | @cindex obj_frob_file
|
---|
1609 | If you define this macro, GAS will call it after the symbol table has been
|
---|
1610 | completed, but before the relocations have been generated.
|
---|
1611 |
|
---|
1612 | @item obj_frob_file_after_relocs
|
---|
1613 | If you define this macro, GAS will call it after the relocs have been
|
---|
1614 | generated.
|
---|
1615 |
|
---|
1616 | @item SET_SECTION_RELOCS (@var{sec}, @var{relocs}, @var{n})
|
---|
1617 | @cindex SET_SECTION_RELOCS
|
---|
1618 | If you define this, it will be called after the relocations have been set for
|
---|
1619 | the section @var{sec}. The list of relocations is in @var{relocs}, and the
|
---|
1620 | number of relocations is in @var{n}. This is only used with
|
---|
1621 | @code{BFD_ASSEMBLER}.
|
---|
1622 | @end table
|
---|
1623 |
|
---|
1624 | @node Emulations
|
---|
1625 | @subsection Writing emulation files
|
---|
1626 |
|
---|
1627 | Normally you do not have to write an emulation file. You can just use
|
---|
1628 | @file{te-generic.h}.
|
---|
1629 |
|
---|
1630 | If you do write your own emulation file, it must include @file{obj-format.h}.
|
---|
1631 |
|
---|
1632 | An emulation file will often define @code{TE_@var{EM}}; this may then be used
|
---|
1633 | in other files to change the output.
|
---|
1634 |
|
---|
1635 | @node Relaxation
|
---|
1636 | @section Relaxation
|
---|
1637 | @cindex relaxation
|
---|
1638 |
|
---|
1639 | @dfn{Relaxation} is a generic term used when the size of some instruction or
|
---|
1640 | data depends upon the value of some symbol or other data.
|
---|
1641 |
|
---|
1642 | GAS knows to relax a particular type of PC relative relocation using a table.
|
---|
1643 | You can also define arbitrarily complex forms of relaxation yourself.
|
---|
1644 |
|
---|
1645 | @menu
|
---|
1646 | * Relaxing with a table:: Relaxing with a table
|
---|
1647 | * General relaxing:: General relaxing
|
---|
1648 | @end menu
|
---|
1649 |
|
---|
1650 | @node Relaxing with a table
|
---|
1651 | @subsection Relaxing with a table
|
---|
1652 |
|
---|
1653 | If you do not define @code{md_relax_frag}, and you do define
|
---|
1654 | @code{TC_GENERIC_RELAX_TABLE}, GAS will relax @code{rs_machine_dependent} frags
|
---|
1655 | based on the frag subtype and the displacement to some specified target
|
---|
1656 | address. The basic idea is that several machines have different addressing
|
---|
1657 | modes for instructions that can specify different ranges of values, with
|
---|
1658 | successive modes able to access wider ranges, including the entirety of the
|
---|
1659 | previous range. Smaller ranges are assumed to be more desirable (perhaps the
|
---|
1660 | instruction requires one word instead of two or three); if this is not the
|
---|
1661 | case, don't describe the smaller-range, inferior mode.
|
---|
1662 |
|
---|
1663 | The @code{fr_subtype} field of a frag is an index into a CPU-specific
|
---|
1664 | relaxation table. That table entry indicates the range of values that can be
|
---|
1665 | stored, the number of bytes that will have to be added to the frag to
|
---|
1666 | accommodate the addressing mode, and the index of the next entry to examine if
|
---|
1667 | the value to be stored is outside the range accessible by the current
|
---|
1668 | addressing mode. The @code{fr_symbol} field of the frag indicates what symbol
|
---|
1669 | is to be accessed; the @code{fr_offset} field is added in.
|
---|
1670 |
|
---|
1671 | If the @code{TC_PCREL_ADJUST} macro is defined, which currently should only happen
|
---|
1672 | for the NS32k family, the @code{TC_PCREL_ADJUST} macro is called on the frag to
|
---|
1673 | compute an adjustment to be made to the displacement.
|
---|
1674 |
|
---|
1675 | The value fitted by the relaxation code is always assumed to be a displacement
|
---|
1676 | from the current frag. (More specifically, from @code{fr_fix} bytes into the
|
---|
1677 | frag.)
|
---|
1678 | @ignore
|
---|
1679 | This seems kinda silly. What about fitting small absolute values? I suppose
|
---|
1680 | @code{md_assemble} is supposed to take care of that, but if the operand is a
|
---|
1681 | difference between symbols, it might not be able to, if the difference was not
|
---|
1682 | computable yet.
|
---|
1683 | @end ignore
|
---|
1684 |
|
---|
1685 | The end of the relaxation sequence is indicated by a ``next'' value of 0. This
|
---|
1686 | means that the first entry in the table can't be used.
|
---|
1687 |
|
---|
1688 | For some configurations, the linker can do relaxing within a section of an
|
---|
1689 | object file. If call instructions of various sizes exist, the linker can
|
---|
1690 | determine which should be used in each instance, when a symbol's value is
|
---|
1691 | resolved. In order for the linker to avoid wasting space and having to insert
|
---|
1692 | no-op instructions, it must be able to expand or shrink the section contents
|
---|
1693 | while still preserving intra-section references and meeting alignment
|
---|
1694 | requirements.
|
---|
1695 |
|
---|
1696 | For the i960 using b.out format, no expansion is done; instead, each
|
---|
1697 | @samp{.align} directive causes extra space to be allocated, enough that when
|
---|
1698 | the linker is relaxing a section and removing unneeded space, it can discard
|
---|
1699 | some or all of this extra padding and cause the following data to be correctly
|
---|
1700 | aligned.
|
---|
1701 |
|
---|
1702 | For the H8/300, I think the linker expands calls that can't reach, and doesn't
|
---|
1703 | worry about alignment issues; the cpu probably never needs any significant
|
---|
1704 | alignment beyond the instruction size.
|
---|
1705 |
|
---|
1706 | The relaxation table type contains these fields:
|
---|
1707 |
|
---|
1708 | @table @code
|
---|
1709 | @item long rlx_forward
|
---|
1710 | Forward reach, must be non-negative.
|
---|
1711 | @item long rlx_backward
|
---|
1712 | Backward reach, must be zero or negative.
|
---|
1713 | @item rlx_length
|
---|
1714 | Length in bytes of this addressing mode.
|
---|
1715 | @item rlx_more
|
---|
1716 | Index of the next-longer relax state, or zero if there is no next relax state.
|
---|
1717 | @end table
|
---|
1718 |
|
---|
1719 | The relaxation is done in @code{relax_segment} in @file{write.c}. The
|
---|
1720 | difference in the length fields between the original mode and the one finally
|
---|
1721 | chosen by the relaxing code is taken as the size by which the current frag will
|
---|
1722 | be increased in size. For example, if the initial relaxing mode has a length
|
---|
1723 | of 2 bytes, and because of the size of the displacement, it gets upgraded to a
|
---|
1724 | mode with a size of 6 bytes, it is assumed that the frag will grow by 4 bytes.
|
---|
1725 | (The initial two bytes should have been part of the fixed portion of the frag,
|
---|
1726 | since it is already known that they will be output.) This growth must be
|
---|
1727 | effected by @code{md_convert_frag}; it should increase the @code{fr_fix} field
|
---|
1728 | by the appropriate size, and fill in the appropriate bytes of the frag.
|
---|
1729 | (Enough space for the maximum growth should have been allocated in the call to
|
---|
1730 | frag_var as the second argument.)
|
---|
1731 |
|
---|
1732 | If relocation records are needed, they should be emitted by
|
---|
1733 | @code{md_estimate_size_before_relax}. This function should examine the target
|
---|
1734 | symbol of the supplied frag and correct the @code{fr_subtype} of the frag if
|
---|
1735 | needed. When this function is called, if the symbol has not yet been defined,
|
---|
1736 | it will not become defined later; however, its value may still change if the
|
---|
1737 | section it is in gets relaxed.
|
---|
1738 |
|
---|
1739 | Usually, if the symbol is in the same section as the frag (given by the
|
---|
1740 | @var{sec} argument), the narrowest likely relaxation mode is stored in
|
---|
1741 | @code{fr_subtype}, and that's that.
|
---|
1742 |
|
---|
1743 | If the symbol is undefined, or in a different section (and therefore moveable
|
---|
1744 | to an arbitrarily large distance), the largest available relaxation mode is
|
---|
1745 | specified, @code{fix_new} is called to produce the relocation record,
|
---|
1746 | @code{fr_fix} is increased to include the relocated field (remember, this
|
---|
1747 | storage was allocated when @code{frag_var} was called), and @code{frag_wane} is
|
---|
1748 | called to convert the frag to an @code{rs_fill} frag with no variant part.
|
---|
1749 | Sometimes changing addressing modes may also require rewriting the instruction.
|
---|
1750 | It can be accessed via @code{fr_opcode} or @code{fr_fix}.
|
---|
1751 |
|
---|
1752 | If you generate frags separately for the basic insn opcode and any relaxable
|
---|
1753 | operands, do not call @code{fix_new} thinking you can emit fixups for the
|
---|
1754 | opcode field from the relaxable frag. It is not guaranteed to be the same frag.
|
---|
1755 | If you need to emit fixups for the opcode field from inspection of the
|
---|
1756 | relaxable frag, then you need to generate a common frag for both the basic
|
---|
1757 | opcode and relaxable fields, or you need to provide the frag for the opcode to
|
---|
1758 | pass to @code{fix_new}. The latter can be done for example by defining
|
---|
1759 | @code{TC_FRAG_TYPE} to include a pointer to it and defining @code{TC_FRAG_INIT}
|
---|
1760 | to set the pointer.
|
---|
1761 |
|
---|
1762 | Sometimes @code{fr_var} is increased instead, and @code{frag_wane} is not
|
---|
1763 | called. I'm not sure, but I think this is to keep @code{fr_fix} referring to
|
---|
1764 | an earlier byte, and @code{fr_subtype} set to @code{rs_machine_dependent} so
|
---|
1765 | that @code{md_convert_frag} will get called.
|
---|
1766 |
|
---|
1767 | @node General relaxing
|
---|
1768 | @subsection General relaxing
|
---|
1769 |
|
---|
1770 | If using a simple table is not suitable, you may implement arbitrarily complex
|
---|
1771 | relaxation semantics yourself. For example, the MIPS backend uses this to emit
|
---|
1772 | different instruction sequences depending upon the size of the symbol being
|
---|
1773 | accessed.
|
---|
1774 |
|
---|
1775 | When you assemble an instruction that may need relaxation, you should allocate
|
---|
1776 | a frag using @code{frag_var} or @code{frag_variant} with a type of
|
---|
1777 | @code{rs_machine_dependent}. You should store some sort of information in the
|
---|
1778 | @code{fr_subtype} field so that you can figure out what to do with the frag
|
---|
1779 | later.
|
---|
1780 |
|
---|
1781 | When GAS reaches the end of the input file, it will look through the frags and
|
---|
1782 | work out their final sizes.
|
---|
1783 |
|
---|
1784 | GAS will first call @code{md_estimate_size_before_relax} on each
|
---|
1785 | @code{rs_machine_dependent} frag. This function must return an estimated size
|
---|
1786 | for the frag.
|
---|
1787 |
|
---|
1788 | GAS will then loop over the frags, calling @code{md_relax_frag} on each
|
---|
1789 | @code{rs_machine_dependent} frag. This function should return the change in
|
---|
1790 | size of the frag. GAS will keep looping over the frags until none of the frags
|
---|
1791 | changes size.
|
---|
1792 |
|
---|
1793 | @node Broken words
|
---|
1794 | @section Broken words
|
---|
1795 | @cindex internals, broken words
|
---|
1796 | @cindex broken words
|
---|
1797 |
|
---|
1798 | Some compilers, including GCC, will sometimes emit switch tables specifying
|
---|
1799 | 16-bit @code{.word} displacements to branch targets, and branch instructions
|
---|
1800 | that load entries from that table to compute the target address. If this is
|
---|
1801 | done on a 32-bit machine, there is a chance (at least with really large
|
---|
1802 | functions) that the displacement will not fit in 16 bits. The assembler
|
---|
1803 | handles this using a concept called @dfn{broken words}. This idea is well
|
---|
1804 | named, since there is an implied promise that the 16-bit field will in fact
|
---|
1805 | hold the specified displacement.
|
---|
1806 |
|
---|
1807 | If broken word processing is enabled, and a situation like this is encountered,
|
---|
1808 | the assembler will insert a jump instruction into the instruction stream, close
|
---|
1809 | enough to be reached with the 16-bit displacement. This jump instruction will
|
---|
1810 | transfer to the real desired target address. Thus, as long as the @code{.word}
|
---|
1811 | value really is used as a displacement to compute an address to jump to, the
|
---|
1812 | net effect will be correct (minus a very small efficiency cost). If
|
---|
1813 | @code{.word} directives with label differences for values are used for other
|
---|
1814 | purposes, however, things may not work properly. For targets which use broken
|
---|
1815 | words, the @samp{-K} option will warn when a broken word is discovered.
|
---|
1816 |
|
---|
1817 | The broken word code is turned off by the @code{WORKING_DOT_WORD} macro. It
|
---|
1818 | isn't needed if @code{.word} emits a value large enough to contain an address
|
---|
1819 | (or, more correctly, any possible difference between two addresses).
|
---|
1820 |
|
---|
1821 | @node Internal functions
|
---|
1822 | @section Internal functions
|
---|
1823 |
|
---|
1824 | This section describes basic internal functions used by GAS.
|
---|
1825 |
|
---|
1826 | @menu
|
---|
1827 | * Warning and error messages:: Warning and error messages
|
---|
1828 | * Hash tables:: Hash tables
|
---|
1829 | @end menu
|
---|
1830 |
|
---|
1831 | @node Warning and error messages
|
---|
1832 | @subsection Warning and error messages
|
---|
1833 |
|
---|
1834 | @deftypefun @{@} int had_warnings (void)
|
---|
1835 | @deftypefunx @{@} int had_errors (void)
|
---|
1836 | Returns non-zero if any warnings or errors, respectively, have been printed
|
---|
1837 | during this invocation.
|
---|
1838 | @end deftypefun
|
---|
1839 |
|
---|
1840 | @deftypefun @{@} void as_perror (const char *@var{gripe}, const char *@var{filename})
|
---|
1841 | Displays a BFD or system error, then clears the error status.
|
---|
1842 | @end deftypefun
|
---|
1843 |
|
---|
1844 | @deftypefun @{@} void as_tsktsk (const char *@var{format}, ...)
|
---|
1845 | @deftypefunx @{@} void as_warn (const char *@var{format}, ...)
|
---|
1846 | @deftypefunx @{@} void as_bad (const char *@var{format}, ...)
|
---|
1847 | @deftypefunx @{@} void as_fatal (const char *@var{format}, ...)
|
---|
1848 | These functions display messages about something amiss with the input file, or
|
---|
1849 | internal problems in the assembler itself. The current file name and line
|
---|
1850 | number are printed, followed by the supplied message, formatted using
|
---|
1851 | @code{vfprintf}, and a final newline.
|
---|
1852 |
|
---|
1853 | An error indicated by @code{as_bad} will result in a non-zero exit status when
|
---|
1854 | the assembler has finished. Calling @code{as_fatal} will result in immediate
|
---|
1855 | termination of the assembler process.
|
---|
1856 | @end deftypefun
|
---|
1857 |
|
---|
1858 | @deftypefun @{@} void as_warn_where (char *@var{file}, unsigned int @var{line}, const char *@var{format}, ...)
|
---|
1859 | @deftypefunx @{@} void as_bad_where (char *@var{file}, unsigned int @var{line}, const char *@var{format}, ...)
|
---|
1860 | These variants permit specification of the file name and line number, and are
|
---|
1861 | used when problems are detected when reprocessing information saved away when
|
---|
1862 | processing some earlier part of the file. For example, fixups are processed
|
---|
1863 | after all input has been read, but messages about fixups should refer to the
|
---|
1864 | original filename and line number that they are applicable to.
|
---|
1865 | @end deftypefun
|
---|
1866 |
|
---|
1867 | @deftypefun @{@} void fprint_value (FILE *@var{file}, valueT @var{val})
|
---|
1868 | @deftypefunx @{@} void sprint_value (char *@var{buf}, valueT @var{val})
|
---|
1869 | These functions are helpful for converting a @code{valueT} value into printable
|
---|
1870 | format, in case it's wider than modes that @code{*printf} can handle. If the
|
---|
1871 | type is narrow enough, a decimal number will be produced; otherwise, it will be
|
---|
1872 | in hexadecimal. The value itself is not examined to make this determination.
|
---|
1873 | @end deftypefun
|
---|
1874 |
|
---|
1875 | @node Hash tables
|
---|
1876 | @subsection Hash tables
|
---|
1877 | @cindex hash tables
|
---|
1878 |
|
---|
1879 | @deftypefun @{@} @{struct hash_control *@} hash_new (void)
|
---|
1880 | Creates the hash table control structure.
|
---|
1881 | @end deftypefun
|
---|
1882 |
|
---|
1883 | @deftypefun @{@} void hash_die (struct hash_control *)
|
---|
1884 | Destroy a hash table.
|
---|
1885 | @end deftypefun
|
---|
1886 |
|
---|
1887 | @deftypefun @{@} PTR hash_delete (struct hash_control *, const char *)
|
---|
1888 | Deletes entry from the hash table, returns the value it had.
|
---|
1889 | @end deftypefun
|
---|
1890 |
|
---|
1891 | @deftypefun @{@} PTR hash_replace (struct hash_control *, const char *, PTR)
|
---|
1892 | Updates the value for an entry already in the table, returning the old value.
|
---|
1893 | If no entry was found, just returns NULL.
|
---|
1894 | @end deftypefun
|
---|
1895 |
|
---|
1896 | @deftypefun @{@} @{const char *@} hash_insert (struct hash_control *, const char *, PTR)
|
---|
1897 | Inserting a value already in the table is an error.
|
---|
1898 | Returns an error message or NULL.
|
---|
1899 | @end deftypefun
|
---|
1900 |
|
---|
1901 | @deftypefun @{@} @{const char *@} hash_jam (struct hash_control *, const char *, PTR)
|
---|
1902 | Inserts if the value isn't already present, updates it if it is.
|
---|
1903 | @end deftypefun
|
---|
1904 |
|
---|
1905 | @node Test suite
|
---|
1906 | @section Test suite
|
---|
1907 | @cindex test suite
|
---|
1908 |
|
---|
1909 | The test suite is kind of lame for most processors. Often it only checks to
|
---|
1910 | see if a couple of files can be assembled without the assembler reporting any
|
---|
1911 | errors. For more complete testing, write a test which either examines the
|
---|
1912 | assembler listing, or runs @code{objdump} and examines its output. For the
|
---|
1913 | latter, the TCL procedure @code{run_dump_test} may come in handy. It takes the
|
---|
1914 | base name of a file, and looks for @file{@var{file}.d}. This file should
|
---|
1915 | contain as its initial lines a set of variable settings in @samp{#} comments,
|
---|
1916 | in the form:
|
---|
1917 |
|
---|
1918 | @example
|
---|
1919 | #@var{varname}: @var{value}
|
---|
1920 | @end example
|
---|
1921 |
|
---|
1922 | The @var{varname} may be @code{objdump}, @code{nm}, or @code{as}, in which case
|
---|
1923 | it specifies the options to be passed to the specified programs. Exactly one
|
---|
1924 | of @code{objdump} or @code{nm} must be specified, as that also specifies which
|
---|
1925 | program to run after the assembler has finished. If @var{varname} is
|
---|
1926 | @code{source}, it specifies the name of the source file; otherwise,
|
---|
1927 | @file{@var{file}.s} is used. If @var{varname} is @code{name}, it specifies the
|
---|
1928 | name of the test to be used in the @code{pass} or @code{fail} messages.
|
---|
1929 |
|
---|
1930 | The non-commented parts of the file are interpreted as regular expressions, one
|
---|
1931 | per line. Blank lines in the @code{objdump} or @code{nm} output are skipped,
|
---|
1932 | as are blank lines in the @code{.d} file; the other lines are tested to see if
|
---|
1933 | the regular expression matches the program output. If it does not, the test
|
---|
1934 | fails.
|
---|
1935 |
|
---|
1936 | Note that this means the tests must be modified if the @code{objdump} output
|
---|
1937 | style is changed.
|
---|
1938 |
|
---|
1939 | @bye
|
---|
1940 | @c Local Variables:
|
---|
1941 | @c fill-column: 79
|
---|
1942 | @c End:
|
---|