| 1 | \input texinfo
|
|---|
| 2 | @c Copyright 1991, 1992, 1993, 1994, 1995, 1996, 1997, 1998, 1999, 2000,
|
|---|
| 3 | @c 2001, 2002, 2003
|
|---|
| 4 | @c Free Software Foundation, Inc.
|
|---|
| 5 | @setfilename internals.info
|
|---|
| 6 | @node Top
|
|---|
| 7 | @top Assembler Internals
|
|---|
| 8 | @raisesections
|
|---|
| 9 | @cindex internals
|
|---|
| 10 |
|
|---|
| 11 | This chapter describes the internals of the assembler. It is incomplete, but
|
|---|
| 12 | it may help a bit.
|
|---|
| 13 |
|
|---|
| 14 | This chapter is not updated regularly, and it may be out of date.
|
|---|
| 15 |
|
|---|
| 16 | @menu
|
|---|
| 17 | * GAS versions:: GAS versions
|
|---|
| 18 | * Data types:: Data types
|
|---|
| 19 | * GAS processing:: What GAS does when it runs
|
|---|
| 20 | * Porting GAS:: Porting GAS
|
|---|
| 21 | * Relaxation:: Relaxation
|
|---|
| 22 | * Broken words:: Broken words
|
|---|
| 23 | * Internal functions:: Internal functions
|
|---|
| 24 | * Test suite:: Test suite
|
|---|
| 25 | @end menu
|
|---|
| 26 |
|
|---|
| 27 | @node GAS versions
|
|---|
| 28 | @section GAS versions
|
|---|
| 29 |
|
|---|
| 30 | GAS has acquired layers of code over time. The original GAS only supported the
|
|---|
| 31 | a.out object file format, with three sections. Support for multiple sections
|
|---|
| 32 | has been added in two different ways.
|
|---|
| 33 |
|
|---|
| 34 | The preferred approach is to use the version of GAS created when the symbol
|
|---|
| 35 | @code{BFD_ASSEMBLER} is defined. The other versions of GAS are documented for
|
|---|
| 36 | historical purposes, and to help anybody who has to debug code written for
|
|---|
| 37 | them.
|
|---|
| 38 |
|
|---|
| 39 | The type @code{segT} is used to represent a section in code which must work
|
|---|
| 40 | with all versions of GAS.
|
|---|
| 41 |
|
|---|
| 42 | @menu
|
|---|
| 43 | * Original GAS:: Original GAS version
|
|---|
| 44 | * MANY_SEGMENTS:: MANY_SEGMENTS gas version
|
|---|
| 45 | * BFD_ASSEMBLER:: BFD_ASSEMBLER gas version
|
|---|
| 46 | @end menu
|
|---|
| 47 |
|
|---|
| 48 | @node Original GAS
|
|---|
| 49 | @subsection Original GAS
|
|---|
| 50 |
|
|---|
| 51 | The original GAS only supported the a.out object file format with three
|
|---|
| 52 | sections: @samp{.text}, @samp{.data}, and @samp{.bss}. This is the version of
|
|---|
| 53 | GAS that is compiled if neither @code{BFD_ASSEMBLER} nor @code{MANY_SEGMENTS}
|
|---|
| 54 | is defined. This version of GAS is still used for the m68k-aout target, and
|
|---|
| 55 | perhaps others.
|
|---|
| 56 |
|
|---|
| 57 | This version of GAS should not be used for any new development.
|
|---|
| 58 |
|
|---|
| 59 | There is still code that is specific to this version of GAS, notably in
|
|---|
| 60 | @file{write.c}. There is no way for this code to loop through all the
|
|---|
| 61 | sections; it simply looks at global variables like @code{text_frag_root} and
|
|---|
| 62 | @code{data_frag_root}.
|
|---|
| 63 |
|
|---|
| 64 | The type @code{segT} is an enum.
|
|---|
| 65 |
|
|---|
| 66 | @node MANY_SEGMENTS
|
|---|
| 67 | @subsection MANY_SEGMENTS gas version
|
|---|
| 68 | @cindex MANY_SEGMENTS
|
|---|
| 69 |
|
|---|
| 70 | The @code{MANY_SEGMENTS} version of gas is only used for COFF. It uses the BFD
|
|---|
| 71 | library, but it writes out all the data itself using @code{bfd_write}. This
|
|---|
| 72 | version of gas supports up to 40 normal sections. The section names are stored
|
|---|
| 73 | in the @code{seg_name} array. Other information is stored in the
|
|---|
| 74 | @code{segment_info} array.
|
|---|
| 75 |
|
|---|
| 76 | The type @code{segT} is an enum. Code that wants to examine all the sections
|
|---|
| 77 | can use a @code{segT} variable as loop index from @code{SEG_E0} up to but not
|
|---|
| 78 | including @code{SEG_UNKNOWN}.
|
|---|
| 79 |
|
|---|
| 80 | Most of the code specific to this version of GAS is in the file
|
|---|
| 81 | @file{config/obj-coff.c}, in the portion of that file that is compiled when
|
|---|
| 82 | @code{BFD_ASSEMBLER} is not defined.
|
|---|
| 83 |
|
|---|
| 84 | This version of GAS is still used for several COFF targets.
|
|---|
| 85 |
|
|---|
| 86 | @node BFD_ASSEMBLER
|
|---|
| 87 | @subsection BFD_ASSEMBLER gas version
|
|---|
| 88 | @cindex BFD_ASSEMBLER
|
|---|
| 89 |
|
|---|
| 90 | The preferred version of GAS is the @code{BFD_ASSEMBLER} version. In this
|
|---|
| 91 | version of GAS, the output file is a normal BFD, and the BFD routines are used
|
|---|
| 92 | to generate the output.
|
|---|
| 93 |
|
|---|
| 94 | @code{BFD_ASSEMBLER} will automatically be used for certain targets, including
|
|---|
| 95 | those that use the ELF, ECOFF, and SOM object file formats, and also all Alpha,
|
|---|
| 96 | MIPS, PowerPC, and SPARC targets. You can force the use of
|
|---|
| 97 | @code{BFD_ASSEMBLER} for other targets with the configure option
|
|---|
| 98 | @samp{--enable-bfd-assembler}; however, it has not been tested for many
|
|---|
| 99 | targets, and can not be assumed to work.
|
|---|
| 100 |
|
|---|
| 101 | @node Data types
|
|---|
| 102 | @section Data types
|
|---|
| 103 | @cindex internals, data types
|
|---|
| 104 |
|
|---|
| 105 | This section describes some fundamental GAS data types.
|
|---|
| 106 |
|
|---|
| 107 | @menu
|
|---|
| 108 | * Symbols:: The symbolS structure
|
|---|
| 109 | * Expressions:: The expressionS structure
|
|---|
| 110 | * Fixups:: The fixS structure
|
|---|
| 111 | * Frags:: The fragS structure
|
|---|
| 112 | @end menu
|
|---|
| 113 |
|
|---|
| 114 | @node Symbols
|
|---|
| 115 | @subsection Symbols
|
|---|
| 116 | @cindex internals, symbols
|
|---|
| 117 | @cindex symbols, internal
|
|---|
| 118 | @cindex symbolS structure
|
|---|
| 119 |
|
|---|
| 120 | The definition for the symbol structure, @code{symbolS}, is located in
|
|---|
| 121 | @file{struc-symbol.h}.
|
|---|
| 122 |
|
|---|
| 123 | In general, the fields of this structure may not be referred to directly.
|
|---|
| 124 | Instead, you must use one of the accessor functions defined in @file{symbol.h}.
|
|---|
| 125 | These accessor functions should work for any GAS version.
|
|---|
| 126 |
|
|---|
| 127 | Symbol structures contain the following fields:
|
|---|
| 128 |
|
|---|
| 129 | @table @code
|
|---|
| 130 | @item sy_value
|
|---|
| 131 | This is an @code{expressionS} that describes the value of the symbol. It might
|
|---|
| 132 | refer to one or more other symbols; if so, its true value may not be known
|
|---|
| 133 | until @code{resolve_symbol_value} is called with @var{finalize_syms} non-zero
|
|---|
| 134 | in @code{write_object_file}.
|
|---|
| 135 |
|
|---|
| 136 | The expression is often simply a constant. Before @code{resolve_symbol_value}
|
|---|
| 137 | is called with @var{finalize_syms} set, the value is the offset from the frag
|
|---|
| 138 | (@pxref{Frags}). Afterward, the frag address has been added in.
|
|---|
| 139 |
|
|---|
| 140 | @item sy_resolved
|
|---|
| 141 | This field is non-zero if the symbol's value has been completely resolved. It
|
|---|
| 142 | is used during the final pass over the symbol table.
|
|---|
| 143 |
|
|---|
| 144 | @item sy_resolving
|
|---|
| 145 | This field is used to detect loops while resolving the symbol's value.
|
|---|
| 146 |
|
|---|
| 147 | @item sy_used_in_reloc
|
|---|
| 148 | This field is non-zero if the symbol is used by a relocation entry. If a local
|
|---|
| 149 | symbol is used in a relocation entry, it must be possible to redirect those
|
|---|
| 150 | relocations to other symbols, or this symbol cannot be removed from the final
|
|---|
| 151 | symbol list.
|
|---|
| 152 |
|
|---|
| 153 | @item sy_next
|
|---|
| 154 | @itemx sy_previous
|
|---|
| 155 | These pointers to other @code{symbolS} structures describe a singly or doubly
|
|---|
| 156 | linked list. (If @code{SYMBOLS_NEED_BACKPOINTERS} is not defined, the
|
|---|
| 157 | @code{sy_previous} field will be omitted; @code{SYMBOLS_NEED_BACKPOINTERS} is
|
|---|
| 158 | always defined if @code{BFD_ASSEMBLER}.) These fields should be accessed with
|
|---|
| 159 | the @code{symbol_next} and @code{symbol_previous} macros.
|
|---|
| 160 |
|
|---|
| 161 | @item sy_frag
|
|---|
| 162 | This points to the frag (@pxref{Frags}) that this symbol is attached to.
|
|---|
| 163 |
|
|---|
| 164 | @item sy_used
|
|---|
| 165 | Whether the symbol is used as an operand or in an expression. Note: Not all of
|
|---|
| 166 | the backends keep this information accurate; backends which use this bit are
|
|---|
| 167 | responsible for setting it when a symbol is used in backend routines.
|
|---|
| 168 |
|
|---|
| 169 | @item sy_mri_common
|
|---|
| 170 | Whether the symbol is an MRI common symbol created by the @code{COMMON}
|
|---|
| 171 | pseudo-op when assembling in MRI mode.
|
|---|
| 172 |
|
|---|
| 173 | @item bsym
|
|---|
| 174 | If @code{BFD_ASSEMBLER} is defined, this points to the BFD @code{asymbol} that
|
|---|
| 175 | will be used in writing the object file.
|
|---|
| 176 |
|
|---|
| 177 | @item sy_name_offset
|
|---|
| 178 | (Only used if @code{BFD_ASSEMBLER} is not defined.) This is the position of
|
|---|
| 179 | the symbol's name in the string table of the object file. On some formats,
|
|---|
| 180 | this will start at position 4, with position 0 reserved for unnamed symbols.
|
|---|
| 181 | This field is not used until @code{write_object_file} is called.
|
|---|
| 182 |
|
|---|
| 183 | @item sy_symbol
|
|---|
| 184 | (Only used if @code{BFD_ASSEMBLER} is not defined.) This is the
|
|---|
| 185 | format-specific symbol structure, as it would be written into the object file.
|
|---|
| 186 |
|
|---|
| 187 | @item sy_number
|
|---|
| 188 | (Only used if @code{BFD_ASSEMBLER} is not defined.) This is a 24-bit symbol
|
|---|
| 189 | number, for use in constructing relocation table entries.
|
|---|
| 190 |
|
|---|
| 191 | @item sy_obj
|
|---|
| 192 | This format-specific data is of type @code{OBJ_SYMFIELD_TYPE}. If no macro by
|
|---|
| 193 | that name is defined in @file{obj-format.h}, this field is not defined.
|
|---|
| 194 |
|
|---|
| 195 | @item sy_tc
|
|---|
| 196 | This processor-specific data is of type @code{TC_SYMFIELD_TYPE}. If no macro
|
|---|
| 197 | by that name is defined in @file{targ-cpu.h}, this field is not defined.
|
|---|
| 198 |
|
|---|
| 199 | @end table
|
|---|
| 200 |
|
|---|
| 201 | Here is a description of the accessor functions. These should be used rather
|
|---|
| 202 | than referring to the fields of @code{symbolS} directly.
|
|---|
| 203 |
|
|---|
| 204 | @table @code
|
|---|
| 205 | @item S_SET_VALUE
|
|---|
| 206 | @cindex S_SET_VALUE
|
|---|
| 207 | Set the symbol's value.
|
|---|
| 208 |
|
|---|
| 209 | @item S_GET_VALUE
|
|---|
| 210 | @cindex S_GET_VALUE
|
|---|
| 211 | Get the symbol's value. This will cause @code{resolve_symbol_value} to be
|
|---|
| 212 | called if necessary.
|
|---|
| 213 |
|
|---|
| 214 | @item S_SET_SEGMENT
|
|---|
| 215 | @cindex S_SET_SEGMENT
|
|---|
| 216 | Set the section of the symbol.
|
|---|
| 217 |
|
|---|
| 218 | @item S_GET_SEGMENT
|
|---|
| 219 | @cindex S_GET_SEGMENT
|
|---|
| 220 | Get the symbol's section.
|
|---|
| 221 |
|
|---|
| 222 | @item S_GET_NAME
|
|---|
| 223 | @cindex S_GET_NAME
|
|---|
| 224 | Get the name of the symbol.
|
|---|
| 225 |
|
|---|
| 226 | @item S_SET_NAME
|
|---|
| 227 | @cindex S_SET_NAME
|
|---|
| 228 | Set the name of the symbol.
|
|---|
| 229 |
|
|---|
| 230 | @item S_IS_EXTERNAL
|
|---|
| 231 | @cindex S_IS_EXTERNAL
|
|---|
| 232 | Return non-zero if the symbol is externally visible.
|
|---|
| 233 |
|
|---|
| 234 | @item S_IS_EXTERN
|
|---|
| 235 | @cindex S_IS_EXTERN
|
|---|
| 236 | A synonym for @code{S_IS_EXTERNAL}. Don't use it.
|
|---|
| 237 |
|
|---|
| 238 | @item S_IS_WEAK
|
|---|
| 239 | @cindex S_IS_WEAK
|
|---|
| 240 | Return non-zero if the symbol is weak.
|
|---|
| 241 |
|
|---|
| 242 | @item S_IS_COMMON
|
|---|
| 243 | @cindex S_IS_COMMON
|
|---|
| 244 | Return non-zero if this is a common symbol. Common symbols are sometimes
|
|---|
| 245 | represented as undefined symbols with a value, in which case this function will
|
|---|
| 246 | not be reliable.
|
|---|
| 247 |
|
|---|
| 248 | @item S_IS_DEFINED
|
|---|
| 249 | @cindex S_IS_DEFINED
|
|---|
| 250 | Return non-zero if this symbol is defined. This function is not reliable when
|
|---|
| 251 | called on a common symbol.
|
|---|
| 252 |
|
|---|
| 253 | @item S_IS_DEBUG
|
|---|
| 254 | @cindex S_IS_DEBUG
|
|---|
| 255 | Return non-zero if this is a debugging symbol.
|
|---|
| 256 |
|
|---|
| 257 | @item S_IS_LOCAL
|
|---|
| 258 | @cindex S_IS_LOCAL
|
|---|
| 259 | Return non-zero if this is a local assembler symbol which should not be
|
|---|
| 260 | included in the final symbol table. Note that this is not the opposite of
|
|---|
| 261 | @code{S_IS_EXTERNAL}. The @samp{-L} assembler option affects the return value
|
|---|
| 262 | of this function.
|
|---|
| 263 |
|
|---|
| 264 | @item S_SET_EXTERNAL
|
|---|
| 265 | @cindex S_SET_EXTERNAL
|
|---|
| 266 | Mark the symbol as externally visible.
|
|---|
| 267 |
|
|---|
| 268 | @item S_CLEAR_EXTERNAL
|
|---|
| 269 | @cindex S_CLEAR_EXTERNAL
|
|---|
| 270 | Mark the symbol as not externally visible.
|
|---|
| 271 |
|
|---|
| 272 | @item S_SET_WEAK
|
|---|
| 273 | @cindex S_SET_WEAK
|
|---|
| 274 | Mark the symbol as weak.
|
|---|
| 275 |
|
|---|
| 276 | @item S_GET_TYPE
|
|---|
| 277 | @item S_GET_DESC
|
|---|
| 278 | @item S_GET_OTHER
|
|---|
| 279 | @cindex S_GET_TYPE
|
|---|
| 280 | @cindex S_GET_DESC
|
|---|
| 281 | @cindex S_GET_OTHER
|
|---|
| 282 | Get the @code{type}, @code{desc}, and @code{other} fields of the symbol. These
|
|---|
| 283 | are only defined for object file formats for which they make sense (primarily
|
|---|
| 284 | a.out).
|
|---|
| 285 |
|
|---|
| 286 | @item S_SET_TYPE
|
|---|
| 287 | @item S_SET_DESC
|
|---|
| 288 | @item S_SET_OTHER
|
|---|
| 289 | @cindex S_SET_TYPE
|
|---|
| 290 | @cindex S_SET_DESC
|
|---|
| 291 | @cindex S_SET_OTHER
|
|---|
| 292 | Set the @code{type}, @code{desc}, and @code{other} fields of the symbol. These
|
|---|
| 293 | are only defined for object file formats for which they make sense (primarily
|
|---|
| 294 | a.out).
|
|---|
| 295 |
|
|---|
| 296 | @item S_GET_SIZE
|
|---|
| 297 | @cindex S_GET_SIZE
|
|---|
| 298 | Get the size of a symbol. This is only defined for object file formats for
|
|---|
| 299 | which it makes sense (primarily ELF).
|
|---|
| 300 |
|
|---|
| 301 | @item S_SET_SIZE
|
|---|
| 302 | @cindex S_SET_SIZE
|
|---|
| 303 | Set the size of a symbol. This is only defined for object file formats for
|
|---|
| 304 | which it makes sense (primarily ELF).
|
|---|
| 305 |
|
|---|
| 306 | @item symbol_get_value_expression
|
|---|
| 307 | @cindex symbol_get_value_expression
|
|---|
| 308 | Get a pointer to an @code{expressionS} structure which represents the value of
|
|---|
| 309 | the symbol as an expression.
|
|---|
| 310 |
|
|---|
| 311 | @item symbol_set_value_expression
|
|---|
| 312 | @cindex symbol_set_value_expression
|
|---|
| 313 | Set the value of a symbol to an expression.
|
|---|
| 314 |
|
|---|
| 315 | @item symbol_set_frag
|
|---|
| 316 | @cindex symbol_set_frag
|
|---|
| 317 | Set the frag where a symbol is defined.
|
|---|
| 318 |
|
|---|
| 319 | @item symbol_get_frag
|
|---|
| 320 | @cindex symbol_get_frag
|
|---|
| 321 | Get the frag where a symbol is defined.
|
|---|
| 322 |
|
|---|
| 323 | @item symbol_mark_used
|
|---|
| 324 | @cindex symbol_mark_used
|
|---|
| 325 | Mark a symbol as having been used in an expression.
|
|---|
| 326 |
|
|---|
| 327 | @item symbol_clear_used
|
|---|
| 328 | @cindex symbol_clear_used
|
|---|
| 329 | Clear the mark indicating that a symbol was used in an expression.
|
|---|
| 330 |
|
|---|
| 331 | @item symbol_used_p
|
|---|
| 332 | @cindex symbol_used_p
|
|---|
| 333 | Return whether a symbol was used in an expression.
|
|---|
| 334 |
|
|---|
| 335 | @item symbol_mark_used_in_reloc
|
|---|
| 336 | @cindex symbol_mark_used_in_reloc
|
|---|
| 337 | Mark a symbol as having been used by a relocation.
|
|---|
| 338 |
|
|---|
| 339 | @item symbol_clear_used_in_reloc
|
|---|
| 340 | @cindex symbol_clear_used_in_reloc
|
|---|
| 341 | Clear the mark indicating that a symbol was used in a relocation.
|
|---|
| 342 |
|
|---|
| 343 | @item symbol_used_in_reloc_p
|
|---|
| 344 | @cindex symbol_used_in_reloc_p
|
|---|
| 345 | Return whether a symbol was used in a relocation.
|
|---|
| 346 |
|
|---|
| 347 | @item symbol_mark_mri_common
|
|---|
| 348 | @cindex symbol_mark_mri_common
|
|---|
| 349 | Mark a symbol as an MRI common symbol.
|
|---|
| 350 |
|
|---|
| 351 | @item symbol_clear_mri_common
|
|---|
| 352 | @cindex symbol_clear_mri_common
|
|---|
| 353 | Clear the mark indicating that a symbol is an MRI common symbol.
|
|---|
| 354 |
|
|---|
| 355 | @item symbol_mri_common_p
|
|---|
| 356 | @cindex symbol_mri_common_p
|
|---|
| 357 | Return whether a symbol is an MRI common symbol.
|
|---|
| 358 |
|
|---|
| 359 | @item symbol_mark_written
|
|---|
| 360 | @cindex symbol_mark_written
|
|---|
| 361 | Mark a symbol as having been written.
|
|---|
| 362 |
|
|---|
| 363 | @item symbol_clear_written
|
|---|
| 364 | @cindex symbol_clear_written
|
|---|
| 365 | Clear the mark indicating that a symbol was written.
|
|---|
| 366 |
|
|---|
| 367 | @item symbol_written_p
|
|---|
| 368 | @cindex symbol_written_p
|
|---|
| 369 | Return whether a symbol was written.
|
|---|
| 370 |
|
|---|
| 371 | @item symbol_mark_resolved
|
|---|
| 372 | @cindex symbol_mark_resolved
|
|---|
| 373 | Mark a symbol as having been resolved.
|
|---|
| 374 |
|
|---|
| 375 | @item symbol_resolved_p
|
|---|
| 376 | @cindex symbol_resolved_p
|
|---|
| 377 | Return whether a symbol has been resolved.
|
|---|
| 378 |
|
|---|
| 379 | @item symbol_section_p
|
|---|
| 380 | @cindex symbol_section_p
|
|---|
| 381 | Return whether a symbol is a section symbol.
|
|---|
| 382 |
|
|---|
| 383 | @item symbol_equated_p
|
|---|
| 384 | @cindex symbol_equated_p
|
|---|
| 385 | Return whether a symbol is equated to another symbol.
|
|---|
| 386 |
|
|---|
| 387 | @item symbol_constant_p
|
|---|
| 388 | @cindex symbol_constant_p
|
|---|
| 389 | Return whether a symbol has a constant value, including being an offset within
|
|---|
| 390 | some frag.
|
|---|
| 391 |
|
|---|
| 392 | @item symbol_get_bfdsym
|
|---|
| 393 | @cindex symbol_get_bfdsym
|
|---|
| 394 | Return the BFD symbol associated with a symbol.
|
|---|
| 395 |
|
|---|
| 396 | @item symbol_set_bfdsym
|
|---|
| 397 | @cindex symbol_set_bfdsym
|
|---|
| 398 | Set the BFD symbol associated with a symbol.
|
|---|
| 399 |
|
|---|
| 400 | @item symbol_get_obj
|
|---|
| 401 | @cindex symbol_get_obj
|
|---|
| 402 | Return a pointer to the @code{OBJ_SYMFIELD_TYPE} field of a symbol.
|
|---|
| 403 |
|
|---|
| 404 | @item symbol_set_obj
|
|---|
| 405 | @cindex symbol_set_obj
|
|---|
| 406 | Set the @code{OBJ_SYMFIELD_TYPE} field of a symbol.
|
|---|
| 407 |
|
|---|
| 408 | @item symbol_get_tc
|
|---|
| 409 | @cindex symbol_get_tc
|
|---|
| 410 | Return a pointer to the @code{TC_SYMFIELD_TYPE} field of a symbol.
|
|---|
| 411 |
|
|---|
| 412 | @item symbol_set_tc
|
|---|
| 413 | @cindex symbol_set_tc
|
|---|
| 414 | Set the @code{TC_SYMFIELD_TYPE} field of a symbol.
|
|---|
| 415 |
|
|---|
| 416 | @end table
|
|---|
| 417 |
|
|---|
| 418 | When @code{BFD_ASSEMBLER} is defined, GAS attempts to store local
|
|---|
| 419 | symbols--symbols which will not be written to the output file--using a
|
|---|
| 420 | different structure, @code{struct local_symbol}. This structure can only
|
|---|
| 421 | represent symbols whose value is an offset within a frag.
|
|---|
| 422 |
|
|---|
| 423 | Code outside of the symbol handler will always deal with @code{symbolS}
|
|---|
| 424 | structures and use the accessor functions. The accessor functions correctly
|
|---|
| 425 | deal with local symbols. @code{struct local_symbol} is much smaller than
|
|---|
| 426 | @code{symbolS} (which also automatically creates a bfd @code{asymbol}
|
|---|
| 427 | structure), so this saves space when assembling large files.
|
|---|
| 428 |
|
|---|
| 429 | The first field of @code{symbolS} is @code{bsym}, the pointer to the BFD
|
|---|
| 430 | symbol. The first field of @code{struct local_symbol} is a pointer which is
|
|---|
| 431 | always set to NULL. This is how the symbol accessor functions can distinguish
|
|---|
| 432 | local symbols from ordinary symbols. The symbol accessor functions
|
|---|
| 433 | automatically convert a local symbol into an ordinary symbol when necessary.
|
|---|
| 434 |
|
|---|
| 435 | @node Expressions
|
|---|
| 436 | @subsection Expressions
|
|---|
| 437 | @cindex internals, expressions
|
|---|
| 438 | @cindex expressions, internal
|
|---|
| 439 | @cindex expressionS structure
|
|---|
| 440 |
|
|---|
| 441 | Expressions are stored in an @code{expressionS} structure. The structure is
|
|---|
| 442 | defined in @file{expr.h}.
|
|---|
| 443 |
|
|---|
| 444 | @cindex expression
|
|---|
| 445 | The macro @code{expression} will create an @code{expressionS} structure based
|
|---|
| 446 | on the text found at the global variable @code{input_line_pointer}.
|
|---|
| 447 |
|
|---|
| 448 | @cindex make_expr_symbol
|
|---|
| 449 | @cindex expr_symbol_where
|
|---|
| 450 | A single @code{expressionS} structure can represent a single operation.
|
|---|
| 451 | Complex expressions are formed by creating @dfn{expression symbols} and
|
|---|
| 452 | combining them in @code{expressionS} structures. An expression symbol is
|
|---|
| 453 | created by calling @code{make_expr_symbol}. An expression symbol should
|
|---|
| 454 | naturally never appear in a symbol table, and the implementation of
|
|---|
| 455 | @code{S_IS_LOCAL} (@pxref{Symbols}) reflects that. The function
|
|---|
| 456 | @code{expr_symbol_where} returns non-zero if a symbol is an expression symbol,
|
|---|
| 457 | and also returns the file and line for the expression which caused it to be
|
|---|
| 458 | created.
|
|---|
| 459 |
|
|---|
| 460 | The @code{expressionS} structure has two symbol fields, a number field, an
|
|---|
| 461 | operator field, and a field indicating whether the number is unsigned.
|
|---|
| 462 |
|
|---|
| 463 | The operator field is of type @code{operatorT}, and describes how to interpret
|
|---|
| 464 | the other fields; see the definition in @file{expr.h} for the possibilities.
|
|---|
| 465 |
|
|---|
| 466 | An @code{operatorT} value of @code{O_big} indicates either a floating point
|
|---|
| 467 | number, stored in the global variable @code{generic_floating_point_number}, or
|
|---|
| 468 | an integer too large to store in an @code{offsetT} type, stored in the global
|
|---|
| 469 | array @code{generic_bignum}. This rather inflexible approach makes it
|
|---|
| 470 | impossible to use floating point numbers or large expressions in complex
|
|---|
| 471 | expressions.
|
|---|
| 472 |
|
|---|
| 473 | @node Fixups
|
|---|
| 474 | @subsection Fixups
|
|---|
| 475 | @cindex internals, fixups
|
|---|
| 476 | @cindex fixups
|
|---|
| 477 | @cindex fixS structure
|
|---|
| 478 |
|
|---|
| 479 | A @dfn{fixup} is basically anything which can not be resolved in the first
|
|---|
| 480 | pass. Sometimes a fixup can be resolved by the end of the assembly; if not,
|
|---|
| 481 | the fixup becomes a relocation entry in the object file.
|
|---|
| 482 |
|
|---|
| 483 | @cindex fix_new
|
|---|
| 484 | @cindex fix_new_exp
|
|---|
| 485 | A fixup is created by a call to @code{fix_new} or @code{fix_new_exp}. Both
|
|---|
| 486 | take a frag (@pxref{Frags}), a position within the frag, a size, an indication
|
|---|
| 487 | of whether the fixup is PC relative, and a type. In a @code{BFD_ASSEMBLER}
|
|---|
| 488 | GAS, the type is nominally a @code{bfd_reloc_code_real_type}, but several
|
|---|
| 489 | targets use other type codes to represent fixups that can not be described as
|
|---|
| 490 | relocations.
|
|---|
| 491 |
|
|---|
| 492 | The @code{fixS} structure has a number of fields, several of which are obsolete
|
|---|
| 493 | or are only used by a particular target. The important fields are:
|
|---|
| 494 |
|
|---|
| 495 | @table @code
|
|---|
| 496 | @item fx_frag
|
|---|
| 497 | The frag (@pxref{Frags}) this fixup is in.
|
|---|
| 498 |
|
|---|
| 499 | @item fx_where
|
|---|
| 500 | The location within the frag where the fixup occurs.
|
|---|
| 501 |
|
|---|
| 502 | @item fx_addsy
|
|---|
| 503 | The symbol this fixup is against. Typically, the value of this symbol is added
|
|---|
| 504 | into the object contents. This may be NULL.
|
|---|
| 505 |
|
|---|
| 506 | @item fx_subsy
|
|---|
| 507 | The value of this symbol is subtracted from the object contents. This is
|
|---|
| 508 | normally NULL.
|
|---|
| 509 |
|
|---|
| 510 | @item fx_offset
|
|---|
| 511 | A number which is added into the fixup.
|
|---|
| 512 |
|
|---|
| 513 | @item fx_addnumber
|
|---|
| 514 | Some CPU backends use this field to convey information between
|
|---|
| 515 | @code{md_apply_fix3} and @code{tc_gen_reloc}. The machine independent code does
|
|---|
| 516 | not use it.
|
|---|
| 517 |
|
|---|
| 518 | @item fx_next
|
|---|
| 519 | The next fixup in the section.
|
|---|
| 520 |
|
|---|
| 521 | @item fx_r_type
|
|---|
| 522 | The type of the fixup. This field is only defined if @code{BFD_ASSEMBLER}, or
|
|---|
| 523 | if the target defines @code{NEED_FX_R_TYPE}.
|
|---|
| 524 |
|
|---|
| 525 | @item fx_size
|
|---|
| 526 | The size of the fixup. This is mostly used for error checking.
|
|---|
| 527 |
|
|---|
| 528 | @item fx_pcrel
|
|---|
| 529 | Whether the fixup is PC relative.
|
|---|
| 530 |
|
|---|
| 531 | @item fx_done
|
|---|
| 532 | Non-zero if the fixup has been applied, and no relocation entry needs to be
|
|---|
| 533 | generated.
|
|---|
| 534 |
|
|---|
| 535 | @item fx_file
|
|---|
| 536 | @itemx fx_line
|
|---|
| 537 | The file and line where the fixup was created.
|
|---|
| 538 |
|
|---|
| 539 | @item tc_fix_data
|
|---|
| 540 | This has the type @code{TC_FIX_TYPE}, and is only defined if the target defines
|
|---|
| 541 | that macro.
|
|---|
| 542 | @end table
|
|---|
| 543 |
|
|---|
| 544 | @node Frags
|
|---|
| 545 | @subsection Frags
|
|---|
| 546 | @cindex internals, frags
|
|---|
| 547 | @cindex frags
|
|---|
| 548 | @cindex fragS structure.
|
|---|
| 549 |
|
|---|
| 550 | The @code{fragS} structure is defined in @file{as.h}. Each frag represents a
|
|---|
| 551 | portion of the final object file. As GAS reads the source file, it creates
|
|---|
| 552 | frags to hold the data that it reads. At the end of the assembly the frags and
|
|---|
| 553 | fixups are processed to produce the final contents.
|
|---|
| 554 |
|
|---|
| 555 | @table @code
|
|---|
| 556 | @item fr_address
|
|---|
| 557 | The address of the frag. This is not set until the assembler rescans the list
|
|---|
| 558 | of all frags after the entire input file is parsed. The function
|
|---|
| 559 | @code{relax_segment} fills in this field.
|
|---|
| 560 |
|
|---|
| 561 | @item fr_next
|
|---|
| 562 | Pointer to the next frag in this (sub)section.
|
|---|
| 563 |
|
|---|
| 564 | @item fr_fix
|
|---|
| 565 | Fixed number of characters we know we're going to emit to the output file. May
|
|---|
| 566 | be zero.
|
|---|
| 567 |
|
|---|
| 568 | @item fr_var
|
|---|
| 569 | Variable number of characters we may output, after the initial @code{fr_fix}
|
|---|
| 570 | characters. May be zero.
|
|---|
| 571 |
|
|---|
| 572 | @item fr_offset
|
|---|
| 573 | The interpretation of this field is controlled by @code{fr_type}. Generally,
|
|---|
| 574 | if @code{fr_var} is non-zero, this is a repeat count: the @code{fr_var}
|
|---|
| 575 | characters are output @code{fr_offset} times.
|
|---|
| 576 |
|
|---|
| 577 | @item line
|
|---|
| 578 | Holds line number info when an assembler listing was requested.
|
|---|
| 579 |
|
|---|
| 580 | @item fr_type
|
|---|
| 581 | Relaxation state. This field indicates the interpretation of @code{fr_offset},
|
|---|
| 582 | @code{fr_symbol} and the variable-length tail of the frag, as well as the
|
|---|
| 583 | treatment it gets in various phases of processing. It does not affect the
|
|---|
| 584 | initial @code{fr_fix} characters; they are always supposed to be output
|
|---|
| 585 | verbatim (fixups aside). See below for specific values this field can have.
|
|---|
| 586 |
|
|---|
| 587 | @item fr_subtype
|
|---|
| 588 | Relaxation substate. If the macro @code{md_relax_frag} isn't defined, this is
|
|---|
| 589 | assumed to be an index into @code{TC_GENERIC_RELAX_TABLE} for the generic
|
|---|
| 590 | relaxation code to process (@pxref{Relaxation}). If @code{md_relax_frag} is
|
|---|
| 591 | defined, this field is available for any use by the CPU-specific code.
|
|---|
| 592 |
|
|---|
| 593 | @item fr_symbol
|
|---|
| 594 | This normally indicates the symbol to use when relaxing the frag according to
|
|---|
| 595 | @code{fr_type}.
|
|---|
| 596 |
|
|---|
| 597 | @item fr_opcode
|
|---|
| 598 | Points to the lowest-addressed byte of the opcode, for use in relaxation.
|
|---|
| 599 |
|
|---|
| 600 | @item tc_frag_data
|
|---|
| 601 | Target specific fragment data of type TC_FRAG_TYPE.
|
|---|
| 602 | Only present if @code{TC_FRAG_TYPE} is defined.
|
|---|
| 603 |
|
|---|
| 604 | @item fr_file
|
|---|
| 605 | @itemx fr_line
|
|---|
| 606 | The file and line where this frag was last modified.
|
|---|
| 607 |
|
|---|
| 608 | @item fr_literal
|
|---|
| 609 | Declared as a one-character array, this last field grows arbitrarily large to
|
|---|
| 610 | hold the actual contents of the frag.
|
|---|
| 611 | @end table
|
|---|
| 612 |
|
|---|
| 613 | These are the possible relaxation states, provided in the enumeration type
|
|---|
| 614 | @code{relax_stateT}, and the interpretations they represent for the other
|
|---|
| 615 | fields:
|
|---|
| 616 |
|
|---|
| 617 | @table @code
|
|---|
| 618 | @item rs_align
|
|---|
| 619 | @itemx rs_align_code
|
|---|
| 620 | The start of the following frag should be aligned on some boundary. In this
|
|---|
| 621 | frag, @code{fr_offset} is the logarithm (base 2) of the alignment in bytes.
|
|---|
| 622 | (For example, if alignment on an 8-byte boundary were desired, @code{fr_offset}
|
|---|
| 623 | would have a value of 3.) The variable characters indicate the fill pattern to
|
|---|
| 624 | be used. The @code{fr_subtype} field holds the maximum number of bytes to skip
|
|---|
| 625 | when doing this alignment. If more bytes are needed, the alignment is not
|
|---|
| 626 | done. An @code{fr_subtype} value of 0 means no maximum, which is the normal
|
|---|
| 627 | case. Target backends can use @code{rs_align_code} to handle certain types of
|
|---|
| 628 | alignment differently.
|
|---|
| 629 |
|
|---|
| 630 | @item rs_broken_word
|
|---|
| 631 | This indicates that ``broken word'' processing should be done (@pxref{Broken
|
|---|
| 632 | words}). If broken word processing is not necessary on the target machine,
|
|---|
| 633 | this enumerator value will not be defined.
|
|---|
| 634 |
|
|---|
| 635 | @item rs_cfa
|
|---|
| 636 | This state is used to implement exception frame optimizations. The
|
|---|
| 637 | @code{fr_symbol} is an expression symbol for the subtraction which may be
|
|---|
| 638 | relaxed. The @code{fr_opcode} field holds the frag for the preceding command
|
|---|
| 639 | byte. The @code{fr_offset} field holds the offset within that frag. The
|
|---|
| 640 | @code{fr_subtype} field is used during relaxation to hold the current size of
|
|---|
| 641 | the frag.
|
|---|
| 642 |
|
|---|
| 643 | @item rs_fill
|
|---|
| 644 | The variable characters are to be repeated @code{fr_offset} times. If
|
|---|
| 645 | @code{fr_offset} is 0, this frag has a length of @code{fr_fix}. Most frags
|
|---|
| 646 | have this type.
|
|---|
| 647 |
|
|---|
| 648 | @item rs_leb128
|
|---|
| 649 | This state is used to implement the DWARF ``little endian base 128''
|
|---|
| 650 | variable length number format. The @code{fr_symbol} is always an expression
|
|---|
| 651 | symbol, as constant expressions are emitted directly. The @code{fr_offset}
|
|---|
| 652 | field is used during relaxation to hold the previous size of the number so
|
|---|
| 653 | that we can determine if the fragment changed size.
|
|---|
| 654 |
|
|---|
| 655 | @item rs_machine_dependent
|
|---|
| 656 | Displacement relaxation is to be done on this frag. The target is indicated by
|
|---|
| 657 | @code{fr_symbol} and @code{fr_offset}, and @code{fr_subtype} indicates the
|
|---|
| 658 | particular machine-specific addressing mode desired. @xref{Relaxation}.
|
|---|
| 659 |
|
|---|
| 660 | @item rs_org
|
|---|
| 661 | The start of the following frag should be pushed back to some specific offset
|
|---|
| 662 | within the section. (Some assemblers use the value as an absolute address; GAS
|
|---|
| 663 | does not handle final absolute addresses, but rather requires that the linker
|
|---|
| 664 | set them.) The offset is given by @code{fr_symbol} and @code{fr_offset}; one
|
|---|
| 665 | character from the variable-length tail is used as the fill character.
|
|---|
| 666 | @end table
|
|---|
| 667 |
|
|---|
| 668 | @cindex frchainS structure
|
|---|
| 669 | A chain of frags is built up for each subsection. The data structure
|
|---|
| 670 | describing a chain is called a @code{frchainS}, and contains the following
|
|---|
| 671 | fields:
|
|---|
| 672 |
|
|---|
| 673 | @table @code
|
|---|
| 674 | @item frch_root
|
|---|
| 675 | Points to the first frag in the chain. May be NULL if there are no frags in
|
|---|
| 676 | this chain.
|
|---|
| 677 | @item frch_last
|
|---|
| 678 | Points to the last frag in the chain, or NULL if there are none.
|
|---|
| 679 | @item frch_next
|
|---|
| 680 | Next in the list of @code{frchainS} structures.
|
|---|
| 681 | @item frch_seg
|
|---|
| 682 | Indicates the section this frag chain belongs to.
|
|---|
| 683 | @item frch_subseg
|
|---|
| 684 | Subsection (subsegment) number of this frag chain.
|
|---|
| 685 | @item fix_root, fix_tail
|
|---|
| 686 | (Defined only if @code{BFD_ASSEMBLER} is defined). Point to first and last
|
|---|
| 687 | @code{fixS} structures associated with this subsection.
|
|---|
| 688 | @item frch_obstack
|
|---|
| 689 | Not currently used. Intended to be used for frag allocation for this
|
|---|
| 690 | subsection. This should reduce frag generation caused by switching sections.
|
|---|
| 691 | @item frch_frag_now
|
|---|
| 692 | The current frag for this subsegment.
|
|---|
| 693 | @end table
|
|---|
| 694 |
|
|---|
| 695 | A @code{frchainS} corresponds to a subsection; each section has a list of
|
|---|
| 696 | @code{frchainS} records associated with it. In most cases, only one subsection
|
|---|
| 697 | of each section is used, so the list will only be one element long, but any
|
|---|
| 698 | processing of frag chains should be prepared to deal with multiple chains per
|
|---|
| 699 | section.
|
|---|
| 700 |
|
|---|
| 701 | After the input files have been completely processed, and no more frags are to
|
|---|
| 702 | be generated, the frag chains are joined into one per section for further
|
|---|
| 703 | processing. After this point, it is safe to operate on one chain per section.
|
|---|
| 704 |
|
|---|
| 705 | The assembler always has a current frag, named @code{frag_now}. More space is
|
|---|
| 706 | allocated for the current frag using the @code{frag_more} function; this
|
|---|
| 707 | returns a pointer to the amount of requested space. Relaxing is done using
|
|---|
| 708 | variant frags allocated by @code{frag_var} or @code{frag_variant}
|
|---|
| 709 | (@pxref{Relaxation}).
|
|---|
| 710 |
|
|---|
| 711 | @node GAS processing
|
|---|
| 712 | @section What GAS does when it runs
|
|---|
| 713 | @cindex internals, overview
|
|---|
| 714 |
|
|---|
| 715 | This is a quick look at what an assembler run looks like.
|
|---|
| 716 |
|
|---|
| 717 | @itemize @bullet
|
|---|
| 718 | @item
|
|---|
| 719 | The assembler initializes itself by calling various init routines.
|
|---|
| 720 |
|
|---|
| 721 | @item
|
|---|
| 722 | For each source file, the @code{read_a_source_file} function reads in the file
|
|---|
| 723 | and parses it. The global variable @code{input_line_pointer} points to the
|
|---|
| 724 | current text; it is guaranteed to be correct up to the end of the line, but not
|
|---|
| 725 | farther.
|
|---|
| 726 |
|
|---|
| 727 | @item
|
|---|
| 728 | For each line, the assembler passes labels to the @code{colon} function, and
|
|---|
| 729 | isolates the first word. If it looks like a pseudo-op, the word is looked up
|
|---|
| 730 | in the pseudo-op hash table @code{po_hash} and dispatched to a pseudo-op
|
|---|
| 731 | routine. Otherwise, the target dependent @code{md_assemble} routine is called
|
|---|
| 732 | to parse the instruction.
|
|---|
| 733 |
|
|---|
| 734 | @item
|
|---|
| 735 | When pseudo-ops or instructions output data, they add it to a frag, calling
|
|---|
| 736 | @code{frag_more} to get space to store it in.
|
|---|
| 737 |
|
|---|
| 738 | @item
|
|---|
| 739 | Pseudo-ops and instructions can also output fixups created by @code{fix_new} or
|
|---|
| 740 | @code{fix_new_exp}.
|
|---|
| 741 |
|
|---|
| 742 | @item
|
|---|
| 743 | For certain targets, instructions can create variant frags which are used to
|
|---|
| 744 | store relaxation information (@pxref{Relaxation}).
|
|---|
| 745 |
|
|---|
| 746 | @item
|
|---|
| 747 | When the input file is finished, the @code{write_object_file} routine is
|
|---|
| 748 | called. It assigns addresses to all the frags (@code{relax_segment}), resolves
|
|---|
| 749 | all the fixups (@code{fixup_segment}), resolves all the symbol values (using
|
|---|
| 750 | @code{resolve_symbol_value}), and finally writes out the file (in the
|
|---|
| 751 | @code{BFD_ASSEMBLER} case, this is done by simply calling @code{bfd_close}).
|
|---|
| 752 | @end itemize
|
|---|
| 753 |
|
|---|
| 754 | @node Porting GAS
|
|---|
| 755 | @section Porting GAS
|
|---|
| 756 | @cindex porting
|
|---|
| 757 |
|
|---|
| 758 | Each GAS target specifies two main things: the CPU file and the object format
|
|---|
| 759 | file. Two main switches in the @file{configure.in} file handle this. The
|
|---|
| 760 | first switches on CPU type to set the shell variable @code{cpu_type}. The
|
|---|
| 761 | second switches on the entire target to set the shell variable @code{fmt}.
|
|---|
| 762 |
|
|---|
| 763 | The configure script uses the value of @code{cpu_type} to select two files in
|
|---|
| 764 | the @file{config} directory: @file{tc-@var{CPU}.c} and @file{tc-@var{CPU}.h}.
|
|---|
| 765 | The configuration process will create a file named @file{targ-cpu.h} in the
|
|---|
| 766 | build directory which includes @file{tc-@var{CPU}.h}.
|
|---|
| 767 |
|
|---|
| 768 | The configure script also uses the value of @code{fmt} to select two files:
|
|---|
| 769 | @file{obj-@var{fmt}.c} and @file{obj-@var{fmt}.h}. The configuration process
|
|---|
| 770 | will create a file named @file{obj-format.h} in the build directory which
|
|---|
| 771 | includes @file{obj-@var{fmt}.h}.
|
|---|
| 772 |
|
|---|
| 773 | You can also set the emulation in the configure script by setting the @code{em}
|
|---|
| 774 | variable. Normally the default value of @samp{generic} is fine. The
|
|---|
| 775 | configuration process will create a file named @file{targ-env.h} in the build
|
|---|
| 776 | directory which includes @file{te-@var{em}.h}.
|
|---|
| 777 |
|
|---|
| 778 | There is a special case for COFF. For historical reason, the GNU COFF
|
|---|
| 779 | assembler doesn't follow the documented behavior on certain debug symbols for
|
|---|
| 780 | the compatibility with other COFF assemblers. A port can define
|
|---|
| 781 | @code{STRICTCOFF} in the configure script to make the GNU COFF assembler
|
|---|
| 782 | to follow the documented behavior.
|
|---|
| 783 |
|
|---|
| 784 | Porting GAS to a new CPU requires writing the @file{tc-@var{CPU}} files.
|
|---|
| 785 | Porting GAS to a new object file format requires writing the
|
|---|
| 786 | @file{obj-@var{fmt}} files. There is sometimes some interaction between these
|
|---|
| 787 | two files, but it is normally minimal.
|
|---|
| 788 |
|
|---|
| 789 | The best approach is, of course, to copy existing files. The documentation
|
|---|
| 790 | below assumes that you are looking at existing files to see usage details.
|
|---|
| 791 |
|
|---|
| 792 | These interfaces have grown over time, and have never been carefully thought
|
|---|
| 793 | out or designed. Nothing about the interfaces described here is cast in stone.
|
|---|
| 794 | It is possible that they will change from one version of the assembler to the
|
|---|
| 795 | next. Also, new macros are added all the time as they are needed.
|
|---|
| 796 |
|
|---|
| 797 | @menu
|
|---|
| 798 | * CPU backend:: Writing a CPU backend
|
|---|
| 799 | * Object format backend:: Writing an object format backend
|
|---|
| 800 | * Emulations:: Writing emulation files
|
|---|
| 801 | @end menu
|
|---|
| 802 |
|
|---|
| 803 | @node CPU backend
|
|---|
| 804 | @subsection Writing a CPU backend
|
|---|
| 805 | @cindex CPU backend
|
|---|
| 806 | @cindex @file{tc-@var{CPU}}
|
|---|
| 807 |
|
|---|
| 808 | The CPU backend files are the heart of the assembler. They are the only parts
|
|---|
| 809 | of the assembler which actually know anything about the instruction set of the
|
|---|
| 810 | processor.
|
|---|
| 811 |
|
|---|
| 812 | You must define a reasonably small list of macros and functions in the CPU
|
|---|
| 813 | backend files. You may define a large number of additional macros in the CPU
|
|---|
| 814 | backend files, not all of which are documented here. You must, of course,
|
|---|
| 815 | define macros in the @file{.h} file, which is included by every assembler
|
|---|
| 816 | source file. You may define the functions as macros in the @file{.h} file, or
|
|---|
| 817 | as functions in the @file{.c} file.
|
|---|
| 818 |
|
|---|
| 819 | @table @code
|
|---|
| 820 | @item TC_@var{CPU}
|
|---|
| 821 | @cindex TC_@var{CPU}
|
|---|
| 822 | By convention, you should define this macro in the @file{.h} file. For
|
|---|
| 823 | example, @file{tc-m68k.h} defines @code{TC_M68K}. You might have to use this
|
|---|
| 824 | if it is necessary to add CPU specific code to the object format file.
|
|---|
| 825 |
|
|---|
| 826 | @item TARGET_FORMAT
|
|---|
| 827 | This macro is the BFD target name to use when creating the output file. This
|
|---|
| 828 | will normally depend upon the @code{OBJ_@var{FMT}} macro.
|
|---|
| 829 |
|
|---|
| 830 | @item TARGET_ARCH
|
|---|
| 831 | This macro is the BFD architecture to pass to @code{bfd_set_arch_mach}.
|
|---|
| 832 |
|
|---|
| 833 | @item TARGET_MACH
|
|---|
| 834 | This macro is the BFD machine number to pass to @code{bfd_set_arch_mach}. If
|
|---|
| 835 | it is not defined, GAS will use 0.
|
|---|
| 836 |
|
|---|
| 837 | @item TARGET_BYTES_BIG_ENDIAN
|
|---|
| 838 | You should define this macro to be non-zero if the target is big endian, and
|
|---|
| 839 | zero if the target is little endian.
|
|---|
| 840 |
|
|---|
| 841 | @item md_shortopts
|
|---|
| 842 | @itemx md_longopts
|
|---|
| 843 | @itemx md_longopts_size
|
|---|
| 844 | @itemx md_parse_option
|
|---|
| 845 | @itemx md_show_usage
|
|---|
| 846 | @itemx md_after_parse_args
|
|---|
| 847 | @cindex md_shortopts
|
|---|
| 848 | @cindex md_longopts
|
|---|
| 849 | @cindex md_longopts_size
|
|---|
| 850 | @cindex md_parse_option
|
|---|
| 851 | @cindex md_show_usage
|
|---|
| 852 | @cindex md_after_parse_args
|
|---|
| 853 | GAS uses these variables and functions during option processing.
|
|---|
| 854 | @code{md_shortopts} is a @code{const char *} which GAS adds to the machine
|
|---|
| 855 | independent string passed to @code{getopt}. @code{md_longopts} is a
|
|---|
| 856 | @code{struct option []} which GAS adds to the machine independent long options
|
|---|
| 857 | passed to @code{getopt}; you may use @code{OPTION_MD_BASE}, defined in
|
|---|
| 858 | @file{as.h}, as the start of a set of long option indices, if necessary.
|
|---|
| 859 | @code{md_longopts_size} is a @code{size_t} holding the size @code{md_longopts}.
|
|---|
| 860 | GAS will call @code{md_parse_option} whenever @code{getopt} returns an
|
|---|
| 861 | unrecognized code, presumably indicating a special code value which appears in
|
|---|
| 862 | @code{md_longopts}. GAS will call @code{md_show_usage} when a usage message is
|
|---|
| 863 | printed; it should print a description of the machine specific options.
|
|---|
| 864 | @code{md_after_pase_args}, if defined, is called after all options are
|
|---|
| 865 | processed, to let the backend override settings done by the generic option
|
|---|
| 866 | parsing.
|
|---|
| 867 |
|
|---|
| 868 | @item md_begin
|
|---|
| 869 | @cindex md_begin
|
|---|
| 870 | GAS will call this function at the start of the assembly, after the command
|
|---|
| 871 | line arguments have been parsed and all the machine independent initializations
|
|---|
| 872 | have been completed.
|
|---|
| 873 |
|
|---|
| 874 | @item md_cleanup
|
|---|
| 875 | @cindex md_cleanup
|
|---|
| 876 | If you define this macro, GAS will call it at the end of each input file.
|
|---|
| 877 |
|
|---|
| 878 | @item md_assemble
|
|---|
| 879 | @cindex md_assemble
|
|---|
| 880 | GAS will call this function for each input line which does not contain a
|
|---|
| 881 | pseudo-op. The argument is a null terminated string. The function should
|
|---|
| 882 | assemble the string as an instruction with operands. Normally
|
|---|
| 883 | @code{md_assemble} will do this by calling @code{frag_more} and writing out
|
|---|
| 884 | some bytes (@pxref{Frags}). @code{md_assemble} will call @code{fix_new} to
|
|---|
| 885 | create fixups as needed (@pxref{Fixups}). Targets which need to do special
|
|---|
| 886 | purpose relaxation will call @code{frag_var}.
|
|---|
| 887 |
|
|---|
| 888 | @item md_pseudo_table
|
|---|
| 889 | @cindex md_pseudo_table
|
|---|
| 890 | This is a const array of type @code{pseudo_typeS}. It is a mapping from
|
|---|
| 891 | pseudo-op names to functions. You should use this table to implement
|
|---|
| 892 | pseudo-ops which are specific to the CPU.
|
|---|
| 893 |
|
|---|
| 894 | @item tc_conditional_pseudoop
|
|---|
| 895 | @cindex tc_conditional_pseudoop
|
|---|
| 896 | If this macro is defined, GAS will call it with a @code{pseudo_typeS} argument.
|
|---|
| 897 | It should return non-zero if the pseudo-op is a conditional which controls
|
|---|
| 898 | whether code is assembled, such as @samp{.if}. GAS knows about the normal
|
|---|
| 899 | conditional pseudo-ops, and you should normally not have to define this macro.
|
|---|
| 900 |
|
|---|
| 901 | @item comment_chars
|
|---|
| 902 | @cindex comment_chars
|
|---|
| 903 | This is a null terminated @code{const char} array of characters which start a
|
|---|
| 904 | comment.
|
|---|
| 905 |
|
|---|
| 906 | @item tc_comment_chars
|
|---|
| 907 | @cindex tc_comment_chars
|
|---|
| 908 | If this macro is defined, GAS will use it instead of @code{comment_chars}.
|
|---|
| 909 |
|
|---|
| 910 | @item tc_symbol_chars
|
|---|
| 911 | @cindex tc_symbol_chars
|
|---|
| 912 | If this macro is defined, it is a pointer to a null terminated list of
|
|---|
| 913 | characters which may appear in an operand. GAS already assumes that all
|
|---|
| 914 | alphanumberic characters, and @samp{$}, @samp{.}, and @samp{_} may appear in an
|
|---|
| 915 | operand (see @samp{symbol_chars} in @file{app.c}). This macro may be defined
|
|---|
| 916 | to treat additional characters as appearing in an operand. This affects the
|
|---|
| 917 | way in which GAS removes whitespace before passing the string to
|
|---|
| 918 | @samp{md_assemble}.
|
|---|
| 919 |
|
|---|
| 920 | @item line_comment_chars
|
|---|
| 921 | @cindex line_comment_chars
|
|---|
| 922 | This is a null terminated @code{const char} array of characters which start a
|
|---|
| 923 | comment when they appear at the start of a line.
|
|---|
| 924 |
|
|---|
| 925 | @item line_separator_chars
|
|---|
| 926 | @cindex line_separator_chars
|
|---|
| 927 | This is a null terminated @code{const char} array of characters which separate
|
|---|
| 928 | lines (null and newline are such characters by default, and need not be
|
|---|
| 929 | listed in this array). Note that line_separator_chars do not separate lines
|
|---|
| 930 | if found in a comment, such as after a character in line_comment_chars or
|
|---|
| 931 | comment_chars.
|
|---|
| 932 |
|
|---|
| 933 | @item EXP_CHARS
|
|---|
| 934 | @cindex EXP_CHARS
|
|---|
| 935 | This is a null terminated @code{const char} array of characters which may be
|
|---|
| 936 | used as the exponent character in a floating point number. This is normally
|
|---|
| 937 | @code{"eE"}.
|
|---|
| 938 |
|
|---|
| 939 | @item FLT_CHARS
|
|---|
| 940 | @cindex FLT_CHARS
|
|---|
| 941 | This is a null terminated @code{const char} array of characters which may be
|
|---|
| 942 | used to indicate a floating point constant. A zero followed by one of these
|
|---|
| 943 | characters is assumed to be followed by a floating point number; thus they
|
|---|
| 944 | operate the way that @code{0x} is used to indicate a hexadecimal constant.
|
|---|
| 945 | Usually this includes @samp{r} and @samp{f}.
|
|---|
| 946 |
|
|---|
| 947 | @item LEX_AT
|
|---|
| 948 | @cindex LEX_AT
|
|---|
| 949 | You may define this macro to the lexical type of the @kbd{@@} character. The
|
|---|
| 950 | default is zero.
|
|---|
| 951 |
|
|---|
| 952 | Lexical types are a combination of @code{LEX_NAME} and @code{LEX_BEGIN_NAME},
|
|---|
| 953 | both defined in @file{read.h}. @code{LEX_NAME} indicates that the character
|
|---|
| 954 | may appear in a name. @code{LEX_BEGIN_NAME} indicates that the character may
|
|---|
| 955 | appear at the beginning of a name.
|
|---|
| 956 |
|
|---|
| 957 | @item LEX_BR
|
|---|
| 958 | @cindex LEX_BR
|
|---|
| 959 | You may define this macro to the lexical type of the brace characters @kbd{@{},
|
|---|
| 960 | @kbd{@}}, @kbd{[}, and @kbd{]}. The default value is zero.
|
|---|
| 961 |
|
|---|
| 962 | @item LEX_PCT
|
|---|
| 963 | @cindex LEX_PCT
|
|---|
| 964 | You may define this macro to the lexical type of the @kbd{%} character. The
|
|---|
| 965 | default value is zero.
|
|---|
| 966 |
|
|---|
| 967 | @item LEX_QM
|
|---|
| 968 | @cindex LEX_QM
|
|---|
| 969 | You may define this macro to the lexical type of the @kbd{?} character. The
|
|---|
| 970 | default value it zero.
|
|---|
| 971 |
|
|---|
| 972 | @item LEX_DOLLAR
|
|---|
| 973 | @cindex LEX_DOLLAR
|
|---|
| 974 | You may define this macro to the lexical type of the @kbd{$} character. The
|
|---|
| 975 | default value is @code{LEX_NAME | LEX_BEGIN_NAME}.
|
|---|
| 976 |
|
|---|
| 977 | @item NUMBERS_WITH_SUFFIX
|
|---|
| 978 | @cindex NUMBERS_WITH_SUFFIX
|
|---|
| 979 | When this macro is defined to be non-zero, the parser allows the radix of a
|
|---|
| 980 | constant to be indicated with a suffix. Valid suffixes are binary (B),
|
|---|
| 981 | octal (Q), and hexadecimal (H). Case is not significant.
|
|---|
| 982 |
|
|---|
| 983 | @item SINGLE_QUOTE_STRINGS
|
|---|
| 984 | @cindex SINGLE_QUOTE_STRINGS
|
|---|
| 985 | If you define this macro, GAS will treat single quotes as string delimiters.
|
|---|
| 986 | Normally only double quotes are accepted as string delimiters.
|
|---|
| 987 |
|
|---|
| 988 | @item NO_STRING_ESCAPES
|
|---|
| 989 | @cindex NO_STRING_ESCAPES
|
|---|
| 990 | If you define this macro, GAS will not permit escape sequences in a string.
|
|---|
| 991 |
|
|---|
| 992 | @item ONLY_STANDARD_ESCAPES
|
|---|
| 993 | @cindex ONLY_STANDARD_ESCAPES
|
|---|
| 994 | If you define this macro, GAS will warn about the use of nonstandard escape
|
|---|
| 995 | sequences in a string.
|
|---|
| 996 |
|
|---|
| 997 | @item md_start_line_hook
|
|---|
| 998 | @cindex md_start_line_hook
|
|---|
| 999 | If you define this macro, GAS will call it at the start of each line.
|
|---|
| 1000 |
|
|---|
| 1001 | @item LABELS_WITHOUT_COLONS
|
|---|
| 1002 | @cindex LABELS_WITHOUT_COLONS
|
|---|
| 1003 | If you define this macro, GAS will assume that any text at the start of a line
|
|---|
| 1004 | is a label, even if it does not have a colon.
|
|---|
| 1005 |
|
|---|
| 1006 | @item TC_START_LABEL
|
|---|
| 1007 | @itemx TC_START_LABEL_WITHOUT_COLON
|
|---|
| 1008 | @cindex TC_START_LABEL
|
|---|
| 1009 | You may define this macro to control what GAS considers to be a label. The
|
|---|
| 1010 | default definition is to accept any name followed by a colon character.
|
|---|
| 1011 |
|
|---|
| 1012 | @item TC_START_LABEL_WITHOUT_COLON
|
|---|
| 1013 | @cindex TC_START_LABEL_WITHOUT_COLON
|
|---|
| 1014 | Same as TC_START_LABEL, but should be used instead of TC_START_LABEL when
|
|---|
| 1015 | LABELS_WITHOUT_COLONS is defined.
|
|---|
| 1016 |
|
|---|
| 1017 | @item NO_PSEUDO_DOT
|
|---|
| 1018 | @cindex NO_PSEUDO_DOT
|
|---|
| 1019 | If you define this macro, GAS will not require pseudo-ops to start with a
|
|---|
| 1020 | @kbd{.} character.
|
|---|
| 1021 |
|
|---|
| 1022 | @item TC_EQUAL_IN_INSN
|
|---|
| 1023 | @cindex TC_EQUAL_IN_INSN
|
|---|
| 1024 | If you define this macro, it should return nonzero if the instruction is
|
|---|
| 1025 | permitted to contain an @kbd{=} character. GAS will call it with two
|
|---|
| 1026 | arguments, the character before the @kbd{=} character, and the value of
|
|---|
| 1027 | @code{input_line_pointer} at that point. GAS uses this macro to decide if a
|
|---|
| 1028 | @kbd{=} is an assignment or an instruction.
|
|---|
| 1029 |
|
|---|
| 1030 | @item TC_EOL_IN_INSN
|
|---|
| 1031 | @cindex TC_EOL_IN_INSN
|
|---|
| 1032 | If you define this macro, it should return nonzero if the current input line
|
|---|
| 1033 | pointer should be treated as the end of a line.
|
|---|
| 1034 |
|
|---|
| 1035 | @item md_parse_name
|
|---|
| 1036 | @cindex md_parse_name
|
|---|
| 1037 | If this macro is defined, GAS will call it for any symbol found in an
|
|---|
| 1038 | expression. You can define this to handle special symbols in a special way.
|
|---|
| 1039 | If a symbol always has a certain value, you should normally enter it in the
|
|---|
| 1040 | symbol table, perhaps using @code{reg_section}.
|
|---|
| 1041 |
|
|---|
| 1042 | @item md_undefined_symbol
|
|---|
| 1043 | @cindex md_undefined_symbol
|
|---|
| 1044 | GAS will call this function when a symbol table lookup fails, before it
|
|---|
| 1045 | creates a new symbol. Typically this would be used to supply symbols whose
|
|---|
| 1046 | name or value changes dynamically, possibly in a context sensitive way.
|
|---|
| 1047 | Predefined symbols with fixed values, such as register names or condition
|
|---|
| 1048 | codes, are typically entered directly into the symbol table when @code{md_begin}
|
|---|
| 1049 | is called. One argument is passed, a @code{char *} for the symbol.
|
|---|
| 1050 |
|
|---|
| 1051 | @item md_operand
|
|---|
| 1052 | @cindex md_operand
|
|---|
| 1053 | GAS will call this function with one argument, an @code{expressionS}
|
|---|
| 1054 | pointer, for any expression that can not be recognized. When the function
|
|---|
| 1055 | is called, @code{input_line_pointer} will point to the start of the
|
|---|
| 1056 | expression.
|
|---|
| 1057 |
|
|---|
| 1058 | @item tc_unrecognized_line
|
|---|
| 1059 | @cindex tc_unrecognized_line
|
|---|
| 1060 | If you define this macro, GAS will call it when it finds a line that it can not
|
|---|
| 1061 | parse.
|
|---|
| 1062 |
|
|---|
| 1063 | @item md_do_align
|
|---|
| 1064 | @cindex md_do_align
|
|---|
| 1065 | You may define this macro to handle an alignment directive. GAS will call it
|
|---|
| 1066 | when the directive is seen in the input file. For example, the i386 backend
|
|---|
| 1067 | uses this to generate efficient nop instructions of varying lengths, depending
|
|---|
| 1068 | upon the number of bytes that the alignment will skip.
|
|---|
| 1069 |
|
|---|
| 1070 | @item HANDLE_ALIGN
|
|---|
| 1071 | @cindex HANDLE_ALIGN
|
|---|
| 1072 | You may define this macro to do special handling for an alignment directive.
|
|---|
| 1073 | GAS will call it at the end of the assembly.
|
|---|
| 1074 |
|
|---|
| 1075 | @item TC_IMPLICIT_LCOMM_ALIGNMENT (@var{size}, @var{p2var})
|
|---|
| 1076 | @cindex TC_IMPLICIT_LCOMM_ALIGNMENT
|
|---|
| 1077 | An @code{.lcomm} directive with no explicit alignment parameter will use this
|
|---|
| 1078 | macro to set @var{p2var} to the alignment that a request for @var{size} bytes
|
|---|
| 1079 | will have. The alignment is expressed as a power of two. If no alignment
|
|---|
| 1080 | should take place, the macro definition should do nothing. Some targets define
|
|---|
| 1081 | a @code{.bss} directive that is also affected by this macro. The default
|
|---|
| 1082 | definition will set @var{p2var} to the truncated power of two of sizes up to
|
|---|
| 1083 | eight bytes.
|
|---|
| 1084 |
|
|---|
| 1085 | @item md_flush_pending_output
|
|---|
| 1086 | @cindex md_flush_pending_output
|
|---|
| 1087 | If you define this macro, GAS will call it each time it skips any space because of a
|
|---|
| 1088 | space filling or alignment or data allocation pseudo-op.
|
|---|
| 1089 |
|
|---|
| 1090 | @item TC_PARSE_CONS_EXPRESSION
|
|---|
| 1091 | @cindex TC_PARSE_CONS_EXPRESSION
|
|---|
| 1092 | You may define this macro to parse an expression used in a data allocation
|
|---|
| 1093 | pseudo-op such as @code{.word}. You can use this to recognize relocation
|
|---|
| 1094 | directives that may appear in such directives.
|
|---|
| 1095 |
|
|---|
| 1096 | @item BITFIELD_CONS_EXPRESSION
|
|---|
| 1097 | @cindex BITFIELD_CONS_EXPRESSION
|
|---|
| 1098 | If you define this macro, GAS will recognize bitfield instructions in data
|
|---|
| 1099 | allocation pseudo-ops, as used on the i960.
|
|---|
| 1100 |
|
|---|
| 1101 | @item REPEAT_CONS_EXPRESSION
|
|---|
| 1102 | @cindex REPEAT_CONS_EXPRESSION
|
|---|
| 1103 | If you define this macro, GAS will recognize repeat counts in data allocation
|
|---|
| 1104 | pseudo-ops, as used on the MIPS.
|
|---|
| 1105 |
|
|---|
| 1106 | @item md_cons_align
|
|---|
| 1107 | @cindex md_cons_align
|
|---|
| 1108 | You may define this macro to do any special alignment before a data allocation
|
|---|
| 1109 | pseudo-op.
|
|---|
| 1110 |
|
|---|
| 1111 | @item TC_CONS_FIX_NEW
|
|---|
| 1112 | @cindex TC_CONS_FIX_NEW
|
|---|
| 1113 | You may define this macro to generate a fixup for a data allocation pseudo-op.
|
|---|
| 1114 |
|
|---|
| 1115 | @item TC_INIT_FIX_DATA (@var{fixp})
|
|---|
| 1116 | @cindex TC_INIT_FIX_DATA
|
|---|
| 1117 | A C statement to initialize the target specific fields of fixup @var{fixp}.
|
|---|
| 1118 | These fields are defined with the @code{TC_FIX_TYPE} macro.
|
|---|
| 1119 |
|
|---|
| 1120 | @item TC_FIX_DATA_PRINT (@var{stream}, @var{fixp})
|
|---|
| 1121 | @cindex TC_FIX_DATA_PRINT
|
|---|
| 1122 | A C statement to output target specific debugging information for
|
|---|
| 1123 | fixup @var{fixp} to @var{stream}. This macro is called by @code{print_fixup}.
|
|---|
| 1124 |
|
|---|
| 1125 | @item TC_FRAG_INIT (@var{fragp})
|
|---|
| 1126 | @cindex TC_FRAG_INIT
|
|---|
| 1127 | A C statement to initialize the target specific fields of frag @var{fragp}.
|
|---|
| 1128 | These fields are defined with the @code{TC_FRAG_TYPE} macro.
|
|---|
| 1129 |
|
|---|
| 1130 | @item md_number_to_chars
|
|---|
| 1131 | @cindex md_number_to_chars
|
|---|
| 1132 | This should just call either @code{number_to_chars_bigendian} or
|
|---|
| 1133 | @code{number_to_chars_littleendian}, whichever is appropriate. On targets like
|
|---|
| 1134 | the MIPS which support options to change the endianness, which function to call
|
|---|
| 1135 | is a runtime decision. On other targets, @code{md_number_to_chars} can be a
|
|---|
| 1136 | simple macro.
|
|---|
| 1137 |
|
|---|
| 1138 | @item md_atof (@var{type},@var{litP},@var{sizeP})
|
|---|
| 1139 | @cindex md_atof
|
|---|
| 1140 | This function is called to convert an ASCII string into a floating point value
|
|---|
| 1141 | in format used by the CPU. It takes three arguments. The first is @var{type}
|
|---|
| 1142 | which is a byte describing the type of floating point number to be created.
|
|---|
| 1143 | Possible values are @var{'f'} or @var{'s'} for single precision, @var{'d'} or
|
|---|
| 1144 | @var{'r'} for double precision and @var{'x'} or @var{'p'} for extended
|
|---|
| 1145 | precision. Either lower or upper case versions of these letters can be used.
|
|---|
| 1146 |
|
|---|
| 1147 | The second parameter is @var{litP} which is a pointer to a byte array where the
|
|---|
| 1148 | converted value should be stored. The third argument is @var{sizeP}, which is
|
|---|
| 1149 | a pointer to a integer that should be filled in with the number of
|
|---|
| 1150 | @var{LITTLENUM}s emitted into the byte array. (@var{LITTLENUM} is defined in
|
|---|
| 1151 | gas/bignum.h). The function should return NULL upon success or an error string
|
|---|
| 1152 | upon failure.
|
|---|
| 1153 |
|
|---|
| 1154 | @item TC_LARGEST_EXPONENT_IS_NORMAL
|
|---|
| 1155 | @cindex TC_LARGEST_EXPONENT_IS_NORMAL (@var{precision})
|
|---|
| 1156 | This macro is used only by @file{atof-ieee.c}. It should evaluate to true
|
|---|
| 1157 | if floats of the given precision use the largest exponent for normal numbers
|
|---|
| 1158 | instead of NaNs and infinities. @var{precision} is @samp{F_PRECISION} for
|
|---|
| 1159 | single precision, @samp{D_PRECISION} for double precision, or
|
|---|
| 1160 | @samp{X_PRECISION} for extended double precision.
|
|---|
| 1161 |
|
|---|
| 1162 | The macro has a default definition which returns 0 for all cases.
|
|---|
| 1163 |
|
|---|
| 1164 | @item md_reloc_size
|
|---|
| 1165 | @cindex md_reloc_size
|
|---|
| 1166 | This variable is only used in the original version of gas (not
|
|---|
| 1167 | @code{BFD_ASSEMBLER} and not @code{MANY_SEGMENTS}). It holds the size of a
|
|---|
| 1168 | relocation entry.
|
|---|
| 1169 |
|
|---|
| 1170 | @item WORKING_DOT_WORD
|
|---|
| 1171 | @itemx md_short_jump_size
|
|---|
| 1172 | @itemx md_long_jump_size
|
|---|
| 1173 | @itemx md_create_short_jump
|
|---|
| 1174 | @itemx md_create_long_jump
|
|---|
| 1175 | @itemx TC_CHECK_ADJUSTED_BROKEN_DOT_WORD
|
|---|
| 1176 | @cindex WORKING_DOT_WORD
|
|---|
| 1177 | @cindex md_short_jump_size
|
|---|
| 1178 | @cindex md_long_jump_size
|
|---|
| 1179 | @cindex md_create_short_jump
|
|---|
| 1180 | @cindex md_create_long_jump
|
|---|
| 1181 | @cindex TC_CHECK_ADJUSTED_BROKEN_DOT_WORD
|
|---|
| 1182 | If @code{WORKING_DOT_WORD} is defined, GAS will not do broken word processing
|
|---|
| 1183 | (@pxref{Broken words}). Otherwise, you should set @code{md_short_jump_size} to
|
|---|
| 1184 | the size of a short jump (a jump that is just long enough to jump around a
|
|---|
| 1185 | number of long jumps) and @code{md_long_jump_size} to the size of a long jump
|
|---|
| 1186 | (a jump that can go anywhere in the function). You should define
|
|---|
| 1187 | @code{md_create_short_jump} to create a short jump around a number of long
|
|---|
| 1188 | jumps, and define @code{md_create_long_jump} to create a long jump.
|
|---|
| 1189 | If defined, the macro TC_CHECK_ADJUSTED_BROKEN_DOT_WORD will be called for each
|
|---|
| 1190 | adjusted word just before the word is output. The macro takes two arguments,
|
|---|
| 1191 | an @code{addressT} with the adjusted word and a pointer to the current
|
|---|
| 1192 | @code{struct broken_word}.
|
|---|
| 1193 |
|
|---|
| 1194 | @item md_estimate_size_before_relax
|
|---|
| 1195 | @cindex md_estimate_size_before_relax
|
|---|
| 1196 | This function returns an estimate of the size of a @code{rs_machine_dependent}
|
|---|
| 1197 | frag before any relaxing is done. It may also create any necessary
|
|---|
| 1198 | relocations.
|
|---|
| 1199 |
|
|---|
| 1200 | @item md_relax_frag
|
|---|
| 1201 | @cindex md_relax_frag
|
|---|
| 1202 | This macro may be defined to relax a frag. GAS will call this with the
|
|---|
| 1203 | segment, the frag, and the change in size of all previous frags;
|
|---|
| 1204 | @code{md_relax_frag} should return the change in size of the frag.
|
|---|
| 1205 | @xref{Relaxation}.
|
|---|
| 1206 |
|
|---|
| 1207 | @item TC_GENERIC_RELAX_TABLE
|
|---|
| 1208 | @cindex TC_GENERIC_RELAX_TABLE
|
|---|
| 1209 | If you do not define @code{md_relax_frag}, you may define
|
|---|
| 1210 | @code{TC_GENERIC_RELAX_TABLE} as a table of @code{relax_typeS} structures. The
|
|---|
| 1211 | machine independent code knows how to use such a table to relax PC relative
|
|---|
| 1212 | references. See @file{tc-m68k.c} for an example. @xref{Relaxation}.
|
|---|
| 1213 |
|
|---|
| 1214 | @item md_prepare_relax_scan
|
|---|
| 1215 | @cindex md_prepare_relax_scan
|
|---|
| 1216 | If defined, it is a C statement that is invoked prior to scanning
|
|---|
| 1217 | the relax table.
|
|---|
| 1218 |
|
|---|
| 1219 | @item LINKER_RELAXING_SHRINKS_ONLY
|
|---|
| 1220 | @cindex LINKER_RELAXING_SHRINKS_ONLY
|
|---|
| 1221 | If you define this macro, and the global variable @samp{linkrelax} is set
|
|---|
| 1222 | (because of a command line option, or unconditionally in @code{md_begin}), a
|
|---|
| 1223 | @samp{.align} directive will cause extra space to be allocated. The linker can
|
|---|
| 1224 | then discard this space when relaxing the section.
|
|---|
| 1225 |
|
|---|
| 1226 | @item TC_LINKRELAX_FIXUP (@var{segT})
|
|---|
| 1227 | @cindex TC_LINKRELAX_FIXUP
|
|---|
| 1228 | If defined, this macro allows control over whether fixups for a
|
|---|
| 1229 | given section will be processed when the @var{linkrelax} variable is
|
|---|
| 1230 | set. The macro is given the N_TYPE bits for the section in its
|
|---|
| 1231 | @var{segT} argument. If the macro evaluates to a non-zero value
|
|---|
| 1232 | then the fixups will be converted into relocs, otherwise they will
|
|---|
| 1233 | be passed to @var{md_apply_fix3} as normal.
|
|---|
| 1234 |
|
|---|
| 1235 | @item md_convert_frag
|
|---|
| 1236 | @cindex md_convert_frag
|
|---|
| 1237 | GAS will call this for each rs_machine_dependent fragment.
|
|---|
| 1238 | The instruction is completed using the data from the relaxation pass.
|
|---|
| 1239 | It may also create any necessary relocations.
|
|---|
| 1240 | @xref{Relaxation}.
|
|---|
| 1241 |
|
|---|
| 1242 | @item TC_FINALIZE_SYMS_BEFORE_SIZE_SEG
|
|---|
| 1243 | @cindex TC_FINALIZE_SYMS_BEFORE_SIZE_SEG
|
|---|
| 1244 | Specifies the value to be assigned to @code{finalize_syms} before the function
|
|---|
| 1245 | @code{size_segs} is called. Since @code{size_segs} calls @code{cvt_frag_to_fill}
|
|---|
| 1246 | which can call @code{md_convert_frag}, this constant governs whether the symbols
|
|---|
| 1247 | accessed in @code{md_convert_frag} will be fully resolved. In particular it
|
|---|
| 1248 | governs whether local symbols will have been resolved, and had their frag
|
|---|
| 1249 | information removed. Depending upon the processing performed by
|
|---|
| 1250 | @code{md_convert_frag} the frag information may or may not be necessary, as may
|
|---|
| 1251 | the resolved values of the symbols. The default value is 1.
|
|---|
| 1252 |
|
|---|
| 1253 | @item TC_VALIDATE_FIX (@var{fixP}, @var{seg}, @var{skip})
|
|---|
| 1254 | @cindex TC_VALIDATE_FIX
|
|---|
| 1255 | This macro is evaluated for each fixup (when @var{linkrelax} is not set).
|
|---|
| 1256 | It may be used to change the fixup in @code{struct fix *@var{fixP}} before
|
|---|
| 1257 | the generic code sees it, or to fully process the fixup. In the latter case,
|
|---|
| 1258 | a @code{goto @var{skip}} will bypass the generic code.
|
|---|
| 1259 |
|
|---|
| 1260 | @item md_apply_fix3 (@var{fixP}, @var{valP}, @var{seg})
|
|---|
| 1261 | @cindex md_apply_fix3
|
|---|
| 1262 | GAS will call this for each fixup that passes the @code{TC_VALIDATE_FIX} test
|
|---|
| 1263 | when @var{linkrelax} is not set. It should store the correct value in the
|
|---|
| 1264 | object file. @code{struct fix *@var{fixP}} is the fixup @code{md_apply_fix3}
|
|---|
| 1265 | is operating on. @code{valueT *@var{valP}} is the value to store into the
|
|---|
| 1266 | object files, or at least is the generic code's best guess. Specifically,
|
|---|
| 1267 | *@var{valP} is the value of the fixup symbol, perhaps modified by
|
|---|
| 1268 | @code{MD_APPLY_SYM_VALUE}, plus @code{@var{fixP}->fx_offset} (symbol addend),
|
|---|
| 1269 | less @code{MD_PCREL_FROM_SECTION} for pc-relative fixups.
|
|---|
| 1270 | @code{segT @var{seg}} is the section the fix is in.
|
|---|
| 1271 | @code{fixup_segment} performs a generic overflow check on *@var{valP} after
|
|---|
| 1272 | @code{md_apply_fix3} returns. If the overflow check is relevant for the target
|
|---|
| 1273 | machine, then @code{md_apply_fix3} should modify *@var{valP}, typically to the
|
|---|
| 1274 | value stored in the object file.
|
|---|
| 1275 |
|
|---|
| 1276 | @item TC_FORCE_RELOCATION (@var{fix})
|
|---|
| 1277 | @cindex TC_FORCE_RELOCATION
|
|---|
| 1278 | If this macro returns non-zero, it guarantees that a relocation will be emitted
|
|---|
| 1279 | even when the value can be resolved locally, as @code{fixup_segment} tries to
|
|---|
| 1280 | reduce the number of relocations emitted. For example, a fixup expression
|
|---|
| 1281 | against an absolute symbol will normally not require a reloc. If undefined,
|
|---|
| 1282 | a default of @w{@code{(S_FORCE_RELOC ((@var{fix})->fx_addsy))}} is used.
|
|---|
| 1283 |
|
|---|
| 1284 | @item TC_FORCE_RELOCATION_ABS (@var{fix})
|
|---|
| 1285 | @cindex TC_FORCE_RELOCATION_ABS
|
|---|
| 1286 | Like @code{TC_FORCE_RELOCATION}, but used only for fixup expressions against an
|
|---|
| 1287 | absolute symbol. If undefined, @code{TC_FORCE_RELOCATION} will be used.
|
|---|
| 1288 |
|
|---|
| 1289 | @item TC_FORCE_RELOCATION_LOCAL (@var{fix})
|
|---|
| 1290 | @cindex TC_FORCE_RELOCATION_LOCAL
|
|---|
| 1291 | Like @code{TC_FORCE_RELOCATION}, but used only for fixup expressions against a
|
|---|
| 1292 | symbol in the current section. If undefined, fixups that are not
|
|---|
| 1293 | @code{fx_pcrel} or @code{fx_plt} or for which @code{TC_FORCE_RELOCATION}
|
|---|
| 1294 | returns non-zero, will emit relocs.
|
|---|
| 1295 |
|
|---|
| 1296 | @item TC_FORCE_RELOCATION_SUB_SAME (@var{fix}, @var{seg})
|
|---|
| 1297 | @cindex TC_FORCE_RELOCATION_SUB_SAME
|
|---|
| 1298 | This macro controls resolution of fixup expressions involving the
|
|---|
| 1299 | difference of two symbols in the same section. If this macro returns zero,
|
|---|
| 1300 | the subtrahend will be resolved and @code{fx_subsy} set to @code{NULL} for
|
|---|
| 1301 | @code{md_apply_fix3}. If undefined, the default of
|
|---|
| 1302 | @w{@code{! SEG_NORMAL (@var{seg}) || TC_FORCE_RELOCATION (@var{fix})}} will
|
|---|
| 1303 | be used.
|
|---|
| 1304 |
|
|---|
| 1305 | @item TC_FORCE_RELOCATION_SUB_ABS (@var{fix})
|
|---|
| 1306 | @cindex TC_FORCE_RELOCATION_SUB_ABS
|
|---|
| 1307 | Like @code{TC_FORCE_RELOCATION_SUB_SAME}, but used when the subtrahend is an
|
|---|
| 1308 | absolute symbol. If the macro is undefined a default of @code{0} is used.
|
|---|
| 1309 |
|
|---|
| 1310 | @item TC_FORCE_RELOCATION_SUB_LOCAL (@var{fix})
|
|---|
| 1311 | @cindex TC_FORCE_RELOCATION_SUB_LOCAL
|
|---|
| 1312 | Like @code{TC_FORCE_RELOCATION_SUB_ABS}, but the subtrahend is a symbol in the
|
|---|
| 1313 | same section as the fixup.
|
|---|
| 1314 |
|
|---|
| 1315 | @item TC_VALIDATE_FIX_SUB (@var{fix})
|
|---|
| 1316 | @cindex TC_VALIDATE_FIX_SUB
|
|---|
| 1317 | This macro is evaluated for any fixup with a @code{fx_subsy} that
|
|---|
| 1318 | @code{fixup_segment} cannot reduce to a number. If the macro returns
|
|---|
| 1319 | @code{false} an error will be reported.
|
|---|
| 1320 |
|
|---|
| 1321 | @item MD_APPLY_SYM_VALUE (@var{fix})
|
|---|
| 1322 | @cindex MD_APPLY_SYM_VALUE
|
|---|
| 1323 | This macro controls whether the symbol value becomes part of the value passed
|
|---|
| 1324 | to @code{md_apply_fix3}. If the macro is undefined, or returns non-zero, the
|
|---|
| 1325 | symbol value will be included. For ELF, a suitable definition might simply be
|
|---|
| 1326 | @code{0}, because ELF relocations don't include the symbol value in the addend.
|
|---|
| 1327 |
|
|---|
| 1328 | @item S_FORCE_RELOC (@var{sym}, @var{strict})
|
|---|
| 1329 | @cindex S_FORCE_RELOC
|
|---|
| 1330 | This macro (or function, for @code{BFD_ASSEMBLER} gas) returns true for symbols
|
|---|
| 1331 | that should not be reduced to section symbols or eliminated from expressions,
|
|---|
| 1332 | because they may be overridden by the linker. ie. for symbols that are
|
|---|
| 1333 | undefined or common, and when @var{strict} is set, weak, or global (for ELF
|
|---|
| 1334 | assemblers that support ELF shared library linking semantics).
|
|---|
| 1335 |
|
|---|
| 1336 | @item EXTERN_FORCE_RELOC
|
|---|
| 1337 | @cindex EXTERN_FORCE_RELOC
|
|---|
| 1338 | This macro controls whether @code{S_FORCE_RELOC} returns true for global
|
|---|
| 1339 | symbols. If undefined, the default is @code{true} for ELF assemblers, and
|
|---|
| 1340 | @code{false} for non-ELF.
|
|---|
| 1341 |
|
|---|
| 1342 | @item tc_gen_reloc
|
|---|
| 1343 | @cindex tc_gen_reloc
|
|---|
| 1344 | A @code{BFD_ASSEMBLER} GAS will call this to generate a reloc. GAS will pass
|
|---|
| 1345 | the resulting reloc to @code{bfd_install_relocation}. This currently works
|
|---|
| 1346 | poorly, as @code{bfd_install_relocation} often does the wrong thing, and
|
|---|
| 1347 | instances of @code{tc_gen_reloc} have been written to work around the problems,
|
|---|
| 1348 | which in turns makes it difficult to fix @code{bfd_install_relocation}.
|
|---|
| 1349 |
|
|---|
| 1350 | @item RELOC_EXPANSION_POSSIBLE
|
|---|
| 1351 | @cindex RELOC_EXPANSION_POSSIBLE
|
|---|
| 1352 | If you define this macro, it means that @code{tc_gen_reloc} may return multiple
|
|---|
| 1353 | relocation entries for a single fixup. In this case, the return value of
|
|---|
| 1354 | @code{tc_gen_reloc} is a pointer to a null terminated array.
|
|---|
| 1355 |
|
|---|
| 1356 | @item MAX_RELOC_EXPANSION
|
|---|
| 1357 | @cindex MAX_RELOC_EXPANSION
|
|---|
| 1358 | You must define this if @code{RELOC_EXPANSION_POSSIBLE} is defined; it
|
|---|
| 1359 | indicates the largest number of relocs which @code{tc_gen_reloc} may return for
|
|---|
| 1360 | a single fixup.
|
|---|
| 1361 |
|
|---|
| 1362 | @item tc_fix_adjustable
|
|---|
| 1363 | @cindex tc_fix_adjustable
|
|---|
| 1364 | You may define this macro to indicate whether a fixup against a locally defined
|
|---|
| 1365 | symbol should be adjusted to be against the section symbol. It should return a
|
|---|
| 1366 | non-zero value if the adjustment is acceptable.
|
|---|
| 1367 |
|
|---|
| 1368 | @item MD_PCREL_FROM_SECTION (@var{fixp}, @var{section})
|
|---|
| 1369 | @cindex MD_PCREL_FROM_SECTION
|
|---|
| 1370 | If you define this macro, it should return the position from which the PC
|
|---|
| 1371 | relative adjustment for a PC relative fixup should be made. On many
|
|---|
| 1372 | processors, the base of a PC relative instruction is the next instruction,
|
|---|
| 1373 | so this macro would return the length of an instruction, plus the address of
|
|---|
| 1374 | the PC relative fixup. The latter can be calculated as
|
|---|
| 1375 | @var{fixp}->fx_where + @var{fixp}->fx_frag->fr_address .
|
|---|
| 1376 |
|
|---|
| 1377 | @item md_pcrel_from
|
|---|
| 1378 | @cindex md_pcrel_from
|
|---|
| 1379 | This is the default value of @code{MD_PCREL_FROM_SECTION}. The difference is
|
|---|
| 1380 | that @code{md_pcrel_from} does not take a section argument.
|
|---|
| 1381 |
|
|---|
| 1382 | @item tc_frob_label
|
|---|
| 1383 | @cindex tc_frob_label
|
|---|
| 1384 | If you define this macro, GAS will call it each time a label is defined.
|
|---|
| 1385 |
|
|---|
| 1386 | @item md_section_align
|
|---|
| 1387 | @cindex md_section_align
|
|---|
| 1388 | GAS will call this function for each section at the end of the assembly, to
|
|---|
| 1389 | permit the CPU backend to adjust the alignment of a section. The function
|
|---|
| 1390 | must take two arguments, a @code{segT} for the section and a @code{valueT}
|
|---|
| 1391 | for the size of the section, and return a @code{valueT} for the rounded
|
|---|
| 1392 | size.
|
|---|
| 1393 |
|
|---|
| 1394 | @item md_macro_start
|
|---|
| 1395 | @cindex md_macro_start
|
|---|
| 1396 | If defined, GAS will call this macro when it starts to include a macro
|
|---|
| 1397 | expansion. @code{macro_nest} indicates the current macro nesting level, which
|
|---|
| 1398 | includes the one being expanded.
|
|---|
| 1399 |
|
|---|
| 1400 | @item md_macro_info
|
|---|
| 1401 | @cindex md_macro_info
|
|---|
| 1402 | If defined, GAS will call this macro after the macro expansion has been
|
|---|
| 1403 | included in the input and after parsing the macro arguments. The single
|
|---|
| 1404 | argument is a pointer to the macro processing's internal representation of the
|
|---|
| 1405 | macro (macro_entry *), which includes expansion of the formal arguments.
|
|---|
| 1406 |
|
|---|
| 1407 | @item md_macro_end
|
|---|
| 1408 | @cindex md_macro_end
|
|---|
| 1409 | Complement to md_macro_start. If defined, it is called when finished
|
|---|
| 1410 | processing an inserted macro expansion, just before decrementing macro_nest.
|
|---|
| 1411 |
|
|---|
| 1412 | @item DOUBLEBAR_PARALLEL
|
|---|
| 1413 | @cindex DOUBLEBAR_PARALLEL
|
|---|
| 1414 | Affects the preprocessor so that lines containing '||' don't have their
|
|---|
| 1415 | whitespace stripped following the double bar. This is useful for targets that
|
|---|
| 1416 | implement parallel instructions.
|
|---|
| 1417 |
|
|---|
| 1418 | @item KEEP_WHITE_AROUND_COLON
|
|---|
| 1419 | @cindex KEEP_WHITE_AROUND_COLON
|
|---|
| 1420 | Normally, whitespace is compressed and removed when, in the presence of the
|
|---|
| 1421 | colon, the adjoining tokens can be distinguished. This option affects the
|
|---|
| 1422 | preprocessor so that whitespace around colons is preserved. This is useful
|
|---|
| 1423 | when colons might be removed from the input after preprocessing but before
|
|---|
| 1424 | assembling, so that adjoining tokens can still be distinguished if there is
|
|---|
| 1425 | whitespace, or concatenated if there is not.
|
|---|
| 1426 |
|
|---|
| 1427 | @item tc_frob_section
|
|---|
| 1428 | @cindex tc_frob_section
|
|---|
| 1429 | If you define this macro, a @code{BFD_ASSEMBLER} GAS will call it for each
|
|---|
| 1430 | section at the end of the assembly.
|
|---|
| 1431 |
|
|---|
| 1432 | @item tc_frob_file_before_adjust
|
|---|
| 1433 | @cindex tc_frob_file_before_adjust
|
|---|
| 1434 | If you define this macro, GAS will call it after the symbol values are
|
|---|
| 1435 | resolved, but before the fixups have been changed from local symbols to section
|
|---|
| 1436 | symbols.
|
|---|
| 1437 |
|
|---|
| 1438 | @item tc_frob_symbol
|
|---|
| 1439 | @cindex tc_frob_symbol
|
|---|
| 1440 | If you define this macro, GAS will call it for each symbol. You can indicate
|
|---|
| 1441 | that the symbol should not be included in the object file by defining this
|
|---|
| 1442 | macro to set its second argument to a non-zero value.
|
|---|
| 1443 |
|
|---|
| 1444 | @item tc_frob_file
|
|---|
| 1445 | @cindex tc_frob_file
|
|---|
| 1446 | If you define this macro, GAS will call it after the symbol table has been
|
|---|
| 1447 | completed, but before the relocations have been generated.
|
|---|
| 1448 |
|
|---|
| 1449 | @item tc_frob_file_after_relocs
|
|---|
| 1450 | If you define this macro, GAS will call it after the relocs have been
|
|---|
| 1451 | generated.
|
|---|
| 1452 |
|
|---|
| 1453 | @item md_post_relax_hook
|
|---|
| 1454 | If you define this macro, GAS will call it after relaxing and sizing the
|
|---|
| 1455 | segments.
|
|---|
| 1456 |
|
|---|
| 1457 | @item LISTING_HEADER
|
|---|
| 1458 | A string to use on the header line of a listing. The default value is simply
|
|---|
| 1459 | @code{"GAS LISTING"}.
|
|---|
| 1460 |
|
|---|
| 1461 | @item LISTING_WORD_SIZE
|
|---|
| 1462 | The number of bytes to put into a word in a listing. This affects the way the
|
|---|
| 1463 | bytes are clumped together in the listing. For example, a value of 2 might
|
|---|
| 1464 | print @samp{1234 5678} where a value of 1 would print @samp{12 34 56 78}. The
|
|---|
| 1465 | default value is 4.
|
|---|
| 1466 |
|
|---|
| 1467 | @item LISTING_LHS_WIDTH
|
|---|
| 1468 | The number of words of data to print on the first line of a listing for a
|
|---|
| 1469 | particular source line, where each word is @code{LISTING_WORD_SIZE} bytes. The
|
|---|
| 1470 | default value is 1.
|
|---|
| 1471 |
|
|---|
| 1472 | @item LISTING_LHS_WIDTH_SECOND
|
|---|
| 1473 | Like @code{LISTING_LHS_WIDTH}, but applying to the second and subsequent line
|
|---|
| 1474 | of the data printed for a particular source line. The default value is 1.
|
|---|
| 1475 |
|
|---|
| 1476 | @item LISTING_LHS_CONT_LINES
|
|---|
| 1477 | The maximum number of continuation lines to print in a listing for a particular
|
|---|
| 1478 | source line. The default value is 4.
|
|---|
| 1479 |
|
|---|
| 1480 | @item LISTING_RHS_WIDTH
|
|---|
| 1481 | The maximum number of characters to print from one line of the input file. The
|
|---|
| 1482 | default value is 100.
|
|---|
| 1483 |
|
|---|
| 1484 | @item TC_COFF_SECTION_DEFAULT_ATTRIBUTES
|
|---|
| 1485 | @cindex TC_COFF_SECTION_DEFAULT_ATTRIBUTES
|
|---|
| 1486 | The COFF @code{.section} directive will use the value of this macro to set
|
|---|
| 1487 | a new section's attributes when a directive has no valid flags or when the
|
|---|
| 1488 | flag is @code{w}. The default value of the macro is @code{SEC_LOAD | SEC_DATA}.
|
|---|
| 1489 |
|
|---|
| 1490 | @item DWARF2_FORMAT ()
|
|---|
| 1491 | @cindex DWARF2_FORMAT
|
|---|
| 1492 | If you define this, it should return one of @code{dwarf2_format_32bit},
|
|---|
| 1493 | @code{dwarf2_format_64bit}, or @code{dwarf2_format_64bit_irix} to indicate
|
|---|
| 1494 | the size of internal DWARF section offsets and the format of the DWARF initial
|
|---|
| 1495 | length fields. When @code{dwarf2_format_32bit} is returned, the initial
|
|---|
| 1496 | length field will be 4 bytes long and section offsets are 32 bits in size.
|
|---|
| 1497 | For @code{dwarf2_format_64bit} and @code{dwarf2_format_64bit_irix}, section
|
|---|
| 1498 | offsets are 64 bits in size, but the initial length field differs. An 8 byte
|
|---|
| 1499 | initial length is indicated by @code{dwarf2_format_64bit_irix} and
|
|---|
| 1500 | @code{dwarf2_format_64bit} indicates a 12 byte initial length field in
|
|---|
| 1501 | which the first four bytes are 0xffffffff and the next 8 bytes are
|
|---|
| 1502 | the section's length.
|
|---|
| 1503 |
|
|---|
| 1504 | If you don't define this, @code{dwarf2_format_32bit} will be used as
|
|---|
| 1505 | the default.
|
|---|
| 1506 |
|
|---|
| 1507 | This define only affects @code{.debug_info} and @code{.debug_line}
|
|---|
| 1508 | sections generated by the assembler. DWARF 2 sections generated by
|
|---|
| 1509 | other tools will be unaffected by this setting.
|
|---|
| 1510 |
|
|---|
| 1511 | @item DWARF2_ADDR_SIZE (@var{bfd})
|
|---|
| 1512 | @cindex DWARF2_ADDR_SIZE
|
|---|
| 1513 | It should return the size of an address, as it should be represented in
|
|---|
| 1514 | debugging info. If you don't define this macro, the default definition uses
|
|---|
| 1515 | the number of bits per address, as defined in @var{bfd}, divided by 8.
|
|---|
| 1516 |
|
|---|
| 1517 | @end table
|
|---|
| 1518 |
|
|---|
| 1519 | @node Object format backend
|
|---|
| 1520 | @subsection Writing an object format backend
|
|---|
| 1521 | @cindex object format backend
|
|---|
| 1522 | @cindex @file{obj-@var{fmt}}
|
|---|
| 1523 |
|
|---|
| 1524 | As with the CPU backend, the object format backend must define a few things,
|
|---|
| 1525 | and may define some other things. The interface to the object format backend
|
|---|
| 1526 | is generally simpler; most of the support for an object file format consists of
|
|---|
| 1527 | defining a number of pseudo-ops.
|
|---|
| 1528 |
|
|---|
| 1529 | The object format @file{.h} file must include @file{targ-cpu.h}.
|
|---|
| 1530 |
|
|---|
| 1531 | This section will only define the @code{BFD_ASSEMBLER} version of GAS. It is
|
|---|
| 1532 | impossible to support a new object file format using any other version anyhow,
|
|---|
| 1533 | as the original GAS version only supports a.out, and the @code{MANY_SEGMENTS}
|
|---|
| 1534 | GAS version only supports COFF.
|
|---|
| 1535 |
|
|---|
| 1536 | @table @code
|
|---|
| 1537 | @item OBJ_@var{format}
|
|---|
| 1538 | @cindex OBJ_@var{format}
|
|---|
| 1539 | By convention, you should define this macro in the @file{.h} file. For
|
|---|
| 1540 | example, @file{obj-elf.h} defines @code{OBJ_ELF}. You might have to use this
|
|---|
| 1541 | if it is necessary to add object file format specific code to the CPU file.
|
|---|
| 1542 |
|
|---|
| 1543 | @item obj_begin
|
|---|
| 1544 | If you define this macro, GAS will call it at the start of the assembly, after
|
|---|
| 1545 | the command line arguments have been parsed and all the machine independent
|
|---|
| 1546 | initializations have been completed.
|
|---|
| 1547 |
|
|---|
| 1548 | @item obj_app_file
|
|---|
| 1549 | @cindex obj_app_file
|
|---|
| 1550 | If you define this macro, GAS will invoke it when it sees a @code{.file}
|
|---|
| 1551 | pseudo-op or a @samp{#} line as used by the C preprocessor.
|
|---|
| 1552 |
|
|---|
| 1553 | @item OBJ_COPY_SYMBOL_ATTRIBUTES
|
|---|
| 1554 | @cindex OBJ_COPY_SYMBOL_ATTRIBUTES
|
|---|
| 1555 | You should define this macro to copy object format specific information from
|
|---|
| 1556 | one symbol to another. GAS will call it when one symbol is equated to
|
|---|
| 1557 | another.
|
|---|
| 1558 |
|
|---|
| 1559 | @item obj_sec_sym_ok_for_reloc
|
|---|
| 1560 | @cindex obj_sec_sym_ok_for_reloc
|
|---|
| 1561 | You may define this macro to indicate that it is OK to use a section symbol in
|
|---|
| 1562 | a relocation entry. If it is not, GAS will define a new symbol at the start
|
|---|
| 1563 | of a section.
|
|---|
| 1564 |
|
|---|
| 1565 | @item EMIT_SECTION_SYMBOLS
|
|---|
| 1566 | @cindex EMIT_SECTION_SYMBOLS
|
|---|
| 1567 | You should define this macro with a zero value if you do not want to include
|
|---|
| 1568 | section symbols in the output symbol table. The default value for this macro
|
|---|
| 1569 | is one.
|
|---|
| 1570 |
|
|---|
| 1571 | @item obj_adjust_symtab
|
|---|
| 1572 | @cindex obj_adjust_symtab
|
|---|
| 1573 | If you define this macro, GAS will invoke it just before setting the symbol
|
|---|
| 1574 | table of the output BFD. For example, the COFF support uses this macro to
|
|---|
| 1575 | generate a @code{.file} symbol if none was generated previously.
|
|---|
| 1576 |
|
|---|
| 1577 | @item SEPARATE_STAB_SECTIONS
|
|---|
| 1578 | @cindex SEPARATE_STAB_SECTIONS
|
|---|
| 1579 | You may define this macro to a nonzero value to indicate that stabs should be
|
|---|
| 1580 | placed in separate sections, as in ELF.
|
|---|
| 1581 |
|
|---|
| 1582 | @item INIT_STAB_SECTION
|
|---|
| 1583 | @cindex INIT_STAB_SECTION
|
|---|
| 1584 | You may define this macro to initialize the stabs section in the output file.
|
|---|
| 1585 |
|
|---|
| 1586 | @item OBJ_PROCESS_STAB
|
|---|
| 1587 | @cindex OBJ_PROCESS_STAB
|
|---|
| 1588 | You may define this macro to do specific processing on a stabs entry.
|
|---|
| 1589 |
|
|---|
| 1590 | @item obj_frob_section
|
|---|
| 1591 | @cindex obj_frob_section
|
|---|
| 1592 | If you define this macro, GAS will call it for each section at the end of the
|
|---|
| 1593 | assembly.
|
|---|
| 1594 |
|
|---|
| 1595 | @item obj_frob_file_before_adjust
|
|---|
| 1596 | @cindex obj_frob_file_before_adjust
|
|---|
| 1597 | If you define this macro, GAS will call it after the symbol values are
|
|---|
| 1598 | resolved, but before the fixups have been changed from local symbols to section
|
|---|
| 1599 | symbols.
|
|---|
| 1600 |
|
|---|
| 1601 | @item obj_frob_symbol
|
|---|
| 1602 | @cindex obj_frob_symbol
|
|---|
| 1603 | If you define this macro, GAS will call it for each symbol. You can indicate
|
|---|
| 1604 | that the symbol should not be included in the object file by defining this
|
|---|
| 1605 | macro to set its second argument to a non-zero value.
|
|---|
| 1606 |
|
|---|
| 1607 | @item obj_frob_file
|
|---|
| 1608 | @cindex obj_frob_file
|
|---|
| 1609 | If you define this macro, GAS will call it after the symbol table has been
|
|---|
| 1610 | completed, but before the relocations have been generated.
|
|---|
| 1611 |
|
|---|
| 1612 | @item obj_frob_file_after_relocs
|
|---|
| 1613 | If you define this macro, GAS will call it after the relocs have been
|
|---|
| 1614 | generated.
|
|---|
| 1615 |
|
|---|
| 1616 | @item SET_SECTION_RELOCS (@var{sec}, @var{relocs}, @var{n})
|
|---|
| 1617 | @cindex SET_SECTION_RELOCS
|
|---|
| 1618 | If you define this, it will be called after the relocations have been set for
|
|---|
| 1619 | the section @var{sec}. The list of relocations is in @var{relocs}, and the
|
|---|
| 1620 | number of relocations is in @var{n}. This is only used with
|
|---|
| 1621 | @code{BFD_ASSEMBLER}.
|
|---|
| 1622 | @end table
|
|---|
| 1623 |
|
|---|
| 1624 | @node Emulations
|
|---|
| 1625 | @subsection Writing emulation files
|
|---|
| 1626 |
|
|---|
| 1627 | Normally you do not have to write an emulation file. You can just use
|
|---|
| 1628 | @file{te-generic.h}.
|
|---|
| 1629 |
|
|---|
| 1630 | If you do write your own emulation file, it must include @file{obj-format.h}.
|
|---|
| 1631 |
|
|---|
| 1632 | An emulation file will often define @code{TE_@var{EM}}; this may then be used
|
|---|
| 1633 | in other files to change the output.
|
|---|
| 1634 |
|
|---|
| 1635 | @node Relaxation
|
|---|
| 1636 | @section Relaxation
|
|---|
| 1637 | @cindex relaxation
|
|---|
| 1638 |
|
|---|
| 1639 | @dfn{Relaxation} is a generic term used when the size of some instruction or
|
|---|
| 1640 | data depends upon the value of some symbol or other data.
|
|---|
| 1641 |
|
|---|
| 1642 | GAS knows to relax a particular type of PC relative relocation using a table.
|
|---|
| 1643 | You can also define arbitrarily complex forms of relaxation yourself.
|
|---|
| 1644 |
|
|---|
| 1645 | @menu
|
|---|
| 1646 | * Relaxing with a table:: Relaxing with a table
|
|---|
| 1647 | * General relaxing:: General relaxing
|
|---|
| 1648 | @end menu
|
|---|
| 1649 |
|
|---|
| 1650 | @node Relaxing with a table
|
|---|
| 1651 | @subsection Relaxing with a table
|
|---|
| 1652 |
|
|---|
| 1653 | If you do not define @code{md_relax_frag}, and you do define
|
|---|
| 1654 | @code{TC_GENERIC_RELAX_TABLE}, GAS will relax @code{rs_machine_dependent} frags
|
|---|
| 1655 | based on the frag subtype and the displacement to some specified target
|
|---|
| 1656 | address. The basic idea is that several machines have different addressing
|
|---|
| 1657 | modes for instructions that can specify different ranges of values, with
|
|---|
| 1658 | successive modes able to access wider ranges, including the entirety of the
|
|---|
| 1659 | previous range. Smaller ranges are assumed to be more desirable (perhaps the
|
|---|
| 1660 | instruction requires one word instead of two or three); if this is not the
|
|---|
| 1661 | case, don't describe the smaller-range, inferior mode.
|
|---|
| 1662 |
|
|---|
| 1663 | The @code{fr_subtype} field of a frag is an index into a CPU-specific
|
|---|
| 1664 | relaxation table. That table entry indicates the range of values that can be
|
|---|
| 1665 | stored, the number of bytes that will have to be added to the frag to
|
|---|
| 1666 | accommodate the addressing mode, and the index of the next entry to examine if
|
|---|
| 1667 | the value to be stored is outside the range accessible by the current
|
|---|
| 1668 | addressing mode. The @code{fr_symbol} field of the frag indicates what symbol
|
|---|
| 1669 | is to be accessed; the @code{fr_offset} field is added in.
|
|---|
| 1670 |
|
|---|
| 1671 | If the @code{TC_PCREL_ADJUST} macro is defined, which currently should only happen
|
|---|
| 1672 | for the NS32k family, the @code{TC_PCREL_ADJUST} macro is called on the frag to
|
|---|
| 1673 | compute an adjustment to be made to the displacement.
|
|---|
| 1674 |
|
|---|
| 1675 | The value fitted by the relaxation code is always assumed to be a displacement
|
|---|
| 1676 | from the current frag. (More specifically, from @code{fr_fix} bytes into the
|
|---|
| 1677 | frag.)
|
|---|
| 1678 | @ignore
|
|---|
| 1679 | This seems kinda silly. What about fitting small absolute values? I suppose
|
|---|
| 1680 | @code{md_assemble} is supposed to take care of that, but if the operand is a
|
|---|
| 1681 | difference between symbols, it might not be able to, if the difference was not
|
|---|
| 1682 | computable yet.
|
|---|
| 1683 | @end ignore
|
|---|
| 1684 |
|
|---|
| 1685 | The end of the relaxation sequence is indicated by a ``next'' value of 0. This
|
|---|
| 1686 | means that the first entry in the table can't be used.
|
|---|
| 1687 |
|
|---|
| 1688 | For some configurations, the linker can do relaxing within a section of an
|
|---|
| 1689 | object file. If call instructions of various sizes exist, the linker can
|
|---|
| 1690 | determine which should be used in each instance, when a symbol's value is
|
|---|
| 1691 | resolved. In order for the linker to avoid wasting space and having to insert
|
|---|
| 1692 | no-op instructions, it must be able to expand or shrink the section contents
|
|---|
| 1693 | while still preserving intra-section references and meeting alignment
|
|---|
| 1694 | requirements.
|
|---|
| 1695 |
|
|---|
| 1696 | For the i960 using b.out format, no expansion is done; instead, each
|
|---|
| 1697 | @samp{.align} directive causes extra space to be allocated, enough that when
|
|---|
| 1698 | the linker is relaxing a section and removing unneeded space, it can discard
|
|---|
| 1699 | some or all of this extra padding and cause the following data to be correctly
|
|---|
| 1700 | aligned.
|
|---|
| 1701 |
|
|---|
| 1702 | For the H8/300, I think the linker expands calls that can't reach, and doesn't
|
|---|
| 1703 | worry about alignment issues; the cpu probably never needs any significant
|
|---|
| 1704 | alignment beyond the instruction size.
|
|---|
| 1705 |
|
|---|
| 1706 | The relaxation table type contains these fields:
|
|---|
| 1707 |
|
|---|
| 1708 | @table @code
|
|---|
| 1709 | @item long rlx_forward
|
|---|
| 1710 | Forward reach, must be non-negative.
|
|---|
| 1711 | @item long rlx_backward
|
|---|
| 1712 | Backward reach, must be zero or negative.
|
|---|
| 1713 | @item rlx_length
|
|---|
| 1714 | Length in bytes of this addressing mode.
|
|---|
| 1715 | @item rlx_more
|
|---|
| 1716 | Index of the next-longer relax state, or zero if there is no next relax state.
|
|---|
| 1717 | @end table
|
|---|
| 1718 |
|
|---|
| 1719 | The relaxation is done in @code{relax_segment} in @file{write.c}. The
|
|---|
| 1720 | difference in the length fields between the original mode and the one finally
|
|---|
| 1721 | chosen by the relaxing code is taken as the size by which the current frag will
|
|---|
| 1722 | be increased in size. For example, if the initial relaxing mode has a length
|
|---|
| 1723 | of 2 bytes, and because of the size of the displacement, it gets upgraded to a
|
|---|
| 1724 | mode with a size of 6 bytes, it is assumed that the frag will grow by 4 bytes.
|
|---|
| 1725 | (The initial two bytes should have been part of the fixed portion of the frag,
|
|---|
| 1726 | since it is already known that they will be output.) This growth must be
|
|---|
| 1727 | effected by @code{md_convert_frag}; it should increase the @code{fr_fix} field
|
|---|
| 1728 | by the appropriate size, and fill in the appropriate bytes of the frag.
|
|---|
| 1729 | (Enough space for the maximum growth should have been allocated in the call to
|
|---|
| 1730 | frag_var as the second argument.)
|
|---|
| 1731 |
|
|---|
| 1732 | If relocation records are needed, they should be emitted by
|
|---|
| 1733 | @code{md_estimate_size_before_relax}. This function should examine the target
|
|---|
| 1734 | symbol of the supplied frag and correct the @code{fr_subtype} of the frag if
|
|---|
| 1735 | needed. When this function is called, if the symbol has not yet been defined,
|
|---|
| 1736 | it will not become defined later; however, its value may still change if the
|
|---|
| 1737 | section it is in gets relaxed.
|
|---|
| 1738 |
|
|---|
| 1739 | Usually, if the symbol is in the same section as the frag (given by the
|
|---|
| 1740 | @var{sec} argument), the narrowest likely relaxation mode is stored in
|
|---|
| 1741 | @code{fr_subtype}, and that's that.
|
|---|
| 1742 |
|
|---|
| 1743 | If the symbol is undefined, or in a different section (and therefore moveable
|
|---|
| 1744 | to an arbitrarily large distance), the largest available relaxation mode is
|
|---|
| 1745 | specified, @code{fix_new} is called to produce the relocation record,
|
|---|
| 1746 | @code{fr_fix} is increased to include the relocated field (remember, this
|
|---|
| 1747 | storage was allocated when @code{frag_var} was called), and @code{frag_wane} is
|
|---|
| 1748 | called to convert the frag to an @code{rs_fill} frag with no variant part.
|
|---|
| 1749 | Sometimes changing addressing modes may also require rewriting the instruction.
|
|---|
| 1750 | It can be accessed via @code{fr_opcode} or @code{fr_fix}.
|
|---|
| 1751 |
|
|---|
| 1752 | If you generate frags separately for the basic insn opcode and any relaxable
|
|---|
| 1753 | operands, do not call @code{fix_new} thinking you can emit fixups for the
|
|---|
| 1754 | opcode field from the relaxable frag. It is not guaranteed to be the same frag.
|
|---|
| 1755 | If you need to emit fixups for the opcode field from inspection of the
|
|---|
| 1756 | relaxable frag, then you need to generate a common frag for both the basic
|
|---|
| 1757 | opcode and relaxable fields, or you need to provide the frag for the opcode to
|
|---|
| 1758 | pass to @code{fix_new}. The latter can be done for example by defining
|
|---|
| 1759 | @code{TC_FRAG_TYPE} to include a pointer to it and defining @code{TC_FRAG_INIT}
|
|---|
| 1760 | to set the pointer.
|
|---|
| 1761 |
|
|---|
| 1762 | Sometimes @code{fr_var} is increased instead, and @code{frag_wane} is not
|
|---|
| 1763 | called. I'm not sure, but I think this is to keep @code{fr_fix} referring to
|
|---|
| 1764 | an earlier byte, and @code{fr_subtype} set to @code{rs_machine_dependent} so
|
|---|
| 1765 | that @code{md_convert_frag} will get called.
|
|---|
| 1766 |
|
|---|
| 1767 | @node General relaxing
|
|---|
| 1768 | @subsection General relaxing
|
|---|
| 1769 |
|
|---|
| 1770 | If using a simple table is not suitable, you may implement arbitrarily complex
|
|---|
| 1771 | relaxation semantics yourself. For example, the MIPS backend uses this to emit
|
|---|
| 1772 | different instruction sequences depending upon the size of the symbol being
|
|---|
| 1773 | accessed.
|
|---|
| 1774 |
|
|---|
| 1775 | When you assemble an instruction that may need relaxation, you should allocate
|
|---|
| 1776 | a frag using @code{frag_var} or @code{frag_variant} with a type of
|
|---|
| 1777 | @code{rs_machine_dependent}. You should store some sort of information in the
|
|---|
| 1778 | @code{fr_subtype} field so that you can figure out what to do with the frag
|
|---|
| 1779 | later.
|
|---|
| 1780 |
|
|---|
| 1781 | When GAS reaches the end of the input file, it will look through the frags and
|
|---|
| 1782 | work out their final sizes.
|
|---|
| 1783 |
|
|---|
| 1784 | GAS will first call @code{md_estimate_size_before_relax} on each
|
|---|
| 1785 | @code{rs_machine_dependent} frag. This function must return an estimated size
|
|---|
| 1786 | for the frag.
|
|---|
| 1787 |
|
|---|
| 1788 | GAS will then loop over the frags, calling @code{md_relax_frag} on each
|
|---|
| 1789 | @code{rs_machine_dependent} frag. This function should return the change in
|
|---|
| 1790 | size of the frag. GAS will keep looping over the frags until none of the frags
|
|---|
| 1791 | changes size.
|
|---|
| 1792 |
|
|---|
| 1793 | @node Broken words
|
|---|
| 1794 | @section Broken words
|
|---|
| 1795 | @cindex internals, broken words
|
|---|
| 1796 | @cindex broken words
|
|---|
| 1797 |
|
|---|
| 1798 | Some compilers, including GCC, will sometimes emit switch tables specifying
|
|---|
| 1799 | 16-bit @code{.word} displacements to branch targets, and branch instructions
|
|---|
| 1800 | that load entries from that table to compute the target address. If this is
|
|---|
| 1801 | done on a 32-bit machine, there is a chance (at least with really large
|
|---|
| 1802 | functions) that the displacement will not fit in 16 bits. The assembler
|
|---|
| 1803 | handles this using a concept called @dfn{broken words}. This idea is well
|
|---|
| 1804 | named, since there is an implied promise that the 16-bit field will in fact
|
|---|
| 1805 | hold the specified displacement.
|
|---|
| 1806 |
|
|---|
| 1807 | If broken word processing is enabled, and a situation like this is encountered,
|
|---|
| 1808 | the assembler will insert a jump instruction into the instruction stream, close
|
|---|
| 1809 | enough to be reached with the 16-bit displacement. This jump instruction will
|
|---|
| 1810 | transfer to the real desired target address. Thus, as long as the @code{.word}
|
|---|
| 1811 | value really is used as a displacement to compute an address to jump to, the
|
|---|
| 1812 | net effect will be correct (minus a very small efficiency cost). If
|
|---|
| 1813 | @code{.word} directives with label differences for values are used for other
|
|---|
| 1814 | purposes, however, things may not work properly. For targets which use broken
|
|---|
| 1815 | words, the @samp{-K} option will warn when a broken word is discovered.
|
|---|
| 1816 |
|
|---|
| 1817 | The broken word code is turned off by the @code{WORKING_DOT_WORD} macro. It
|
|---|
| 1818 | isn't needed if @code{.word} emits a value large enough to contain an address
|
|---|
| 1819 | (or, more correctly, any possible difference between two addresses).
|
|---|
| 1820 |
|
|---|
| 1821 | @node Internal functions
|
|---|
| 1822 | @section Internal functions
|
|---|
| 1823 |
|
|---|
| 1824 | This section describes basic internal functions used by GAS.
|
|---|
| 1825 |
|
|---|
| 1826 | @menu
|
|---|
| 1827 | * Warning and error messages:: Warning and error messages
|
|---|
| 1828 | * Hash tables:: Hash tables
|
|---|
| 1829 | @end menu
|
|---|
| 1830 |
|
|---|
| 1831 | @node Warning and error messages
|
|---|
| 1832 | @subsection Warning and error messages
|
|---|
| 1833 |
|
|---|
| 1834 | @deftypefun @{@} int had_warnings (void)
|
|---|
| 1835 | @deftypefunx @{@} int had_errors (void)
|
|---|
| 1836 | Returns non-zero if any warnings or errors, respectively, have been printed
|
|---|
| 1837 | during this invocation.
|
|---|
| 1838 | @end deftypefun
|
|---|
| 1839 |
|
|---|
| 1840 | @deftypefun @{@} void as_perror (const char *@var{gripe}, const char *@var{filename})
|
|---|
| 1841 | Displays a BFD or system error, then clears the error status.
|
|---|
| 1842 | @end deftypefun
|
|---|
| 1843 |
|
|---|
| 1844 | @deftypefun @{@} void as_tsktsk (const char *@var{format}, ...)
|
|---|
| 1845 | @deftypefunx @{@} void as_warn (const char *@var{format}, ...)
|
|---|
| 1846 | @deftypefunx @{@} void as_bad (const char *@var{format}, ...)
|
|---|
| 1847 | @deftypefunx @{@} void as_fatal (const char *@var{format}, ...)
|
|---|
| 1848 | These functions display messages about something amiss with the input file, or
|
|---|
| 1849 | internal problems in the assembler itself. The current file name and line
|
|---|
| 1850 | number are printed, followed by the supplied message, formatted using
|
|---|
| 1851 | @code{vfprintf}, and a final newline.
|
|---|
| 1852 |
|
|---|
| 1853 | An error indicated by @code{as_bad} will result in a non-zero exit status when
|
|---|
| 1854 | the assembler has finished. Calling @code{as_fatal} will result in immediate
|
|---|
| 1855 | termination of the assembler process.
|
|---|
| 1856 | @end deftypefun
|
|---|
| 1857 |
|
|---|
| 1858 | @deftypefun @{@} void as_warn_where (char *@var{file}, unsigned int @var{line}, const char *@var{format}, ...)
|
|---|
| 1859 | @deftypefunx @{@} void as_bad_where (char *@var{file}, unsigned int @var{line}, const char *@var{format}, ...)
|
|---|
| 1860 | These variants permit specification of the file name and line number, and are
|
|---|
| 1861 | used when problems are detected when reprocessing information saved away when
|
|---|
| 1862 | processing some earlier part of the file. For example, fixups are processed
|
|---|
| 1863 | after all input has been read, but messages about fixups should refer to the
|
|---|
| 1864 | original filename and line number that they are applicable to.
|
|---|
| 1865 | @end deftypefun
|
|---|
| 1866 |
|
|---|
| 1867 | @deftypefun @{@} void fprint_value (FILE *@var{file}, valueT @var{val})
|
|---|
| 1868 | @deftypefunx @{@} void sprint_value (char *@var{buf}, valueT @var{val})
|
|---|
| 1869 | These functions are helpful for converting a @code{valueT} value into printable
|
|---|
| 1870 | format, in case it's wider than modes that @code{*printf} can handle. If the
|
|---|
| 1871 | type is narrow enough, a decimal number will be produced; otherwise, it will be
|
|---|
| 1872 | in hexadecimal. The value itself is not examined to make this determination.
|
|---|
| 1873 | @end deftypefun
|
|---|
| 1874 |
|
|---|
| 1875 | @node Hash tables
|
|---|
| 1876 | @subsection Hash tables
|
|---|
| 1877 | @cindex hash tables
|
|---|
| 1878 |
|
|---|
| 1879 | @deftypefun @{@} @{struct hash_control *@} hash_new (void)
|
|---|
| 1880 | Creates the hash table control structure.
|
|---|
| 1881 | @end deftypefun
|
|---|
| 1882 |
|
|---|
| 1883 | @deftypefun @{@} void hash_die (struct hash_control *)
|
|---|
| 1884 | Destroy a hash table.
|
|---|
| 1885 | @end deftypefun
|
|---|
| 1886 |
|
|---|
| 1887 | @deftypefun @{@} PTR hash_delete (struct hash_control *, const char *)
|
|---|
| 1888 | Deletes entry from the hash table, returns the value it had.
|
|---|
| 1889 | @end deftypefun
|
|---|
| 1890 |
|
|---|
| 1891 | @deftypefun @{@} PTR hash_replace (struct hash_control *, const char *, PTR)
|
|---|
| 1892 | Updates the value for an entry already in the table, returning the old value.
|
|---|
| 1893 | If no entry was found, just returns NULL.
|
|---|
| 1894 | @end deftypefun
|
|---|
| 1895 |
|
|---|
| 1896 | @deftypefun @{@} @{const char *@} hash_insert (struct hash_control *, const char *, PTR)
|
|---|
| 1897 | Inserting a value already in the table is an error.
|
|---|
| 1898 | Returns an error message or NULL.
|
|---|
| 1899 | @end deftypefun
|
|---|
| 1900 |
|
|---|
| 1901 | @deftypefun @{@} @{const char *@} hash_jam (struct hash_control *, const char *, PTR)
|
|---|
| 1902 | Inserts if the value isn't already present, updates it if it is.
|
|---|
| 1903 | @end deftypefun
|
|---|
| 1904 |
|
|---|
| 1905 | @node Test suite
|
|---|
| 1906 | @section Test suite
|
|---|
| 1907 | @cindex test suite
|
|---|
| 1908 |
|
|---|
| 1909 | The test suite is kind of lame for most processors. Often it only checks to
|
|---|
| 1910 | see if a couple of files can be assembled without the assembler reporting any
|
|---|
| 1911 | errors. For more complete testing, write a test which either examines the
|
|---|
| 1912 | assembler listing, or runs @code{objdump} and examines its output. For the
|
|---|
| 1913 | latter, the TCL procedure @code{run_dump_test} may come in handy. It takes the
|
|---|
| 1914 | base name of a file, and looks for @file{@var{file}.d}. This file should
|
|---|
| 1915 | contain as its initial lines a set of variable settings in @samp{#} comments,
|
|---|
| 1916 | in the form:
|
|---|
| 1917 |
|
|---|
| 1918 | @example
|
|---|
| 1919 | #@var{varname}: @var{value}
|
|---|
| 1920 | @end example
|
|---|
| 1921 |
|
|---|
| 1922 | The @var{varname} may be @code{objdump}, @code{nm}, or @code{as}, in which case
|
|---|
| 1923 | it specifies the options to be passed to the specified programs. Exactly one
|
|---|
| 1924 | of @code{objdump} or @code{nm} must be specified, as that also specifies which
|
|---|
| 1925 | program to run after the assembler has finished. If @var{varname} is
|
|---|
| 1926 | @code{source}, it specifies the name of the source file; otherwise,
|
|---|
| 1927 | @file{@var{file}.s} is used. If @var{varname} is @code{name}, it specifies the
|
|---|
| 1928 | name of the test to be used in the @code{pass} or @code{fail} messages.
|
|---|
| 1929 |
|
|---|
| 1930 | The non-commented parts of the file are interpreted as regular expressions, one
|
|---|
| 1931 | per line. Blank lines in the @code{objdump} or @code{nm} output are skipped,
|
|---|
| 1932 | as are blank lines in the @code{.d} file; the other lines are tested to see if
|
|---|
| 1933 | the regular expression matches the program output. If it does not, the test
|
|---|
| 1934 | fails.
|
|---|
| 1935 |
|
|---|
| 1936 | Note that this means the tests must be modified if the @code{objdump} output
|
|---|
| 1937 | style is changed.
|
|---|
| 1938 |
|
|---|
| 1939 | @bye
|
|---|
| 1940 | @c Local Variables:
|
|---|
| 1941 | @c fill-column: 79
|
|---|
| 1942 | @c End:
|
|---|