riscv-non-isa · kito-cheng · Sep 26, 2024 · Oct 18, 2024 · Oct 18, 2024 · Oct 18, 2024
diff --git a/riscv-elf.adoc b/riscv-elf.adoc
@@ -548,7 +548,9 @@ Description:: Additional information about the relocation
                                             <| S - P
 .2+| 65      .2+| TLSDESC_CALL      .2+| Static  |                   .2+| Annotate call to TLS descriptor resolver function, `%tlsdesc_call(address of %tlsdesc_hi)`, for relaxation purposes only
                                             <|
-.2+| 66-190  .2+| *Reserved*                          .2+| -       |                   .2+| Reserved for future standard use
+.2+| 66      .2+| LPAD              .2+| Static  |                   .2+| Annotates the landing pad instruction inserted at the beginning of the function. The addend indicates the label value of the landing pad, and the symbol value is the address of the mapping symbol for the function signature, which will have the same address as the function.
+                                            <|
+.2+| 67-190  .2+| *Reserved*                          .2+| -       |                   .2+| Reserved for future standard use
                                             <|
 .2+| 191     .2+| VENDOR        .2+| Static  |                   .2+| Paired with a vendor-specific relocation and must be placed immediately before it, indicates which vendor owns the relocation.
                                             <|
@@ -1210,6 +1212,7 @@ The defined processor-specific section types are listed in <<rv-section-type>>.
 | Name                  | Value       | Attributes
 
 | SHT_RISCV_ATTRIBUTES  | 0x70000003  | none
+| SHT_RISCV_LADING_PAD_INFO  | 0x70000004  | none
 |===
 
 ==== Special Sections
@@ -1224,12 +1227,16 @@ The defined processor-specific section types are listed in <<rv-section-type>>.
 | Name                       | Type                 | Attributes
 
 | .riscv.attributes          | SHT_RISCV_ATTRIBUTES | none
+| .riscv.lpadinfo            | SHT_RISCV_LADING_PAD_INFO | none
 | .riscv.jvt                 | SHT_PROGBITS         | SHF_ALLOC + SHF_EXECINSTR
 | .note.gnu.property         | SHT_NOTE             | SHF_ALLOC
 |===
 
 +++.riscv.attributes+++ names a section that contains RISC-V ELF attributes.
 
++++.riscv.lpadinfo+++ names a section that contains RISC-V landing pad
+information, which used for generating PLT and also can be used for debugging.
+
 +++.riscv.jvt+++ is a linker-created section to store table jump
 target addresses. The minimum alignment of this section is 64 bytes.
 
@@ -1568,6 +1575,51 @@ the `Zicfilp` extension. An executable or shared library with this bit set is
 required to generate PLTs with the landing pad (`lpad`) instruction, and all
 label are set to a value which hashed from its function signature.
 
+=== Landing Pad Information Section (`.riscv.lpadinfo`)
+
+Landing pad information section is a section that contains the nessary information
+for generating function signature based landing pad PLT, this section also may
+exsiting when the unlabeled landing pad scheme is used.
+
+This section is consist by the entries of the following structure:
+
+```
+typedef struct
+{
+  Elf32_Word    lpi_name;                /* Symbol name (string tbl index) */
+  Elf32_Word    lpi_sig;                 /* Signature for the symbol (string tbl index) */
+  Elf32_Word    lpi_value;               /* Landing pad value for the symbol */
+} Elf32_Lpadinfo;
+
+typedef struct
+{
+  Elf64_Word    lpi_name;                /* Symbol name (string tbl index) */
+  Elf64_Word    lpi_sig;                 /* Signature for the symbol (string tbl index) */
+  Elf64_Word    lpi_value;               /* Landing pad value for the symbol */
+} Elf64_Lpadinfo;
+```
+
+The `lpi_name` field is the index into the string table for the symbol name,
+the `lpi_signature` field is the index into the string table for the function
+signature, it can be 0 if the signature string is not present,
+and the `lpi_value` field is the landing pad value for the symbol.
+
+The string hold by `lpi_signature` field is the function signature string, which
+is encoded as same as the mapping symbol of the function signature.
+
+NOTE: Using same encoding as mapping symbol aims to reduce the size of the
+string table
+
+Every symbol with global or weak bind must has a corresponding entry in this
+section, the `lpi_name` field must be the same as the symbol name string table
+index.
+
+This section can be discard after static linking stage.
+
+Static linker should emit error if objects with same symbol but different
+landing pad value are beging merged, however it may suppress the error if
+linker enable the landing pad schem relaxation.
+
 === Mapping Symbol
 
 The section can have a mixture of code and data or code with different ISAs.
@@ -1582,6 +1634,7 @@ A number of symbols, named mapping symbols, describe the boundaries.
 | $x.<any>
 | $x<ISA>  .2+| Start of a sequence of instructions with <ISA> extension.
 | $x<ISA>.<any>
+| $s<function-signature-string> | Marker for the landing pad instruction. This should only be used with the function signature-based scheme and should be placed only at the beginning of the function.
 |===
 
 The mapping symbol should set the type to `STT_NOTYPE`, binding to `STB_LOCAL`,
@@ -2317,6 +2370,96 @@ instructions. It is recommended to initialize `jvt` CSR immediately after
     csrw  jvt, a0
 ----
 
+==== Landing Pad Relaxation
+
+  Target Relocation::: R_RISCV_LPAD
+
+  Description:: This relaxation type allows the `lpad` instruction to be removed.
+  However, if `R_RISCV_RELAX` is not present, the `lpad` instruction can only be
+  replaced with a sequence of `nop` instructions of the same length as the
+  original instruction.
+
+  Description:: This relaxation type can relax lpad instruction into a none,
+  which removed the lpad instruction.
+  This relaxation type can be performed even without `R_RISCV_RELAX`,
+  but the linker should pad nop instruction to the same length of the original
+  instruction sequence.
+
+  Condition:: This relaxation can only be applied if the symbol is **NOT**
+  exported to the dynamic symbol table and is only referenced by `R_RISCV_CALL`
+  or `R_RISCV_CALL_PLT` relocations. If the symbol is exported or referenced by
+  other relocations, relaxation cannot be performed.
+
+  Relaxation::
+  - Lpad instruction associated with `R_RISCV_LPAD` can be removed.
+  - Lpad instruction associated with `R_RISCV_LPAD` can be replaced with nop
+    instruction if the relacation isn't paired with `R_RISCV_RELAX`.
+
+  Example::
++
+--
+Relaxation candidate:
+[,asm]
+----
+    lpad  0x123           # R_RISCV_LPAD, R_RISCV_RELAX
+----
+
+Relaxation result:
+[,asm]
+----
+    # No instruction
+----
+Can be relaxed into `nop` if no `R_RISCV_RELAX` is paired with `R_RISCV_LPAD`.
+[,asm]
+----
+    nop
+----
+--
+
+==== Landing Pad Scheme Relaxation
+
+  Target Relocation::: R_RISCV_LPAD
+
+  Description:: This relaxation type allows an `lpad` instruction to be relaxed
+  into `lpad 0`, which is a universal landing pad that ignores the label value
+  comparison. This relaxation is used when the label value is not computed
+  correctly.
+
+  Condition:: This relaxation can be performed without `R_RISCV_RELAX`, and
+  should not be enabled by default. The user must explicitly enable this
+  relaxation. Additionally, if this relaxation is applied, it must be applied
+  consistently to all `R_RISCV_LPAD` relocations in the entire binary.
+
+  Relaxation::
+  - Lpad instruction associated with `R_RISCV_LPAD` will be replaced with
+    `lpad 0`.
+  - The GNU property must be adjusted to reflect the use of this relaxation.
+  - The format of the PLT entries must also be adjusted accordingly.
+
+  Example::
++
+--
+Relaxation candidate:
+[,asm]
+----
+    lpad  0x123           # R_RISCV_LPAD
+----
+
+Relaxation result:
+[,asm]
+----
+    lpad 0
+----
+--
+
+NOTE: This relaxation is designed to be compatible with legacy programs that
+      may not declare the function signature correctly.
+
+NOTE: Dependent shared libraries will not undergo the corresponding
+transformation. Therefore, if this Landing Pad Scheme Relaxation is used in a
+dynamically linked environment, ensure that all dependent shared libraries are
+rebuilt with the corresponding version.
+
 [bibliography]
 == References