<table border="1" cellspacing="0" cellpadding="8">
<tr>
<th>Issue</th>
<td>
<a href=https://github.com/llvm/llvm-project/issues/133446>133446</a>
</td>
</tr>
<tr>
<th>Summary</th>
<td>
[riscv] Incorrect disassembly of c.lui when imm=0
</td>
</tr>
<tr>
<th>Labels</th>
<td>
new issue
</td>
</tr>
<tr>
<th>Assignees</th>
<td>
</td>
</tr>
<tr>
<th>Reporter</th>
<td>
paulhuggett
</td>
</tr>
</table>
<pre>
# context:
During disassembly, a binary can contain an invalid instruction and `objdump`
tells that this instruction is unknown.
For example, Bytecode 0x6101 represents instruction `c.addi16sp x2, 0` (where
`c.addi16sp` shares the opcode with `c.lui`, but has a destination field of
`x2`) and `c.addi16sp` is only valid when the immediate is not equal to zero.
If I try to disassemble a binary with Bytecode 0x6101, I get:
```
$ cat disasm.s
.insn 0x6101 # c.addi16sp x2, 0
$ clang -c --target=riscv32 -march=rv32imafdc disasm.s -o disasm.o
$ llvm-objdump -M no-aliases -d disasm.o
disasm.o: file format elf32-littleriscv
Disassembly of section .text:
00000000 <.text>:
0: 6101 <unknown>
```
# description of the problem:
According to RISC-V Instruction Set Manual, instruction `c.lui rd, imm` is only
valid when `rd != {x0, x2}`, and when the immediate is not equal to zero.
As a consequence, the following instruction is invalid:
```
$ cat compile.s
c.lui x1, 0
```
`llvm` verifies this before compilation:
```
$ clang -c --target=riscv32 -march=rv32imafdc compile.s -o compile.o
compile.s:1:11: error: immediate must be in [0xfffe0, 0xfffff] or [1, 31]
c.lui x1, 0
^
```
But, if a binary contains such invalid instruction, `objdump` says that this is
valid:
```
$ cat problem.s
.insn 0x6081 # c.lui x1, 0
$ clang -c --target=riscv32 -march=rv32imafdc problem.s -o problem.o
$ llvm-objdump -M no-aliases -d problem.o
problem.o: file format elf32-littleriscv
Disassembly of section .text:
00000000 <.text>:
0: 6081 c.lui ra, 0x0
```
The problem occurs with the 31 following Bytecodes:
- 0x6001 is disassembled to `c.lui zero, 0x0`
- 0x6081 is disassembled to `c.lui ra, 0x0`
- 0x6181 is disassembled to `c.lui gp, 0x0`
- 0x6201 is disassembled to `c.lui tp, 0x0`
- 0x6281 is disassembled to `c.lui t0, 0x0`
- 0x6301 is disassembled to `c.lui t1, 0x0`
- 0x6381 is disassembled to `c.lui t2, 0x0`
- 0x6401 is disassembled to `c.lui s0, 0x0`
- 0x6481 is disassembled to `c.lui s1, 0x0`
- 0x6501 is disassembled to `c.lui a0, 0x0`
- 0x6581 is disassembled to `c.lui a1, 0x0`
- 0x6601 is disassembled to `c.lui a2, 0x0`
- 0x6681 is disassembled to `c.lui a3, 0x0`
- 0x6701 is disassembled to `c.lui a4, 0x0`
- 0x6781 is disassembled to `c.lui a5, 0x0`
- 0x6801 is disassembled to `c.lui a6, 0x0`
- 0x6881 is disassembled to `c.lui a7, 0x0`
- 0x6901 is disassembled to `c.lui s2, 0x0`
- 0x6981 is disassembled to `c.lui s3, 0x0`
- 0x6a01 is disassembled to `c.lui s4, 0x0`
- 0x6a81 is disassembled to `c.lui s5, 0x0`
- 0x6b01 is disassembled to `c.lui s6, 0x0`
- 0x6b81 is disassembled to `c.lui s7, 0x0`
- 0x6c01 is disassembled to `c.lui s8, 0x0`
- 0x6c81 is disassembled to `c.lui s9, 0x0`
- 0x6d01 is disassembled to `c.lui s10, 0x0`
- 0x6d81 is disassembled to `c.lui s11, 0x0`
- 0x6e01 is disassembled to `c.lui t3, 0x0`
- 0x6e81 is disassembled to `c.lui t4, 0x0`
- 0x6f01 is disassembled to `c.lui t5, 0x0`
- 0x6f81 is disassembled to `c.lui t6, 0x0`
# expected result:
Since these 31 Bytecodes represent invalid instructions, `objdump` should
say that the resulting instructions are unknown.
</pre>
<img width="1" height="1" alt="" src="http://email.email.llvm.org/o/eJy8V02T4ygS_TX4kmEHAku2Dz64ylsRdejL9sbekZSy6EXgBlRl76-fAMlfZU0zPYdRuLskQfJSL18miXBOHjTiluQvJN_PRO9bY7dH0au2PxzQ-1lp6vOW0B1hHCqjPZ484bvwgu72vZX6ALV0wjnsSnUm7BUElFILe4ZK6GgipAahQeoPoWQNUjtv-8pLE17XQApqyh913x1JQQndeVTKgW-FB99K9zBfOuj1_7T51IvBhTdjAU-iOyoM2C9nj5WpEeipyGgGFo8WHWr_uAwpaLUQdS2zwh3hxIIpJQUFwtafLVoMi9_PCWOuFRaDYwjmGEE-pW-HtVQvg_PsFcreQyscCKjRealFBGwkqhpMM6x7YnHy5vL5jzjSgdHqDANbny3qiCm7DmspPIYJ2njAn71Q4A38H60Z6Xhv4B28PYfXt7DgLSbR5S8sBbff4YC3wAb3hl8I_BIq4YfluoUjdLeQ2ukLxRCV8czmaKmEPsC8gvncCxsx9la66oMzmHfCVm148cGZ7ERTV1cUmJvLvRmXUuqjm49Kgfk30GYulBQOHczrh8l0d33iO2ikQmiM7YQHVA1ncyW9VxjdGIV8UzCYBhwOMlk8qJ2OFxD-Oo78axgEGnAGNi4X4a-jUsO0L5QOP8aDRiorjxHNNDHOR2tKhd0VdldVxtYhz7yBf79_f53_F97vtPwdPXwTuhcq8P6kctVLsHUc6ro7eRG6uxMYKaitgbCM8D2Q1cspavnEyGo_6jpI9fe0uAtJUBnt8GePuor5GYwbo5T5DB_0JbPHAvFLFVamO0qFUYbDx52yq96-UFzQIJnwzR9oZSNj8koHJTbG4rhUzM8_h_xN-V7dC_q9PARN3vzmuyz8C_8BWmtsuLkR2vXOQ4kgNZD8hZ6apsEYgHjbNCTfg7FhLH43z0i-n-DiToj5hPxeeh8l0dxV66FSO3B91U4V62DwUKvBifNDoXYXUf0yhKPAHysJXV8ryZeg_o0oXBFCFC4Pf62MPMymu9sj38E_V0kiG5drzGExiGBC6P-5VQ0wVdVbN5T5kGw8u8u3S913V0_mkXuahfS72y_qkMrX8hGT-oIeQeeXkP3S7ObyzShLGR2Oz0Ys5aCfMkohefpsxJNI2YRREok9Gy1TSG7CvWUKyU24l6eQxARSnkISE0hFEmmCiCKJxJ-NVkmk5YRREil_NlonkYoJoyTS6tlok1TEBHubpCIm2BNJpAn2RBJpgr0yiTTBXplEmmCvSiKtJ4ySSJtnozqJlE0kVJ1O3YmMwmQ9mggvJuvRRHibJNJEeJsk0mN4b40vno5YeazBouvVbWv8LnWFYedyce-67li3k9xUa-ImepPW9KomdOfE-dKj4Ij2pfd0ICx-OVfO6i2vN3wjZrjNVkuerbKMZ7N2y5eUM-RLkRcrusrztcAmz7O84JtNsS7FTG4ZZTnlbJ0tM5bRhViVjG7Wm1VRN_mKMrKk2AmpFqETWRh7mEnnetxmnC-XxUyJEpWLJ3LGNH5CHCWMhQO63cb2pewPjiypks672zJeehWP8kM_ku_hXVfGWqz8_Rk99CVDeGI_H44FfE9nvVXb1vtjbA7YG2FvB-nbvlxUpiPsLbbSw5_50ZofWHnC3qJvjrC30fmPLfsjAAD__w_i9T4">