Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Fix asm modifiers in add_dpbusd_epi32x2 implementations
The accumulator should be an earlyclobber because it is written before all input operands are read. Otherwise, the asm code computes a wrong result if the accumulator shares a register with one of the other input operands (which happens if we pass in the same expression for the accumulator and the operand). Closes #4339 No functional change
- Loading branch information