Complete vld1 instructions with some corrections #1216

SparrowLii · 2021-09-10T03:04:09Z

This PR does the following:
(1) completes vld1 neon instructions, mainly for *_p64s
(2) replaces crypto feature with aes for arm
(3) adds "core_arch/src/arm_shared/neon" dir in stdarch-verify. Based on this, the function names of vmull_n_s16, vmull_n_s32, vmull_n_u16, vmull_n_u32, vqdmulhq_n_s16, vqdmulhq_n_s32 have been corrected
(4) optimizes stdarch-gen so that vld2* (and other) instructions can be generated more clearly(For the convenience of reviewing, they are not generated in this PR)

rust-highfive · 2021-09-10T03:04:12Z

r? @Amanieu

(rust-highfive has picked a reviewer for you, use r? to override)

hkratz · 2021-09-10T08:25:24Z

crates/stdarch-gen/neon.spec

-name = vmull
-n-suffix
+name = vmull_n
+no-q


Maybe this change should be documented in the PR or in an extra commit/extra PR as it changes the names of public intrinsics (to their correct names).

Thanks for pointing it out! I added relevant documents to the description of the PR.

hkratz · 2021-09-10T08:25:37Z

crates/stdarch-gen/neon.spec

@@ -3568,7 +3733,7 @@ generate int16x4_t:i16:int16x4_t, int32x2_t:i32:int32x2_t

 /// Vector saturating doubling multiply high with scalar
 name = vqdmulhq_n
-out-suffix
+no-q


Amanieu · 2021-09-10T10:06:47Z

crates/core_arch/src/aarch64/neon/mod.rs

+#[target_feature(enable = "neon,aes")]
+#[cfg_attr(test, assert_instr(ldr))]
+pub unsafe fn vld1q_p64(ptr: *const p64) -> poly64x2_t {
+    transmute(u64x2::new(*ptr, *ptr.offset(1)))


There was a concern raised in #1148 that this doesn't get optimized down to a single instruction.

I agree, all others have been already changed to use read_unaligned() instead which cannot be seen in the diff as it this PR is based on an older commit.

That makes sense. I will rebase the PR and make changes accordingly.

hkratz · 2021-09-14T07:25:25Z

@Amanieu Can this be merged? Fixing #1212 and #1217 requires changes to stdarch-gen and this PR contains a big refactoring of it. I would rather like to build on this instead of running into rebase conflicts.

Update stdarch submodule This is mainly to fix the critical issue of aarch64 store intrinsics overwriting additional memory, see rust-lang/stdarch#1220 Changes: * aarch64/armv7: additional vld1/vst1 intrinsics + perf fixes for existing ones * rust-lang/stdarch#1205 * rust-lang/stdarch#1207 * rust-lang/stdarch#1216 * armv7: Make FMA work with vfpv4 and optimize * rust-lang/stdarch#1219 * Non-visible changes to the testing framework * rust-lang/stdarch#1208 * rust-lang/stdarch#1211 * rust-lang/stdarch#1213 * rust-lang/stdarch#1215 * rust-lang/stdarch#1218

rust-highfive assigned Amanieu Sep 10, 2021

hkratz reviewed Sep 10, 2021

View reviewed changes

SparrowLii changed the title ~~Complete vld1 instructions with some optimization~~ Complete vld1 instructions with some corrections Sep 10, 2021

Amanieu reviewed Sep 10, 2021

View reviewed changes

SparrowLii added 3 commits September 10, 2021 18:26

Complete vld1 instructions with some corrections

07b2e4b

correct assert_instr

a4ca211

use read_unaligned in vld1_p64

3e0efe2

SparrowLii force-pushed the vld2 branch from 285da9e to 3e0efe2 Compare September 10, 2021 10:32

Amanieu merged commit 30b3eb3 into rust-lang:master Sep 18, 2021

hkratz mentioned this pull request Sep 21, 2021

Update stdarch submodule rust-lang/rust#89145

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Complete vld1 instructions with some corrections #1216

Complete vld1 instructions with some corrections #1216

SparrowLii commented Sep 10, 2021 •

edited

Loading

rust-highfive commented Sep 10, 2021

hkratz Sep 10, 2021 •

edited

Loading

SparrowLii Sep 10, 2021

hkratz Sep 10, 2021

Amanieu Sep 10, 2021

hkratz Sep 10, 2021

SparrowLii Sep 10, 2021 •

edited

Loading

hkratz commented Sep 14, 2021

Complete vld1 instructions with some corrections #1216

Complete vld1 instructions with some corrections #1216

Conversation

SparrowLii commented Sep 10, 2021 • edited Loading

rust-highfive commented Sep 10, 2021

hkratz Sep 10, 2021 • edited Loading

Choose a reason for hiding this comment

SparrowLii Sep 10, 2021

Choose a reason for hiding this comment

hkratz Sep 10, 2021

Choose a reason for hiding this comment

Amanieu Sep 10, 2021

Choose a reason for hiding this comment

hkratz Sep 10, 2021

Choose a reason for hiding this comment

SparrowLii Sep 10, 2021 • edited Loading

Choose a reason for hiding this comment

hkratz commented Sep 14, 2021

SparrowLii commented Sep 10, 2021 •

edited

Loading

hkratz Sep 10, 2021 •

edited

Loading

SparrowLii Sep 10, 2021 •

edited

Loading