Use probe-stack=inline-asm in LLVM 11+ #77885

erikdesjardins · 2020-10-13T02:37:48Z

Fixes (?) #74405, related to #43241

r? @cuviper

src/test/assembly/stack-probes.rs

cuviper · 2020-10-14T21:50:00Z

Thanks! This may have perf implications, so I'm excluding it from rollups.

@bors r+ rollup=never

bors · 2020-10-14T21:50:01Z

📌 Commit 95269c2 has been approved by cuviper

bors · 2020-10-15T01:10:46Z

⌛ Testing commit 95269c2 with merge 1272a1a...

Use probe-stack=inline-asm in LLVM 11+ Fixes (?) rust-lang#74405, related to rust-lang#43241 r? `@cuviper`

bors · 2020-10-15T01:43:17Z

💔 Test failed - checks-actions

cuviper · 2020-10-15T16:22:37Z

The specific failures start here. I'm not sure why this change would cause new stack overflows, but it needs investigation.

failures:

---- [ui] ui/unsized-locals/by-value-trait-object-safety-rpass.rs stdout ----

error: test run failed!
status: signal: 6
command: "/checkout/obj/build/i686-unknown-linux-gnu/test/ui/unsized-locals/by-value-trait-object-safety-rpass/a"
stdout:
------------------------------------------

------------------------------------------
stderr:
------------------------------------------

thread 'main' has overflowed its stack
fatal runtime error: stack overflow

------------------------------------------


---- [ui] ui/unsized-locals/by-value-trait-object-safety-withdefault.rs stdout ----

error: test run failed!
status: signal: 6
command: "/checkout/obj/build/i686-unknown-linux-gnu/test/ui/unsized-locals/by-value-trait-object-safety-withdefault/a"
stdout:
------------------------------------------

------------------------------------------
stderr:
------------------------------------------

thread 'main' has overflowed its stack
fatal runtime error: stack overflow

------------------------------------------


---- [ui] ui/unsized-locals/autoderef.rs stdout ----

error: test run failed!
status: signal: 6
command: "/checkout/obj/build/i686-unknown-linux-gnu/test/ui/unsized-locals/autoderef/a"
stdout:
------------------------------------------

------------------------------------------
stderr:
------------------------------------------

thread 'main' has overflowed its stack
fatal runtime error: stack overflow

------------------------------------------



failures:
    [ui] ui/unsized-locals/autoderef.rs
    [ui] ui/unsized-locals/by-value-trait-object-safety-rpass.rs
    [ui] ui/unsized-locals/by-value-trait-object-safety-withdefault.rs

test result: FAILED. 10827 passed; 3 failed; 109 ignored; 0 measured; 0 filtered out

erikdesjardins · 2020-10-17T18:39:48Z

Looks like a comparison is inverted. This is a code snippet from by-value-trait-object-safety-rpass. Immediately before this, the size of the alloca is loaded into ebx, which in our case is 0 because the unsized trait object has size 0.

	add	ebx, 15
	and	ebx, -16
	mov	eax, esp
	sub	eax, ebx
	mov	dword ptr [ebp - 368], edx
	mov	dword ptr [ebp - 372], esi
	mov	dword ptr [ebp - 376], edi
	mov	dword ptr [ebp - 380], eax
.LBB87_43:
	mov	eax, dword ptr [ebp - 380]
	cmp	eax, esp
	jl	.LBB87_45
	mov	dword ptr [esp], 0
	sub	esp, 4096
	jmp	.LBB87_43
.LBB87_45:
	mov	eax, dword ptr [ebp - 380]
	mov	esp, eax

Due to:

cmp	eax, esp
jl	.LBB87_45

...if ebx is nonzero, the new stack pointer in eax will be below esp, and it will exit the loop without probing.
...if ebx is zero, the new stack pointer in eax will be equal to, and then greater than esp, and it will probe infinitely.
(and that's what happens running it locally under gdb)

The same thing shows up in Clang 11 with -fstack-clash-protection (https://godbolt.org/z/vxbaxd) with just:

int size;

void foo(void*);

int main() {
    foo(alloca(size));
}

I suppose nobody noticed this in C because it's rare to call alloca(0), and alloca(<nonzero>) will still run, it just won't actually probe anything.

Assuming I haven't misinterpreted something, can you report this to LLVM? I don't have an account yet.

pietroalbini · 2020-10-19T10:53:58Z

@bors r-

The PR failed but bors forgot about that during synchronize.

cuviper · 2020-10-19T17:36:49Z

Hmm, I thought we fixed that cmp direction in D82867, but it's possible we missed a case...
(edit: actually, I think that change may have mis-corrected the cmp for dynamic alloca... still investigating...)

erikdesjardins · 2020-10-19T22:19:35Z

Yeah, the changes to stack-clash-dynamic-alloca.ll in that revision look wrong; it was correct before (although maybe replace jl .LBB0_3 with jle to avoid probing for zero size allocas).
stack-clash-large.ll looks right if it's guaranteed that we always probe an exact multiple of the page size for static allocas.

serge-sans-paille · 2020-10-27T10:22:47Z

@erikdesjardins / @cuviper https://reviews.llvm.org/D90216 should do the trick. Thanks for spotting this!

cuviper · 2020-10-27T19:55:31Z

Ah, so there are two parts to this...

I suppose nobody noticed this in C because it's rare to call alloca(0),

This actually was already reported in bug 47657 and fixed in D88548.

and alloca(<nonzero>) will still run, it just won't actually probe anything.

This problem remains, but I just confirmed locally that it is fixed by D90216. I stepped through the assembly in gdb just to be sure. 🙂

I think both fixes would be good to have in LLVM 11.0.1. For Rust's part, we can backport the fixes to our bundled LLVM fork, but we should be careful about how this is enabled for external LLVM. I don't think we usually check the patch version for functionality, but maybe this one justifies it -- assuming 11.0.1 does get fixed.

- Perform the probing in the correct direction. Related to rust-lang/rust#77885 (comment) - The first touch on a dynamic alloca cannot use a mov because it clobbers existing space. Use a xor 0 instead Differential Revision: https://reviews.llvm.org/D90216

crlf0710 · 2020-11-13T12:21:03Z

Triage: So it seems this is blocked on a llvm bug.

- Perform the probing in the correct direction. Related to rust-lang/rust#77885 (comment) - The first touch on a dynamic alloca cannot use a mov because it clobbers existing space. Use a xor 0 instead Differential Revision: https://reviews.llvm.org/D90216 (cherry picked from commit 0f60bcc)

rust-timer · 2021-01-08T17:08:36Z

Awaiting bors try build completion.

bors · 2021-01-08T17:08:46Z

⌛ Trying commit b28b231 with merge 088a806...

Use probe-stack=inline-asm in LLVM 11+ Fixes (?) rust-lang#74405, related to rust-lang#43241 r? `@cuviper`

bors · 2021-01-08T18:09:09Z

☀️ Try build successful - checks-actions
Build commit: 088a806 (088a806ae338b7577aa6f01e5700a94f9feb139c)

rust-timer · 2021-01-08T18:09:11Z

Queued 088a806 with parent 937f629, future comparison URL.

@rustbot label: +S-waiting-on-perf

rust-timer · 2021-01-08T19:45:10Z

Finished benchmarking try commit (088a806): comparison url.

Benchmarking this pull request likely means that it is perf-sensitive, so we're automatically marking it as not fit for rolling up. Please note that if the perf results are neutral, you should likely undo the rollup=never given below by specifying rollup- to bors.

Importantly, though, if the results of this run are non-neutral do not roll this PR up -- it will mask other regressions or improvements in the roll up.

@bors rollup=never
@rustbot label: +S-waiting-on-review -S-waiting-on-perf

src/test/assembly/stack-probes.rs

cuviper · 2021-01-13T23:02:35Z

Now that #80796 has merged, can you rebase again to separate this PR from that change?

erikdesjardins · 2021-01-15T03:51:48Z

Rebased.

cuviper · 2021-01-15T21:54:46Z

@bors r+

bors · 2021-01-15T21:54:47Z

📌 Commit cd25807 has been approved by cuviper

bors · 2021-01-16T03:10:58Z

⌛ Testing commit cd25807 with merge 635ccfe...

bors · 2021-01-16T06:09:14Z

☀️ Test successful - checks-actions
Approved by: cuviper
Pushing 635ccfe to master...

…iper Target stack-probe support configurable finely This adds capability to configure the target's stack probe support in a more precise manner than just on/off. In particular now we allow choosing between always inline-asm, always call or either one of those depending on the LLVM version. Note that this removes the ability to turn off the generation of the stack-probe attribute. This is valid to replace it with inline-asm for all targets because `probe-stack="inline-asm"` will not generate any machine code on targets that do not currently support stack probes. This makes support for stack probes on targets that don't have any right now automatic with LLVM upgrades in the future. (This is valid to do based on the fact that clang unconditionally sets this attribute when `-fstack-clash-protection` is used, AFAICT) cc rust-lang#77885 r? `@cuviper`

rust-highfive assigned cuviper Oct 13, 2020

rust-highfive added the S-waiting-on-review label Oct 13, 2020

erikdesjardins reviewed Oct 13, 2020

View changes

src/test/assembly/stack-probes.rs Outdated Show resolved Hide resolved

bors added S-waiting-on-bors and removed S-waiting-on-review labels Oct 14, 2020

bors added S-waiting-on-review and removed S-waiting-on-bors labels Oct 15, 2020

cuviper added S-waiting-on-author and removed S-waiting-on-review labels Oct 15, 2020

bjorn3 mentioned this pull request Oct 18, 2020

Cranelift: Support stack probes without external function call bytecodealliance/wasmtime#2299

Open

crlf0710 added S-blocked and removed S-waiting-on-author labels Nov 13, 2020

bors added a commit to rust-lang-ci/rust that referenced this pull request Jan 8, 2021

Auto merge of rust-lang#77885 - erikdesjardins:probeasm, r=<try>

Loading status checks…

088a806

Use probe-stack=inline-asm in LLVM 11+ Fixes (?) rust-lang#74405, related to rust-lang#43241 r? `@cuviper`

rustbot added the S-waiting-on-perf label Jan 8, 2021

rustbot added S-waiting-on-review and removed S-waiting-on-perf labels Jan 8, 2021

nagisa reviewed Jan 8, 2021

View changes

src/test/assembly/stack-probes.rs Show resolved Hide resolved

nagisa mentioned this pull request Jan 9, 2021

Target stack-probe support configurable finely #80838

Merged

Use probe-stack=inline-asm in LLVM 11+

Loading status checks…

cd25807

erikdesjardins force-pushed the erikdesjardins:probeasm branch from b28b231 to cd25807 Jan 15, 2021

bors added S-waiting-on-bors and removed S-blocked S-waiting-on-review labels Jan 15, 2021

bors added the merged-by-bors label Jan 16, 2021

rustbot added this to the 1.51.0 milestone Jan 16, 2021

erikdesjardins deleted the erikdesjardins:probeasm branch Jan 16, 2021

addrianyy mentioned this pull request Jan 19, 2021

inline-asm stack probe used but not defined on x86_64-unknown-uefi #81196

Closed

rust-lang / rust

Use probe-stack=inline-asm in LLVM 11+ #77885

Use probe-stack=inline-asm in LLVM 11+ #77885

erikdesjardins commented Oct 13, 2020

cuviper commented Oct 14, 2020

bors commented Oct 14, 2020

bors commented Oct 15, 2020

bors commented Oct 15, 2020

cuviper commented Oct 15, 2020

erikdesjardins commented Oct 17, 2020 •

edited

pietroalbini commented Oct 19, 2020

cuviper commented Oct 19, 2020 •

edited

erikdesjardins commented Oct 19, 2020 •

edited

serge-sans-paille commented Oct 27, 2020

cuviper commented Oct 27, 2020

crlf0710 commented Nov 13, 2020

rust-timer commented Jan 8, 2021

bors commented Jan 8, 2021

bors commented Jan 8, 2021

rust-timer commented Jan 8, 2021

rust-timer commented Jan 8, 2021

cuviper commented Jan 13, 2021

erikdesjardins commented Jan 15, 2021

cuviper commented Jan 15, 2021

bors commented Jan 15, 2021

bors commented Jan 16, 2021

bors commented Jan 16, 2021

rust-lang / rust

Use probe-stack=inline-asm in LLVM 11+ #77885

Use probe-stack=inline-asm in LLVM 11+ #77885

Conversation

erikdesjardins commented Oct 13, 2020

cuviper commented Oct 14, 2020

bors commented Oct 14, 2020

bors commented Oct 15, 2020

bors commented Oct 15, 2020

cuviper commented Oct 15, 2020

erikdesjardins commented Oct 17, 2020 • edited

pietroalbini commented Oct 19, 2020

cuviper commented Oct 19, 2020 • edited

erikdesjardins commented Oct 19, 2020 • edited

serge-sans-paille commented Oct 27, 2020

cuviper commented Oct 27, 2020

crlf0710 commented Nov 13, 2020

rust-timer commented Jan 8, 2021

bors commented Jan 8, 2021

bors commented Jan 8, 2021

rust-timer commented Jan 8, 2021

rust-timer commented Jan 8, 2021

cuviper commented Jan 13, 2021

erikdesjardins commented Jan 15, 2021

cuviper commented Jan 15, 2021

bors commented Jan 15, 2021

bors commented Jan 16, 2021

bors commented Jan 16, 2021

erikdesjardins commented Oct 17, 2020 •

edited

cuviper commented Oct 19, 2020 •

edited

erikdesjardins commented Oct 19, 2020 •

edited