Value array stack alloc alt #111284

AndyAyersMS · 2025-01-10T19:46:18Z

No description provided.

These flags are strictly optimizations. Having them required to be set for certain indirs based on context of the operand introduces IR invariants that are annoying to work with since it suddenly becomes necessary for all transformations to consider "did we just introduce this IR shape?". Instead, handle these patterns during morph as the optimization it is and remove the strict flags checking from `fgDebugCheckFlags`.

Leave the newarr helper call in place, and don't rewrite the uses to be uses of the local. Remove special handling in local morph and morph. Lower the helper to the proper stores later on. Update a few utilities to understand array base addresses may not be TYP_REF.

dotnet-policy-service · 2025-01-10T19:47:15Z

Tagging subscribers to this area: @JulieLeeMSFT, @jakobbotsch
See info in area-owners.md if you want to be subscribed.

hez2010 · 2025-01-11T02:06:17Z

src/coreclr/jit/morph.cpp

@@ -3660,7 +3633,7 @@ GenTree* Compiler::fgMorphIndexAddr(GenTreeIndexAddr* indexAddr)
        }

        if (((index->gtFlags & (GTF_ASG | GTF_CALL | GTF_GLOB_REF)) != 0) ||
-            gtComplexityExceeds(index, MAX_INDEX_COMPLEXITY) || index->OperIs(GT_LCL_FLD) ||
+            gtComplexityExceeds(index, MAX_ARR_COMPLEXITY) || index->OperIs(GT_LCL_FLD) ||


This line is a correctness fix. We were checking the index tree using the arr complexity.

Ok, thanks. We should probably PR things like that separately.

Guess it's not actually a functional change so no need for a separate PR

const int MAX_ARR_COMPLEXITY = 4; const int MAX_INDEX_COMPLEXITY = 4;

Yeah I think it's a copy-paste mistake.

AndyAyersMS · 2025-01-11T02:28:56Z

I can't repro the x86 failure. Haven't looked into the R2R issues yet.

AndyAyersMS · 2025-01-12T20:02:28Z

One flaw with this approach is that the explicit zero inits are getting dead-coded by early liveness, because there are no evident uses of the array temp. We can fix this by using a custom new helper which takes the array address as an argument. And this introduces a dependence between the zero init and the new helper, which was absent before.

That is, instead of the current

   lArray = 0;
   pArray = new(type, size);
          = pArray[i];

we produce

   lArray = 0;
   pArray = new(type, size, &lArray);
          = pArray[i];

and (both) get later expanded to

   lArray = 0
   pArray = &lArray;
   pArray[0] = type;
   pArray[lenOffset] = size;
          = pArray[i];

The downside is we now have a jit helper with no runtime counterpart, which is a new concept, or we just tack on an extra arg to the existing helpers and hope it doesn't cause problems.

AndyAyersMS · 2025-01-15T23:03:55Z

This should show a decent set of SPMI diffs (though with 160ish misses in coreclr_tests we can remedy once we do yet another round of collection). Most of the diffs are now just extra prolog zeroing as the code shape in the method body is generally the same. Only rarely can we propagate through an array element, though I think we can improve on this some over time.

I think it's in good shape and we should seriously consider it as the way forward. We can either use this PR or merge back into #104906.

@hez2010, @jakobbotsch thoughts?

AndyAyersMS · 2025-01-16T01:43:44Z

For the R2R failures, it is failing on

TestRangeCheckElimination
Found:False
Not equal!
actual   = True
expected = False

This test seems to be checking what methods are inlineable across assembly boundaries, and evidently this method is now missing when it's expected to be in the allow list (I think the actual and expected labels are swapped here, the method is expected to be found in the list but isn't.)

Need to dig into this further. When I looked at it earlier I couldn't figure out where things were going wrong.

hez2010 · 2025-01-16T05:11:55Z

I'm seeing some regressions like this:

 
 G_M59007_IG01:        ; bbWeight=1, gcrefRegs=0000 {}, byrefRegs=0000 {}, byref, nogc <-- Prolog IG
-       sub      rsp, 40
+       sub      rsp, 24
 						;; size=4 bbWeight=1 PerfScore 0.25
 G_M59007_IG02:        ; bbWeight=1, gcrefRegs=0000 {}, byrefRegs=0000 {}, byref
+       vxorps   xmm0, xmm0, xmm0
+       vmovdqu  xmmword ptr [rsp], xmm0
+       vmovdqu  xmmword ptr [rsp+0x04], xmm0
+       mov      rax, 0xD1FFAB1E      ; <unknown class>
+       mov      qword ptr [rsp], rax
+       mov      dword ptr [rsp+0x08], 2
        xor      eax, eax
-						;; size=2 bbWeight=1 PerfScore 0.25
+						;; size=39 bbWeight=1 PerfScore 4.83
 G_M59007_IG03:        ; bbWeight=1, epilog, nogc, extend
-       add      rsp, 40
+       add      rsp, 24
        ret

It seems that we don't eliminate a dead stack allocated array. Is it expected?

AndyAyersMS · 2025-01-16T15:58:17Z

It seems that we don't eliminate a dead stack allocated array. Is it expected?

Yes, we will need some enhancements to detect cases like this.

AndyAyersMS · 2025-01-17T16:38:42Z

Pushed the changes over to #104906. So will close this.

hez2010 and others added 30 commits July 15, 2024 20:23

initial prototype

1b0e3d3

Morph ARR_LENGTH and INDEX_ADDR

57b7e42

Fix incorrect array length storage

1b5b25e

Use offset and correct type

395b735

handle reassignment

17de70b

range check

5443c42

throw range check failure

b2d07da

update comments

b5ae9e7

add metrics

87b29de

minor cleanup

eeb681d

Introduce new temp and implement local address morphing

dee9f38

handle index out-of-range

94c103b

Refactor to remove duplicates

12b297b

Remove invalid asserts

e0fa91e

make compiler happy

9e0a04f

Address review feedbacks

ae822f8

Fix INDEX_ADDR and add Sub

a4588bb

Support IsAddressLessThan and its friends

32b9e26

Fix assertions

39d1ad9

Merge remote-tracking branch 'origin/main' into value-array-stack-alloc

0df0d58

Use new overload

9f408b2

Remove old comment

4572408

Expose jitconfig

9255762

Remove another assert

1af84b9

Count

629c793

Try 2 at counting

b578203

Introduce BBF_HAS_NEWARR

b4445f6

Early exit on debug as well

af9c40e

Update computed flags

8b54f5a

hez2010 and others added 4 commits December 21, 2024 22:47

Oops

47114c6

Merge branch 'main' into value-array-stack-alloc

66e98cd

basic VN support

5137a5c

dotnet-issue-labeler bot added the area-CodeGen-coreclr CLR JIT compiler in src/coreclr/src/jit and related components such as SuperPMI label Jan 10, 2025

dotnet-policy-service bot assigned AndyAyersMS Jan 10, 2025

AndyAyersMS mentioned this pull request Jan 10, 2025

JIT: Extend escape analysis to account for arrays with non-gcref elements #104906

Merged

AndyAyersMS mentioned this pull request Jan 10, 2025

JIT: run extra SPMI queries for arrays #111293

Merged

build-analysis bot mentioned this pull request Jan 11, 2025

The hosted runner encountered an error while running your job. (Error Type: Disconnect). dotnet/dnceng#1919

Open

3 tasks

hez2010 reviewed Jan 11, 2025

View reviewed changes

restore complexity change

aa09187

AndyAyersMS added 5 commits January 14, 2025 11:25

pass address of stack local to new helper

c451435

Merge branch 'main' into value-array-stack-alloc-alt

18d3677

temp hack to boost SPMI coverage

f6b012a

avoid pessimizing tail calls. implement configurable size limit

a6e7bd5

add missing well known arg string

b3edc07

This was referenced Jan 16, 2025

slow macOS - "##[error]The job running on agent Azure Pipelines 9 ran longer than the maximum time of 60 minutes." dotnet/dnceng#1883

Open

The Operation will be canceled. The next steps may not contain expected logs. dotnet/dnceng#3008

Open

AndyAyersMS added 3 commits January 16, 2025 13:38

fix array length check

9b63e2d

use ClrSafeInt directly

e535657

bypass for R2R for now, since it may inhibit prejitting

07cd310

AndyAyersMS closed this Jan 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Value array stack alloc alt #111284

Value array stack alloc alt #111284

AndyAyersMS commented Jan 10, 2025

dotnet-policy-service bot commented Jan 10, 2025

hez2010 Jan 11, 2025 •

edited

Loading

AndyAyersMS Jan 11, 2025

AndyAyersMS Jan 11, 2025

hez2010 Jan 11, 2025

AndyAyersMS commented Jan 11, 2025

AndyAyersMS commented Jan 12, 2025

AndyAyersMS commented Jan 15, 2025

AndyAyersMS commented Jan 16, 2025

hez2010 commented Jan 16, 2025

AndyAyersMS commented Jan 16, 2025

AndyAyersMS commented Jan 17, 2025

Value array stack alloc alt #111284

Value array stack alloc alt #111284

Conversation

AndyAyersMS commented Jan 10, 2025

dotnet-policy-service bot commented Jan 10, 2025

hez2010 Jan 11, 2025 • edited Loading

Choose a reason for hiding this comment

AndyAyersMS Jan 11, 2025

Choose a reason for hiding this comment

AndyAyersMS Jan 11, 2025

Choose a reason for hiding this comment

hez2010 Jan 11, 2025

Choose a reason for hiding this comment

AndyAyersMS commented Jan 11, 2025

AndyAyersMS commented Jan 12, 2025

AndyAyersMS commented Jan 15, 2025

AndyAyersMS commented Jan 16, 2025

hez2010 commented Jan 16, 2025

AndyAyersMS commented Jan 16, 2025

AndyAyersMS commented Jan 17, 2025

hez2010 Jan 11, 2025 •

edited

Loading