Commits · 0516b6f2b006f32ac8494cd6239eca8d2271c73c · Roger Ferrer / llvm-epi-0.8

Apr 23, 2008
- Unbreak JIT · 0516b6f2
  Anton Korobeynikov authored Apr 23, 2008
```
llvm-svn: 50173
```
  0516b6f2
- Fix typo · 0d6df367
  Anton Korobeynikov authored Apr 23, 2008
```
llvm-svn: 50169
```
  0d6df367
- Only allow increase of max alignment value · 965babda
  Anton Korobeynikov authored Apr 23, 2008
```
llvm-svn: 50168
```
  965babda
- Be over-conservative: scan for all used virtual registers and calculate... · c1534dca
  Anton Korobeynikov authored Apr 23, 2008
```
Be over-conservative: scan for all used virtual registers and calculate maximal stack alignment in assumption, that there will be spill of vector register.

llvm-svn: 50167
```
  c1534dca
- Add X86 Maximal Stack Alignment Calculator Pass before RA · 2659011b
  Anton Korobeynikov authored Apr 23, 2008
```
llvm-svn: 50166
```
  2659011b
- Add facility for pre-RA passes · 7e859dd7
  Anton Korobeynikov authored Apr 23, 2008
```
llvm-svn: 50165
```
  7e859dd7
- Use precomputed value, if any · 41334635
  Anton Korobeynikov authored Apr 23, 2008
```
llvm-svn: 50164
```
  41334635
- Do proper book-keeping of offsets and prologue/epilogue code for stack realignment · 156550ae
  Anton Korobeynikov authored Apr 23, 2008
```
llvm-svn: 50163
```
  156550ae
- If stack realignment is used - incoming args will use EBP as base register and locals - ESP · 89a0a017
  Anton Korobeynikov authored Apr 23, 2008
```
llvm-svn: 50162
```
  89a0a017
- Eastimate required stack alignment early, so we can decide, whether we will... · ba512907
  Anton Korobeynikov authored Apr 23, 2008
```
Eastimate required stack alignment early, so we can decide, whether we will need frame pointer or not

llvm-svn: 50161
```
  ba512907
- Cleanup · f49bc9f8
  Anton Korobeynikov authored Apr 23, 2008
```
llvm-svn: 50160
```
  f49bc9f8
- Cleanup · c756b460
  Anton Korobeynikov authored Apr 23, 2008
```
llvm-svn: 50159
```
  c756b460
- Simplify · a8aac3db
  Anton Korobeynikov authored Apr 23, 2008
```
llvm-svn: 50158
```
  a8aac3db
- Make stack alignment options global for all targets · cb195f51
  Anton Korobeynikov authored Apr 23, 2008
```
llvm-svn: 50157
```
  cb195f51
- Provide option for enabling-disabling stack realignment · 9328fbc4
  Anton Korobeynikov authored Apr 23, 2008
```
llvm-svn: 50156
```
  9328fbc4
- Disable stack realignment for functions with dynamic-sized alloca's · ca150edd
  Anton Korobeynikov authored Apr 23, 2008
```
llvm-svn: 50155
```
  ca150edd
- Provide ABI-correct stack alignment · a7495260
  Anton Korobeynikov authored Apr 23, 2008
```
llvm-svn: 50154
```
  a7495260
- Provide convenient helpers for some operations · 8843487e
  Anton Korobeynikov authored Apr 23, 2008
```
llvm-svn: 50153
```
  8843487e
- Whitespace cleanup · 2ccafa47
  Anton Korobeynikov authored Apr 23, 2008
```
llvm-svn: 50152
```
  2ccafa47
- simplify code for propagation of constant arguments into · a82d691c
  Chris Lattner authored Apr 23, 2008
```
callees.

llvm-svn: 50142
```
  a82d691c
- Fix a number of bugs in ipconstantprop, simplify the code, fit in 80 cols, · 5f1802cf
  Chris Lattner authored Apr 23, 2008
```
fix read after free bug (PR2238).

llvm-svn: 50141
```
  5f1802cf
- Rewrite multiple return value handling in SCCP. Before, the -sccp pass · 5a58a4dc
  Chris Lattner authored Apr 23, 2008
```
would turn every getresult instruction into undef.  This helps with
rdar://5778210

llvm-svn: 50140
```
  5a58a4dc
- regenerate · 6e0284bb
  Chris Lattner authored Apr 23, 2008
```
llvm-svn: 50139
```
  6e0284bb
- Validate that the result of a function type is valid using shared · ff9089eb
  Chris Lattner authored Apr 23, 2008
```
logic with vmcore.

llvm-svn: 50138
```
  ff9089eb
- Enforce that multiple return values have to have at least one result. · b6917d2a
  Chris Lattner authored Apr 23, 2008
```
llvm-svn: 50137
```
  b6917d2a
- Verify that the operand of a getresult instruction is a · 31abd54e
  Chris Lattner authored Apr 23, 2008
```
call/invoke or undef.

llvm-svn: 50129
```
  31abd54e
- Do not change the type of a ByVal argument to a · 493527d8
  Dale Johannesen authored Apr 23, 2008
```
type of a different size.

llvm-svn: 50121
```
  493527d8
- Don't do: "(X & 4) >> 1 == 2 --> (X & 4) == 4" if there are more than one... · 1c89ca72
  Evan Cheng authored Apr 23, 2008
```
Don't do: "(X & 4) >> 1 == 2  --> (X & 4) == 4" if there are more than one uses of the shift result.

llvm-svn: 50118
```
  1c89ca72
Apr 22, 2008

Start doing the significantly useful part of jump threading: handle cases · 37e9c187

Chris Lattner authored Apr 22, 2008

where a comparison has a phi input and that phi is a constant.  For example,
stuff like:

  Threading edge through bool from 'bb2149' to 'bb2231' with cost: 1, across block:
bb2237:		; preds = %bb2231, %bb2149
	%tmp2328.rle = phi i32 [ %tmp2232, %bb2231 ], [ %tmp2232439, %bb2149 ]		; <i32> [#uses=2]
	%done.0 = phi i32 [ %done.2, %bb2231 ], [ 0, %bb2149 ]		; <i32> [#uses=1]
	%tmp2239 = icmp eq i32 %done.0, 0		; <i1> [#uses=1]
	br i1 %tmp2239, label %bb2231, label %bb2327

or

bb38.i298:		; preds = %bb33.i295, %bb1693
	%tmp39.i296.rle = phi %struct.ibox* [ null, %bb1693 ], [ %tmp39.i296.rle1109, %bb33.i295 ]		; <%struct.ibox*> [#uses=2]
	%minspan.1.i291.reg2mem.1 = phi i32 [ 32000, %bb1693 ], [ %minspan.0.i288, %bb33.i295 ]		; <i32> [#uses=1]
	%tmp40.i297 = icmp eq %struct.ibox* %tmp39.i296.rle, null		; <i1> [#uses=1]
	br i1 %tmp40.i297, label %implfeeds.exit311, label %bb43.i301

This triggers thousands of times in spec.

llvm-svn: 50110

37e9c187

Dig through multiple levels of AND to thread jumps if needed. · d5425e8f
Chris Lattner authored Apr 22, 2008
```
llvm-svn: 50106
```
d5425e8f

Teach jump threading to thread through blocks like: · 3df4c15d

Chris Lattner authored Apr 22, 2008

  br (and X, phi(Y, Z, false)), label L1, label L2

This triggers once on 252.eon and 6 times on 176.gcc.  Blocks 
in question often look like this:

bb262:		; preds = %bb261, %bb248
	%iftmp.251.0 = phi i1 [ true, %bb261 ], [ false, %bb248 ]		; <i1> [#uses=4]
	%tmp270 = icmp eq %struct.rtx_def* %tmp.0.i, null		; <i1> [#uses=1]
	%bothcond = or i1 %iftmp.251.0, %tmp270		; <i1> [#uses=1]
	br i1 %bothcond, label %bb288, label %bb273

In this case, it is clear that it doesn't matter if tmp.0.i is null when coming from bb261.  When coming from bb248, it is all that matters.


Another random example:

check_asm_operands.exit:		; preds = %check_asm_operands.exit.thr_comm, %bb30.i, %bb12.i, %bb6.i413
	%tmp.0.i420 = phi i1 [ true, %bb6.i413 ], [ true, %bb12.i ], [ true, %bb30.i ], [ false, %check_asm_operands.exit.thr_comm ; <i1> [#uses=1]
	call void @llvm.stackrestore( i8* %savedstack ) nounwind 
	%tmp4389 = icmp eq i32 %added_sets_1.0, 0		; <i1> [#uses=1]
	%tmp4394 = icmp eq i32 %added_sets_2.0, 0		; <i1> [#uses=1]
	%bothcond80 = and i1 %tmp4389, %tmp4394		; <i1> [#uses=1]
	%bothcond81 = and i1 %bothcond80, %tmp.0.i420		; <i1> [#uses=1]
	br i1 %bothcond81, label %bb4398, label %bb4397

Here is the case from 252.eon:

bb290.i.i:		; preds = %bb23.i57.i.i, %bb8.i39.i.i, %bb100.i.i, %bb100.i.i, %bb85.i.i110
	%myEOF.1.i.i = phi i1 [ true, %bb100.i.i ], [ true, %bb100.i.i ], [ true, %bb85.i.i110 ], [ true, %bb8.i39.i.i ], [ false, %bb23.i57.i.i ]		; <i1> [#uses=2]
	%i.4.i.i = phi i32 [ %i.1.i.i, %bb85.i.i110 ], [ %i.0.i.i, %bb100.i.i ], [ %i.0.i.i, %bb100.i.i ], [ %i.3.i.i, %bb8.i39.i.i ], [ %i.3.i.i, %bb23.i57.i.i ]		; <i32> [#uses=3]
	%tmp292.i.i = load i8* %tmp16.i.i100, align 1		; <i8> [#uses=1]
	%tmp293.not.i.i = icmp ne i8 %tmp292.i.i, 0		; <i1> [#uses=1]
	%bothcond.i.i = and i1 %tmp293.not.i.i, %myEOF.1.i.i		; <i1> [#uses=1]
	br i1 %bothcond.i.i, label %bb202.i.i, label %bb301.i.i
  Factoring out 3 common predecessors.

On the path from any blocks other than bb23.i57.i.i, the load and compare 
are dead.

llvm-svn: 50096

3df4c15d

refactor some code, no functionality change. · e369c35a
Chris Lattner authored Apr 22, 2008
```
llvm-svn: 50094
```
e369c35a
remove dead code. · 8fb13cbe
Chris Lattner authored Apr 22, 2008
```
llvm-svn: 50080
```
8fb13cbe

optimize "p != gep p, ..." better. This allows us to compile · c3a43935

Chris Lattner authored Apr 22, 2008

getelementptr-seteq.ll into:

define i1 @test(i64 %X, %S* %P) {
	%C = icmp eq i64 %X, -1		; <i1> [#uses=1]
	ret i1 %C
}

instead of:

define i1 @test(i64 %X, %S* %P) {
	%A.idx.mask = and i64 %X, 4611686018427387903		; <i64> [#uses=1]
	%C = icmp eq i64 %A.idx.mask, 4611686018427387903		; <i1> [#uses=1]
	ret i1 %C
}

And fixes the second half of PR2235.  This speeds up the insertion sort
case by 45%, from 1.12s to 0.77s.  In practice, this will significantly
speed up for loops structured like:

for (double *P = Base + N; P != Base; --P)
  ...

Which happens frequently for C++ iterators.

llvm-svn: 50079

c3a43935

more fallout from Nicholas' asmprinter patch. · ff834c0c
Chris Lattner authored Apr 22, 2008
```
llvm-svn: 50078
```
ff834c0c

Implement an x86-64 ABI detail of passing structs by hidden first · f166d2d0

Dan Gohman authored Apr 21, 2008

argument. The x86-64 ABI requires the incoming value of %rdi to
be copied to %rax on exit from a function that is returning a
large C struct.

Also, add a README-X86-64 entry detailing the missed optimization
opportunity and proposing an alternative approach.

llvm-svn: 50075

f166d2d0

Apr 21, 2008
- Fix an out-of-bounds access in -view-sunit-dags in the case of an · dc90919d
  Dan Gohman authored Apr 21, 2008
```
empty ScheduleDAG.

llvm-svn: 50054
```
  dc90919d
- Fix the encoding of the MMX movd that moves from MMX to 64-bit GPR. · db08f521
  Dan Gohman authored Apr 21, 2008
```
llvm-svn: 50053
```
  db08f521
- Fix the way AliasSet::print prints "may alias". · 79fff7cf
  Dan Gohman authored Apr 21, 2008
```
llvm-svn: 50051
```
  79fff7cf
- fix grammar-o, thanks to Duncan for noticing. · bab7bec9
  Chris Lattner authored Apr 21, 2008
```
llvm-svn: 50047
```
  bab7bec9