Commits · decee183ee347b4f56148d3f938808cda6f3a88f · Roger Ferrer / llvm-epi-0.8

Mar 13, 2009

Fix FastISel's assumption that i1 values are always zero-extended · c0bb9595

Dan Gohman authored Mar 13, 2009

by inserting explicit zero extensions where necessary. Included
is a testcase where SelectionDAG produces a virtual register
holding an i1 value which FastISel previously mistakenly assumed
to be zero-extended.

llvm-svn: 66941

c0bb9595

remove a test that depends on -debug. · 2150eb9f
Chris Lattner authored Mar 13, 2009
```
llvm-svn: 66937
```
2150eb9f
remove a testcase that depends on -debug existing. · 033a654d
Chris Lattner authored Mar 13, 2009
```
llvm-svn: 66936
```
033a654d
add 8 and 16 bit TLS moves. · 997b74ac
Rafael Espindola authored Mar 13, 2009
```
add a fixme note on how to remove code duplication.

llvm-svn: 66932
```
997b74ac
One more place where debug info affects codegen. · c6583051
Dale Johannesen authored Mar 13, 2009
```
llvm-svn: 66930
```
c6583051
Test case for rev. 66925 · a01646ef
Devang Patel authored Mar 13, 2009
```
llvm-svn: 66927
```
a01646ef
Improve sext and zext of TLS variables. · 71144973
Rafael Espindola authored Mar 13, 2009
```
llvm-svn: 66922
```
71144973

Second installment of "BasicBlock operands to the back" · 258232fb

Gabor Greif authored Mar 13, 2009

changes.

For InvokeInst now all arguments begin at op_begin().
The Callee, Cont and Fail are now faster to get by
access relative to op_end().

This patch introduces some temporary uglyness in CallSite.
Next I'll bring CallInst up to a similar scheme and then
the uglyness will magically vanish.

This patch also exposes all the reliance of the libraries
on InvokeInst's operand ordering. I am thinking of taking
care of that too.

llvm-svn: 66920

258232fb

remove a buggy test, it is not ok to use -debug in RUN line. · a18c768e
Chris Lattner authored Mar 13, 2009
```
llvm-svn: 66918
```
a18c768e

generalize this code so that fast isel handles integer truncates to i1, which · 3fb71c8f

Chris Lattner authored Mar 13, 2009

codegen to the same thing as integer truncates to i8 (the top bits are 
just undefined).  This implements rdar://6667338

llvm-svn: 66902

3fb71c8f

add a new TGError class and use it to propagate location info with · ba42e49c

Chris Lattner authored Mar 13, 2009

errors when thrown.  This gets us nice errors like this from tblgen:

CMOVL32rr: 	(set GR32:i32:$dst, (X86cmov GR32:$src1, GR32:$src2))
/Users/sabre/llvm/Debug/bin/tblgen: error:
Included from X86.td:116:
Parsing X86InstrInfo.td:922: In CMOVL32rr: X86cmov node requires exactly 4 operands!
def CMOVL32rr : I<0x4C, MRMSrcReg,       // if <s, GR32 = GR32
^

instead of just:

CMOVL32rr: 	(set GR32:i32:$dst, (X86cmov GR32:$src1, GR32:$src2))
/Users/sabre/llvm/Debug/bin/tblgen: In CMOVL32rr: X86cmov node requires exactly 4 operands!

This is all I plan to do with this, but it should be easy enough to improve if anyone 
cares (e.g. keeping more loc info in "dag" expr records in tblgen.

llvm-svn: 66898

ba42e49c

give each Record a location. · bd9b9210
Chris Lattner authored Mar 13, 2009
```
llvm-svn: 66897
```
bd9b9210
make "locations" a class instead of a typedef. · 87710ca5
Chris Lattner authored Mar 13, 2009
```
llvm-svn: 66895
```
87710ca5
Update these for the 2.5 release. · 0c9742c3
Duncan Sands authored Mar 13, 2009
```
llvm-svn: 66890
```
0c9742c3
These instructions have special lowering that may lower them to SSE · 798fd56d
Bill Wendling authored Mar 13, 2009
```
instructions. Prevent that if we don't want implicit uses of SSE.

llvm-svn: 66877
```
798fd56d
Unbreak build, bring in std::string for GCC 4.3 · afc74e23
Argyrios Kyrtzidis authored Mar 13, 2009
```
llvm-svn: 66876
```
afc74e23

Fix some significant problems with constant pools that resulted in unnecessary... · 1fb8aedd

Evan Cheng authored Mar 13, 2009

Fix some significant problems with constant pools that resulted in unnecessary paddings between constant pool entries, larger than necessary alignments (e.g. 8 byte alignment for .literal4 sections), and potentially other issues.

1. ConstantPoolSDNode alignment field is log2 value of the alignment requirement. This is not consistent with other SDNode variants.
2. MachineConstantPool alignment field is also a log2 value.
3. However, some places are creating ConstantPoolSDNode with alignment value rather than log2 values. This creates entries with artificially large alignments, e.g. 256 for SSE vector values.
4. Constant pool entry offsets are computed when they are created. However, asm printer group them by sections. That means the offsets are no longer valid. However, asm printer uses them to determine size of padding between entries.
5. Asm printer uses expensive data structure multimap to track constant pool entries by sections.
6. Asm printer iterate over SmallPtrSet when it's emitting constant pool entries. This is non-deterministic.

Solutions:
1. ConstantPoolSDNode alignment field is changed to keep non-log2 value.
2. MachineConstantPool alignment field is also changed to keep non-log2 value.
3. Functions that create ConstantPool nodes are passing in non-log2 alignments.
4. MachineConstantPoolEntry no longer keeps an offset field. It's replaced with an alignment field. Offsets are not computed when constant pool entries are created. They are computed on the fly in asm printer and JIT.
5. Asm printer uses cheaper data structure to group constant pool entries.
6. Asm printer compute entry offsets after grouping is done.
7. Change JIT code to compute entry offsets on the fly.

llvm-svn: 66875

1fb8aedd

Unbreak build. · bd561627
Evan Cheng authored Mar 13, 2009
```
llvm-svn: 66874
```
bd561627
split buffer management and diagnostic printing out of the tblgen · 8db9bc7e
Chris Lattner authored Mar 13, 2009
```
lexer into its own TGSourceMgr class.

llvm-svn: 66873
```
8db9bc7e
Convert VirtRegMap to a MachineFunctionPass. · d37ddf5b
Owen Anderson authored Mar 13, 2009
```
llvm-svn: 66870
```
d37ddf5b

generalize the previous code to use the full generality of LEA · 99cc1337

Chris Lattner authored Mar 13, 2009

for i32/i64 expressions (we could also do i16 on cpus where
i16 lea is fast, but I didn't add this).  On the example, we now
generate:

_test:
	movl	4(%esp), %eax
	cmpl	$42, (%eax)
	setl	%al
	movzbl	%al, %eax
	leal	4(%eax,%eax,8), %eax
	ret

instead of:

_test:
	movl	4(%esp), %eax
	cmpl	$41, (%eax)
	movl	$4, %ecx
	movl	$13, %eax
	cmovg	%ecx, %eax
	ret

llvm-svn: 66869

99cc1337

optimize the case of cond ? 42 : 41 and friends. This compiles the · 4be6df5d

Chris Lattner authored Mar 13, 2009

example to:

_test:
	movl	4(%esp), %eax
	cmpl	$41, (%eax)
	setg	%al
	movzbl	%al, %eax
	orl	$4294967294, %eax
	ret

instead of:

        movl    4(%esp), %eax
        cmpl    $41, (%eax)
	movl	$4294967294, %ecx
	movl	$4294967295, %eax
	cmova	%ecx, %eax
	ret

which is smaller in code size and faster. rdar://6668608

llvm-svn: 66868

4be6df5d

Oops...I committed too much. · fa54bc20
Bill Wendling authored Mar 13, 2009
```
llvm-svn: 66867
```
fa54bc20
Temporarily XFAIL this test. · b02eadf6
Bill Wendling authored Mar 13, 2009
```
llvm-svn: 66866
```
b02eadf6

Enhance address-mode folding of ISD::ADD to handle cases where the · a1d92423

Dan Gohman authored Mar 13, 2009

operands can't both be fully folded at the same time. For example,
in the included testcase, a global variable is being added with
an add of two values. The global variable wants RIP-relative
addressing, so it can't share the address with another base
register, but it's still possible to fold the initial add.

llvm-svn: 66865

a1d92423

Fix one more place where debug info affected · cecfa6e0
Dale Johannesen authored Mar 13, 2009
```
codegen (speculative execution).

llvm-svn: 66859
```
cecfa6e0
just initialize the first element, we don't need to set the rest to zeros. · b858c0eb
Chris Lattner authored Mar 13, 2009
```
llvm-svn: 66850
```
b858c0eb
Eliminate a 9640 byte static mutable initialized data item by moving it · 0bf18690
Chris Lattner authored Mar 13, 2009
```
to the stack.  This shrinks all llvm tools by 9k, and improves reentrancy.

llvm-svn: 66847
```
0bf18690
static functions don't need an anonymous namespace. · 91702096
Chris Lattner authored Mar 12, 2009
```
llvm-svn: 66845
```
91702096
Fix a typo in a comment. · a19c662a
Dan Gohman authored Mar 12, 2009
```
llvm-svn: 66843
```
a19c662a

Previous debug info fix to this code wasn't quite · ed6f5a82

Dale Johannesen authored Mar 12, 2009

right; did the wrong thing when there are exactly 11
non-debug instructions, followed by debug info.
Remove a FIXME since it's apparently been fixed along the way.

llvm-svn: 66840

ed6f5a82

cosmetic change, in preparation of future change · af76c34b
Gabor Greif authored Mar 12, 2009
```
llvm-svn: 66839
```
af76c34b
Add this test back. · 50a839e6
Evan Cheng authored Mar 12, 2009
```
llvm-svn: 66838
```
50a839e6

Mar 12, 2009

raw_ostream: unbuffered streams weren't being immediately flushed on · db948ffa
Daniel Dunbar authored Mar 12, 2009
```
single character writes.

llvm-svn: 66827
```
db948ffa

Revert commit 66140 since it caused several failures · 1f853d6a

Duncan Sands authored Mar 12, 2009

in the Ada testcase.  Reverting this only covers up
the real problem, which is a nasty conceptual difficulty
in the phi elimination pass: when eliminating phi nodes
in landing pads, the register copies need to come before
the invoke, not at the end of the basic block which is
too late...  See PR3784.

llvm-svn: 66826

1f853d6a

Darwin 10.4.x: "-rpath" is unnecessary when linking shared libraries. · b1a830ab
Scott Michel authored Mar 12, 2009
```
llvm-svn: 66825
```
b1a830ab
There already was a class to force deterministic · 7f99d22f
Dale Johannesen authored Mar 12, 2009
```
sorting of ConstantInt's; unreinvent wheel.

llvm-svn: 66824
```
7f99d22f

Fix an inconsistent use of LLVMGCCDIR. In all other cases, this directory · e4467e46

Bob Wilson authored Mar 12, 2009

refers to the "prefix" directory, i.e., one level above "bin".  LLVMGCCPATH
is used as the directory containing the llvm-gcc executable, so add a "/bin"
suffix to get from LLVMGCCDIR to LLVMGCCPATH.

llvm-svn: 66823

e4467e46

Rearrange operands of the BranchInst, to be able to · c91aa9b8

Gabor Greif authored Mar 12, 2009

access each with a fixed negative index from op_end().

This has two important implications:
- getUser() will work faster, because there are less iterations
  for the waymarking algorithm to perform. This is important
  when running various analyses that want to determine callers
  of basic blocks.
- getSuccessor() now runs faster, because the indirection via OperandList
  is not necessary: Uses corresponding to the successors are at fixed
  offset to "this".

The price we pay is the slightly more complicated logic in the operator
User::delete, as it has to pick up the information whether it has to free
the memory of an original unconditional BranchInst or a BranchInst that
was originally conditional, but has been shortened to unconditional.
I was not able to come up with a nicer solution to this problem. (And
rest assured, I tried *a lot*).

Similar reorderings will follow for InvokeInst and CallInst. After that
some optimizations to pred_iterator and CallSite will fall out naturally.

llvm-svn: 66815

c91aa9b8

Re-apply 66024 with fixes: 1. Fixed indirect call to immediate address... · 2a332aa8

Evan Cheng authored Mar 12, 2009

Re-apply 66024 with fixes: 1. Fixed indirect call to immediate address assembly. 2. Fixed JIT encoding by making the address pc-relative.

llvm-svn: 66803

2a332aa8