Commits · a88cd4ea2a205346302760b5f0c8b815a1c20c16 · Roger Ferrer / llvm-epi-0.8

Jun 13, 2008

Disable some DAG combiner optimizations that may be · 8651e9c5

Duncan Sands authored Jun 13, 2008

wrong for volatile loads and stores.  In fact this
is almost all of them!  There are three types of
problems: (1) it is wrong to change the width of
a volatile memory access.  These may be used to
do memory mapped i/o, in which case a load can have
an effect even if the result is not used.  Consider
loading an i32 but only using the lower 8 bits.  It
is wrong to change this into a load of an i8, because
you are no longer tickling the other three bytes.  It
is also unwise to make a load/store wider.  For
example, changing an i16 load into an i32 load is
wrong no matter how aligned things are, since the
fact of loading an additional 2 bytes can have
i/o side-effects.  (2) it is wrong to change the
number of volatile load/stores: they may be counted
by the hardware.  (3) it is wrong to change a volatile
load/store that requires one memory access into one
that requires several.  For example on x86-32, you
can store a double in one processor operation, but to
store an i64 requires two (two i32 stores).  In a
multi-threaded program you may want to bitcast an i64
to a double and store as a double because that will
occur atomically, and be indivisible to other threads.
So it would be wrong to convert the store-of-double
into a store of an i64, because this will become two
i32 stores - no longer atomic.  My policy here is
to say that the number of processor operations for
an illegal operation is undefined.  So it is alright
to change a store of an i64 (requires at least two
stores; but could be validly lowered to memcpy for
example) into a store of double (one processor op).
In short, if the new store is legal and has the same
size then I say that the transform is ok.  It would
also be possible to say that transforms are always
ok if before they were illegal, whether after they
are illegal or not, but that's more awkward to do
and I doubt it buys us anything much.
However this exposed an interesting thing - on x86-32
a store of i64 is considered legal!  That is because
operations are marked legal by default, regardless of
whether the type is legal or not.  In some ways this
is clever: before type legalization this means that
operations on illegal types are considered legal;
after type legalization there are no illegal types
so now operations are only legal if they really are.
But I consider this to be too cunning for mere mortals.
Better to do things explicitly by testing AfterLegalize.
So I have changed things so that operations with illegal
types are considered illegal - indeed they can never
map to a machine operation.  However this means that
the DAG combiner is more conservative because before
it was "accidentally" performing transforms where the
type was illegal because the operation was nonetheless
marked legal.  So in a few such places I added a check
on AfterLegalize, which I suppose was actually just
forgotten before.  This causes the DAG combiner to do
slightly more than it used to, which resulted in the X86
backend blowing up because it got a slightly surprising
node it wasn't expecting, so I tweaked it.

llvm-svn: 52254

8651e9c5

Jun 11, 2008
- Properly lower DYNAMIC_STACKALLOC - bracket all black magic with · 729c4e95
  Anton Korobeynikov authored Jun 11, 2008
```
CALLSEQ_BEGIN & CALLSEQ_END.

llvm-svn: 52225
```
  729c4e95
Jun 09, 2008
- CPPBackend support for extractvalue and insertvalue. · 6e384fc2
  Dan Gohman authored Jun 09, 2008
```
llvm-svn: 52147
```
  6e384fc2
- Abort on an unrecognized opcode. · 7be3fc7c
  Dan Gohman authored Jun 09, 2008
```
llvm-svn: 52146
```
  7be3fc7c
- Update the CPP backend for the ConstantFP::get API change. · 62f63f43
  Dan Gohman authored Jun 09, 2008
```
llvm-svn: 52144
```
  62f63f43
- add support for PIC on linux x86-64 · 29479df2
  Rafael Espindola authored Jun 09, 2008
```
llvm-svn: 52139
```
  29479df2
Jun 08, 2008

Remove comparison methods for MVT. The main cause · 11dd4245

Duncan Sands authored Jun 08, 2008

of apint codegen failure is the DAG combiner doing
the wrong thing because it was comparing MVT's using
< rather than comparing the number of bits.  Removing
the < method makes this mistake impossible to commit.
Instead, add helper methods for comparing bits and use
them.

llvm-svn: 52098

11dd4245

Added FP instruction formats. · 041604ba
Bruno Cardoso Lopes authored Jun 08, 2008
```
llvm-svn: 52086
```
041604ba
Temporarily reverting r52056. It's causing PPC to fail to bootstrap. · b7272db9
Bill Wendling authored Jun 08, 2008
```
llvm-svn: 52085
```
b7272db9

Jun 07, 2008
- Added support for FP Registers · f09c3721
  Bruno Cardoso Lopes authored Jun 07, 2008
```
llvm-svn: 52079
```
  f09c3721
- Revert r52046. It broke cbe on x86 / Mac OS X. · 1a083501
  Evan Cheng authored Jun 07, 2008
```
llvm-svn: 52071
```
  1a083501
Jun 06, 2008

Typo. · 0b8f2c53
Evan Cheng authored Jun 06, 2008
```
llvm-svn: 52062
```
0b8f2c53
PPC preferred loop alignment is 16. · 9bf9110d
Evan Cheng authored Jun 06, 2008
```
llvm-svn: 52056
```
9bf9110d
Handle assembler identifiers specially in CBE. This fixes PR2418. · f69bc3df
Anton Korobeynikov authored Jun 06, 2008
```
llvm-svn: 52046
```
f69bc3df

Wrap MVT::ValueType in a struct to get type safety · 13237ac3

Duncan Sands authored Jun 06, 2008

and better control the abstraction.  Rename the type
to MVT.  To update out-of-tree patches, the main
thing to do is to rename MVT::ValueType to MVT, and
rewrite expressions like MVT::getSizeInBits(VT) in
the form VT.getSizeInBits().  Use VT.getSimpleVT()
to extract a MVT::SimpleValueType for use in switch
statements (you will get an assert failure if VT is
an extended value type - these shouldn't exist after
type legalization).
This results in a small speedup of codegen and no
new testsuite failures (x86-64 linux).

llvm-svn: 52044

13237ac3

Added custom isel for MUL, SDIVREM, UDIVREM, SMUL_LOHI and UMUL_LOHI nodes · 1a6e0d61

Bruno Cardoso Lopes authored Jun 06, 2008

MUL is not anymore directly matched because its a pseudoinstruction.
LogicI class fixed to zero-extend immediates. 

llvm-svn: 52036

1a6e0d61

Added custom SELECT_CC lowering · 4eed3afd
Bruno Cardoso Lopes authored Jun 06, 2008
```
Added special isel for ADDE,SUBE and new patterns to match SUBC,ADDC

llvm-svn: 52031
```
4eed3afd
Don't break strict aliasing. · 9e76c047
Evan Cheng authored Jun 05, 2008
```
llvm-svn: 52026
```
9e76c047

Jun 04, 2008

Rewrite a bunch of the CBE's inline asm code, giving it the · c596ec04
Chris Lattner authored Jun 04, 2008
```
ability to handle indirect input operands.  This fixes PR2407.

llvm-svn: 51952
```
c596ec04

Change packed struct layout so that field sizes · fc3c489b

Duncan Sands authored Jun 04, 2008

are the same as in unpacked structs, only field
positions differ.  This only matters for structs
containing x86 long double or an apint; it may
cause backwards compatibility problems if someone
has bitcode containing a packed struct with a
field of one of those types.
The issue is that only 10 bytes are needed to
hold an x86 long double: the store size is 10
bytes, but the ABI size is 12 or 16 bytes (linux/
darwin) which comes from rounding the store size
up by the alignment.  Because it seemed silly not
to pack an x86 long double into 10 bytes in a
packed struct, this is what was done.  I now
think this was a mistake.  Reserving the ABI size
for an x86 long double field even in a packed
struct makes things more uniform: the ABI size is
now always used when reserving space for a type.
This means that developers are less likely to
make mistakes.  It also makes life easier for the
CBE which otherwise could not represent all LLVM
packed structs (PR2402).
Front-end people might need to adjust the way
they create LLVM structs - see following change
to llvm-gcc.

llvm-svn: 51928

fc3c489b

Some Mips minor fixes · 326a0373
Bruno Cardoso Lopes authored Jun 04, 2008
```
Added support for mips little endian arch => mipsel

llvm-svn: 51923
```
326a0373

Jun 03, 2008
- Add StringConstantPrefix to control what the · 355b74ac
  Dale Johannesen authored Jun 03, 2008
```
assembler names of string constants look like.

llvm-svn: 51909
```
  355b74ac
- Add necessary 64-bit support so that gcc frontend compiles (mostly). Current · d831cc49
  Scott Michel authored Jun 02, 2008
```
issue is operand promotion for setcc/select... but looks like the fundamental
stuff is implemented for CellSPU.

llvm-svn: 51884
```
  d831cc49
Jun 02, 2008

Implement CBE support for first-class structs and array values, · 4e8a512f

Dan Gohman authored Jun 02, 2008

and insertvalue and extractvalue instructions.

First-class array values are not trivial because C doesn't
support them. The approach I took here is to wrap all arrays
in structs. Feedback is welcome.

The 2007-01-15-NamedArrayType.ll test needed to be modified
because it has a "not grep" for a string that now exists,
because array types now have associated struct types, and
those struct types have names.

llvm-svn: 51881

4e8a512f

Don't use the GOT for symbols that are not externally visible. · d04cd22f
Rafael Espindola authored Jun 02, 2008
```
llvm-svn: 51865
```
d04cd22f

Jun 01, 2008
- Fixed flag issue that was generating infinite loop while in list scheduling. · bdedc148
  Bruno Cardoso Lopes authored Jun 01, 2008
```
llvm-svn: 51833
```
  bdedc148
May 31, 2008

Peer through sext/zext when looking for not(cmp). · 035fe6f7
Nick Lewycky authored May 31, 2008
```
llvm-svn: 51819
```
035fe6f7
Yay us! Every one of these examples turns into icmp/zext/ret. · 69a51cbd
Nick Lewycky authored May 31, 2008
```
llvm-svn: 51818
```
69a51cbd

Fix the CBE's handling of instructions whose result is an i1. Previously, · 666d6645

Chris Lattner authored May 31, 2008

we did not truncate the value down to i1 with (x&1).  This caused a problem
when the computation of x was nontrivial, for example, "add i1 1, 1" would 
return 2 instead of 0.

This makes the testcase compile into:

...
  llvm_cbe_t = (((llvm_cbe_r == 0u) + (llvm_cbe_r == 0u))&1);
  llvm_cbe_u = (((unsigned int )(bool )llvm_cbe_t));
...

instead of:

...
  llvm_cbe_t = ((llvm_cbe_r == 0u) + (llvm_cbe_r == 0u));
  llvm_cbe_u = (((unsigned int )(bool )llvm_cbe_t));
...

This fixes a miscompilation of mediabench/adpcm/rawdaudio/rawdaudio and
403.gcc with the CBE, regressions from LLVM 2.2. Tanya, please pull 
this into the release branch.

llvm-svn: 51813

666d6645

Teach the DAGISelEmitter to not compute the variable_ops operand · bd3390c7

Dan Gohman authored May 31, 2008

index for the input pattern in terms of the output pattern. Instead
keep track of how many fixed operands the input pattern actually
has, and have the input matching code pass the output-emitting
function that index value. This simplifies the code, disentangles
variables_ops from the support for predication operations, and
makes variable_ops more robust.

llvm-svn: 51808

bd3390c7

Fix indentation. · 864541aa
Evan Cheng authored May 30, 2008
```
llvm-svn: 51792
```
864541aa

May 30, 2008
- Add the "AsCheapAsAMove" flag to some 64-bit xor instructions. · b0aa6512
  Bill Wendling authored May 30, 2008
```
llvm-svn: 51761
```
  b0aa6512
May 29, 2008

Add patterns for CALL32m and CALL64m. They aren't matched in most · 96af4ddb

Dan Gohman authored May 29, 2008

cases due to an isel deficiency already noted in
lib/Target/X86/README.txt, but they can be matched in this fold-call.ll
testcase, for example.

This is interesting mainly because it exposes a tricky tblgen bug;
tblgen was incorrectly computing the starting index for variable_ops
in the case of a complex pattern.

llvm-svn: 51706

96af4ddb

Remove more iostream header includes. Needed to implement a "FlushStream" · 33e396d0
Bill Wendling authored May 29, 2008
```
function to flush a specified std::ostream.

llvm-svn: 51705
```
33e396d0

Fix a tblgen problem handling variable_ops in tblgen instruction · 6e582c44

Dan Gohman authored May 29, 2008

definitions. This adds a new construct, "discard", for indicating
that a named node in the input matching pattern is to be discarded,
instead of corresponding to a node in the output pattern. This
allows tblgen to know where the arguments for the varaible_ops are
supposed to begin.

This fixes "rdar://5791600", whatever that is ;-).

llvm-svn: 51699

6e582c44

Expand small memmovs using inline code. Set the X86 threshold for expanding · 714663ab
Dan Gohman authored May 29, 2008
```
memmove to a more plausible value, now that it's actually being used.

llvm-svn: 51696
```
714663ab
Implement vector shift up / down and insert zero with ps{rl}lq / ps{rl}ldq. · 5e28227d
Evan Cheng authored May 29, 2008
```
llvm-svn: 51667
```
5e28227d
XOR?RI instructions aren't as cheap as moves. · 0252be17
Bill Wendling authored May 29, 2008
```
llvm-svn: 51664
```
0252be17
Implement "AsCheapAsAMove" for some obviously cheap instructions: xor and the · 7a1a8eb6
Bill Wendling authored May 29, 2008
```
like.

llvm-svn: 51662
```
7a1a8eb6

Add a flag to indicate that an instruction is as cheap (or cheaper) than a move · 3f6bb271

Bill Wendling authored May 28, 2008

instruction to execute. This can be used for transformations (like two-address
conversion) to remat an instruction instead of generating a "move"
instruction. The idea is to decrease the live ranges and register pressure and
all that jazz.

llvm-svn: 51660

3f6bb271