Commits · 1e4272085d666ac02affbbd70d621fe386ab66b3 · Roger Ferrer / llvm-epi-0.8

Mar 07, 2013

Debug Info: store the files and directories for each compile unit. · 1e427208

Manman Ren authored Mar 07, 2013

We now emit a line table for each compile unit. To reduce the prologue size
of each line table, the files and directories used by each compile unit are
stored in std::map<unsigned, std::vector< > > instead of std::vector< >.

The prologue for a lto'ed image can be as big as 93K. Duplicating 93K for each
compile unit causes a huge increase of debug info. With this patch, each
prologue will only emit the files required by the compile unit.

rdar://problem/13342023

llvm-svn: 176605

1e427208

ArrayRef has a OneElt constructor. Beautify the code. · 96a4aa67
Nadav Rotem authored Mar 07, 2013
```
llvm-svn: 176604
```
96a4aa67
Switch from std::vector to ArrayRef. Speedup FoldBitCast by 5x. · 88330433
Nadav Rotem authored Mar 07, 2013
```
llvm-svn: 176602
```
88330433

SimplifyCFG fix for volatile load/store. · a0a5ca06

Andrew Trick authored Mar 07, 2013

Fixes rdar:13349374.

Volatile loads and stores need to be preserved even if the language
standard says they are undefined. "volatile" in this context means "get
out of the way compiler, let my platform handle it".

Additionally, this is the only way I know of with llvm to write to the
first page (when hardware allows) without dropping to assembly.

llvm-svn: 176599

a0a5ca06

Fix two remaining issue after fixing PR15355 when CMOV is not available · d5cac37d

Michael Liao authored Mar 07, 2013

- Phi nodes should be replaced/updated after lowering CMOV into branch
  because 'mainMBB' updating operand in Phi node is changed.
- Add EFLAGS in livein before lowering the 2nd CMOV. It's necessary as
  we will reuse the EFLAGS generated before the 1st lowered CMOV, which
  won't clobber EFLAGS. However, we need explicitly specify that.
- '-attr=-cmov' test case are added.

llvm-svn: 176598

d5cac37d

Mar 06, 2013

[mips] Custom-legalize BR_JT. · 0f693a8a

Akira Hatanaka authored Mar 06, 2013

In N64-static, GOT address is needed to compute the branch address.

llvm-svn: 176580

0f693a8a

Generalize my previous fix for -print-options. · fcb37243

Andrew Trick authored Mar 06, 2013

Always print options that differ from their implicit default. At least
for simple option types.

llvm-svn: 176572

fcb37243

Give -loop-vectorize an explicit default. · 946c2b32

Andrew Trick authored Mar 06, 2013

This way, clang -mllvm -print-options shows that the driver is overriding it.

llvm-svn: 176569

946c2b32

Memory Dependence Analysis (not mem-dep test) take advantage of "invariant.load" metadata. · 408bdad5

Shuxin Yang authored Mar 06, 2013

The "invariant.load" metadata indicates the memory unit being accessed is immutable.
A load annotated with this metadata can be moved across any store.

As I am not sure if it is legal to move such loads across barrier/fence, this
change dose not allow such transformation.

rdar://11311484

Thank Arnold for code review.

llvm-svn: 176562

408bdad5

InstCombine: Don't shrink allocas when combining with a bitcast. · 95d2eb95

Jim Grosbach authored Mar 06, 2013

When considering folding a bitcast of an alloca into the alloca itself,
make sure we don't shrink the amount of memory being allocated, or
things rapidly go sideways.

rdar://13324424

llvm-svn: 176547

95d2eb95

Fix PR15355 · da22b30b

Michael Liao authored Mar 06, 2013

- Clear 'mayStore' flag when loading from the atomic variable before the
  spin loop
- Clear kill flag from one use to multiple use in registers forming the
  address to that atomic variable
- don't use a physical register as live-in register in BB (neither entry
  nor landing pad.) by copying it into virtual register

(patch by Cameron Zwarich)

llvm-svn: 176538

da22b30b

Use dyn_cast instead of isa && cast. No functionality change. · b7129f21
Jakub Staszak authored Mar 06, 2013
```
llvm-svn: 176537
```
b7129f21

[mips] Remove android calling convention. · 1454ed8a

Akira Hatanaka authored Mar 05, 2013

This calling convention was added just to handle functions which return vector
of floats. The fix committed in r165585 solves the problem.

llvm-svn: 176530

1454ed8a

Mar 05, 2013

[mips] Fix MipsCC::analyzeReturn so that, in soft-float mode, fp128 gets · e092f729
Akira Hatanaka authored Mar 05, 2013
```
returned in registers $2 and $4.

llvm-svn: 176527
```
e092f729
[mips] Fix MipsTargetLowering::LowerCallResult and LowerReturn to correctly · 5f3ba9e5
Akira Hatanaka authored Mar 05, 2013
```
handle fp128 returns.

llvm-svn: 176523
```
5f3ba9e5
[mips] Fix MipsTargetLowering::LowerCall to pass fp128 arguments in floating · 3b7391d1
Akira Hatanaka authored Mar 05, 2013
```
point registers.

llvm-svn: 176521
```
3b7391d1
[mips] Correct handling of fp128 (long double) formals and read long double · 4b634fa3
Akira Hatanaka authored Mar 05, 2013
```
parameters from floating point registers if target is mips64 hard float.

llvm-svn: 176520
```
4b634fa3

Add more functions to the TLI. · b904e6e4

Meador Inge authored Mar 05, 2013



This patch adds many more functions to the target library information.
All of the functions being added were discovered while doing the migration
of the simplify-libcalls attribute annotation functionality to the
functionattrs pass.  As a part of that work the attribute annotation logic
will query TLI to determine if a function should be annotated or not.

Signed-off-by: Meador Inge <meadori@codesourcery.com>
llvm-svn: 176514

b904e6e4

reverting patch 176508. · 457801f7
Jyotsna Verma authored Mar 05, 2013
```
llvm-svn: 176513
```
457801f7
Hexagon: Add support for lowering block address. · 7179e712
Jyotsna Verma authored Mar 05, 2013
```
llvm-svn: 176508
```
7179e712
R600: Do not predicate vector op · fe32bd87
Vincent Lejeune authored Mar 05, 2013
```
llvm-svn: 176507
```
fe32bd87
Hexagon: Expand addc, adde, subc and sube. · 0eeea14e
Jyotsna Verma authored Mar 05, 2013
```
llvm-svn: 176505
```
0eeea14e
Update cmake build. · 5dc83180
Benjamin Kramer authored Mar 05, 2013
```
llvm-svn: 176501
```
5dc83180
Hexagon: Use MO operand flags to mark constant extended instructions. · f1214a8a
Jyotsna Verma authored Mar 05, 2013
```
llvm-svn: 176500
```
f1214a8a
Hexagon: Add encoding bits to the TFR64 instructions. · f4e324f4
Jyotsna Verma authored Mar 05, 2013
```
Set imMoveImm, isAsCheapAsAMove flags for TFRI instructions.

llvm-svn: 176499
```
f4e324f4

R600: initial scheduler code · 68b6b6dd

Vincent Lejeune authored Mar 05, 2013

This is a skeleton for a pre-RA MachineInstr scheduler strategy. Currently
it only tries to expose more parallelism for ALU instructions (this also
makes the distribution of GPR channels more uniform and increases the
chances of ALU instructions to be packed together in a single VLIW group).
Also it tries to reduce clause switching by grouping instruction of the
same kind (ALU/FETCH/CF) together.

Vincent Lejeune:
 - Support for VLIW4 Slot assignement
 - Recomputation of ScheduleDAG to get more parallelism opportunities

Tom Stellard:
 - Fix assertion failure when trying to determine an instruction's slot
   based on its destination register's class
 - Fix some compiler warnings

Vincent Lejeune: [v2]
 - Remove recomputation of ScheduleDAG (will be provided in a later patch)
 - Improve estimation of an ALU clause size so that heuristic does not emit cf
 instructions at the wrong position.
 - Make schedule heuristic smarter using SUnit Depth
 - Take constant read limitations into account

Vincent Lejeune: [v3]
 - Fix some uninitialized values in ConstPair
 - Add asserts to ensure an ALU slot is always populated

llvm-svn: 176498

68b6b6dd

R600: Remove LowerConstCopyPass and lower CONST_COPY right after ISel. · 0b72f102

Vincent Lejeune authored Mar 05, 2013

Maintaining CONST_COPY Instructions until Pre Emit may prevent some ifcvt case
and taking them in account for scheduling is difficult for no real benefit.

llvm-svn: 176488

0b72f102

R600: Turn BUILD_VECTOR into Reg_Sequence · 3b6f20e9
Vincent Lejeune authored Mar 05, 2013
```
Reviewed-by: Tom Stellard <thomas.stellard at amd.com>
llvm-svn: 176487
```
3b6f20e9

R600: CONST_ADDRESS node is not marked as mayLoad anymore · 10a5e477

Vincent Lejeune authored Mar 05, 2013

Reviewed-by: Tom Stellard <thomas.stellard at amd.com>

mayLoad complexify scheduling and does not bring any usefull info
as the location is not writeable at all.

llvm-svn: 176486

10a5e477

R600: Use MUL_IEEE for trig/fdiv intrinsic · a199d01e
Vincent Lejeune authored Mar 05, 2013
```
Reviewed-by: Tom Stellard <thomas.stellard at amd.com>
llvm-svn: 176485
```
a199d01e
R600: Add support for indirect addressing of non default const buffer · 743dca04
Vincent Lejeune authored Mar 05, 2013
```
NOTE: This is a candidate for the Mesa stable branch.
llvm-svn: 176484
```
743dca04
Remove unused #includes. · a69d0aaa
Bill Wendling authored Mar 05, 2013
```
llvm-svn: 176467
```
a69d0aaa

The current X86 NOP padding uses one long NOP followed by the remainder in · 4c8979cd

David Sehr authored Mar 05, 2013

one-byte NOPs.  If the processor actually executes those NOPs, as it sometimes
does with aligned bundling, this can have a performance impact.  From my
micro-benchmarks run on my one machine, a 15-byte NOP followed by twelve
one-byte NOPs is about 20% worse than a 15 followed by a 12.  This patch
changes NOP emission to emit as many 15-byte (the maximum) as possible followed
by at most one shorter NOP.

llvm-svn: 176464

4c8979cd

Mar 04, 2013

Check isDiscardableIfUnused, rather than hasLocalLinkage, when bumping · 30be8a30

Lang Hames authored Mar 04, 2013

GlobalValue linkage up to ExternalLinkage in the ExtractGV pass. This
prevents linkonce and linkonce_odr symbols from being DCE'd.

llvm-svn: 176459

30be8a30

[mips] Print move instructions. · c7828356
Akira Hatanaka authored Mar 04, 2013
```
"move $4, $5" is printed instead of "or $4, $5, $zero".

llvm-svn: 176455
```
c7828356

Mips specific inline assembler constraint 'R' · 0e149b04

Jack Carter authored Mar 04, 2013

'R' An address that can be sued in a non-macro load or store.
This patch includes a positive test case.

llvm-svn: 176452

0e149b04

Bypass Slow Divides · 485296d1

Preston Gurd authored Mar 04, 2013

* Only apply divide bypass optimization when not optimizing for size. 
* Fixed bug caused by constant for 0 value of type Int32,
  used dividend type to generate the constant instead.
* For atom x86-64 apply the divide bypass to use 16-bit divides instead of
  64-bit divides when operand values are small enough.
* Added lit tests for 64-bit divide bypass.

Patch by Tyler Nowicki!

llvm-svn: 176442

485296d1

R600: Clean up datalayout strings so they better match hardware capabilities · b2f2f960
Tom Stellard authored Mar 04, 2013
```
llvm-svn: 176439
```
b2f2f960
Mips ISD typo · 434874db
Jia Liu authored Mar 04, 2013
```
llvm-svn: 176426
```
434874db

Mar 02, 2013

ARM: Creating a vector from a lane of another. · a3c5c769

Jim Grosbach authored Mar 02, 2013

The VDUP instruction source register doesn't allow a non-constant lane
index, so make sure we don't construct a ARM::VDUPLANE node asking it to
do so.

rdar://13328063
http://llvm.org/bugs/show_bug.cgi?id=13963

llvm-svn: 176413

a3c5c769