- Jan 16, 2015
-
-
Rui Ueyama authored
This patch makes File::parse() multi-thread safe. If one thread is running File::parse(), other threads will block if they try to call the same method. File::parse() is idempotent, so you can safely call multiple times. With this change, we don't have to wait for all worker threads to finish in Driver::link(). Previously, Driver::link() calls TaskGroup::sync() to wait for all threads running File::parse(). This was not ideal because we couldn't start the resolver until we parse all files. This patch increase parallelism by making Driver::link() to not wait for worker threads. The resolver calls parse() to make sure that the file being read has been parsed, and then uses the file. In this approach, the resolver can run with the parser threads in parallel. http://reviews.llvm.org/D6994 llvm-svn: 226281
-
Rafael Espindola authored
llvm-svn: 226280
-
Evgeniy Stepanov authored
llvm-svn: 226279
-
Nathan Sidwell authored
Correct logic concerning 'T &&' deduction against lvalues. llvm-svn: 226278
-
Andrea Di Biagio authored
This patch disables target specific combine on X86ISD::INSERTPS dag nodes if optlevel is CodeGenOpt::None. The backend currently implements a target specific combine rule that converts a vector load used by an INSERTPS dag node into a scalar load plus a scalar_to_vector. This allows ISel to select a single INSERTPSrm instead of two instructions (i.e. a vector load plus INSERTPSrr). However, the existing target combine rule on INSERTPS nodes only works under the assumption that ISel will always be able to match an INSERTPSrm. This is not true in general at -O0, since the backend only allows folding a load into the memory operand of an instruction if the optimization level is not CodeGenOpt::None. In the example below: // __m128 test(__m128 a, __m128 *b) { __m128 c = _mm_insert_ps(a, *b, 1 << 6); return c; } // Before this patch, at -O0, the backend would have canonicalized the load to 'b' into a scalar load plus scalar_to_vector. Later on, ISel would have selected an INSERTPSrr leaving the insertps mask in an inconsistent state: movss 4(%rdi), %xmm1 insertps $64, %xmm1, %xmm0 # xmm0 = xmm1[1],xmm0[1,2,3]. With this patch, the backend avoids folding the vector load into the operand of the INSERTPS. The new codegen at -O0 is: movaps (%rdi), %xmm1 insertps $64, %xmm1, %xmm0 # %xmm1[1],xmm0[1,2,3]. llvm-svn: 226277
-
Viktor Kutuzov authored
Committed unreviewed with permission. llvm-svn: 226276
-
Viktor Kutuzov authored
Committed unreviewed with permission. llvm-svn: 226275
-
Rui Ueyama authored
The previous default behavior of LLD is --as-needed. LLD linked against a DSO only if the DSO file was actually used to link an executable (i.e. at least one symbol was resolved using the shared library file.) In this patch I added a boolean flag to FileNode for --as-needed. I also added an accessor to DSO name to shared library file class. llvm-svn: 226274
-
Evgeniy Stepanov authored
llvm-svn: 226273
-
Andrey Churbanov authored
llvm-svn: 226272
-
Andrey Churbanov authored
llvm-svn: 226271
-
Evgeniy Stepanov authored
Trying to debug a buildbot-only failure. llvm-svn: 226270
-
Toma Tabacu authored
llvm-svn: 226269
-
Evgeniy Stepanov authored
Wrong include order. llvm-svn: 226268
-
Evgeniy Stepanov authored
llvm-svn: 226267
-
Evgeniy Stepanov authored
llvm-svn: 226266
-
Simon Atanasyan authored
These classes contain only abstract virtual functions. Explicit constructors are redundant. llvm-svn: 226265
-
Simon Pilgrim authored
The current 'big vectors' stack folded reload testing pattern is very bulky and makes it difficult to test all instructions as big vectors will tend to use only the ymm instruction implementations. This patch changes the tests to use a nop call that lists explicit xmm registers as sideeffects, with this we can force a partial register spill of the relevant registers and then check that the reload is correctly folded. The asm generated only adds the forced spill, a nop instruction and a couple of extra labels (a fraction of the current approach). More exhaustive tests will follow shortly, I've added some extra tests (the xmm versions of some of the existing folding tests) as a starting point. Differential Revision: http://reviews.llvm.org/D6932 llvm-svn: 226264
-
Evgeniy Stepanov authored
llvm-svn: 226263
-
Simon Atanasyan authored
No functional changes. llvm-svn: 226262
-
Simon Atanasyan authored
No functional changes. llvm-svn: 226261
-
Simon Atanasyan authored
No functional changes. llvm-svn: 226260
-
Simon Atanasyan authored
This class defines a relocation handler interface. The interface does not depend on the template argument so the argument is redundant. llvm-svn: 226259
-
Simon Atanasyan authored
No functional changes. llvm-svn: 226258
-
Simon Atanasyan authored
No functional changes. llvm-svn: 226257
-
Simon Atanasyan authored
No functional changes. llvm-svn: 226256
-
Simon Atanasyan authored
That reduce class dependencies and simplify the code a bit. No functional changes. llvm-svn: 226255
-
Simon Atanasyan authored
No functional changes. llvm-svn: 226254
-
Simon Atanasyan authored
If object files satisfy O32 ABI they can be linked together even if some of them contains MIPS64 or MIPS64R2 instructions. llvm-svn: 226253
-
Timur Iskhodzhanov authored
This breaks AddressSanitizer (ninja check-asan) on Windows llvm-svn: 226251
-
Alexey Bataev authored
storage. This fix allows to use non-constant global variables, static local variables and static data members in data-sharing attribute clauses in parallel and task regions. llvm-svn: 226250
-
Vince Harron authored
Differential review: http://reviews.llvm.org/D6919 llvm-svn: 226249
-
Filipe Cabecinhas authored
llvm-svn: 226248
-
Hal Finkel authored
Bill Schmidt pointed out that some adjustments would be needed to properly support powerpc64le (using the ELF V2 ABI). For one thing, R11 is not available as a scratch register, so we need to use R12. R12 is also available under ELF V1, so to maintain consistency, I flipped the order to make R12 the first scratch register in the array under both ABIs. llvm-svn: 226247
-
Saleem Abdulrasool authored
The target may be a synthetic symbol like __ImageBase. cast_or_null will ensure that the atom is a DefinedAtom, which is not guaranteed, which was the original reason for the cast_or_null. Switch this to dyn_cast, which should enable building of executables for WoA. Unfortunately, the issue of missing base relocations still needs to be investigated. llvm-svn: 226246
-
Mehdi Amini authored
http://reviews.llvm.org/D6993 llvm-svn: 226245
-
Jason Molenda authored
the register state when debugging AArch32 programs (armv7 programs running on an armv8 processor). Most notably, there is no "fpscr" register in the register context - there is an fpsr and an fpcr. Also fix a bug where the floating point values could not be written in armv7 processes. <rdar://problem/18977767> llvm-svn: 226244
-
Dan Albert authored
This isn't actually used for anything, and is broken on Darwin (currently causing build failures now that the triple is passed to aid cross compiling). Rather than fix unused code, just remove it. llvm-svn: 226243
-
Rafael Espindola authored
This reverts commit r226173, adding r226038 back. No change in this commit, but clang was changed to also produce trivial comdats for costructors, destructors and vtables when needed. Original message: Don't create new comdats in CodeGen. This patch stops the implicit creation of comdats during codegen. Clang now sets the comdat explicitly when it is required. With this patch clang and gcc now produce the same result in pr19848. llvm-svn: 226242
-
Kevin Enderby authored
removing the macho-archive-headers.test added with r226228 that it is failing on for now while I try to figure out what is going on. llvm-svn: 226241
-