[polly] r275392 - [NFC] Add full title/author information to "Apply the BLIS matmul optimization pattern"
Roman Gareev via llvm-commits
llvm-commits at lists.llvm.org
Thu Jul 14 03:40:15 PDT 2016
Author: romangareev
Date: Thu Jul 14 05:40:15 2016
New Revision: 275392
URL: http://llvm.org/viewvc/llvm-project?rev=275392&view=rev
Log:
[NFC] Add full title/author information to "Apply the BLIS matmul optimization pattern"
Modified:
polly/trunk/include/polly/ScheduleOptimizer.h
Modified: polly/trunk/include/polly/ScheduleOptimizer.h
URL: http://llvm.org/viewvc/llvm-project/polly/trunk/include/polly/ScheduleOptimizer.h?rev=275392&r1=275391&r2=275392&view=diff
==============================================================================
--- polly/trunk/include/polly/ScheduleOptimizer.h (original)
+++ polly/trunk/include/polly/ScheduleOptimizer.h Thu Jul 14 05:40:15 2016
@@ -107,12 +107,17 @@ private:
/// @brief Apply the BLIS matmul optimization pattern
///
- /// Apply the BLIS matmul optimization pattern
- /// (http://www.cs.utexas.edu/users/flame/pubs/TOMS-BLIS-Analytical.pdf).
- /// BLIS implements gemm as three nested loops around a macro-kernel,
- /// plus two packing routines. The macro-kernel is implemented in terms
- /// of two additional loops around a micro-kernel. The micro-kernel
- /// is a loop around a rank-1 (i.e., outer product) update.
+ /// Apply the BLIS matmul optimization pattern. BLIS implements gemm
+ /// as three nested loops around a macro-kernel, plus two packing routines.
+ /// The macro-kernel is implemented in terms of two additional loops around
+ /// a micro-kernel. The micro-kernel is a loop around a rank-1
+ /// (i.e., outer product) update.
+ ///
+ /// For a detailed description please see:
+ /// Analytical Modeling is Enough for High Performance BLIS
+ /// Tze Meng Low, Francisco D Igual, Tyler M Smith, Enrique S Quintana-Orti
+ /// Technical Report, 2014
+ /// http://www.cs.utexas.edu/users/flame/pubs/TOMS-BLIS-Analytical.pdf
///
/// We create the BLIS micro-kernel by applying a combination of tiling
/// and unrolling. In subsequent changes we will add the extraction
More information about the llvm-commits
mailing list