用户名: 密码: 验证码:
Avoiding Communication in Dense Linear Algebra.
详细信息   
  • 作者:Ballard ; Grey Malone.
  • 学历:Doctor
  • 年:2013
  • 毕业院校:University of California
  • Department:Computer Science.
  • ISBN:9781303832604
  • CBH:3616413
  • Country:USA
  • 语种:English
  • FileSize:2031708
  • Pages:216
文摘
Dense linear algebra computations are essential to nearly every problem in scientific computing and to countless other fields. Most matrix computations enjoy a high computational intensity i.e.,ratio of computation to data),and therefore the algorithms for the computations have a potential for high efficiency. However,performance for many linear algebra algorithms is limited by the cost of moving data between processors on a parallel computer or throughout the memory hierarchy of a single processor,which we will refer to generally as communication. Technological trends indicate that algorithmic performance will become even more limited by communication in the future. In this thesis,we consider the fundamental computations within dense linear algebra and address the following question: can we significantly improve the current algorithms for these computations,in terms of the communication they require and their performance in practice? To answer the question,we analyze algorithms on sequential and parallel architectural models that are simple enough to determine coarse communication costs but accurate enough to predict performance of implementations on real hardware. For most of the computations,we prove lower bounds on the communication that any algorithm must perform. If an algorithm exists with communication costs that match the lower bounds at least in an asymptotic sense),we call the algorithm communication optimal. In many cases,the most commonly used algorithms are not communication optimal,and we can develop new algorithms that require less data movement and attain the communication lower bounds. In this thesis,we develop both new communication lower bounds and new algorithms,tightening and in many cases closing) the gap between best known lower bound and best known algorithm or upper bound). We consider both sequential and parallel algorithms,and we asses both classical and fast algorithms e.g.,Strassens matrix multiplication algorithm). In particular,the central contributions of this thesis are: proving new communication lower bounds for nearly all classical direct linear algebra computations dense or sparse),including factorizations for solving linear systems,least squares problems,and eigenvalue and singular value problems; proving new communication lower bounds for Strassens and other fast matrix multiplication algorithms; proving new parallel communication lower bounds for classical and fast computations that set limits on an algorithms ability to perfectly strong scale; summarizing the state-of-the-art in communication efficiency for both sequential and parallel algorithms for the computations to which the lower bounds apply; developing a new communication-optimal algorithm for computing a symmetric-indefinite factorization observing speedups of up to 2.8x compared to alternative shared-memory parallel algorithms); developing new,more communication-efficient algorithms for reducing a symmetric band matrix to tridiagonal form via orthogonal similar transformations observing speedups of 2--6x compared to alternative sequential and parallel algorithms); and developing a new communication-optimal parallelization of Strassens matrix multiplication algorithm observing speedups of up to 2.84x compared to alternative distributed-memory parallel algorithms).

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700