Key points are not available for this paper at this time.
This paper describes an extension to the set of Basic Linear Algebra Subprograms. The extensions proposed are targeted at matrix vector operations which should provide for more efficient and portable implementations of algorithms for high performance computers.
Dongarra et al. (Tue,) studied this question.