Optimistic Parallelization of Floating-Point AccumulationNachiket Kapre and André DeHon
Proceedings of the IEEE Symposium on Computer Arithmetic, pp. 205--213 (Arith18, June 25--27, 2007)
Floating-point arithmetic is notoriously nonassociative due to the limited precision representation which demands intermediate values be rounded to fit in the available precision. The resulting cyclic dependency in floating-point accumulation inhibits parallelization of the computation, including efficient use of pipelining. In practice, however, we observe that floating-point operations are
© 2007 IEEE. Authors/employers may reproduce or authorize others to reproduce The Work, material extracted verbatim from the Work, or derivative works to
the extent permissible under United States law for works authored by U.S. Government employees, and for the author's personal use or for
company or organizational use, provided that the source and any IEEE copyright notice are indicated, the copies are not used in any way that
implies IEEE endorsement of a product or service of any employer, and the copies themselves are not offered for sale.