Show simple item record

dc.contributor.authorLarsen, Samuelen_US
dc.contributor.authorAmarasinghe, Samanen_US
dc.date.accessioned2023-03-29T14:41:39Z
dc.date.available2023-03-29T14:41:39Z
dc.date.issued1999-11
dc.identifier.urihttps://hdl.handle.net/1721.1/149292
dc.description.abstractIncreasing focus on multimedia applications has prompted the addition of multimedia extensions to most existing general-purpose microprocessors. This added functionality comes primarily in the addition of short SIMD instructions. Unfortunately, access to these instructions is limited to in-line assembly and library calls. Some researchers have proposed using vector compilers as a means of exploiting multimedia instructions. Although vectorization technology is well understood, it is inherently complex and fragile. In addition, it is incapable of locating SIMD-style parallelism within a basic block. In this paper we introduce the concept of Superword Level Parallelism (SLP), a novel way of viewing parallelism in multimedia applications. We believe SLP is fundamentally different from the loop-level parallelism exploited by traditional vector processing, and therefore warrants a different method for extracting it. We have developed a simple and robust compiler technique for detecting SLP that targets basic blocks rather than loop nests. As with techniques designed to extract ILP, ours is able to exploit parallelism both across loop iterations and within badic blocks. The result is an algorithm that provides excellent performance in several application domains. Experiments on scientific and multimedia benchmarks have yielded average performance improvements of 84%, and range as high as 253%.en_US
dc.relation.ispartofseriesMIT-LCS-TM-601
dc.titleExploiting Superword Level Parallelism with Multimedia Instruction Setsen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record