題名: | An Efficient Data and Computation Decomposition Technique for Nested Loops on NUMA Multiprocessor Systems |
作者: | Lai, Guan-Joe Lee, Haw-Jaw Chen, Cheng |
期刊名/會議名稱: | 1996 ICS會議 |
摘要: | This paper presents an automatic computation/data decomposition technique for nested loops on NUMA (Non-Uniform Memory Access) systems. In NUMA systems, the remote memory access time is longer than the local one, and computation/data decomposition affects the amount of remote accesses incurred by parallel processing. Therefore, the system performance is dependent on how to decompose computation/data onto parallel processors. Here, we propose a modified locality algorithm to improve the one in [6] for the case when the decomposition is not communication-free. In addition, a new performance estimating method is also presented. The whole method has been implemented on SUIF [8]. Experimental results demonstrate the superiority of our proposed algorithm over that in previous littature. |
日期: | 2006-10-31T08:58:20Z |
分類: | 1996年 ICS 國際計算機會議 |
文件中的檔案:
檔案 | 描述 | 大小 | 格式 | |
---|---|---|---|---|
ce07ics001996000221.pdf | 1.19 MB | Adobe PDF | 檢視/開啟 |
在 DSpace 系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。