TY - JOUR
T1 - A DSM-based fragmented data sharing framework for grids
AU - Chen, Po Cheng
AU - Chang, Jyh Biau
AU - Shieh, Ce Kuen
AU - Lin, Chia Han
AU - Zhuang, Yi Chang
N1 - Funding Information:
This work was supported by the National Science Council of Taiwan, ROC, under project No. 96-2221-E-426-004-.
PY - 2010/4
Y1 - 2010/4
N2 - Sharing scientific and data capture files of gigabyte and terabyte size in conventional data grid systems is inefficient because conventional approaches copy the entire shared file to a user's local storage even when only a tiny file fragment is required. Such transfer schemes consume unnecessary data transmission time and local storage space, with the additional problem of maintaining replica synchronization. Traditionally, replica consistency treats shared files as read-only, consequently sacrificing guaranteed replica consistency. This paper presents a DSM-based fragmented data sharing framework called "Spigot" which transfers only the necessary fragments of large files on user demand, thereby reducing data transmission time, wasted network bandwidth and required storage space. Data waiting time is further reduced by overlapping data transmission and data analysis. The DSM concept maintains replica synchronization. Real experiments show reduced turnaround time in data-intensive applications, particularly when fragment size is low and analysis time and network latency are high.
AB - Sharing scientific and data capture files of gigabyte and terabyte size in conventional data grid systems is inefficient because conventional approaches copy the entire shared file to a user's local storage even when only a tiny file fragment is required. Such transfer schemes consume unnecessary data transmission time and local storage space, with the additional problem of maintaining replica synchronization. Traditionally, replica consistency treats shared files as read-only, consequently sacrificing guaranteed replica consistency. This paper presents a DSM-based fragmented data sharing framework called "Spigot" which transfers only the necessary fragments of large files on user demand, thereby reducing data transmission time, wasted network bandwidth and required storage space. Data waiting time is further reduced by overlapping data transmission and data analysis. The DSM concept maintains replica synchronization. Real experiments show reduced turnaround time in data-intensive applications, particularly when fragment size is low and analysis time and network latency are high.
UR - http://www.scopus.com/inward/record.url?scp=75049084370&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=75049084370&partnerID=8YFLogxK
U2 - 10.1016/j.future.2009.12.008
DO - 10.1016/j.future.2009.12.008
M3 - Article
AN - SCOPUS:75049084370
SN - 0167-739X
VL - 26
SP - 668
EP - 677
JO - Future Generation Computer Systems
JF - Future Generation Computer Systems
IS - 4
ER -