TY - GEN
T1 - On the feasibility of utilizing correlations between user populations for traffic inference
AU - Lan, Kun Chan
AU - Heidemann, John
PY - 2005
Y1 - 2005
N2 - Network models today are often derived from two different methods. On one hand, detailed traffic models are generated based on traces from a single tap into the network. Alternatively, one can collect higher-level traffic-matrix data with SNMP from many routers. However, inferring flow-level details from such data is still an open research issue. Today it is infeasible to collect a fine-grained, packet-level representation of a complete, multi-router network. Even if it were economically feasible to synchronize and monitor every router in a large network, the amount of data generated would tax storage and computation resources. In this work, we propose a methodology to infer flow-level traffic across a network by exploiting the correlations between user populations across different networks. The contribution of this paper is twofold. First, based on traces of web traffic collected from two different sources, we observe that the user-behavior parameters of the traffic (such as user "think" time in web traffic) are correlated across time, while the application-specific parameters of the traffic (such as object size) are correlated across "similar" networks. Second, by utilizing the correlations between similar networks, we propose a methodology for inferring traffic at places where continuously taking measurements is infeasible. We evaluate the effectiveness of our methodology via simulation.
AB - Network models today are often derived from two different methods. On one hand, detailed traffic models are generated based on traces from a single tap into the network. Alternatively, one can collect higher-level traffic-matrix data with SNMP from many routers. However, inferring flow-level details from such data is still an open research issue. Today it is infeasible to collect a fine-grained, packet-level representation of a complete, multi-router network. Even if it were economically feasible to synchronize and monitor every router in a large network, the amount of data generated would tax storage and computation resources. In this work, we propose a methodology to infer flow-level traffic across a network by exploiting the correlations between user populations across different networks. The contribution of this paper is twofold. First, based on traces of web traffic collected from two different sources, we observe that the user-behavior parameters of the traffic (such as user "think" time in web traffic) are correlated across time, while the application-specific parameters of the traffic (such as object size) are correlated across "similar" networks. Second, by utilizing the correlations between similar networks, we propose a methodology for inferring traffic at places where continuously taking measurements is infeasible. We evaluate the effectiveness of our methodology via simulation.
UR - http://www.scopus.com/inward/record.url?scp=33751421041&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=33751421041&partnerID=8YFLogxK
U2 - 10.1109/LCN.2005.97
DO - 10.1109/LCN.2005.97
M3 - Conference contribution
AN - SCOPUS:33751421041
SN - 0769524214
SN - 9780769524214
T3 - Proceedings - Conference on Local Computer Networks, LCN
SP - 132
EP - 139
BT - Proceedings - The IEEE Conference on Local Computer Networks - 30th Anniversary, LCN 2005
T2 - IEEE Conference on Local Computer Networks - 30th Anniversary, LCN 2005
Y2 - 15 November 2005 through 17 November 2005
ER -