# Grid'5000 user report for Patrick Loiseau

## User information

Patrick Loiseau (users, user, lyon user)
## Experiments

• Metrology on Grid5000 (Networking) [in progress]
Description: We want to analyse network traffic on the grid in terms of statistical properties like Long Range Dependance, self-similarity, etc. We especially want to understand the impact of statistic characteristics of the traffic on the QoS. The experiments consist in creating traffic with some imposed characteristics (flow size distribution, etc.) and looking at the resulting arrival and bandwidth processes and the QoS.
Results: not yet
• Investigating self-similarity and heavy tailed distributions on a large scale experimental facility (Networking) [achieved]
Description: After seminal work by Taqqu et al. relating self-similarity to heavy tail distributions, a number of research articles verified that aggregated Internet traffic time series show self-similarity and that Internet attributes, like WEB file sizes and flow lengths, were heavy tailed. However, the validation of the theoretical prediction relating self-similarity and heavy tails remains unsatisfactorily addressed, being investigated either using numerical or network simulations, or from uncontrolled web traffic data. Notably, this prediction has never been conclusively verified on real networks using controlled and stationary scenarii, prescribing specific heavy-tail distributions, and estimating confidence intervals. In the present work, we use the potential and facilities offered by the large-scale, deeply reconfigurable and fully controllable experimental Grid5000 instrument, to investigate the prediction observability on real networks. To this end we organize a large number of controlled traffic circulation sessions on a nation-wide real network involving two hundred independent hosts. We use a FPGA-based measurement system, to collect the corresponding traffic at packet level. We then estimate both the self-similarity exponent of the aggregated time series and the heavy-tail index of flow size distributions, independently. Comparison of these two estimated parameters, enables us to discuss the practical applicability conditions of the theoretical prediction.
Results:
• TCP traffic self-similarity under loss (Networking) [in progress]
Description: Over the last decade, many research efforts have been devoted to the study of aggregated traffic time series collected at the core of networks. The pioneering works by Paxson and Leland showed that the Poisson hypothesis, which is relevantly used in phone networks, was not suitable to describe computer networks. Instead, self-similarity was proved a much more appropriate paradigm. Then, the theoretical work from Taqqu and collaborators identified the heavy-tailed nature of the file size distribution as a possible origin for the observed self-similarity. In addition, it gave the exact relation between the self-similarity index and the tail index that should be observed when the sources behavior is modeled with the ON/OFF model. Despite a controversial debate on the question, it has then been more recently stated that the TCP congestion control mechanism cannot be responsible for the self-similarity observed in the large time scales. On the opposite side, we show in this work that when the file size is heavy-tailed, the TCP congestion control mechanism under sufficiently high loss can annihilate the self-similarity that would be observed without any loss. For this work, we use large scale controled experiments performed on Grid5000. Independant TCP sources send files in an ON/OFF scenario with a heavy-tailed ON periods; and a constant loss rate is created via UDP cross traffic.
Results:

