Application Agnostic Network Traffic Modeling for Realistic Traffic Generation
Date
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
Research and testing in networking sometimes require experiments that utilize real application network traffic. However, the process for obtaining production network traffic data from industry partners for testing novel algorithms, protocols, and network functions is a significant pain point for many researchers in academia. Many industry operators are reluctant to share network traffic data with third parties to avoid violating privacy policies and avoid unintentional exposure of proprietary information to competitors. Therefore, many researchers resort to the use of synthetic traffic generators in networking experiments. Our survey of over 7000 networking research papers revealed that most research projects exclusively use constant/maximum throughput traffic generators in their evaluation experiments. These generators do not always generate traffic that is similar to real production traffic. They often blast out packets at fixed rates or rates based on statistical distributions. Existing realistic traffic generators are rarely used, and there is no standardized evaluation system for realistic traffic generators. Therefore, this work focuses on developing a new application-agnostic framework for producing abstract, high-fidelity models of application network traffic patterns for realistic application traffic generation in laboratory environments. The framework includes a comprehensive evaluation system for realistic traffic generation models. We evaluated the methods and algorithms applied in the framework, then we created and evaluated a new application traffic modeling method that combines clustering methods with stochastic modeling for realistic traffic modeling. The evaluation results reveal that traffic generated is similar to actual production traffic for many types of applications. This work's outcome is vital to researchers and industry operators in computer networking, especially those involved in large scale enterprise, data-center, and internet of things (IoT) network testing. The methods presented make it easy to investigate how various changes in a network's traffic patterns and infrastructure can impact its performance. Researchers can test new protocols and algorithms with realistic traffic derived from actual applications, without violating privacy policies or replaying extra-large traffic trace files.