Generating representative, live network traffic out of millions of code repositories

5Citations
Citations of this article
9Readers
Mendeley users who have this article in their library.

Abstract

In theory, any network operator, developer, or vendor should have access to large amounts of live network traffic for testing their solutions. In practice, though, that is not the case. Network actors instead have to use packet traces or synthetic traffic, which is highly suboptimal: today's generated traffic is unrealistic. We propose a system for generating live application traffic leveraging massive codebases such as GitHub. Our key observation is that many repositories have now become "orchestrable"thanks to the rise of container technologies. To showcase the practicality of the approach, we iterate through >293k GitHub repositories and manage to capture >74k traces containing meaningful and diverse network traffic. Based on this first success, we outline the design of a system, Dynamo, which analyzes these traces to select and orchestrate open-source projects to automatically generate live application traffic matching a user's specification.

Cite

CITATION STYLE

APA

Bühler, T., Schmid, R., Lutz, S., & Vanbever, L. (2022). Generating representative, live network traffic out of millions of code repositories. In HotNets 2022 - Proceedings of the 2022 21st ACM Workshop on Hot Topics in Networks (pp. 1–7). Association for Computing Machinery, Inc. https://doi.org/10.1145/3563766.3564084

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free