We introduce a simple and useful view for observing graph streams. They are viewed as collections of edge events where each edge has associated a set of time-dependent statistics that include firing rate, recency, and persistence. The activity rate of any subgraph is expressed as an aggregation of its corresponding edge statistics. Salient subgraphs are detected by isolating through time those edges whose activity rate deviates substantially from the activity rate of the entire stream. These salient subgraphs exhibit some peculiar "herding" and "straying" behaviors that are humanly interpretable. The vertices involved in the creation of these salient behaviors cover a substantial portion of the entire graph stream. This coverage can be subject to both human and computer verification. All our computations are incremental and are accompanied by a visualization platform that integrates dynamic node link views of "recent" graph substreams with a tape view of the Top-K edge statistics to provide a compact overview of the graph stream. This platform has also been coupled to our modular Degree-of-Interest system for a closer investigation of those patterns found in the overview. We use Twitter data to illustrate our tools, but our approach is by no means confined to microblog data.
Mendeley saves you time finding and organizing research
Choose a citation style from the tabs below