Graph preview · Show
Show / Hide      
Dataset Info

American College football(美国大学足球俱乐部网络)

Mark Newman provides a `football.gml' file which contains the network of American football games between Division IA colleges during regular season Fall 2000.  There are are two issues with the original GN file.

First, three teams met twice in one season so the graph is not simple. This is easily dealt with if required.

Secondly, the assignments to conferences, the node values, seem to be for the 2001 season and not the 2000 season. The games do appear to be for the 2000 season as stated. For instance, the Big West conference existed for football till 2000 while the Sun Belt conference was only started in 2001. Also, there were 11 conferences and 5 independents in 2001 but 10 conferences and 8 independents in 2000. I have provided a set of files footballTSE* which define a simple graph with the correct conference assignments in the archive here.

There is a read me file included with more details.  Further information about the problems with this data and the solutions are given in T.S. Evans, “Clique Graphs and Overlapping Communities”, J. Stat. Mech. (2010) P12037 [arXiv:1009.0638] which would be the appropriate source to cite along with the original GN publication.

标签数据(Ground Truth):节点标签(ground truth).xls
Girvan M, Newman M E J. Community structure in social and biological networks[J]. Proceedings of the national academy of sciences, 2002, 99(12): 7821-7826.

Data preview
Source Target Value
Communities Result
Group Size Nodes
Graph Information
Basic statistics · Calculate note

N and E are the number of nodes and links. 〈k〉 and 〈d〉 are the average degree and the average distance, respectively. C and r are the average clustering coefficient and the assortative coefficient. H is the degree heterogeneity. βc is the epidemic threshold of the SIR model.

N 115
E 613
<k> 10.6609
<d> 2.5082
<C> 0.4032
r 0.1624
H 0.04
beta_c 0.1027
Degree Histogram · Plot


Modularity (Q):

Runtime (s):

Export Format