NEUSNCP

全部数据集上传数据集


1. LFR1.txt
2. LFR6.txt
3. Euroroad.txt
4. Router.txt
5. lfr9000-0.5.txt
6. lfr8000-0.5.txt
7. lfr7000-0.5.txt
8. lfr6000-0.5.txt
9. lfr5000-0.5.txt
10. lfr4000-0.5.txt
11. lfr3000-0.5.txt
12. lfr2000-0.5.txt
13. lfr1000-0.5.txt
14. LFR_1000_0.1.txt
15. LFR_2500_0.1.txt
16. LFR随机生成数据集
17. facebook.txt
18. authors.net
19. journals.net
20. 罗格同义词词典
21. keywords.net
22. malaria.net
23. Geom Collaboration network in computational geometry
24. CSphd.net
25. The friendships of registered users in NEUSNCP
26. E. coli transcription networks. 大肠杆菌转录
27. leader2inter_st.txt
28. Social networks of positive sentiment
29. C.elegans.秀丽隐杆线虫代谢网络
30. SUNBELT 2013 DATA
31. eron邮件网络
32. tree15.txt
33. 环形结构网络示意图
34. 树形结构示例网络
35. 线性网络示例
36. Padgett.gml
37. London Multiplex Transport Network
38. EUAirTransportation.gml
39. mammalia-voles-plj-trapping-60 （田鼠诱捕网络）
40. Lesmis.gml 小说《悲惨世界》人物共现网络
41. bio-DM-LC.gml
42. road-roadNet-PA.gml
43. qspk_road.gml
44. dblp_author.gml
45. CORA
46. WV (a network of Wikipedia who-votes-on-whom).gml
47. Sex(a bipartite network in which nodes are females (sex sellers) and males (sex buyers) ).gml
48. PG (a snapshot of the Gnutella peer-to-peer file sharing network).gml
49. Facebook
50. Email (Rovirai Virgili University).gml
51. GrQc (is a collaboration network of eprint articles in arXiv categories General Relativity and Quantum Cosmology.).gml
52. A network of coauthorships between 379 scientists
53. Weibo user_relationships (预览失败，请直接下载)
54. Weibo_relationships (预览失败，请直接下载).gml
55. Status_user_bipartite.txt
56. PPI_Homo_sapiens_mcc.gml
57. PPI_Homo_sapiens.gml
58. web-EPA.gml
59. reactome.gml
60. facebook.gml
61. vidal.gml
62. Power (美国西部电力网络)
63. openflights.gml
64. LFR_2500.gml
65. LFR_5000_0.1.gml
66. LFR_100.gml
67. LFR_500.gml
68. book2.gml
69. sx-superuser1.gml
70. europe-airport3_network.gml
71. email22_network.gml
72. lncDN159_network.gml
73. email_network.gml
74. europe-airports_network.gml
75. brazil-airports_network.gml
76. Jazz musicians network 爵士乐音乐家合作网络（1）
77. email430.gml
78. tea_link.gml2
79. trans_polblogs.gml
80. (EEC) trans_email.gml
81. tea_link.gml
82. cora_edgelist.gml
83. citeseer.gml
84. cora.gml
85. terrorist_attack_loc_org.gml
86. terrorist_attack_loc.gml
87. sanguo_2.gml
88. sanguo.gml
89. students.gml
90. Indonesian_terrorists.gml
91. tudents.gml
92. 冰与火之歌-书5
93. 冰与火之歌-书4
94. 冰与火之歌-书3
95. 冰与火之歌-书2
96. 冰与火之歌-书1
97. The relationships of characters in the novel ``A Song of Ice and Fire''
98. Email-Europe-Research-Insisute-core network
99. netscience.gml (科学家合作网络，From Newman, 2006 )
100. sciencenet.gml
101. Jazz musicians network 爵士乐音乐家合作网络
102. USAir (USA_Air_Lines).gml
103. NEU信息安全专业学生选课数据集（手动采集）
104. BLS.gml
105. neusncp dataset [txt]
106. Tmall online shopping records
107. krackhardt_kite （风筝网络）
108. Southern Women Activities Networks (南部妇女活动网络)
109. Scotland Enterprise Network（苏格兰连锁企业网络）
110. 911_attack.gml
111. Books about US politics（美国政治书籍网络）.gml
112. American College football（美国大学足球俱乐部网络）
113. Dolphin social network（海豚社交网络）
114. Zachary's karate club（空手道俱乐部网络）

图预览 · 显示

数据集介绍

malaria.net

You have downloaded the "malaria" data set that was used by Larremore, Clauset, and Jacobs in the paper "Efficiently inferring community structure in bipartite networks."

http://danlarremore.com/bipartiteSBM
larremor@hsph.harvard.edu

// FILE LIST

There are 4 files:
   1. malaria.edgelist - a tab-separated list of edges in the malaria network, in the form: i j w. In this case, all weights w are equal to 1.
   2. malaria.types - a list of the types of all vertices in the malaria network, which is bipartite and comprises genes (type 1) and substrings (type 2).
   3. malaria.partition - the partition shown in Figures 6 and 7 of the paper.
   4. malaria.mat - A MATLAB file that contains:
       A - the adjacency matrix
       B - the bipartite adjacency matrix
       N_a, N_b - the numbers of genes and substrings, respectively.
       P_a, P_b - both weighted one-mode projections.
       geneSequences - the genes themselves, at the amino acid level. See note below.
       geneSequenceHeaders - the names of the genes
       substrings - the substrings that were extracted from the sequences
       g - the partition shown in Figures 6 and 7 of the paper.

// A NOTE ABOUT SEQUENCE DATA

These sequences were initially published by Thomas S. Rask, et al. but were analyzed using more traditional genetic techniques:

   Rask, T. S., Hansen, D. A., Theander, T. G., Gorm Pedersen, A., & Lavstsen, T. (2010). Plasmodium falciparum Erythrocyte Membrane Protein 1 Diversity in Seven Genomes – Divide and Conquer. PLoS Computational Biology, 6(9), e1000933. doi:10.1371/journal.pcbi.1000933

The same sequences were reanalyzed using complex networks in their Highly Variable Regions by Daniel B. Larremore, Aaron Clauset, and Caroline O. Buckee. The sequence data provided here correspond to HVR6 of their paper.

   Larremore, D. B., Clauset, A., & Buckee, C. O. (2013). A Network Approach to Analyzing Highly Recombinant Malaria Parasite Genes. PLoS Computational Biology, 9(10), e1003268. doi:10.1371/journal.pcbi.1003268.s010

数据预览

起点	终点	数值

社团结果

社团	大小	节点

图信息

基本统计 · 计算 说明

N and E are the number of nodes and links. 〈k〉 and 〈d〉 are the average degree and the average distance, respectively. C and r are the average clustering coefficient and the assortative coefficient. H is the degree heterogeneity. βc is the epidemic threshold of the SIR model.

N	1103
E	2964
<k>
<d>
<C>
r
H
beta_c

度分布 · 绘制

结果

社团个数：

模块度（Q）

运行时间（秒）

平均值

AUC:

准确率

召回率

F值