社交网络分析(Social Network Analysis)

这是一门由Lada Adamic教授在Coursera上的课程,介绍社交网络分析的基本概念,如Centrality、Betweeness、Modularity等,并探讨如何使用Gephi、NetLogo等工具进行分析。课程适合没有专业背景的程序员,涉及如何用R语言进行社交网络分析。课程内容包括社交网络图的表示方法、强弱连通组件和巨组件的概念,以及后续课程的问答环节。
摘要由CSDN通过智能技术生成

Soical Network Analysis 是密歇根大学Lada Adamic教授在Coursera上开的一门课。主要介绍了Social Network中的一些基本概念,比如Centrality,Betweeness,Modularity等等,如何使用工具(Gephi,NetLogo等等)去分析Social Network,如何检测Social Network中的Community,不需要任何专业基础。对于程序员出身的同学,课程中也提到了如何用R去分析Social Network。

第一周 

这周只是讲了一些 Social Network Graph的基本概念: 

Node 

Edge(directed edge, undirected edge)

Indegree

Outdegree

这些概念比较简单,不再重复了。


Social Network Graph的表示主要有三种方式:

1. Adjacency matrix

2. Edge list

3. Adjacency list

对下面这张简单的图,三种表示方式分别为:




1. Adjacency matrix



2. Edge list

2,3

2,4

3,2

3,4

4,5

5,1

5,2


3. Adjacency list

1:

2: 3 4

3: 2 4

4: 5

5: 1 2


接下来是最重要的一个概念了:strongly connected component, weakly connected component, giant component

Strongly connected component: Each node within the component can be reached from every other node in the component by following directed links.

任意两个节点必须相互能到达

Weakly connected component: Each node within the component can be reached from every other node by following either direction.

任意两个节点只需要有一个方向能到达就可以

Giant component:If the largest connect component encompasses a significant fraction of the graph, it is called the giant component.

到底多大比例算significant呢?maybe 5%, maybe 10%……


第二周

这周主要讲了两个随机网络模型,一个是Erdos-Renyi模型,另一个是 Barabasi-Albert模型。这周的概念涉及: 连通组件(强连通,弱连通),最大连通分支,平均最短距离,直径,广度优先搜索

第五周

Question 1

Download Lada's Facebook network. Load it in Gephi (as undirected). Note the number of nodes and edges present. Calculate the clustering coefficient and average shortest path (this is OK to take as-is even though the network is not connected). Next close the project, and after you have blank slate, generate an Erdos Renyi random graph (File > Generate > Random graph...) with the same number of nodes and edges (you'll have to figure out the corresponding wiring probability to achieve this). It will produce a directed network. Calculate the clustering coefficient and average shortest path for this random network, making sure to treat the network as undirected. Which of the following observations is true?

Here is the result:


Lada's Facebook network Erdros Renyi Random Graph(wiring probablity=0.049)
Nodes 388 388
Edges 3598 3620
Network Diameter 8 4
Connect Components 20 1
Average Clustering Coefficient 0.534 0.049
Average Shortest Path 2.781 2.34



Question 2

Download the a snapshot of the Gnutella peer-to-peer filesharing network (now over a decade old). Go through the same procedure: Load it in Gephi (as undirected). Note the number of n
评论 2
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值