networkx读取csv,从具有行和列标题的csv文件读取networkx图

I have a CSV file that represents the adjacency matrix of a graph. However the file has as the first row the labels of the nodes and as the first column also the labels of the nodes. How can I read this file into a networkx graph object? Is there a neat pythonic way to do it without hacking around?

My trial so far:

x = np.loadtxt('file.mtx', delimiter='\t', dtype=np.str)

row_headers = x[0,:]

col_headers = x[:,0]

A = x[1:, 1:]

A = np.array(A, dtype='int')

But of course this doesn't solve the problem since I need the labels for the nodes in the graph creation.

Example of the data:

Attribute,A,B,C

A,0,1,1

B,1,0,0

C,1,0,0

A Tab is the delimiter, not a comma tho.

解决方案

You could read the data into a structured array. The labels can be obtained from x.dtype.names, and then the networkx graph can be generated using nx.from_numpy_matrix:

import numpy as np

import networkx as nx

import matplotlib.pyplot as plt

# read the first line to determine the number of columns

with open('file.mtx', 'rb') as f:

ncols = len(next(f).split('\t'))

x = np.genfromtxt('file.mtx', delimiter='\t', dtype=None, names=True,

usecols=range(1,ncols) # skip the first column

)

labels = x.dtype.names

# y is a view of x, so it will not require much additional memory

y = x.view(dtype=('int', len(x.dtype)))

G = nx.from_numpy_matrix(y)

G = nx.relabel_nodes(G, dict(zip(range(ncols-1), labels)))

print(G.edges(data=True))

# [('A', 'C', {'weight': 1}), ('A', 'B', {'weight': 1})]

The nx.from_numpy_matrix has a create_using parameter you can use to specify the type of networkx Graph you wish to create. For example,

G = nx.from_numpy_matrix(y, create_using=nx.DiGraph())

makes G a DiGraph.

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值