我正在尝试编写一个函数来使用python3.4中的matplotlib生成Matlab风格的相关图(示例here)。但是,我想改变曲线图,使对角线子图显示变量名称,下三角子图显示皮尔逊相关系数,上三角子图显示散点图。下面是一些生成示例数据的代码和我编写的函数。它在正确的位置显示带有变量名和相关系数的4x4子图网格,但散点图没有显示出来。在import numpy as np
import matplotlib.pyplot as plt
means = [0, 1, 0, 2]
sig = [[1, 0.5, 0, -0.1], [0.5, 3, 0, 0.2], [0, -0.1, 1, -0.3], [-0.1, 0.2, -0.3, 1]]
data = np.random.multivariate_normal(means, sig, 50)
names = ['Var' + str(i) for i in range(data.shape[1])]
def corrplot(data, names):
corrMat = np.corrcoef(data, rowvar = 0)
numVars = data.shape[1]
fig, ax = plt.subplots(numVars, numVars, sharex = "col", sharey = "row")
fig.subplots_adjust(wspace = 0, hspace = 0)
for i in range(numVars):
for j in range(numVars):
if i == j: # On the diagonal
ax[i, j].text(0.5, 0.5, names[i], transform = ax[i, j].transAxes)
elif i < j: # In the upper triangle
ax[i, j].scatter(data[:, i], data[:, j], marker = '.')
elif i > j: # In the lower triangle
ax[i, j].text(0.5, 0.5, str(round(corrMat[i, j], 3)), transform = ax[i, j].transAxes)
plt.show()
为了找出问题的根源,我使用以下代码手动重建了2变量情况下的绘图,生成了所需的绘图:
^{pr2}$
因为这样做是可行的,我假设问题与在子批次中混合文本和数据无关。我编写了下一个函数来测试使用for循环填充子批次,它在每个子批次中按预期生成散点图。在def test1(data):
numVars = data.shape[1]
fig, ax = plt.subplots(numVars, numVars, sharex = "col", sharey = "row")
fig.subplots_adjust(wspace = 0, hspace = 0)
for i in range(numVars):
for j in range(numVars):
ax[i, j].scatter(data[:, i], data[:, j], marker = '.')
plt.show()
接下来,我尝试使用for循环只填充子批次的一个子集。这将生成一个如下所示的空白栅格。在def test2(data):
numVars = data.shape[1]
fig, ax = plt.subplots(numVars, numVars, sharex = "col", sharey = "row")
fig.subplots_adjust(wspace = 0, hspace = 0)
for i in range(numVars):
for j in range(i + 1, numVars):
ax[i, j].scatter(data[:, i], data[:, j], marker = '.')
plt.show()
这使我相信在for循环和散点图的创建过程中存在一些错误,但是我还没有找到错误。在