对有重复的结果筛选

最新推荐文章于 2022-04-04 22:01:27 发布

沈帅杰

最新推荐文章于 2022-04-04 22:01:27 发布

阅读量213

点赞数 1

分类专栏：数据处理文章标签： python

本文链接：https://blog.csdn.net/weixin_45452300/article/details/107614355

版权

数据处理专栏收录该内容

14 篇文章 5 订阅

订阅专栏

结果每三个为一个重复，选择其中两个差距小的平均
数据如下
在这里插入图片描述

"""
author: shuaijie
intro: 在三个测氮的值中选择两个相近的平均
date: 07/27/2020 11:28
"""
import pandas as pd


def main():
    fp = pd.read_excel(r'C:\Users\admire\Desktop\测氮结果示例.xlsx')  # 读取数据
    result = []
    identify = []
    for i in range(int(len(fp)/3)):
        std1 = (fp.iloc[i*3, 1] - fp.iloc[i*3+1, 1])**2
        std2 = (fp.iloc[i*3, 1] - fp.iloc[i*3+2, 1])**2
        std3 = (fp.iloc[i*3+2, 1] - fp.iloc[i*3+1, 1])**2  # 计算方差
        if min(std1, std2, std3) == std1:  # 选择方差小的两个值
            result.append((fp.iloc[i*3, 1] + fp.iloc[i*3+1, 1])/2)
        elif min(std1, std2, std3) == std2:
            result.append((fp.iloc[i*3, 1] + fp.iloc[i*3+2, 1])/2)
        else:
            result.append((fp.iloc[i*3+2, 1] + fp.iloc[i*3+1, 1])/2)
        identify.append(i)
        identify.append(i)
        identify.append(i)  # 定义位置三个一组，计算原始方差和均值
    fp.insert(0, 'ID', identify)
    data_mean = fp.groupby(by='ID').mean()
    data_std = fp.groupby(by='ID').std()
    final_data = pd.merge(data_mean, data_std, on='ID', how='left')
    final_data['最终值'] = pd.Series(result)
    final = final_data[['值_x', '值_y', '最终值']]  # 提取需要的值
    final_2 = final.rename(columns={'值_x': '平均', '值_y': '方差'})  # 改变列名
    final_2.to_excel(r'C:\Users\admire\Desktop\测氮结果筛选.xlsx')  # 输出结果


if __name__ == '__main__':
    main()

结部分
部分结果

沈帅杰

关注

1
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
对有重复的结果筛选

结果每三个为一个重复，选择其中两个差距小的平均数据如下"""author: shuaijieintro: 在三个测氮的值中选择两个相近的平均date: 07/27/2020 11:28"""import pandas as pddef main(): fp = pd.read_excel(r'C:\Users\admire\Desktop\测氮结果示例.xlsx') # 读取数据 result = [] identify = [] for i in r
复制链接

扫一扫

专栏目录