我有以下CSV数据:
id,gene,celltype,stem,bcell,tcell id,organs,bm,fl,pt,bm 134,foo,about_foo,20,10,11,23,22,79 222,bar,about_bar,17,13,55,12,88@H_301_3@我可以用这种方式成功地总结出来:
import pandas as pd df = pd.read_csv("http://dpaste.com/1X74TNP.txt",header=None,index_col=[1,2]).iloc[:,1:] df.columns = pd.MultiIndex.from_arrays(df.ix[:2].values) df = df.ix[2:].astype(int) df.index.names = ['cell','organ'] df = df.reset_index('organ',drop=True) result = df.groupby(level=[0,1],axis=1).mean() result = result.stack().replace(np.nan,0).unstack() result = result.swaplevel(0,1,axis=1).sort_index(axis=1)@H_301_3@看起来像:
In [341]: result Out[341]: bm fl pt bcell stem tcell bcell stem tcell bcell stem tcell cell foo 0 15 79 0 11 0 22.5 0 0 bar 0 15 88 0 55 0 12.5 0 0@H_301_3@我的问题是,从结果如何获得第一级列列索引作为列表:
['bm','fl','pt']@H_301_3@