Pandas）のピボットテーブルの小計

Question

私は次のデータを持っています：

Employee Account Currency Amount Location Test 2 Basic USD 3000 Airport Test 2 Net USD 2000 Airport Test 1 Basic USD 4000 Town Test 1 Net USD 3000 Town Test 3 Basic GBP 5000 Town Test 3 Net GBP 4000 Town

次のようにすることで、なんとかピボットできます。

import pandas as pd table = pd.pivot_table(df, values=['Amount'], index=['Location', 'Employee'], columns=['Account', 'Currency'], fill_value=0, aggfunc=np.sum, dropna=True)

出力：

 Amount Account Basic Net Currency GBP USD GBP USD Location Employee Airport Test 2 0 3000 0 2000 Town Test 1 0 4000 0 3000 Test 3 5000 0 4000 0

場所ごとに小計を作成し、最後に最終的な総計を作成するにはどうすればよいですか。必要な出力：

 Amount Account Basic Net Currency GBP USD GBP USD Location Employee Airport Test 2 0 3000 0 2000 Airport Total 3000 0 2000 Town Test 1 0 4000 0 3000 Test 3 5000 0 4000 0 Town Total 5000 4000 4000 3000 Grand Total 5000 7000 4000 5000

following をフォローしてみました。しかし、それは望ましい出力を与えません。ありがとうございました。

piRSquared · Accepted Answer

ピボットテーブル

table = pd.pivot_table(df, values=['Amount'], index=['Location', 'Employee'], columns=['Account', 'Currency'], fill_value=0, aggfunc=np.sum, dropna=True, ) print(table) Amount Account Basic Net Currency GBP USD GBP USD Location Employee Airport Test 2 0 3000 0 2000 Town Test 1 0 4000 0 3000 Test 3 5000 0 4000 0

`pandas.concat`

pd.concat([ d.append(d.sum().rename((k, 'Total'))) for k, d in table.groupby(level=0) ]).append(table.sum().rename(('Grand', 'Total'))) Amount Account Basic Net Currency GBP USD GBP USD Location Employee Airport 2 0 3000 0 2000 Total 0 3000 0 2000 Town 1 0 4000 0 3000 3 5000 0 4000 0 Total 5000 4000 4000 3000 Grand Total 5000 7000 4000 5000

古い答え

後世のために

小計を作成する

tab_tots = table.groupby(level='Location').sum() tab_tots.index = [tab_tots.index, ['Total'] * len(tab_tots)] print(tab_tots) Amount Account Basic Net Currency GBP USD GBP USD Location Airport Total 0 3000 0 2000 Town Total 5000 4000 4000 3000

すべて一緒に

pd.concat( [table, tab_tots] ).sort_index().append( table.sum().rename(('Grand', 'Total')) )

lmo · Answer

これが機能するはずの2つのライナーです。 locメソッドでは、インデックスで行をサブセット化できます。multiIndexがあるため、左側の行挿入ポイントにlocタプルをフィードします。タプルなしで「Town」を使用すると、インデックスの対応するすべてのレベルがプルされます。

2行目では、DataFrameの最後の行をsumから削除する必要があり、その形状属性を使用してこれを行います。

In[1]: table.loc[('Town Total', ''),:] = table.loc['Town'].sum() table.loc[('Grand Total', ''),:] = table.iloc[:(table.shape[0]-1), :].sum() In[2]: table Out[2]: Amount Account Basic Net Currency GBP USD GBP USD Location Employee Airport 2 0 3000 0 2000 Town 1 0 4000 0 3000 3 5000 0 4000 0 Town Total 5000 4000 4000 3000 Grand Total 5000 7000 4000 5000