pandas DataFrameを行ごとに正規化する

Question

pandas DataFrameの各行を正規化する最も慣用的な方法は何ですか？列の正規化は簡単なので、1つの（非常にい！）オプションは次のとおりです。

_(df.T / df.T.sum()).T _

パンダのブロードキャストルールにより、df / df.sum(axis=1)がこれを実行できません

joris · Accepted Answer

ブロードキャストの問題を解決するには、divメソッドを使用できます。

df.div(df.sum(axis=1), axis=0)

http://pandas.pydata.org/pandas-docs/stable/basics.html#matching-broadcasting-behavior を参照してください

Rafa · Answer

Scikit前処理ライブラリを使用し、必要に応じてデータフレームを転置することをお勧めします。

''' Created on 05/11/2015 @author: rafaelcastillo ''' import matplotlib.pyplot as plt import pandas import random import numpy as np from sklearn import preprocessing def create_cos(number_graphs,length,amp): # This function is used to generate cos-kind graphs for testing # number_graphs: to plot # length: number of points included in the x axis # amp: Y domain modifications to draw different shapes x = np.arange(length) amp = np.pi*amp xx = np.linspace(np.pi*0.3*amp, -np.pi*0.3*amp, length) for i in range(number_graphs): iterable = (2*np.cos(x) + random.random()*0.1 for x in xx) y = np.fromiter(iterable, np.float) if i == 0: yfinal = y continue yfinal = np.vstack((yfinal,y)) return x,yfinal x,y = create_cos(70,24,3) data = pandas.DataFrame(y) x_values = data.columns.values num_rows = data.shape[0] fig, ax = plt.subplots() for i in range(num_rows): ax.plot(x_values, data.iloc[i]) ax.set_title('Raw data') plt.show() std_scale = preprocessing.MinMaxScaler().fit(data.transpose()) df_std = std_scale.transform(data.transpose()) data = pandas.DataFrame(np.transpose(df_std)) fig, ax = plt.subplots() for i in range(num_rows): ax.plot(x_values, data.iloc[i]) ax.set_title('Data Normalized') plt.show()