DataFrameをパンダにするJSON

Question

次のように、緯度経度座標で指定されたパスに沿って、Google Maps APIから標高データを抽出します。

from urllib2 import Request, urlopen import json path1 = '42.974049,-81.205203|42.974298,-81.195755' request=Request('http://maps.googleapis.com/maps/api/elevation/json?locations='+path1+'&sensor=false') response = urlopen(request) elevations = response.read()

これは私にこのように見えるデータを与える：

elevations.splitlines() ['{', ' "results" : [', ' {', ' "elevation" : 243.3462677001953,', ' "location" : {', ' "lat" : 42.974049,', ' "lng" : -81.205203', ' },', ' "resolution" : 19.08790397644043', ' },', ' {', ' "elevation" : 244.1318664550781,', ' "location" : {', ' "lat" : 42.974298,', ' "lng" : -81.19575500000001', ' },', ' "resolution" : 19.08790397644043', ' }', ' ],', ' "status" : "OK"', '}']

ここにDataFrameとして入れたときに私が得るものです：

enter image description here

pd.read_json(elevations)

そしてここに私が欲しいものがあります：

enter image description here

これが可能かどうかはわかりませんが、主に私が探しているのは、標高、緯度、経度のデータをパンダデータフレームにまとめることができる方法です（派手なmutilineヘッダは必要ありません）。

誰かがこのデータを使って作業するのを手伝ったり助言をしたりすることができればそれは素晴らしいことです！あなたが言うことができないなら私は前にjsonデータであまり働いていないと言うことができます...

編集：

この方法はそれほど魅力的ではありませんが、うまくいくようです。

data = json.loads(elevations) lat,lng,el = [],[],[] for result in data['results']: lat.append(result[u'location'][u'lat']) lng.append(result[u'location'][u'lng']) el.append(result[u'elevation']) df = pd.DataFrame([lat,lng,el]).T

緯度、経度、高度の列を持つデータフレームになる

enter image description here

pbreach · Accepted Answer

私は、pandas 0.13の最新リリースに含まれているjson_normalize関数を使用したいことに対する迅速で簡単な解決策を見つけました。

from urllib2 import Request, urlopen import json from pandas.io.json import json_normalize path1 = '42.974049,-81.205203|42.974298,-81.195755' request=Request('http://maps.googleapis.com/maps/api/elevation/json?locations='+path1+'&sensor=false') response = urlopen(request) elevations = response.read() data = json.loads(elevations) json_normalize(data['results'])

これは私がグーグルマップAPIから得たjsonデータを持つニースフラットデータフレームを与えます。

Rishu · Answer

この断片をチェックしてください。

# reading the JSON data using json.load() file = 'data.json' with open(file) as train_file: dict_train = json.load(train_file) # converting json dataset from dictionary to dataframe train = pd.DataFrame.from_dict(dict_train, orient='index') train.reset_index(level=0, inplace=True)

それが役に立てば幸い：）

Rapha&#235;l Braud · Answer

最初にPythonの辞書にあなたのjsonデータをインポートすることができます：

data = json.loads(elevations)

その後、その場でデータを変更します。

for result in data['results']: result[u'lat']=result[u'location'][u'lat'] result[u'lng']=result[u'location'][u'lng'] del result[u'location']

JSON文字列を再構築します。

elevations = json.dumps(data)

最後に：

pd.read_json(elevations)

Pandaは辞書から直接DataFrameを作成できると思います（長い間使っていません：p

billmanH · Answer

問題は、データフレーム内にいくつかの列があり、その中に小さい方の幅の辞書が含まれていることです。便利なJsonは頻繁に入れ子になっています。私は欲しい情報を新しいコラムに引き出す小さな関数を書いています。そのように私は私が使いたいフォーマットでそれを持っています。

for row in range(len(data)): #First I load the dict (one at a time) n = data.loc[row,'dict_column'] #Now I make a new column that pulls out the data that I want. data.loc[row,'new_column'] = n.get('key')

niltoid · Answer

billmanHの解決策は私を助けましたが、私が切り替えたまではうまくいきませんでした。

n = data.loc[row,'json_column']

に：

n = data.iloc[[row]]['json_column']

残りはここにあります、辞書に変換することはjsonデータを扱うのに役立ちます。

import json for row in range(len(data)): n = data.iloc[[row]]['json_column'].item() jsonDict = json.loads(n) if ('mykey' in jsonDict): display(jsonDict['mykey'])

AB Abhi · Answer

python3.xはurllib2をサポートしていないため、承認された回答の新しいバージョンです。

from requests import request import json from pandas.io.json import json_normalize path1 = '42.974049,-81.205203|42.974298,-81.195755' response=request(url='http://maps.googleapis.com/maps/api/elevation/json?locations='+path1+'&sensor=false', method='get') elevations = response.json() elevations data = json.loads(elevations) json_normalize(data['results'])

loganbvh · Answer

受け入れられた回答によってフラット化されたDataFrameが得られたら、次のように列をMultiIndex（ "fancy multiline header"）にすることができます。

df.columns = pd.MultiIndex.from_tuples([Tuple(c.split('.')) for c in df.columns])

MIKHIL NAGARALE · Answer

#Use the small trick to make the data json interpret-able #Since your data is not directly interpreted by json.loads() >>> import json >>> f=open("sampledata.txt","r+") >>> data = f.read() >>> for x in data.split("
"): ... strlist = "["+x+"]" ... datalist=json.loads(strlist) ... for y in datalist: ... print(type(y)) ... print(y) ... ... <type 'dict'> {u'0': [[10.8, 36.0], {u'10': 0, u'1': 0, u'0': 0, u'3': 0, u'2': 0, u'5': 0, u'4': 0, u'7': 0, u'6': 0, u'9': 0, u'8': 0}]} <type 'dict'> {u'1': [[10.8, 36.1], {u'10': 0, u'1': 0, u'0': 0, u'3': 0, u'2': 0, u'5': 0, u'4': 0, u'7': 0, u'6': 0, u'9': 0, u'8': 0}]} <type 'dict'> {u'2': [[10.8, 36.2], {u'10': 0, u'1': 0, u'0': 0, u'3': 0, u'2': 0, u'5': 0, u'4': 0, u'7': 0, u'6': 0, u'9': 0, u'8': 0}]} <type 'dict'> {u'3': [[10.8, 36.300000000000004], {u'10': 0, u'1': 0, u'0': 0, u'3': 0, u'2': 0, u'5': 0, u'4': 0, u'7': 0, u'6': 0, u'9': 0, u'8': 0}]} <type 'dict'> {u'4': [[10.8, 36.4], {u'10': 0, u'1': 0, u'0': 0, u'3': 0, u'2': 0, u'5': 0, u'4': 0, u'7': 0, u'6': 0, u'9': 0, u'8': 0}]} <type 'dict'> {u'5': [[10.8, 36.5], {u'10': 0, u'1': 0, u'0': 0, u'3': 0, u'2': 0, u'5': 0, u'4': 0, u'7': 0, u'6': 0, u'9': 0, u'8': 0}]} <type 'dict'> {u'6': [[10.8, 36.6], {u'10': 0, u'1': 0, u'0': 0, u'3': 0, u'2': 0, u'5': 0, u'4': 0, u'7': 0, u'6': 0, u'9': 0, u'8': 0}]} <type 'dict'> {u'7': [[10.8, 36.7], {u'10': 0, u'1': 0, u'0': 0, u'3': 0, u'2': 0, u'5': 0, u'4': 0, u'7': 0, u'6': 0, u'9': 0, u'8': 0}]} <type 'dict'> {u'8': [[10.8, 36.800000000000004], {u'1': 0, u'0': 0, u'3': 0, u'2': 0, u'5': 0, u'4': 0, u'7': 0, u'6': 0, u'9': 0, u'8': 0}]} <type 'dict'> {u'9': [[10.8, 36.9], {u'1': 0, u'0': 0, u'3': 0, u'2': 0, u'5': 0, u'4': 0, u'7': 0, u'6': 0, u'9': 0, u'8': 0}]}

Siva · Answer

JSONからDataFrameへの変換とその逆の変換を行う小さなユーティリティクラスは、次のとおりです。

# -*- coding: utf-8 -*- from pandas.io.json import json_normalize class DFConverter: #Converts the input JSON to a DataFrame def convertToDF(self,dfJSON): return(json_normalize(dfJSON)) #Converts the input DataFrame to JSON def convertToJSON(self, df): resultJSON = df.to_json(orient='records') return(resultJSON)