Using python to process Big Data

Pandas is a great lib to process BIg Data.

1) pandas.pivot_table(data,values=None,columns=None,aggfunc=func)

func can be any function in python

2) pandas.merge(left,right,hpw='inner')

combine left with right based on the inner columns.

3) pandas.read_table(filepath_or_buffer,sep=' ',names=None)

I think《powerful Python data analysis toolkit》 is useful. And It's enough for us to use pandas.

原文地址:https://www.cnblogs.com/Victory-walt/p/5120927.html