您现在的位置：程式師世界 >> 編程語言 > >> 更多編程語言 >> Python

Tips | share several pandas efficient functions

編輯：Python

Hello everyone ~

In this issue, I recommend some pandas Efficient data processing functions ( Continuous updating ), I hope it helped you ：

Dictionary creation Dataframe
Column splitting （split/extract）
columns （cat）
Fill left and right （pad）
Filter columns by type （select_dtypes）
Sort （rank）

1. Dictionary creation Dataframe

df_dict = {'name':['Alice_001','Bob_002','Cindy_003','Eric_004','Helen_005','Grace_006'],'sex':['female','male','female','male','female','male'],'math':[90,89,99,78,97,93],'english':[95,94,80,94,94,90]}
#[1]. Write parameters directly test_dict
df = pd.DataFrame(df_dict)
#[2]. Dictionary assignment
df = pd.DataFrame(data=df_dict)

2. Column splitting （split/extract）

Character splitting ：

df1[['name', 'id']] = df1['name'].str.split('_', 2, expand = True)

Regular expression splitting ：

df2 = df.copy()
df2['name2'] = df2['name'].str.extract('([A-Z]+[a-z]+)')
df2['id2'] = df2['name'].str.extract('(\d+)')

3. columns （cat）

Custom connector ：

df1["name_id"] = df1["name"].str.cat(df1["id"],sep='_'*3)

Merge output of a column ：

df1["name"].str.cat(sep='*'*5)

4. Fill left and right （pad）

padding-left ：

df1["id"] = df1["id"].str.pad(10,fillchar="*")
# amount to ljust()
df1["id"] = df1["id"].str.rjust(10,fillchar="*")

Right fill ：

df1["id"] = df1["id"].str.pad(10,side="right",fillchar="*")

Fill both sides ：

df1["id"] = df1["id"].str.pad(10,side="both",fillchar="*")

5. Filter columns by type （select_dtypes）

Filter numeric Columns ：

df1.select_dtypes(include=['float64', 'int64'])

Screening object Column ：

df1.select_dtypes(include=['object'])

6. Sort （rank）

English achievement ranking ：

df1['e_rank'] = df1['english'].rank(method='min',ascending=False)

94 There are three , So the three are tied for the first place 2.

The above is all the contents sorted out for you in this issue , Practice quickly

上一篇文章： Dry goods | variables in Python
下一篇文章： 90% of people dont know that Python already supports Chinese variable names!

Python

Python - Matplotlib drawing library use detailed 1 (column chart, line chart, pie chart, scatter chart, box chart)

一、基本介紹（1） Matplotlib 是 Pytho

Python bits and bits or how to understand

python The binary bit and Bit

《Python 常用技能》爬蟲入門必備—ip代理的優勢與使用方法

本文由呆呆敲代碼的小Y 原創學習專欄推薦：Unity系統

想用python編寫一個普查管理軟件，想問問思路和涉及的軟件

想設計一個普查管理軟件，要求大概如下圖所示。基本功能就是可以

計算機畢業設計Python+django的基於協同過濾算法的電影推薦系統(源碼+系統+mysql數據庫+Lw文檔）

項目介紹隨著社會的發展，人們生活水平的提高，欣賞電影逐漸成為

Django builds an integrated celery blog

Why integrate celery When inte