您现在的位置：程式師世界 >> 編程語言 > >> 更多編程語言 >> Python

General usage of pandas data processing | apply() function

編輯：Python

This article introduces about Pandas in apply() Several common uses of functions ,apply() The degree of freedom of the function is high , It can be directly to Series perhaps DataFrame Element by element traversal operation , Convenient and efficient , Similar to Numpy Characteristics of .

apply() When using , Usually put one lambda Function expression 、 Or a function as an operation , Officially apply() usage ：

DataFrame.apply(self, func, axis=0, raw=False, result_type=None, args=(), **kwds


1.

func Represents the incoming function or lambda expression ;
axis There are two parameters that can be provided , This parameter defaults to 0/ Column
0 perhaps index , Indicates that the function deals with each column ;
1 or columns , Indicates that each line is processed ;
raw ;bool type , The default is False;
False , Means to treat each row or column as Series In the incoming function ;
True, Accept is ndarray data type ;

apply() Finally, it is processed by function , The data to Series or DataFrame Format return .

Here are a few examples apply() Specific use of ;

DataFrame Use apply()

1, Calculate the square root of each element

This is just for convenience , Directly use numpy Of sqrt function ;

>>> df =pd.DataFrame([[4,9]]*3,columns = ['A','B'])

>>> df

 A B

0 4 9

1 4 9

2 4 9





>>> df.apply(np.sqrt)

 A B

0 2.0 3.0

1 2.0 3.0

2 2.0 3.0


1.
2.
3.
4.
5.
6.
7.
8.
9.
10.
11.
12.
13.

2, Calculate the average value of each line of elements

Here, the incoming data exists in the form of columns , therefore axis = 0, You can omit ;

>>> df.apply(np.mean)

A 4.0

B 9.0


1.
2.
3.

3, Calculate the average value of each column of elements

And 2 The difference is that it is passed in the form of rows , To add a parameter axis =1;

>>> df.apply(np.mean,axis = 1)

0 6.5

1 6.5

2 6.5

dtype: float64


1.
2.
3.
4.
5.

4, Add new column C, The values are columns A、B The sum of the

Implement this function , The simplest line of code can be achieved :

df['C'] = df.A +df.B


1.

But here we need apply() To achieve , Realize the usage of inter column operation , The operation steps are divided into the following two steps ：

1, First define a function implementation Column A + Column B ;

2, utilize apply() Add this function , And the data needs Join line by line , So set axis = 1

>>> def Add_a(x):

... return x.A+x.B



>>> df['C'] = df.apply(Add_a,axis=1)

>>> df

 A B C

0 4 9 13

1 4 9 13

2 4 9 13


1.
2.
3.
4.
5.
6.
7.
8.
9.

Series Use apply()

Series Use apply() Function and DataFrame be similar , The biggest difference in usage is the addition of a column name DataFram. Class name

1, Column A Add 1

no need apply() Methods

df.A =df.A +1


1.

utilize apply() Function to operate , Here I introduce a lambda function ：

>>> df.A = df.A.apply(lambda x:x+1)

>>> df

 A B C

0 5 9 13

1 5 9 13

2 5 9 13


1.
2.
3.
4.
5.
6.

2, Judgment column A Whether the element in can be 2 to be divisible by , use Yes or No Mark beside

>>> df.A = df.A.apply(lambda x:str(x)+"\tYes" if x%2==0 else str(x)+"\tNo")

>>> df

 A B

0 5\tNo 9

1 5\tNo 9

2 5\tNo 9


1.
2.
3.
4.
5.
6.

apply() Most usages of are the above points , The examples listed here are simpler , But it is enough for basic usage understanding .

That's all of this , Finally, thank you for reading ！

上一篇文章： I have to tell the story between Python and excel. I have modules. Do you have data?
下一篇文章： Try python (I)

Python

What is Python? What can you do? How to counter attack?

First of all ：python What is i

Word distance (Python)

Problem description ： The samp

Python內置庫struct

目錄struct庫簡要說明struct方法struct.pa

[python] use while and for to loop through the list

Traversal is to access each da

強化學習系列（二):Q learning算法簡介及python實現Q learning求解TSP問題

目錄一、什麼是Q learning算法？1.Q table2

[Python] pygithub+jinja2 generate a simple poster of GitHub project

Bowen author wangzirui32 Like

The problem of sorted and reversed in Python

The use of str() and repr() methods in Python

How to add the same character to each element of Python list

Pandas custom change the order of columns in dataframe

Pandas uses the split function to split the specific string data column of dataframe into two new data columns and generate a new dataframe

pandas自定義改變dataframe數據列的前後次序 (change the order of columns in dataframe)

Leetcode solution (1672): total assets of the richest customers (Python)

Python and fractal 0019 - [tutorial] stack of circles

python與分形0019 - 【教程】Stack of Circles

leetcode 2305. Fair Distribution of Cookies（python）

熱門圖文

HDU1556 color the ball][樹狀數組]解題報告 PHP 的 __FILE__ 常量 Delphi壓縮流和解壓流的應用 MIDP手機利用程序的屬性先容 HDU 5094 狀壓BFS 19個必須知道的Visual Studio快捷鍵，visualstudio Java編程那些事兒37—for語句語法基於.NET平台常用的框架和開源程序整理

欄目導航