您现在的位置：程式師世界 >> 編程語言 > >> 更多編程語言 >> Python

Some tips for data analysis - based on Python

編輯：Python

1. Reading data

We're passing pandas When reading data , There are many reading methods you can choose , Here are some tips .

import pandas as pd
df = pd.read_csv('../data/XXX.csv',sep='\t',nrows = 100)

1.1 When setting the path , Pay attention to the direction of the slash , At the same time, it is recommended to write as follows

df = pd.read_csv(r'C:\Users\12810\Desktop\temp\XXX.csv',sep='\t',nrows = 100) # The previous regular ‘r’, Do not change the slash direction

1.2 Set a read only 100 Samples , It is of great help to speed up the operation when writing code initially .

2. How to add a new column in the data frame

df['new_col'] = XX # Additional data

3.apply And lambda Mixed use of functions

df.apply(lambda x: x.split(','))
# more zip()、map()、lambda() For continuous use, please refer to ：
# https://blog.csdn.net/HG0724/article/details/117374802

4.‘_’ What's the use of defining variables

According to custom , Sometimes a single independent underscore is used as a name , To indicate that a variable is temporary or irrelevant .·

for _ in range(10):
print(' I am a ：',_)

 I am a ： 0
I am a ： 1
I am a ： 2
I am a ： 3
I am a ： 4
I am a ： 5
I am a ： 6
I am a ： 7
I am a ： 8
I am a ： 9

5. Count the element frequency of a column of data and express it with histogram

df['label'].value_counts().plot(kind='bar')

Used in data analysis , Find the data in a column , What is the number of elements , And visualize , Very practical .

6.Counter Module introduction

from collections import Counter # A counter module

Reference resources ：https://blog.csdn.net/ch_improve/article/details/89388389

# Count the number of characters 
str_1 = 'wdqdqwdqwqwd11dq2wd'
count_result = Counter(str_1) # 
print(count_result)

Counter({'d': 6, 'w': 5, 'q': 5, '1': 2, '2': 1})

count_result.most_common(3)

[('d', 6), ('w', 5), ('q', 5)]

上一篇文章： Python closure, decorator, syntax sugar
下一篇文章： python——使用API

Python

python繪制折線圖遇見的一些問題

1、我想要達到的結果繪制一個折線圖大概是橫坐標為產品名，縱坐

Detailed explanation of newline parameter instance in Python open function

Catalog The origin of the pro

python畢業設計作品基於django框架新聞信息管理系統畢設成品（1）開發概要

整個項目包含了：開題報告 + 開題報告PPT + 任務書 +

【機器學習】數據准備--python爬蟲

前言我們在學習機器學習相關內容時，一般是不需要我們自己去爬取

Python foundation 12 (iterator, generator, calculation of Fibonacci sequence)

Iterators and generators Iter

python opencv 同窗口顯示多個圖像