This book is about using Python Data control 、 Handle 、 Arrangement 、 Analysis and other aspects of the specific details and basic points . My goal is to introduce Python Library and tool environment for programming and data processing , Master these , It can make you a data analysis expert . Although the title of this book is “ Data analysis ”, The point is Python Programming 、 library , And tools for data analysis . This is what data analysis needs Python Programming .
When the book appears “ data ” when , What exactly does it mean ? It mainly refers to structured data (structured data), This deliberately vague term refers to data in all common formats , for example :
This is by no means a complete list . Most data sets can be transformed into structured forms more suitable for analysis and modeling , Although sometimes this is not obvious . If not , The features of the data set can also be extracted into a structured form . for example , A group of news articles can be processed into a word frequency table , And this word frequency table can be used for emotional analysis .
Most spreadsheet software ( such as Microsoft Excel, It is probably the most widely used data analysis tool in the world ) Users will not be unfamiliar with such data .
A lot of people ( Including myself )