程序師世界是廣大編程愛好者互助、分享、學習的平台，程序師世界有你更精彩！


設為首頁	加入收藏

首頁
編程語言: C語言|JAVA編程
 Python編程
網頁編程: ASP編程|PHP編程
 JSP編程
數據庫知識: MYSQL數據庫|SqlServer數據庫
 Oracle數據庫|DB2數據庫

您现在的位置：程式師世界 >> 編程語言 > >> 更多編程語言 >> Python

Python data extraction BS4

編輯：Python

Use bs4 Extract local html When you file , An encoding error occurred . as follows

#-*- coding = utf-8 -*-
#@Time : 2022/2/20 17:46
#@File : bs4 Data analysis .py
#@software : PyCharm
#bs4 Data analysis
# Principle of data analysis 1. Label positioning ,2. Extract tags , Data values stored in label properties
#bs4 1. Label positioning 1. Instantiate a BeautifulSoup object , And load the page source code into the object
#2. By calling BeautifulSoup Object for tag location and data extraction
# Environmental installation ：install bs4 pip install lxml
from bs4 import BeautifulSoup
# Object instantiation
#1. Local HTML You can only get the text content directly below the tag
# Will local html Load with this object
fp =open('./sogou.html','r',encoding='utf-8')
soup = BeautifulSoup(fp,'lxml')
fp.close()
print(soup)
#2. Load the source code of the page obtained from the Internet into the object

An error has occurred UnicodeEncodeError: 'gbk' codec can't encode character '\xa0' in position 7819: illegal multibyte

terms of settlement ：

import sys
import io
sys.stdout = io.TextIOWrapper(sys.stdout.buffer,encoding='utf8') # Change the default encoding of standard output
fp =open('./sogou.html','r',encoding='utf-8')
soup = BeautifulSoup(fp,'lxml')
fp.close()
print(soup.decode('utf-8'))

上一篇文章： Time module of Python
下一篇文章： Python crawler case

Python

Python separates different types of files

1. one2two Save the files in t

Summary of Python interview questions (II)

1、 sketch any() and all() Meth

Graphical Python | basic grammar

author ： Han Xinzi @ShowMeAI T

Django form

Django To ensure the correct d

Python efficient identification verification code minimal dddd

https://github.com/sml2h3/dddd

python自動化測試全棧之ApiFox批量運行腳本生成報告

【文章末尾給大家留下了大量的福利】安裝nodeN

相關文章

Pandas uses the split function to split the specific string data column of dataframe into two new data columns and generate a new dataframe

51job crawler + data visualization Python

Python data structure problems

Introduction to Python data structure and algorithm

Django project - order module (next) and data statistics_ 11 [more readable version]

Python data analysis - pandas data structure (dataframe)

Python data analysis science library pandas (statistical analysis and decision)

Python -- data visualization using Matplotlib Library

I read a value from a file. How can I make this data value locate according to the value I read (Language Python)

Python implements the cell filling color of data required in Excel

閱讀排行榜

How to become a senior digital IC Design Engineer (4-4) script: file comparison operation implemented by Python script Python線程池ThreadPoolExecutor詳細介紹 The python live server library is extremely stubborn, and the file cannot be updated after modification. What should I do? python-opencv rotation correction python中pytorch、torchaudio、torchvision版本對應信息如何在ubuntu下安裝任意版本python Python processing Excel Python魔法方法(4):__call __(self, *args, **kwargs) 方法【機器學習基礎】用Python畫出幾種常見機器學習二分類損失函數 Talking about the K-means algorithm and implementation (based on Python) Ive just learned python. I want to ask you some questions

熱門圖文

Java Servlet 編程及應用之Cookie的使用方法 java系列1 環境變量配置，java環境變量 php有效防止同一用戶多次登錄，php有效防止登錄 [javaSE] GUI（圖形用戶界面），javasegui poj3414 Pots C# 公式計算(字符串)，關於JSP Basic表單驗證的資料及個人實踐總結 PHP 和 MySQL 基礎教程（四）

欄目導航

編程綜合問答

更多關於編程

編程問題解答

Copyright © 程式師世界 All Rights Reserved