程序師世界是廣大編程愛好者互助、分享、學習的平台，程序師世界有你更精彩！


設為首頁	加入收藏

首頁
編程語言: C語言|JAVA編程
 Python編程
網頁編程: ASP編程|PHP編程
 JSP編程
數據庫知識: MYSQL數據庫|SqlServer數據庫
 Oracle數據庫|DB2數據庫

您现在的位置：程式師世界 >> 編程語言 > >> 更多編程語言 >> Python

When reading a file, can you skip the characters that cannot be encoded and continue reading? (Language Python)

編輯：Python

Recently, I am studying reptile novels , There is a garbled code in a web page . Its web page is gb2312 code , I use gb2312、gbk、utf-8 I tried it once and couldn't recognize . Because I am crawling the text page by page , An error report means a chapter is missing , It's hard .
I want to ask you , Is there any way to directly ignore the characters that cannot be encoded , Write the extracted content directly ？
The download code is as follows

# download async def download(url, name): async with semaphore: async with aiohttp.ClientSession() as session: async with session.get(url) as reques: reques.encoding = 'gbk' page = bs4.BeautifulSoup(await reques.text(), 'html.parser') div = page.find('div', class_="read_chapterDetail") p = div.find_all('p') # Open file , Open mode , The data is binary  with open(f'{name}.txt', mode='wb') as f: for i in p: text = i.text + '\n' f.write(text.encode('utf-8')) print(f'{name} Download complete ！')

上一篇文章： Python 日志記錄-loguru
下一篇文章： How is it that every time a python crawler runs, a coordinate will appear

Python

Harris特征檢測（python實現）

文章目錄1.Opencv特征的場景2.特征3.Harris特

Python之禅 -- 致初學者

Python之禅Python社區的理念 ————“Zen o

已解決（Pycharm切換Python版本後報錯）No Python at”C:Program FiLesPython39pythen.exe‘

已解決（Pycharm切換Python版本後報錯）No Py

python中pickle向redis中存儲數據

pickle 和json對比pickle.loa

Simple use of Python automation

Catalog Environment configura

110 Python interview questions for second kill interviewers (Part 1)

1、 One line of code 1--100 The

相關文章

Pandas uses the split function to split the specific string data column of dataframe into two new data columns and generate a new dataframe

Python script: change all files in the current folder in a certain order, and save the original file name and the new file name to TXT (separated by spaces)

An error occurs when arcgis10.7 calls the mosaic to new grid tool with a python script

Python Dictionary: is there a higher-level game I cant play?

Django sends a post request and returns 500 errors

Python calculates the area and perimeter of a circle. Analysis of the real problem of level 2 of the python programming level examination of the Electronic Society for youth programming march2021

Python traverses the files in a folder and copies and renames the files to another folder

Python processes two excel and writes a new Excel based on the same fields

I read a value from a file. How can I make this data value locate according to the value I read (Language Python)

I use Python to do a quantitative index enhancement strategy, cool! Code attached!

閱讀排行榜

讀書筆記-《wxPython in Action》一 python-分割線 How to modify the horizontal and vertical axis scales of python line charts? 利用Python搭建簡易的Http服務器 3. Python 變量和賦值 [python] argparse模塊 6、Python量化交易-單均線策略升級1：T+0限制 python-面向對象-繼承 js如何調用python的算法 Python implementation of banker algorithm [operating system experiment] Python basic knowledge review | quick start | common key points

熱門圖文

Remoting基本原理及擴展機制（中）(3) u-boot-2013.07 移植到 mini2440 記錄(二) php關鍵詞替換的類(避免重復替換，保留與還原原始鏈接)，php關鍵詞【4天快速入門Python數據挖掘之第1天】Matplotlib的使用 Java中數據庫連接池原理機制的詳細講解線程，java線程 APACHE支持.htaccess方法構造方法調用淺談

欄目導航

編程綜合問答

更多關於編程

編程問題解答

Copyright © 程式師世界 All Rights Reserved