程序師世界是廣大編程愛好者互助、分享、學習的平台，程序師世界有你更精彩！


設為首頁	加入收藏

首頁
編程語言: C語言|JAVA編程
 Python編程
網頁編程: ASP編程|PHP編程
 JSP編程
數據庫知識: MYSQL數據庫|SqlServer數據庫
 Oracle數據庫|DB2數據庫

您现在的位置：程式師世界 >> 編程語言 > >> 更多編程語言 >> Python

python高考專業數據爬取

編輯：Python

# coding=utf-8
import json
import pandas as pd
import requests
def detail(page_num):
heads = {
'user-agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/91.0.4472.106 Safari/537.36'} # 請求頭
url = 'https://static-data.gaokao.cn/www/2.0/special/%s/pc_special_detail.json'#Url
d2 = pd.DataFrame()
#分頁爬取一頁10個,需要對pandas進行安裝pip install openpyxl
for i in range(1,page_num):
response = requests.get(url % (i), headers=heads)
if response!=None:
json_data = json.loads(response.text)
my_json = json_data['data'] # 獲得josn 數據的根目錄
df3 = pd.DataFrame({#d對my_json中文件進行獲取
'id':my_json['id'],
'name':my_json['name'],
'內容':my_json['content'],
'工作':my_json['job'],
'code':my_json['code'],
'degree':my_json['degree'],
'年限':my_json['limit_year'],
'男女比例':my_json['rate'],
'type':my_json['type'],
'type_detail':my_json['type_detail']
}, index=[0])
d2 = d2.append(df3, ignore_index=True)
print(d2)
d2.to_excel("major.xlsx", index=False)

detail(5)
————————————————
版權聲明：本文為CSDN博主「螺旋大西瓜」的原創文章，遵循CC 4.0 BY-SA版權協議，轉載請附上原文出處鏈接及本聲明。
原文鏈接：https://blog.csdn.net/weixin_45208256/article/details/124950788

上一篇文章： Python點運算符左右可以有任意多個空格、Tab
下一篇文章：【Python】數組整列賦值問題

Python

《大秦賦》最近有點火！於是我用Python抓取了“相關數據”，發現了這些秘密......

前言最近，最火的電視劇莫過於《大秦賦了》，自12月1日開播後

Python crawler series of a hi spelling reverse algorithm

Python A reverse algorithm of

這段代碼各部分代表啥意思呀，Python

第二行，第三行，還有讀完文件之後的部分，寫入前面那部分，寫入

Django項目——報錯處理

目錄DB Browser (SQLite)報錯情況DB Br

Python development environment setup (Windows)

The following provides a link

1.python program icon making

We usually use the program wil

相關文章

没有相关文章

閱讀排行榜

python django數據庫詳解第 4 天迭代器、生成器、裝飾器、正則表達式 A treasure cartoon avatar 50 yuan? 1 line of Python code, dont pay IQ tax again Python pycharm in version 3.8 is also the lxml successfully installed in the new version, but the etree object has no attribute HTML Django 運行時發生異常ValueError: save() prohibited to prevent data loss due to unsaved related object ‘user 【Python例子】列表綜合應用 - 「隨機分配辦公室」 Python USES the mkdir error: FileNotFoundError: [3] WinError system can not find the specified path. Briefly talk about the application of Python in algorithm, back-end and quantification Python環境搭建與輸入輸出 How to quickly implement a thread pool in Python?most complete (1) Ganglia python metric extension

熱門圖文

J2EE新手入門篇：“Spring”的名詞解釋調試-關於怎麼運行python 有以下幾個程序怎麼放一起運行得出結果 PHP加密解密函數 ASP.NET仿新浪微博下拉加載更多數據瀑布流效果 php explode函數實例代碼 ASP操作XML文件的完整實例代碼獲得connect string簡單方法 Delphi 注冊快捷鍵

欄目導航

編程綜合問答

更多關於編程

編程問題解答

Copyright © 程式師世界 All Rights Reserved