您当前的位置: 首页 >  Python

浪荡子爱自由

暂无认证

  • 2浏览

    0关注

    74博文

    0收益

  • 0浏览

    0点赞

    0打赏

    0留言

私信
关注
热门博文

【Python】B站详细信息抓取

浪荡子爱自由 发布时间:2022-03-08 22:27:00 ,浏览量:2

import time
import requests
import pandas as pd
import random
headers = {
    "user-agent":
    "Mozilla/5.0 (iPhone; CPU iPhone OS 11_0 like Mac OS X) AppleWebKit/604.1.38 (KHTML, like Gecko) Version/11.0 Mobile/15A372 Safari/604.1",
    "Referer": "https://www.bilibili.com/",
}

video_data = []
for page in range(1,2):
    api_url = f'https://api.bilibili.com/x/space/arc/search?mid=10330740&ps=30&tid=0&pn={page}&keyword=&order=pubdate&jsonp=jsonp'
    res = requests.get(api_url, headers=headers).json()
    video_list = res['data']['list']['vlist']
    for video in video_list:
        videos_dic = {}
        videos_dic["title"]=video['title']
        videos_dic["img"] = video["pic"]
        videos_dic['video_time'] = video['length']
        videos_dic['play_num'] = video['play']
        videos_dic['video_description'] = video['description']
        videos_dic['author'] = video['author']
        videos_dic['comment_num'] = video['comment']
        videos_dic['play_url'] = 'https://www.bilibili.com/video/' + video['bvid']
        video_data.append(videos_dic)
    time.sleep(random.randint(2, 5))

video_df = pd.DataFrame(video_data)

关注
打赏
1647272175
查看更多评论
立即登录/注册

微信扫码登录

0.0463s