您当前的位置: 首页 >  Python

浪荡子爱自由

暂无认证

  • 2浏览

    0关注

    74博文

    0收益

  • 0浏览

    0点赞

    0打赏

    0留言

私信
关注
热门博文

【Python】根据汽车品牌列表及链接地址分别获取对应子品牌及车系数据列表

浪荡子爱自由 发布时间:2022-03-14 21:39:39 ,浏览量:2

def get_series_data(url_b, car_brand):
    """根据所有新能源汽车品牌列表及链接地址分别获取对应子品牌及车系数据列表"""
    series_data = {}
    headers = {
        'authority': 'car.autohome.com.cn',
        'method': 'GET',
        'scheme': 'https',
        'accept': '*/*',
        'accept-encoding': 'gzip, deflate, br',
        'accept-language': 'zh-CN,zh;q=0.9',
        'cookie': 'fvlid=156974583432110wygoXZiH; sessionid=D7FE9717-245E-4F8D-8D42-AAF453D1F470%7C%7C2019-09-29+16%3A30%3A35.298%7C%7C0; autoid=851072202da5829e1b4e6cbb05975388; cookieCityId=110100; __ah_uuid_ng=c_D7FE9717-245E-4F8D-8D42-AAF453D1F470; area=460106; ahpau=1; sessionuid=D7FE9717-245E-4F8D-8D42-AAF453D1F470%7C%7C2019-09-29+16%3A30%3A35.298%7C%7C0; ahsids=3170; sessionip=153.0.3.115; Hm_lvt_9924a05a5a75caf05dbbfb51af638b07=1585205934,1585207311,1585266321; clubUserShow=87236155|692|2|%E6%B8%B8%E5%AE%A2|0|0|0||2020-03-27+08%3A35%3A50|0; clubUserShowVersion=0.1; sessionvid=0F2198AC-5A75-47E2-B476-EAEC2AF05F04; Hm_lpvt_9924a05a5a75caf05dbbfb51af638b07=1585269508; ahpvno=45; v_no=8; visit_info_ad=D7FE9717-245E-4F8D-8D42-AAF453D1F470||0F2198AC-5A75-47E2-B476-EAEC2AF05F04||-1||-1||8; ref=www.baidu.com%7C0%7C0%7C0%7C2020-03-27+08%3A38%3A40.425%7C2019-10-07+22%3A52%3A34.733',
        'sec-fetch-mode': 'no-cors',
        'sec-fetch-site': 'same-origin',
        'user-agent': 'Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/77.0.3865.90 Safari/537.36',
    }
    re = requests.get(url_b, headers)
    soup = BeautifulSoup(re.text, 'lxml')  # 直接解析
    brand_id = "b" + url_b[url_b.index('brand-') + 6: url_b.index('.html')]
    li_node = soup.find('li', id=brand_id)
    for i in li_node.find_all('dd'):
        for j in i.find_all('a'):
            print("车系数据:", (car_brand, j.text), "=", 'https://car.autohome.com.cn' + j.get('href'))
            series_data[(car_brand, j.text)] = 'https://car.autohome.com.cn' + j.get('href')
    return series_data

完整源代码下载:https://download.csdn.net/download/weixin_56516468/84861598icon-default.png?t=M276https://download.csdn.net/download/weixin_56516468/84861598

关注
打赏
1647272175
查看更多评论
立即登录/注册

微信扫码登录

0.0398s