您当前的位置：首页 >

魔王不会哭

暂无认证

5浏览

0关注

149博文

0收益
0浏览

0点赞

0打赏

0留言

私信

关注

热门博文

别人的六一兴高彩烈,我的六一苦逼的敲代码采集壁纸~

魔王不会哭发布时间：2022-06-01 15:17:18 ，浏览量：5

前言

嗨喽，大家好呀！这里是魔王呐~

在这里插入图片描述

环境使用:

Python 3.8 解释器
Pycharm 编辑器

所使用模块

import re
import requests >>> pip install requests

如果安装python第三方模块:

win + R 输入 cmd 点击确定, 输入安装命令 pip install 模块名 (pip install requests) 回车
在pycharm中点击Terminal(终端) 输入安装命令

基本思路流程:

发送请求模拟浏览器对于url地址发送请求, 获取服务器返回响应数据伪装 headers 请求头
获取数据
解析数据提取我们想要的内容
保存数据

在这里插入图片描述

代码

import requests  # 用来发送请求模块
import re  # 提取数据工具
for page in range(6, 11):
    url = f'http://www.netbian.com/index_{page}.htm' # 发送请求
    # headers 字典数据类型,
    headers = {
        'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/101.0.4951.54 Safari/537.36',
    }
    response = requests.get(url=url, headers=headers)
    response.encoding = 'gbk'   # 获取网页内容,返回出现乱码
    print(response.text)  # 获取网页源代码
    # 获取壁纸名字以及壁纸详情页url地址  从什么地方找什么样数据内容,  从response.text 里面找
    # (.*?) 就是我们想要数据
    html_info = re.findall('
', response.text)
    print(html_info)
    for link, title in html_info:
        # http://www.netbian.com/desk/27062.htm
        link_url = 'http://www.netbian.com' + link  # 字符串拼接
        response_1 = requests.get(url=link_url, headers=headers)
        response_1.encoding = 'gbk'
        # print(response_1.text)
        img_url = re.findall('


    
        
        
        
            最近更新
            
                深拷贝和浅拷贝的区别（重点）
【Vue】走进Vue框架世界
【云服务器】项目部署—搭建网站—vue电商后台管理系统
【React介绍】 一文带你深入React
【React】React组件实例的三大属性之state，props，refs（你学废了吗）
【脚手架VueCLI】从零开始，创建一个VUE项目
【React】深入理解React组件生命周期----图文详解（含代码）
【React】DOM的Diffing算法是什么？以及DOM中key的作用----经典面试题
【React】1_使用React脚手架创建项目步骤--------详解(含项目结构说明)
【React】2_如何使用react脚手架写一个简单的页面？
            
        
        
        
            热门博客
            
                优秀的代码都是如何分层的？
Spring 最常用的 7 大类注解，史上最强整理！
别再用currentTimeMillis统计耗时了，太 Low，试试StopWatch吧！
HTTP 3.0彻底放弃TCP，TCP到底做错了什么？
为什么有些大公司技术弱爆了？
聊聊8 种架构模式，你经过几种？
同事写了一个责任链模式，bug无数...
那些让你起飞的计算机知识。。
3行代码写出8个接口，开挂了？
AI 加持实时互动｜ZegoAvatar ⾯部表情随动技术解析





        
        [ 申请 ]友情链接：
        
            ClashX教程
            绘画宝宝
            配音宝宝
        
    


    
        
            关于我们
            服务条款
            广告服务
            联系我们
            网站地图
            免责声明
            WAP
        
        技术支持：
            武汉快勤科技有限公司
            XML网站地图 
            备案号：鄂ICP备18027844号-9
            
        
    




    
        立即登录/注册
        
    
    
        
        微信扫码登录
    













	    基本
        文件
        流程
        错误
        SQL
        调试
    

		    
    
	请求信息 : 2025-07-04 17:19:34 HTTP/2.0 GET : /home/article/detail/id/406683.html
运行时间 : 0.1486s ( Load:0.0153s Init:0.0027s Exec:0.1254s Template:0.0051s )
吞吐率 : 6.73req/s
内存开销 : 1,635.23 kb
查询信息 : 11 queries 0 writes 
文件加载 : 18
缓存信息 : 5 gets 0 writes 
配置加载 : 132
会话信息 : SESSION_ID=9jl0ev8kev8mgd75vhis12uvko
    
    
        
    
	/www/wwwroot/www.chaojiit.com/index.php ( 1.30 KB )
/www/wwwroot/www.chaojiit.com/ThinkPHP/ThinkPHP.php ( 4.71 KB )
/www/wwwroot/www.chaojiit.com/ThinkPHP/Library/Think/Think.class.php ( 12.32 KB )
/www/wwwroot/www.chaojiit.com/ThinkPHP/Library/Think/Storage.class.php ( 1.38 KB )
/www/wwwroot/www.chaojiit.com/ThinkPHP/Library/Think/Storage/Driver/File.class.php ( 3.56 KB )
/www/wwwroot/www.chaojiit.com/Application/Runtime/common~runtime.php ( 76.63 KB )
/www/wwwroot/www.chaojiit.com/Application/Home/Conf/config.php ( 0.05 KB )
/www/wwwroot/www.chaojiit.com/ThinkPHP/Library/Behavior/ReadHtmlCacheBehavior.class.php ( 5.62 KB )
/www/wwwroot/www.chaojiit.com/Application/Home/Controller/ArticleController.class.php ( 6.10 KB )
/www/wwwroot/www.chaojiit.com/ThinkPHP/Library/Think/Model.class.php ( 67.27 KB )
/www/wwwroot/www.chaojiit.com/ThinkPHP/Library/Think/Db.class.php ( 5.70 KB )
/www/wwwroot/www.chaojiit.com/ThinkPHP/Library/Think/Db/Driver/Mysql.class.php ( 8.73 KB )
/www/wwwroot/www.chaojiit.com/ThinkPHP/Library/Think/Db/Driver.class.php ( 41.60 KB )
/www/wwwroot/www.chaojiit.com/ThinkPHP/Library/Think/Cache.class.php ( 3.84 KB )
/www/wwwroot/www.chaojiit.com/ThinkPHP/Library/Think/Cache/Driver/File.class.php ( 5.90 KB )
/www/wwwroot/www.chaojiit.com/Application/Runtime/Cache/Home/3c8a1a47a3534a7b1252c226abfc3928.php ( 15.77 KB )
/www/wwwroot/www.chaojiit.com/ThinkPHP/Library/Behavior/WriteHtmlCacheBehavior.class.php ( 1.43 KB )
/www/wwwroot/www.chaojiit.com/ThinkPHP/Library/Behavior/ShowPageTraceBehavior.class.php ( 5.27 KB )
    
    
        
    
	    
    
        
    
	[2] session_save_path(): open_basedir restriction in effect. File(/var/lib/php/session) is not within the allowed path(s): (/www/wwwroot/www.chaojiit.com/:/tmp/) /www/wwwroot/www.chaojiit.com/Application/Runtime/common~runtime.php 第 1 行.
[8192] Array and string offset access syntax with curly braces is deprecated /www/wwwroot/www.chaojiit.com/ThinkPHP/Library/Think/Cache/Driver/File.class.php 第 59 行.
[8] Undefined variable: user /www/wwwroot/www.chaojiit.com/Application/Runtime/Cache/Home/3c8a1a47a3534a7b1252c226abfc3928.php 第 35 行.
[8] Undefined variable: user /www/wwwroot/www.chaojiit.com/Application/Runtime/Cache/Home/3c8a1a47a3534a7b1252c226abfc3928.php 第 140 行.
[8] Trying to access array offset on value of type null /www/wwwroot/www.chaojiit.com/Application/Runtime/Cache/Home/3c8a1a47a3534a7b1252c226abfc3928.php 第 140 行.
[8] Undefined variable: user /www/wwwroot/www.chaojiit.com/Application/Runtime/Cache/Home/3c8a1a47a3534a7b1252c226abfc3928.php 第 141 行.
[8] Trying to access array offset on value of type null /www/wwwroot/www.chaojiit.com/Application/Runtime/Cache/Home/3c8a1a47a3534a7b1252c226abfc3928.php 第 141 行.
[8] Undefined variable: pinglun_list /www/wwwroot/www.chaojiit.com/Application/Runtime/Cache/Home/3c8a1a47a3534a7b1252c226abfc3928.php 第 150 行.
[8] Undefined variable: top_list /www/wwwroot/www.chaojiit.com/Application/Runtime/Cache/Home/3c8a1a47a3534a7b1252c226abfc3928.php 第 185 行.
    
    
        
    
	SELECT `value` FROM `configuration` WHERE `name` = 'site_name' LIMIT 1   [ RunTime:0.0006s ]
SELECT * FROM `menu` WHERE `fid` = 0 AND `status` = 1  [ RunTime:0.0008s ]
SELECT * FROM `menu` WHERE `fid` = 1 AND `status` = 1  [ RunTime:0.0005s ]
SELECT * FROM `menu` WHERE `fid` = 2 AND `status` = 1  [ RunTime:0.0004s ]
SELECT * FROM `menu` WHERE `fid` = 3 AND `status` = 1  [ RunTime:0.0055s ]
SELECT * FROM `menu` WHERE `fid` = 4 AND `status` = 1  [ RunTime:0.0005s ]
SELECT * FROM `article` WHERE `id` = 406683 LIMIT 1   [ RunTime:0.0006s ]
SELECT * FROM `bloger` WHERE `id` = 666 LIMIT 1   [ RunTime:0.0007s ]
SELECT COUNT(*) AS tp_count FROM `article` WHERE `bloger_id` = 666 LIMIT 1   [ RunTime:0.0006s ]
SELECT `content` FROM `article_content` WHERE `article_id` = 406683 LIMIT 1   [ RunTime:0.0330s ]
SELECT * FROM `article` WHERE `bloger_id` = 666 ORDER BY view_count desc LIMIT 0,10   [ RunTime:0.0137s ]
    
    
        
    
	    
    
    



0.1486s