ddxiami

 找回密码
 立即注册
搜索
热搜: 活动 交友 discuz
查看: 2922|回复: 0

nginx跟踪静态页蜘蛛日志

[复制链接]
发表于 2021-5-14 09:18:59 | 显示全部楼层 |阅读模式
#文件:host.conf,在server里面加上
==================================================
body_filter_by_lua_file /usr/local/nginx/conf/spider.lua;
==================================================

#文件:spider.lua
==================================================
-- 文件用/tmp内存文件,用硬盘会漏写
local f, err = io.open("/tmp/log_spider.txt", "a+") -- 通常没必要用b二进制,也要确认是否有写权限

-- 第一层URL判断
check_uri = string.find(ngx.var.request_uri, "/article/")
if(check_uri ~= nil)
then
        check_baidu = string.find(ngx.var.http_user_agent, "Baiduspider")
        check_360 = string.find(ngx.var.http_user_agent, "360Spider")
        check_sousou = string.find(ngx.var.http_user_agent, "Sogou web spider")

        --判断蜘蛛
        if(check_baidu~=nil or check_360~=nil or check_sousou~=nil)
        then
                f:write(ngx.var.time_iso8601) f:write("|") f:write(ngx.var.host) f:write("|") f:write(ngx.var.request_uri) f:write("|") f:write(ngx.var.http_user_agent) f:write("|") f:write(ngx.var.remote_addr) f:write("\r\n") f:flush()
        end

end
==================================================
回复

使用道具 举报

您需要登录后才可以回帖 登录 | 立即注册

本版积分规则

小黑屋|手机版|Archiver|技术文档库 ( 闽ICP备15017263号-2 )|网站地图

GMT+8, 2025-5-19 01:29 , Processed in 0.035573 second(s), 16 queries .

Powered by Discuz! X3.4

Copyright © 2001-2020, Tencent Cloud.

快速回复 返回顶部 返回列表