# 慧企智投 robots — Hyperf SitemapController::robots() 动态替换 https://www.qihuizhitou.com/慧企智投 # 主搜索引擎默认全允许;吸血爬虫(SEO 工具/抓取库)显式 disallow # Sitemap 由 `php bin/hyperf.php sitemap` 命令生成到 public/sitemap.xml + public/sitemap/*.xml # ─── 全局规则:所有爬虫共享的 Disallow ─────────────────────────────── User-agent: * Allow: / # 用户中心 / 个人订单 / 支付回调 等登录后才能用的页面,搜索引擎收录无意义 Disallow: /user/ Disallow: /user.html Disallow: /pay/ # 安全页(登录 / 注册 / 找回密码) Disallow: /login.html Disallow: /register.html Disallow: /register_success.html Disallow: /forget.html Disallow: /afterRegister.html # 投诉 / 询价提交端点 Disallow: /suggestion.html Disallow: /inquiry/ # 带 keyword 参数的搜索结果(避免无限组合形成爬虫陷阱;品类页 /search/product?catid= 仍可被抓) Disallow: /search/product?keyword= Disallow: /search/company?keyword= Sitemap: https://www.qihuizhitou.com/sitemap.xml # ─── 显式放行:百度系搜索引擎(B2B 站点的核心流量来源)───────────── User-agent: Baiduspider Allow: / User-agent: Baiduspider-render Allow: / # 百度爱采购商业爬虫 —— 站点是百度爱采购官方合作伙伴,关键 User-agent: Baiduspider-cpro Allow: / # 百度图片搜索(商品图片是 B2B 站点重要流量来源) User-agent: Baiduspider-image Allow: / # 百度新闻 / 资讯 / 视频(资讯页 + 产品视频页) User-agent: Baiduspider-news Allow: / User-agent: Baiduspider-video Allow: / User-agent: Baiduspider-favo Allow: / # ─── 显式放行:国内其他主流搜索引擎 ─────────────────────────────── User-agent: Sogou web spider Allow: / User-agent: Sogou inst spider Allow: / User-agent: Sogou Pic Spider Allow: / User-agent: Sogou News Spider Allow: / User-agent: 360Spider Allow: / User-agent: 360Spider-Image Allow: / User-agent: 360Spider-Video Allow: / User-agent: HaosouSpider Allow: / # 神马搜索(UC / 阿里系) User-agent: YisouSpider Allow: / # ─── 显式放行:国际搜索引擎 ─────────────────────────────────────── User-agent: Googlebot Allow: / User-agent: Googlebot-Image Allow: / User-agent: Googlebot-News Allow: / User-agent: Bingbot Allow: / User-agent: BingPreview Allow: / User-agent: DuckDuckBot Allow: / User-agent: YandexBot Allow: / # ─── 显式放行:AI 训练 / AI 搜索爬虫(豆包 / 文心 / Kimi / 通义 / # ChatGPT / Claude / Gemini / Perplexity / DeepSeek)─ # 字节豆包 / 抖音搜索 User-agent: Bytespider Allow: / # OpenAI User-agent: GPTBot Allow: / User-agent: ChatGPT-User Allow: / User-agent: OAI-SearchBot Allow: / # Google Bard / Gemini 训练 User-agent: Google-Extended Allow: / # Anthropic Claude User-agent: ClaudeBot Allow: / User-agent: Claude-Web Allow: / User-agent: anthropic-ai Allow: / # Perplexity User-agent: PerplexityBot Allow: / User-agent: Perplexity-User Allow: / # Apple Intelligence User-agent: Applebot Allow: / User-agent: Applebot-Extended Allow: / # 其他 AI 平台 User-agent: cohere-ai Allow: / User-agent: Meta-ExternalAgent Allow: / User-agent: Meta-ExternalFetcher Allow: / User-agent: FacebookBot Allow: / User-agent: DeepSeekBot Allow: / User-agent: MistralAI-User Allow: / User-agent: YouBot Allow: / User-agent: Diffbot Allow: / User-agent: Amazonbot Allow: / # ─── 屏蔽:对收录无贡献的 SEO 工具 / 数据采集爬虫 ────────────────── User-agent: SemrushBot Disallow: / User-agent: AhrefsBot Disallow: / User-agent: MJ12bot Disallow: / User-agent: DotBot Disallow: / User-agent: MegaIndex.ru Disallow: / User-agent: MauiBot Disallow: / User-agent: BLEXBot Disallow: / User-agent: PetalBot Disallow: / User-agent: DataForSeoBot Disallow: / User-agent: SeznamBot Disallow: / User-agent: serpstatbot Disallow: / User-agent: SiteAuditBot Disallow: /