Yahoo不遵循robots.txt?
Yeeseo Team Posted on 2009-04-23 at 5:12
Yahoo Slurp抓走我从robots.txt中禁止的文件,原来我以为只有百度不太遵守robots.txt规范,现在看来也不尽然。
查看最近的蜘蛛抓取日志,我发现Yahoo! Slurp抓走了两个css文件,css是放在我用robots.txt禁止抓取的文件夹中,不知道Slurp什么时候开始也这么流氓了?
虽然不是见不得人的文件,可之前我一直相信大型搜索引擎除了百度都能很守规矩地遵循robots.txt,现在看来不见得。
从支持的http版本来看,Googlebot和Baiduspider不愧为搜索引擎市场领头羊手下的蜘蛛,支持http 1.1 Gzip的特性有效地减少了中间的传输流量。
This post is filed under 易优团队http://yeeseo.com/html/yahoo-slurp-disobey-robots-txt.html
成都SEO专家
- 相关阅读
- 随机文章
- SEO问答
- 热门标签
- 站内搜索
»»3条评论
How I Lost Thirty Pounds in Thirty Days posted on 09-05-04 at 14:24Reply
Hi, good post. I have been thinking about this issue,so thanks for writing. I will certainly be subscribing to your blog.
Yeeseo Team posted on 09-05-04 at 14:53Reply
@How I Lost Thirty Pounds in Thirty Days:
hi, sob guy!
You don't have to say this, your mom told me you've always been a liar.
God Father posted on 09-05-04 at 15:02Reply
@How I Lost Thirty Pounds in Thirty Days:
I subscribed a blog on mars~