How do I keep Yahoo robots from scanning my webpage? I have a robots.txt file in place, but it's not working.

Question

We have "Slurp" as a disallow, which is supposedly the Yahoo search robot, but it's not working at all.  We've managed to keep Google out for the most part, but have had no luck with Yahoo and can't get any answers from anyone working for Yahoo.  Help!

bobsmith2089 · Accepted Answer

my understanding is that you don't disallow slurp to keep him away but that you dissalow to allow them. It's odd I know but dissalow means dissalowing them from reading that part and putting it's name in doesn't help

To keep them all away you need to put in

# go away
User-agent: *
Disallow: /

with no disallows and the user agent being and asteric that means all robots. In this case the dissallow is asking for for subdirectories of your webapage to disallow robots to, leaving it blank means the entire site.

Lie Ryan · Answer

Yahoo is a search engine that obeys robot.txt, your robot.txt mus be written in a good manner though. 
Have you done these?
- robot.txt must be written in a certain syntax, which is (very) strict, even a single extra space might make some robots unable to understand your robot.txt
- robot.txt must be placed in root, not subfolders
- Be aware that other robot may also be named same to Yahoo's robot, and these robots might ignore the robot.txt

Trending News

How do I keep Yahoo robots from scanning my webpage? I have a robots.txt file in place, but it's not working.

2 Answers