Yahoo Answers is shutting down on May 4th, 2021 (Eastern Time) and beginning April 20th, 2021 (Eastern Time) the Yahoo Answers website will be in read-only mode. There will be no changes to other Yahoo properties or services, or your Yahoo account. You can find more information about the Yahoo Answers shutdown and how to download your data on this help page.

How do I keep Yahoo robots from scanning my webpage? I have a robots.txt file in place, but it's not working.

We have "Slurp" as a disallow, which is supposedly the Yahoo search robot, but it's not working at all. We've managed to keep Google out for the most part, but have had no luck with Yahoo and can't get any answers from anyone working for Yahoo. Help!

2 Answers

Relevance
  • 2 decades ago
    Favorite Answer

    my understanding is that you don't disallow slurp to keep him away but that you dissalow to allow them. It's odd I know but dissalow means dissalowing them from reading that part and putting it's name in doesn't help

    To keep them all away you need to put in

    # go away

    User-agent: *

    Disallow: /

    with no disallows and the user agent being and asteric that means all robots. In this case the dissallow is asking for for subdirectories of your webapage to disallow robots to, leaving it blank means the entire site.

  • 2 decades ago

    Yahoo is a search engine that obeys robot.txt, your robot.txt mus be written in a good manner though.

    Have you done these?

    - robot.txt must be written in a certain syntax, which is (very) strict, even a single extra space might make some robots unable to understand your robot.txt

    - robot.txt must be placed in root, not subfolders

    - Be aware that other robot may also be named same to Yahoo's robot, and these robots might ignore the robot.txt

Still have questions? Get your answers by asking now.