Yahoo Answers is shutting down on May 4th, 2021 (Eastern Time) and beginning April 20th, 2021 (Eastern Time) the Yahoo Answers website will be in read-only mode. There will be no changes to other Yahoo properties or services, or your Yahoo account. You can find more information about the Yahoo Answers shutdown and how to download your data on this help page.

Which library to use in order to simulate Google s full text search in my Python project?

I m working on a personal project to try to simulate a search engine, which uses PageRank algorithm and some sort of a full text phrase search (pretty much what Google does). I have contents of the web pages stored in a dictionary and I would like to index them based on some sort of match with a given phrase. Is there a recommended library which can do that? I found Elasticsearch and Whoosh, but I don t know enough about them to know if I ll be able to use them for my example. I need it to be usable in Python.

1 Answer

Relevance
  • 2 years ago

    I don't know if this is helpful, but there's a free course at Udacity.com that has building a search engine (with PageRank algorithm) as it's course-long project.

    https://www.udacity.com/course/intro-to-computer-s...

    It's rated at 3 months, but if you already code, and know enough Python for this project, you can spin through the videos in a week or two to get the new information. I did just that a couple years ago before they added a little bit and expanded from the old 7 week estimate.

Still have questions? Get your answers by asking now.