Skip to content

A search engine built from scratch including - crawler, indexer and a ranker. Please check out the code here :)

Notifications You must be signed in to change notification settings

HackStrix/search-engine-v2

Repository files navigation

Vision Search Engine Project

Project Logo

Welcome to Your Search Engine Project! This open-source search engine is designed to help you efficiently crawl, index, and search through web content. Whether you're building a web application, conducting research, or just curious about how search engines work, this project can serve as a valuable resource.

Table of Contents

Features

  • Web Crawler: A robust web crawler that efficiently traverses websites and collects data for indexing.
  • Indexer: An indexing mechanism that organizes crawled data for quick and efficient searching.
  • Frontend: A user-friendly web interface for searching and browsing the indexed content.
  • Customization: Easily customize and configure the crawler, indexer, and frontend to suit your needs.
  • Scalability: Designed to scale with the size of your data and user traffic.
  • Open Source: Built with open-source technologies and available for collaboration.

Demo

Demo Screenshot

Check out our live demo to see the search engine in action.

Installation

Follow these steps to get your search engine up and running:

  1. Clone the Repository:

    git clone https://github.com/yourusername/your-search-engine.git
    cd your-search-engine
  2. Install Dependencies:

    # Install backend dependencies
    pip install -r requirements.txt
    
    # Install frontend dependencies
    cd frontend
    npm install
  3. Configuration:

    • Configure the crawler settings in crawler.py.
    • Customize the indexer settings in indexer.py.
    • Set up your frontend configuration in frontend/src/config.js.
  4. Database Setup:

    • Create a database for storing indexed data.
    • Update the database connection settings in config.py.
  5. Run the Application:

    # Start the crawler
    python crawler.py
    
    # Start the indexer
    python indexer.py
    
    # Start the frontend
    cd frontend
    npm start
  6. Access the Application: Open your web browser and go to http://localhost:3000 to use the search engine.

Usage

  • Use the search bar on the frontend to perform searches.
  • Browse through search results and click on links to view full content.
  • Customize the project to integrate with your own website or application.

Contributing

We welcome contributions from the community! If you'd like to contribute to the development of this search engine, please follow our Contributing Guidelines.

License

This project is licensed under the MIT License - see the LICENSE file for details.

About

A search engine built from scratch including - crawler, indexer and a ranker. Please check out the code here :)

Resources

Stars

Watchers

Forks