Skip to content

mohamedsamirz/Searchly

 
 

Repository files navigation

Searchly

This project aims to develop a basic search engine demonstrating web crawling, indexing, ranking, and query processing using Java.

Getting Started

For running the crawler

make crawler

For running the indexer

make indexer

For running the query test

make query-test

Components

Web Crawler

  • Collects documents starting with seed URLs.
  • Ensures crawling etiquette and multithreading.
  • Collects 6000 pages for the project.

Indexer

  • Indexes documents for fast retrieval.
  • Maintains index in secondary storage.
  • Supports incremental updates.

Query Processor

  • Receives and processes user queries.
  • Supports stem matching.

Phrase Searching

  • Supports phrase searching with quotation marks.
  • Maintains word order in results.

Ranker

  • Ranks documents based on relevance and popularity.
  • Considers various relevance calculation methods.
  • Uses algorithms like PageRank for popularity.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Java 77.7%
  • JavaScript 12.1%
  • HTML 4.0%
  • CSS 3.1%
  • Makefile 3.1%