Searchly

This project aims to develop a basic search engine demonstrating web crawling, indexing, ranking, and query processing using Java.

Getting Started

For running the crawler

make crawler

For running the indexer

make indexer

For running the query test

make query-test

Components

Web Crawler

Collects documents starting with seed URLs.
Ensures crawling etiquette and multithreading.
Collects 6000 pages for the project.

Indexer

Indexes documents for fast retrieval.
Maintains index in secondary storage.
Supports incremental updates.

Query Processor

Receives and processes user queries.
Supports stem matching.

Phrase Searching

Supports phrase searching with quotation marks.
Maintains word order in results.

Ranker

Ranks documents based on relevance and popularity.
Considers various relevance calculation methods.
Uses algorithms like PageRank for popularity.

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
frontend		frontend
out/production/RANKER		out/production/RANKER
src		src
DB.json		DB.json
Makefile		Makefile
RANKER.iml		RANKER.iml
README.md		README.md
apt.iml		apt.iml
json-simple-1.1.1.jar		json-simple-1.1.1.jar
members.txt		members.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Searchly

Getting Started

Components

Web Crawler

Indexer

Query Processor

Phrase Searching

Ranker

About

Releases

Packages

Languages

mohamedsamirz/Searchly

Folders and files

Latest commit

History

Repository files navigation

Searchly

Getting Started

Components

Web Crawler

Indexer

Query Processor

Phrase Searching

Ranker

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages