Search Engine
Description
This project involves the development of a search engine that builds an index from stemmed text files and web pages. It utilizes efficient data structures for search, incorporates a multithreaded architecture to improve performance, and includes a front-end interface for user interaction.
Tech Stack & Tools
Java, Jetty, Terminal CSS
key features
• Inverted Indexing
• Stemming Algorithm
• Exact and Partial Search Support
• Query File Processing
• Multithreaded Crawling
• Custom Thread Pool
• Web Crawling via Seed URL
• Crawl Limit Control
• Command-Line Interface with Modular Flags
• JSON Output for Index, Counts, and Results
special features
• "Feeling Lucky... Punk" Button
• Retro Terminal UI Styling
demo
Source Code
Back to Top