Abstract: As the internet continues to grow exponentially, efficient web crawling and effective page relevance computation have become crucial for information retrieval systems. The objective is to determine how relevant a web page or document is to a given query or user context. This paper explores various techniques employed in assessing page relevance, including link-based, content-based, and hybrid methods. Additionally, it discusses the challenges associated with relevance computation, such as the dynamic nature of web content, scalability issues, and the presence of low-quality pages.
| DOI: 10.17148/IARJSET.2020.7121