COMP9313 Pyspark Project 3
fFinding Similar News Article Headlines Using Pyspark In this problem, we are still going to use the dataset of Australian news from ABC. Similar news may appear in different years. Your task is to find all similar news article headline pairs across different years. Background: Set similarity self-join Given a collection of records R, a […]
COMP9313 Pyspark Project 3 Read More »