Information Retrieval & Data Mining Assignment 2014

The data set for Section A (Text Retrieval) of the Information Retrieval and Data Mining Assignment 2014 can be downloaded using the links below:

Information Retrieval Assignment Dataset Part 1 (1103) Information Retrieval Assignment Dataset Part 2 (1047)

Note: After unzipping the two files, place the fb396001fr94 and ft folders from Part 2  inside the Documents folder of Part 1.

 

The Panda Platform required for this assignment can be found on Github

The data and workspace for Section B (Distributed Computing) of the Information Retrieval and Data Mining Assignment 2014 can be downloaded using the link below:

MapReduce Assignment Workspace (1526)

Comments are closed.