Wednesday, August 14, 2019

Replication of the Keyword Extraction part of the paper "Without the Clutter of Unimportant Words": Descriptive Keyphrases for Text Visualization

The paper is on vixra at: http://vixra.org/abs/1908.0422
and on arxiv at: https://arxiv.org/abs/1908.07818


The dataset and code associated with the replication can be found at: web.eecs.umich.edu/~lahiri/replication_of_keyword_extraction_part_of_the_paper_by_Chuang_etal_data_and_code.zip (note that a long time has passed since the implementation, and we only have a minimal README at this moment.)

This dataset is on Keyword Extraction (Keyphrase Extraction), based on the SemEval 2010 Task 5 (Keyphrase Extraction) Dataset: https://www.aclweb.org/anthology/S10-1004. We re-annotated the data (144 files) using Amazon Mechanical Turk (MTurk).

Keyword Extraction Dataset
Keyphrase Extraction Dataset
SemEval 2010 Dataset, re-annotated by Amazon Mechanical Turk. - MTurk