Please cite us if you find our paper and the data helpful: Title: Citation Worthiness of Sentences in Scientific Reports Authors: Hamed Bonab, Hamed Zamani, Erik Learned-Miller, James Allan Contact email: bonab@cs.umass.edu The paper is published in SIGIR 2018. ===== Bibtex ======================= @inproceedings{bonab:citationsigir2018, author = {Bonab, Hamed and Zamani, Hamed and Learned-Miller, Erik and Allan, James}, title = {Citation worthiness of sentences in scientific reports}, booktitle = {Proceedings of the 41st International ACM SIGIR Conference on Research and Development in Information Retrieval}, series = {SIGIR '18}, year = {2018}, isbn = {978-1-4503-5657-2}, url = {https://doi.org/10.1145/3209978.3210162}, doi = {10.1145/3209978.3210162}, publisher = {ACM} } =================================== This data is prepared based on SEPID corpus (http://pars.ie/lr/sepid-corpus). All the sentence IDs are consistent with the SEPID corpus, in case you need to do other interesting research with this data.