5331 positive and 5331 negative processed sentences / snippets. Introduced in Pang/Lee ACL 2005. Released July 2005.
Usage
dataset_sentence_polarity(
dir = NULL,
delete = FALSE,
return_path = FALSE,
clean = FALSE,
manual_download = FALSE
)
Arguments
- dir
Character, path to directory where data will be stored. If
NULL
, user_cache_dir will be used to determine path.- delete
Logical, set
TRUE
to delete dataset.- return_path
Logical, set
TRUE
to return the path of the dataset.- clean
Logical, set
TRUE
to remove intermediate files. This can greatly reduce the size. Defaults to FALSE.- manual_download
Logical, set
TRUE
if you have manually downloaded the file and placed it in the folder designated by running this function withreturn_path = TRUE
.
Value
A tibble with 10,662 rows and 2 variables:
- text
Sentences or snippets
- sentiment
Indicator for sentiment, "neg" for negative and "pos" for positive
Details
Citation info:
This data was first used in Bo Pang and Lillian Lee, ``Seeing stars: Exploiting class relationships for sentiment categorization with respect to rating scales.'', Proceedings of the ACL, 2005.
InProceedings{pang05,
author = {Bo Pang and Lillian Lee},
title = {Seeing stars: Exploiting class relationships for sentiment
categorization with respect to rating scales},
booktitle = {Proceedings of the ACL},
year = 2005
}