研究目的
To propose an efficient way to detect sarcastic tweets on Twitter and study how to use this information to enhance the accuracy of sentiment analysis.
研究成果
The proposed pattern-based approach for detecting sarcasm on Twitter shows promising results, with an accuracy of 83.1% and precision of 91.1%. The study highlights the importance of pattern-based features in sarcasm detection and suggests that combining different sets of features can improve classification performance. Future work could explore using a larger training set to cover more sarcastic patterns and further enhance sentiment analysis.
研究不足
The approach's performance is dependent on the quality and size of the training set. The informal language and noise in tweets can affect the accuracy of part-of-speech tagging. The method may not cover all possible sarcastic patterns due to the limited size of the training set.
1:Experimental Design and Method Selection:
The approach involves extracting features from tweets to classify them as sarcastic or non-sarcastic using machine learning algorithms.
2:Sample Selection and Data Sources:
Tweets were collected using Twitter's streaming API, focusing on those containing the hashtag "#sarcasm" for sarcastic tweets and general tweets for non-sarcastic ones.
3:List of Experimental Equipment and Materials:
Tools used include Apache OpenNLP for NLP tasks, Gate Twitter part-of-speech tagger for tagging tweets, and Weka toolkit for classification.
4:Experimental Procedures and Operational Workflow:
Features were extracted from tweets, including sentiment-related, punctuation-related, syntactic and semantic features, and pattern-related features. These features were then used to train a model to classify tweets.
5:Data Analysis Methods:
The classification was performed using Random Forest, SVM, k-NN, and Maximum Entropy classifiers, with performance evaluated based on accuracy, precision, recall, and F1-score.
独家科研数据包,助您复现前沿成果,加速创新突破
获取完整内容