SentimentAnalysis

所属分类:自然语言处理
开发工具:Java
文件大小:13055KB
下载次数:0
上传日期:2014-07-05 01:39:23
上 传 者sh-1993
说明:  从新闻文章中察觉情绪
(To detect sentiment in the news articles)

文件列表:
Dataset (0, 2014-07-05)
Dataset\allemotionsheadlinecontent.txt (32512828, 2014-07-05)
Final Project Report.pdf (327115, 2014-07-05)
Matlab Codes (0, 2014-07-05)
Matlab Codes\KNN.m (2469, 2014-07-05)
Matlab Codes\SVM_withoutCV.m (586, 2014-07-05)
Matlab Codes\SVMwithCV.m (1303, 2014-07-05)
Matlab Codes\classifySVM.m (338, 2014-07-05)
Matlab Codes\testSVM.m (432, 2014-07-05)
Matlab Codes\trainSVM.m (408, 2014-07-05)
baseline accuracy calculator (0, 2014-07-05)
baseline accuracy calculator\baseline.java (6591, 2014-07-05)
feature vector generator (0, 2014-07-05)
feature vector generator\filereader.java (5770, 2014-07-05)
feature vector generator\idftf svm.txt (10915, 2014-07-05)
feature vector generator\tfidf.java (7713, 2014-07-05)
lexical chain (0, 2014-07-05)
lexical chain\WordNetDemo.java (15240, 2014-07-05)
neural network (0, 2014-07-05)
neural network\NN.m (1421, 2014-07-05)
perceptron (0, 2014-07-05)
perceptron\perceptron.py (3876, 2014-07-05)
web crawler (0, 2014-07-05)
web crawler\Main.java (4452, 2014-07-05)

SentimentAnalysis ================== To detect sentiment in the news articles The project aims as classifying news article based on their sentiments. We have taken multiple categories into account. For example, sad, scary, stupid, wierd etc. We take data from fark.com and train our model to predict the sentiments which are invoked in the corresponding article. It will be beneficial to read the paper attached with the codes. It will help you understand the model. The repositories can be explained as: Dataset: It contains a sample dataset which we took for training our model Matlab: This folder includes the matlab codes used. It has codes for KNearest Neighbor, SVM with cross validation Lexical Chain: It has the code for lexical chain analysis Neural Network: We tried employing neural network with unsuccesful attempt. Will update if any progress made in future Baseline Accuracy calculator: It is used to calculate baseline. The description of our baseline is present in the paper Feature Vector Generator: This code takes the text and converts it into a feature vector. We have used bag or words i.e. unigram, bigram, trigram, we also used TF-IDF and a few more Perceptron: It contains the perceptron code which we used to analyze two emotion categories at a time Web Crawler: This is the code used to extract data from Fark.com:

近期下载者

相关文件


收藏者