ML-Programming-Final-Project

所属分类:人工智能/神经网络/深度学习
开发工具:Jupyter Notebook
文件大小:0KB
下载次数:0
上传日期:2023-08-24 02:09:46
上 传 者sh-1993
说明:  ML编程最终项目,,
(ML-Programming-Final-Project,,)

文件列表:
Final_project_report.ipynb (33401, 2023-08-23)
data/ (0, 2023-08-23)
data/imdb_reviews.csv (61526358, 2023-08-23)
hahnn/ (0, 2023-08-23)
hahnn/cnn-hierarchical-network-for-document-classification/ (0, 2023-08-23)
hahnn/cnn-hierarchical-network-for-document-classification/Hierarchical Attentional Hybrid Neural Networks.pdf (616733, 2023-08-23)
hahnn/cnn-hierarchical-network-for-document-classification/hahnn-for-document-classification.ipynb (224429, 2023-08-23)
hahnn/cnn-hierarchical-network-for-document-classification/imdb_reviews.csv (61526358, 2023-08-23)
hahnn/cnn-hierarchical-network-for-document-classification/make_predictions.ipynb (28051, 2023-08-23)
hahnn/cnn-hierarchical-network-for-document-classification/requirements.txt (6232, 2023-08-23)
hahnn/cnn-hierarchical-network-for-document-classification/track_colab.PNG (29509, 2023-08-23)
hahnn/cnn-hierarchical-network-for-document-classification/tsne.png (112419, 2023-08-23)
src/ (0, 2023-08-23)
src/Movie_Reviews_XLNet.ipynb (200387, 2023-08-23)
src/imdb_reviews.csv (61235312, 2023-08-23)

# ML-Programming-Final-Project Hierarchical Attentional Hybrid NeuralNetworks and XLNet for document classification. ********************************************************************************************************************************************************************************* ## Paper Title and Link : HAHNN: Hierarchical Attentional Hybrid NeuralNetworks for document classification github link : [https://github.com/luisfredgs/cnn-hierarchical-network-for-document-classification](https://github.com/rubaramanan/ML-Programming-Final-Project/blob/master/link) ## Description of Paper ********************************************************************************************************************************************************************************* The paper proposes a novel approach combining convolutional neural networks, gated recurrent units, and attention mechanisms for document classification. The key contribution lies in utilizing convolutional layers to extract meaningful and abstract features through hierarchical representation. ## Context of the Problem ******************************************************************************************************************************************************************************** * The continually increasing number of complex datasets each year necessitates ever improving machine learning methods for robust and accurate categorization of these data. * General documentation classification models not take account of the context. * Generally, deep learning models needs lots of computational power to do the mathematical functions * Users need to have high configuration hardware resources for train a model from scratch to this kind of big size data * So, They proposed an hierarchical attention model based approach for language tasks instead of RNN. ## Implementation ********************************************************************************************************************************************************************************* * Setup the python environment for project * create python environmtn * install packages ```pip pip install -r hahnn\cnn-hierarchical-network-for-document-classification\requirements.txt ``` * Try to replicate the results given in paper on document classification datasets with HAHNN model. [a link](https://github.com/rubaramanan/ML-Programming-Final-Project/blob/master/https://github.com/rubaramanan/ML-Programming-Final-Project/blob/main/hahnn/cnn-hierarchical-network-for-document-classification/hahnn-for-document-classification.ipynb) * choose imdb datset as standard dataset. * To compare the performance of HAHNN model, trained XLNet model on the above-mentioned dataset. [a link](https://github.com/rubaramanan/ML-Programming-Final-Project/blob/master/https://github.com/rubaramanan/ML-Programming-Final-Project/blob/main/src/Movie_Reviews_XLNet.ipynb) ## Results and Conclusion ********************************************************************************************************************************************************************************* For Results each dataset is trained with HAHNN and XLNet model and comparison is present in the form of accuracy, f1-macro and f1-weighted. HAHNN paper only shows accuarcy in their results however I feel f1-score should be a better metric to assess the performance of document classification tasks. * Both the models perform equally well * XLNet model performs well however the execution time was 6+ hours with four epoches, if epoches 8, 13+ hours. |Models|Accuracy| |------|--------| |HAHNN|0.98| |XLNet|0.92|

近期下载者

相关文件


收藏者