Link to github repo
To develop this project I used fetch_20newsgroups which is sklearn’s open source dataset. Following are the labels in the dataset.
- Data preparation was done thoroughly using TfidfVectorizer.
- Used MultiNomial Gaussian Naive Bayes algorithm to develop this project.
- Model accuracy is 77%
- The developed model will show the type of given input text.