Detection-of-Real-or-Fake-News

PROBLEM STATEMENT

To build a model to accurately classify a piece of news as REAL or FAKE.

Using sklearn, build a TfidfVectorizer on the provided dataset. Then, initialize a PassiveAggressive Classifier and fit the model. In the end, the accuracy score and the confusion matrix tell us how well our model fares.

Description of all files:

news.zip: Unzip the Dataset to get news.csv
news.csv: Dataset having fake and real news
Real_or_Fake_News.ipynb: Jupyter Notebook containing all explanation and my workdoings
train.py: Simply run this file to automatically train the model and generate vocabulary and model.pkl file to be saved for further
Could be run only once
Takes a command line argument taking the file name
Usage

python train.py news.csv

predict.py: Run this file as much as you want. Uses the saved models to run, hence is much faster to execute.
P.S The larger the text, the better the chance of accurate prediction

python predict.py

Disclamer: I donot guarrantee in case some real life sensitive words comes off as fake. It is solely trained on the dataset. No feelings are meant to be hurt.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Detection-of-Real-or-Fake-News

PROBLEM STATEMENT

Description of all files:

Disclamer: I donot guarrantee in case some real life sensitive words comes off as fake. It is solely trained on the dataset. No feelings are meant to be hurt.

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md
Real_or_Fake_News.ipynb		Real_or_Fake_News.ipynb
model.pkl		model.pkl
news.zip		news.zip
predict.py		predict.py
tfidf1.pkl		tfidf1.pkl
train.py		train.py

rishabh-karmakar/Detection-of-Real-or-Fake-News

Folders and files

Latest commit

History

Repository files navigation

Detection-of-Real-or-Fake-News

PROBLEM STATEMENT

Description of all files:

Disclamer: I donot guarrantee in case some real life sensitive words comes off as fake. It is solely trained on the dataset. No feelings are meant to be hurt.

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages