Misinformation Identification with LLM

Project Overview

This project identifies misinformation in textual data using Retrieval-Augmented Generation (RAG), combining large language models (LLMs) with a custom knowledge retrieval pipeline for accurate, context-aware misinformation detection. By leveraging external knowledge sources, the system enhances the model's ability to distinguish between truthful and misleading content.

Features

RAG-based Model: Combines the retrieval of external knowledge and generation capabilities of LLMs for enhanced misinformation detection.
Real-time Information Retrieval: Uses a custom-built information retrieval system to fetch relevant data for context-based analysis.
High Accuracy: Capable of identifying false claims with a high degree of accuracy by cross-checking statements against factual databases.

Installation

Prerequisites

Python 3.8+

Clone the repository

  git clone https://github.com/JimmyIITR/misInformationIdentificationUsingRAG.git
  cd misInformationIdentificationUsingRAG/usingLangChain

Install the required libraries using pip:
```
pip install -r requirements.txt
```

Usage

Running the Script

Prepare the Dataset: Ensure your dataset (textual data for analysis) is available in the data/ directory.
Run the Misinformation Detection:
```
streamlit run main.py
```

The script will process the input data, query the retrieval system, and use the LLM to generate a classification for misinformation.

Example Input:

"Climate change is a hoax created by scientists to get more funding."

Example Output:

"The statement has been flagged as misinformation based on contextual facts."

Model Architecture

The system is based on a Retrieval-Augmented Generation (RAG) architecture, which involves:

Information Retrieval: The first stage retrieves the most relevant documents or knowledge from a database to provide context.
Text Generation: The second stage uses an LLM to generate a classification or response, based on both the input text and retrieved context.

Documentation

For detailed documentation and the roadmap of our project, please visit the Wiki of this repository.

Video Demonstration

To view or download the video demonstration, click here 🎬

Facing any issues???

Feel free to open an issue. We are glad to help you. ❤️

License

This project is published under the MIT license.

Name	Name	Last commit message	Last commit date
Latest commit Jimmy5467 Update README.md Nov 29, 2024 7b575f5 · Nov 29, 2024 History 15 Commits
DataSet	DataSet	working project	Nov 20, 2024
usingLangChain	usingLangChain	working project	Nov 20, 2024
.DS_Store	.DS_Store	working project	Nov 20, 2024
.env	.env	working project	Nov 20, 2024
.gitignore	.gitignore	working	Nov 20, 2024
README.md	README.md	Update README.md	Nov 29, 2024
Web_scraping.ipynb	Web_scraping.ipynb	working project	Nov 20, 2024
checkAPIValidity.py	checkAPIValidity.py	working project	Nov 20, 2024
dataScrapping	dataScrapping	working project	Nov 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Misinformation Identification with LLM

Project Overview

Features

Installation

Prerequisites

Usage

Running the Script

Example Input:

Example Output:

Model Architecture

Documentation

Video Demonstration

Facing any issues???

License

About

Releases

Packages

Languages

Jimmy5467/misInformationIdentificationUsingRAG

Folders and files

Latest commit

History

Repository files navigation

Misinformation Identification with LLM

Project Overview

Features

Installation

Prerequisites

Usage

Running the Script

Example Input:

Example Output:

Model Architecture

Documentation

Video Demonstration

Facing any issues???

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages