Enhanced Dialogue Summarization and Content Detoxification with FLAN-T5

Overview

This project focuses on dialogue summarization using generative AI techniques with a specific emphasis on enhancing model performance through prompt engineering. Additionally, it explores various fine-tuning approaches including full finetuning, Parameter Efficient Fine-Tuning (PEFT), and reinforcement learning techniques such as Proximal Policy Optimization (PPO) for generating less-toxic summaries. The main goal is to create efficient and effective models for summarizing dialogues while mitigating the presence of toxic content.

Project Parts

1. Generative AI Use Case: Summarize Dialogue without and with Prompt Engineering

This part involves experimenting with generative AI models for dialogue summarization both with and without prompt engineering. The objective is to understand how prompt engineering can enhance the performance of the models in generating accurate summaries.

2. Full Finetuning and Parameter Efficient Fine-Tuning (PEFT)

Here, we conduct fine-tuning of the FLAN-T5 model for dialogue summarization. We explore both full finetuning and PEFT optimization methods, utilizing LoRA for efficient parameter tuning. The goal is to optimize model performance while minimizing computational resources.

3. Fine-Tuning with Reinforcement Learning (PPO) and PEFT to Generate Less-Toxic Summaries

In this part, we delve into fine-tuning the model using reinforcement learning techniques, particularly Proximal Policy Optimization (PPO). We leverage PEFT along with Meta AI's RoBERTa-based hate speech model to generate summaries with reduced toxic content. The objective is to develop models that not only summarize dialogues effectively but also promote content detoxification by reducing the presence of harmful language.

Libraries to Install:

Trl: for accessing PPO
torch==1.13.1
torchdata==0.5.1
transformers==4.27.2
datasets==2.11.0
evaluate==0.4.0
rouge_score==0.1.2
loralib==0.1.1
peft==0.3.0

Dataset

DialogSum Dataset

Description: DialogSum is a large-scale dialogue summarization dataset comprising 13,460 dialogues along with manually labeled summaries and topics. Additionally, it includes 100 holdout data for topic generation.
Data Fields:
- dialogue: Text of dialogue.
- summary: Human-written summary of the dialogue.
- topic: Human-written topic/one-liner of the dialogue.
- id: Unique file ID of an example.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
Enhanced Dialogue Summarization and Content Detoxification with FLAN-T5.ipynb		Enhanced Dialogue Summarization and Content Detoxification with FLAN-T5.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Enhanced Dialogue Summarization and Content Detoxification with FLAN-T5

Overview

Project Parts

1. Generative AI Use Case: Summarize Dialogue without and with Prompt Engineering

2. Full Finetuning and Parameter Efficient Fine-Tuning (PEFT)

3. Fine-Tuning with Reinforcement Learning (PPO) and PEFT to Generate Less-Toxic Summaries

Libraries to Install:

Dataset

DialogSum Dataset

Models

FLAN T5 Base Model

Meta AI's RoBERTa-based Hate Speech Model

About

Releases

Packages

Languages

Kevinjadia/Generative-AI-with-LLMs

Folders and files

Latest commit

History

Repository files navigation

Enhanced Dialogue Summarization and Content Detoxification with FLAN-T5

Overview

Project Parts

1. Generative AI Use Case: Summarize Dialogue without and with Prompt Engineering

2. Full Finetuning and Parameter Efficient Fine-Tuning (PEFT)

3. Fine-Tuning with Reinforcement Learning (PPO) and PEFT to Generate Less-Toxic Summaries

Libraries to Install:

Dataset

DialogSum Dataset

Models

FLAN T5 Base Model

Meta AI's RoBERTa-based Hate Speech Model

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages