Learning to Refine with Fine-Grained Natural Language Feedback

This repo contains code and instructions for reproducing experiments in the paper "Learning to Refine with Fine-Grained Natural Language Feedback". We propose a new method - Detect, Critique and Refine (DCR) for post-hoc editing document grounded summaries and making them more factual.

Run end to end refinement with DCR

To run end to end editing with DCR you can run our code with the following command and arguments:

from run_end_to_end_refinement.dcr import DCR
document_instruction = '' # source document with the summarization instruction 
initial_response = '' # initial response 
model = "llama3-ft" # critique and refinement model: could be any HF model or GPT-4
dcr = DCR(cuda_id=0, model_name=model, path_to_minicheck="/home/mwadhwa/code/MiniCheck/",cache_dir="/data/users/mwadhwa/")
refinement = dcr.refine(source_text=document_instruction, initial_response=initial_response)
print(refinement)

Models

Our fine-tuned feedback and refinement models are available on HuggingFace 🤗:

Critique Model: Llama2-7b-Chat Fine-Tuned / Llama3-8b-Instruct Fine-Tuned
Refinement Model: Llama2-7b-Chat Fine-Tune / Llama3-8b-Instruct Fine-Tuned

Data for fine-tuning

The fine-tuning data distilled from GPT-4 is available on HuggingFace: https://huggingface.co/datasets/wadhma/dcr_data

Setup

You need to setup the folloiwng:

pip install -r requirements.txt
Setup MiniCheck here

Evaluation

We use the following metrics for evaluation:

AlignScore (here)
GPT-4 Likert Score on a scale of 1-5
GPT-4 pairwise score

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
data_prep		data_prep
fine_tuning		fine_tuning
run_end_to_end_refinement		run_end_to_end_refinement
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Learning to Refine with Fine-Grained Natural Language Feedback

Run end to end refinement with DCR

Models

Data for fine-tuning

Setup

Evaluation

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

ManyaWadhwa/DCR

Folders and files

Latest commit

History

Repository files navigation

Learning to Refine with Fine-Grained Natural Language Feedback

Run end to end refinement with DCR

Models

Data for fine-tuning

Setup

Evaluation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages