BlankCode0

Raj Christ Ronghang BlankCode0

Achievements

E E Public

HTML
BTech-Project BTech-Project Public

Forked from Santanu-02/BTech-Project

Less
DPO_controlled_sentiment_generation DPO_controlled_sentiment_generation Public

Implementation of the the research paper DPO: your language model is secretly a reward model

Python
DPO_tldr_summarisation DPO_tldr_summarisation Public

2nd experiment of the paper "DPO: Your Language Model is Secretly a Reward Model"
RLHF_Sentiment_Alignment RLHF_Sentiment_Alignment Public

RLHF implementation of sentiment alignments of DPO paper

Python
Summarization-using-human-feedback Summarization-using-human-feedback Public