State of art performance after data augmentation techniques upon model saturation in NLP text summarization

Raul · April 8, 2022, 2:58am

I’d like to share a real world example we have reached state of art performance in our main metric was reached using data centricity approaches were introducing in my team’s project at Stanford University NLP with Deep Learning class.
We augmented radiology reports by generating a sort of “paraphrasing with scale” our large corpus by shuffling the reports fields ordering when fine-tuning transformers in additional epochs.

You can watch our short presentation in this video, and if you would like to dig deeper feel free to see our published report in the course webpage. Also feel free to PM directly.

Cheers!

Topic		Replies	Views
Any suggestions for applying Data-centric AI to NLP datasets? AI Discussions ai-discussions , data-centric	1	174	April 8, 2022
My first Data-Centric AI based Publication AI Discussions ai-discussions , data-centric	8	151	March 24, 2023
Data Centric Application for NLP AI Discussions ai-discussions , data-centric	1	263	April 8, 2022
Generative AI - LLMs Applications Building Systems with the ChatGPT API	2	179	June 9, 2023
Natural Language Processeing Live Event! News and Announcements	3	540	November 3, 2021

State of art performance after data augmentation techniques upon model saturation in NLP text summarization

Related topics