Refining LLMs with Reinforcement Learning for Human-Like Text Generation

dc.contributor.author	Harish, A.
dc.contributor.author	Prakash, G.
dc.contributor.author	Nair, R.R.
dc.contributor.author	Iyer, V.B.
dc.contributor.author	Anand Kumar, M.
dc.date.accessioned	2026-02-06T06:33:50Z
dc.date.issued	2024
dc.description.abstract	Large Language Models (LLMs) are used widely for tasks involving text generation such as dialogue summarization and creative writing. The generated text often appears unnatural, and this text can easily be distinguished from natural language. In this paper, we leverage the capabilities of Reinforcement Learning to fine-tune LLMs so as to produce text that resembles human language. We have applied the Proximal Policy Optimization algorithm to fine tune a FLAN-T5 LLM for a dialogue summarization task. Â© 2024 IEEE.
dc.identifier.citation	Proceedings of CONECCT 2024 - 10th IEEE International Conference on Electronics, Computing and Communication Technologies, 2024, Vol., , p. -
dc.identifier.uri	https://doi.org/10.1109/CONECCT62155.2024.10677038
dc.identifier.uri	https://idr.nitk.ac.in/handle/123456789/28902
dc.publisher	Institute of Electrical and Electronics Engineers Inc.
dc.subject	AI detection
dc.subject	Large Language Models (LLMs)
dc.subject	Low Rank Adaptation (LoRA)
dc.subject	Proximal Policy Optimization (PPO)
dc.title	Refining LLMs with Reinforcement Learning for Human-Like Text Generation

Collections