Automated title generation for research papers: A transformer-based approach

dc.contributor.authorHosseinzadeh, Maryam
dc.contributor.supervisorBaniasadi, Amirali
dc.date.accessioned2024-09-04T22:09:10Z
dc.date.available2024-09-04T22:09:10Z
dc.date.issued2024
dc.degree.departmentDepartment of Electrical and Computer Engineering
dc.degree.levelMaster of Engineering MEng
dc.description.abstractThe task of generating concise, informative, and relevant titles for research papers is a critical but challenging aspect of academic publishing. This project report explores the application of Transformer-based Large Language Models (LLMs) to automatically generate and suggest research paper titles from their abstracts. Building on the foundational Transformer architecture introduced by Vaswani et al., we evaluate the performance of different models, including a custom-built LLM, a pre-trained GPT-2 model, and a fine-tuned version of GPT-2. Through qualitative analysis, we demonstrate that fine-tuning GPT-2 on a specific dataset of research paper abstracts and titles significantly enhances the coherence, relevance, and contextual accuracy of the generated titles. We address challenges such as hallucinations in LLM-generated text and discuss the importance of high-quality datasets and task-specific fine-tuning. This work contributes to the broader understanding of the capabilities and limitations of LLMs in specialized NLP tasks, offering insights for future research and applications in academic publishing.
dc.description.scholarlevelGraduate
dc.identifier.urihttps://hdl.handle.net/1828/20375
dc.language.isoen
dc.rightsAvailable to the World Wide Web
dc.subjecttransformers
dc.subjectlarge language models
dc.subjectnatural language processing
dc.titleAutomated title generation for research papers: A transformer-based approach
dc.typeproject

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Hosseinzadeh_Maryam_MEng_2024.pdf
Size:
217.56 KB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.62 KB
Format:
Item-specific license agreed upon to submission
Description: