Project Title: Research Paper Title Generation with Llama 1B Fine-Tuning

This project fine-tunes the Llama 1B model to generate titles for research papers based on their abstracts, using a custom dataset of title-abstract pairs. By leveraging techniques like LoRA (Low-Rank Adaptation) quantization, the model is optimized for efficient training and inference.

The aim of this project is to generate accurate and relevant research paper titles by training a language model to understand the abstract and context of each paper. By employing Llama 1B as the base model, this fine-tuning process demonstrates how pre-trained language models can be adapted for specialized NLP tasks such as title generation.

Dataset

Dataset: A dataset of research papers containing two columns: title and abstract.
Data Preprocessing: The dataset is preprocessed to ensure high-quality input, and tokenization is performed using the Llama tokenizer.

Model and Techniques

Model: Llama 1B model by Meta AI, chosen for its balance between performance and efficiency.
Quantization: LoRA quantization is applied to make fine-tuning feasible on smaller hardware setups by reducing memory usage.
Training: The model is fine-tuned using Hugging Face's Trainer API, which simplifies the training loop, handling evaluation metrics and model checkpoints.
Evaluation: The model is evaluated based on title generation accuracy and loss metrics, which help measure its ability to generalize to unseen abstracts.

Installation

To replicate this project, set up the environment by installing the necessary libraries:

# Clone the repository
git clone https://github.com/your_username/llama-title-generator.git
cd llama-title-generator

# Install dependencies
pip install -r requirements.txt

Requirements can be generated from your environment using:

pip freeze > requirements.txt

Usage

Data Preparation:
- Ensure your dataset is structured with title and abstract columns.
- Save the dataset as data/titles_abstracts.csv.
Training the Model:
- Use the Jupyter notebook to load and preprocess the dataset, initialize the model, and start fine-tuning:
```
jupyter notebook llm_llama_1b_finetune_generate_title.ipynb
```
Evaluating the Model:
- After training, evaluate the model on a validation dataset to verify its performance.
Inference:
- Use the model to generate titles from new abstracts by running the inference section of the notebook.

Project Structure

.
├── __pycache__/                                         # Python bytecode cache directory
├── llama_1B_lora_finetuned/                            # Directory containing the fine-tuned Llama model using LoRA
├── Hugginface_prasun.py                                # Script for Hugging Face model integration and utilities
├── llm_llama_1b_finetune_generate_title.ipynb          # Jupyter notebook for fine-tuning Llama and generating titles
├── requirements.txt                                     # Project dependencies and their versions
└── title_maker.py                                      # Core script for title generation functionality

Results

The fine-tuned model shows promising results in generating titles that are contextually relevant to the provided abstracts. Further evaluation metrics are saved in the notebook.

Acknowledgments

Meta AI's Llama model and its Hugging Face page for Llama 3.2-1B for providing the foundational pre-trained language model used in this project.
Hugging Face for their Trainer API, which simplifies model training and deployment.
LoRA quantization technique for memory-efficient training.

For the DATASET

@misc{acar_arxiver2024,
   author = {Alican Acar, Alara Dirik, Muhammet Hatipoglu},
   title = {ArXiver},
   year = {2024},
   publisher = {Hugging Face},
   howpublished = {\url{https://huggingface.co/datasets/neuralwork/arxiver}}
}

Name	Name	Last commit message	Last commit date
Latest commit pro402 Update Hugginface_prasun.py Nov 13, 2024 22aa0ad · Nov 13, 2024 History 26 Commits
__pycache__	__pycache__	uploading the project	Nov 4, 2024
llama_1B_lora_finetuned	llama_1B_lora_finetuned	uploading the project	Nov 4, 2024
Hugginface_prasun.py	Hugginface_prasun.py	Update Hugginface_prasun.py	Nov 13, 2024
README.md	README.md	Update README.md	Nov 13, 2024
llm_llama_1b_finetune_generate_title.ipynb	llm_llama_1b_finetune_generate_title.ipynb	Add files via upload	Nov 13, 2024
requirements.txt	requirements.txt	Create requirements.txt	Nov 4, 2024
title_maker.py	title_maker.py	Update title_maker.py	Nov 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project Title: Research Paper Title Generation with Llama 1B Fine-Tuning

Table of Contents

Project Overview

Dataset

Model and Techniques

Installation

Usage

Project Structure

Results

Acknowledgments

About

Releases

Packages

Languages

pro402/Abstract_to_Title

Folders and files

Latest commit

History

Repository files navigation

Project Title: Research Paper Title Generation with Llama 1B Fine-Tuning

Table of Contents

Project Overview

Dataset

Model and Techniques

Installation

Usage

Project Structure

Results

Acknowledgments

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages