Function calling Gemma 3

This project demonstrates how to fine-tune the google/gemma-3-1b-it and google/gemma-3-270m-itmodels for function calling tasks. The process involves preparing the NousResearch/hermes-function-calling-v1 dataset, fine-tuning the model using Low-Rank Adaptation (LoRA), and evaluating its performance on a test set.

You can find a detailed description of the project by reading my Medium article:

Fine-Tuning Gemma 3 1B for Function Calling: A Step-by-Step Guide

The project is structured into three main Python scripts:

prepare_function_calling_dataset.py: Downloads and preprocesses the dataset, splitting it into training and testing sets.
function_calling_train.py: Fine-tunes the Gemma model using the preprocessed training data and LoRA.
evaluation.py: Evaluates the fine-tuned model's performance on function calling and helpful exchanges.

About the Project

This project provides a complete workflow for fine-tuning a large language model for function calling. By following the scripts, you can reproduce the results and adapt the process for your own function calling tasks. The use of LoRA allows for efficient fine-tuning, and the evaluation script provides a clear measure of the model's performance improvement.

Baseline, with unquantized Gemma 3-1b-it:

Accuracy in function calling (ROUGE-L): ~0.28
Match in helpful exchange (ROUGE-N): ~0.29

After finetuning:

Accuracy in function calling (ROUGE-L): ~0.91
Match in helpful exchange (ROUGE-N): ~0.78

Baseline, with unquantized Gemma 3-270m-it:

Accuracy in function calling (ROUGE-L): ~0.13
Match in helpful exchange (ROUGE-N): ~0.48

After finetuning:

Accuracy in function calling (ROUGE-L): ~0.90
Match in helpful exchange (ROUGE-N): ~0.68

Getting Started

To get started with this project, follow these steps:

Clone the repository:

git clone https://github.com/lmassaron/Gemma-3-Function-Calling.git
cd Gemma-3-Function-Calling

Install the dependencies:
```
pip install -r requirements.txt
```
Run the scripts:
- First, prepare the dataset:
```
python prepare_function_calling_dataset.py
```
- Then, fine-tune the model:
```
python function_calling_train.py
```
- Finally, evaluate the model:
```
python evaluation.py
```

Model and Training

The base model for this project is google/gemma-3-1b-it or google/gemma-3-270m-it. The model is fine-tuned using Low-Rank Adaptation (LoRA), which is a parameter-efficient fine-tuning technique. The LoRA configuration can be found in the function_calling_train.py script.

The training process uses the trl library and the SFTTrainer to perform supervised fine-tuning. The training arguments, such as the number of epochs, batch size, and learning rate, are also defined in the function_calling_train.py script.

About the evaluation

The compute_matching_percentage function shows a strong resemblance to a component of the ROUGE-N score, specifically ROUGE-1. ROUGE-N is a metric that measures the overlap of n-grams (contiguous sequences of n words) between a candidate text and a reference text. It is commonly used in tasks like machine translation and text summarization.

The find_longest_common_sequence_length function is designed to find the length of the longest common contiguous sequence. ROUGE-L is based on the Longest Common Subsequence (LCS). An LCS measures the longest sequence of words that appear in the same order in both the candidate and reference texts, but these words do not need to be contiguous. The find_longest_common_sequence_length function is designed to find the length of the longest common contiguous sequence, which makes it even stricter than the classical ROUGE-L metric.

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
README.md		README.md
evaluation.py		evaluation.py
function_calling_gemma3_1B.ipynb		function_calling_gemma3_1B.ipynb
function_calling_gemma3_270M.ipynb		function_calling_gemma3_270M.ipynb
function_calling_train.py		function_calling_train.py
install.sh		install.sh
prepare_function_calling_dataset.py		prepare_function_calling_dataset.py
toolbox.png		toolbox.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Function calling Gemma 3

About the Project

Getting Started

Model and Training

About the evaluation

About

Uh oh!

Releases

Packages

Languages

lmassaron/Gemma-3-Function-Calling

Folders and files

Latest commit

History

Repository files navigation

Function calling Gemma 3

About the Project

Getting Started

Model and Training

About the evaluation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages