GitHub - Kki2Eve/Agri-LLaVA

Agri-LLaVA: Knowledge-Infused Large Multimodal Assistant on Agricultural Pests and Diseases

Install

cd Agri-LLaVA
conda create -n agrillava python=3.10 -y
conda activate agrillava
pip install --upgrade pip  # enable PEP 660 support
pip install -e .
pip install -e ".[train]"
pip install flash-attn --no-build-isolation

Data

All of our data can be found in Huggingface

- Training data

Our Agri-400K consists of:

- Evaluation data

Train

Agri-LLaVA is trained on 8 A800 GPUs, with the entire training process taking 11 hours and 20 minutes.

- Stage 1: Agricultural Pre-training for Feature Alignment.

Hyperparameters

Hyperparameter	Global Batch Size	Learning rate	Epochs	Max length	Weight decay
Agri-LLaVA	256	1e-3	1	2048	0

Pretrain

Please download the agricultural pests and diseases feature alignment data for pre-training from here

You can use the script to pre-train: pretrain.sh.

- Stage 2: Agricultural End-to-End Instruction-tuning.

Hyperparameters

Hyperparameter	Global Batch Size	Learning rate	Epochs	Max length	Weight decay
Agri-LLaVA	128	2e-5	3	2048	0

Visual Instruction Tuning

Please download the agricultural pests and diseases instruction-tuning data for fine-tuning from here

You can use the following scripts to fine-tune:

Full fine-tune: finetune.sh.
Lora fine-tune: finetune_lora.sh.

Evaluation

Agricultural Multimodal Chatbot

- Prepare Data

Please download the Agri-LLaVA-Chatbot-Bench data for evaluating from here

- GPT-based Evaluation

Generate Agri-LLaVA responses on Agri-LLaVA-Chatbot-Bench.

python agri_llava/eval/model_vqa.py \
    --model-path /path/to/model_checkpoint \
    --question-file \
    /path/to/agri_llava_chatbot_question.jsonl \
    --image-folder \
    /path/to/image_folder \
    --answers-file \
    /path/to/agri_llava_chatbot_answer_our.jsonl

Evaluate the generated responses. In the command, agri_llava_chatbot_answer.jsonl refers to the reference answer generated by language-only GPT-4 based on knowledge.

python agri_llava/eval/eval_gpt_review_visual.py \
    --question /path/to/agri_llava_chatbot_question.jsonl \
    --answer-list \
    /path/to/agri_llava_chatbot_answer.jsonl \
    /path/to/agri_llava_chatbot_answer_our.jsonl \
    --rule agri_llava/eval/eval_metrics/rule.json \
    --output /path/to/review.jsonl

Summarize the evaluation results

python agri_llava/eval/summarize_gpt_review.py

Agricultural VQA

- Prepare Data

Please download the Agri-LLaVA-VQA-Bench data for evaluating from here.

- Fine-tuning on the VQA training data

We use the same scripts to adapt Agri-LLaVA to downstream tasks:

Full fine-tune: finetune.sh.
Lora fine-tune: finetune_lora.sh.

- Evaluating on the VQA testing data

Generate Agri-LLaVA responses on Agri-LLaVA-VQA-Bench.

python agri_llava/eval/model_vqa.py \
    --model-path /path/to/model_checkpoint \
    --question-file \
    /path/to/agri_llava_vqa_test_question.jsonl \
    --image-folder \
    /path/to/image_folder \
    --answers-file \
    /path/to/agri_llava_vqa_answer_our.jsonl

Evaluate the generated responses.

python agri_llava/eval/run_eval.py \
    --gt /path/to/agri_llava_vqa_test_answer.json \
    --pred /path/to/agri_llava_vqa_answer_our.jsonl

Acknowledgement

We sincerely appreciate LLaVA for making their models and code available as open-source contributions.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
agri_llava		agri_llava
assets		assets
scripts		scripts
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Agri-LLaVA: Knowledge-Infused Large Multimodal Assistant on Agricultural Pests and Diseases

Install

Data

- Training data

- Evaluation data

Train

- Stage 1: Agricultural Pre-training for Feature Alignment.

Hyperparameters

Pretrain

- Stage 2: Agricultural End-to-End Instruction-tuning.

Hyperparameters

Visual Instruction Tuning

Evaluation

Agricultural Multimodal Chatbot

- Prepare Data

- GPT-based Evaluation

Agricultural VQA

- Prepare Data

- Fine-tuning on the VQA training data

- Evaluating on the VQA testing data

Acknowledgement

About

Releases

Packages

Languages

License

Kki2Eve/Agri-LLaVA

Folders and files

Latest commit

History

Repository files navigation

Agri-LLaVA: Knowledge-Infused Large Multimodal Assistant on Agricultural Pests and Diseases

Install

Data

- Training data

- Evaluation data

Train

- Stage 1: Agricultural Pre-training for Feature Alignment.

Hyperparameters

Pretrain

- Stage 2: Agricultural End-to-End Instruction-tuning.

Hyperparameters

Visual Instruction Tuning

Evaluation

Agricultural Multimodal Chatbot

- Prepare Data

- GPT-based Evaluation

Agricultural VQA

- Prepare Data

- Fine-tuning on the VQA training data

- Evaluating on the VQA testing data

Acknowledgement

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages