👾nvAgent

🙌Introduction

Hi! This is the official repository for the paper "nvAgent: Automated Data Visualization from Natural Language via Collaborative Agent Workflow".

Abstract

Natural Language to Visualization (NL2Vis) seeks to convert natural-language descriptions into visual representations of given tables, empowering users to derive insights from large-scale data. Recent advancements in Large Language Models (LLMs) show promise in automating code generation to transform tabular data into accessible visualizations. However, they often struggle with complex queries that require reasoning across multiple tables.

To address this limitation, we propose a collaborative agent workflow, termed nvAgent, for NL2Vis. Specifically, nvAgent comprises three agents: a processor agent for database processing and context filtering, a composer agent for planning visualization generation, and a validator agent for code translation and output verification.

Comprehensive evaluations on the new VisEval benchmark demonstrate that nvAgent consistently surpasses state-of-the-art baselines, achieving a 7.88% improvement in single-table and a 9.23% improvement in multi-table scenarios. Qualitative analyses further highlight that nvAgent maintains nearly a 20% performance margin over previous models, underscoring its capacity to produce high-quality visual representations from complex, heterogeneous data sources.

(pipeline in ./assets/pipeline.png)

🎮Demo

We conduct a web interface to demonstrate how to use nvAgent to generate visualizations from natural language descriptions. Upload .csv files and enter your requirements to generate visualizations simply.

We implement the interface in web_vis, and here is a demonstration. (./assets/tinywow_web_70526330.gif)

🎉Updates

⚙️Project Structure

This repo is organized as follows:

├─core
|  ├─agents.py       # define three agents class
|  ├─api_config.py   # config API key and base
|  ├─chat_manager.py # manage the communication between agents
|  ├─const.py        # prompt templates
|  ├─llm.py          # config llm api call and write logs
|  ├─utils.py        # contains utils functions
├─web_vis # the interface for nvAgent
|  ├─core
|  ├─templates
|  ├─app.py
├─visEval # the evaluation framework
|  ├─check # contains different check aspects
|  ├─dataset.py # generate the dataset path mapping
|  ├─evaluate.py # evaluate the score of agent
├─run_evaluate.py # evaluation script
├─README.md
├─requirements.txt
├─visEval_dataset.zip # the dataset used for evaluation

⚡Start

To start with this project, there are several steps you can follow:

Set up your local environment

Create a virtual environment for the project. The recommended Python version is 3.9 or higher.

conda create -n nvagent python=3.9
conda env list
conda activate nvagent

Use the provided requirements.txt file to install the necessary dependencies.

pip install -r requirements.txt

Note: If there is any conflict in your packages, try to reinstall them again individually.

pip uninstall package_name
pip install package_name

Config your API, and file paths.

Edit your api key, api base, and api version in api_config.py. (We use AzureOpenAI API for nvAgent, and you can replace it with OpenAI)
In chat_manager.py, replace folder = "E:/visEval_dataset" with your own dataset path.

Run llm.py to test your api config, and run chat_manager.py to test nvAgent. (you can find test examples in visEval.json in visEval_dataset)

🎰Evaluation

After you config nvAgent correctly, you can run run_evaluate.py to acquire the final scores. But there are also several configs you need to set before evaluation.

Vision model:

We implement the evaluation with a vision language model such as GPT-4o-mini for MLLM-as-a-Judge. Due to the rate limit of Azure API, we choose Openai API for the vision model instead:

vision_model = ChatOpenAI(
        model_name="gpt-4o-mini",
        ...
        base_url="your api base here",
        api_key="your api key here",
    )

Note: Here, we use Langchain to implement the interactions.

Others:

folder = "E:/visEval_dataset" # your dataset path here
library = 'matplotlib' # choose matplotlib or seaborn for visualization
webdriver = Path("C:\Program Files\Google\Chrome\Application\chromedriver.exe") # your chromedriver path here
log_folder = Path("evaluate_logs") # set your evaluation results path
dataset = Dataset(Path(folder), "all") # choose all,single,multiple for different dataset setting
agent = ChatManager(data_path=folder, log_path="./test_logs.txt") # set the prompt and response logs path
evaluator = Evaluator(webdriver_path=webdriver, vision_model=vision_model)

💡Citation

If you find our work is helpful, please cite as:

🪶Contributing

We welcome contributions and suggestions!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

👾nvAgent

🙌Introduction

Abstract

🎮Demo

🎉Updates

⚙️Project Structure

⚡Start

🎰Evaluation

💡Citation

🪶Contributing

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 68 Commits
assets		assets
core		core
viseval		viseval
web_vis		web_vis
README.md		README.md
requirements.txt		requirements.txt
run_evaluate.py		run_evaluate.py
visEval_dataset.zip		visEval_dataset.zip

geliang0114/nvAgent

Folders and files

Latest commit

History

Repository files navigation

👾nvAgent

🙌Introduction

Abstract

🎮Demo

🎉Updates

⚙️Project Structure

⚡Start

🎰Evaluation

💡Citation

🪶Contributing

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages