Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GRANDE: Gradient-Based Decision Tree Ensembles for Tabular Data | OpenReview #914

Open
1 task
ShellLM opened this issue Aug 22, 2024 · 1 comment
Open
1 task
Labels
Algorithms Sorting, Learning or Classifying. All algorithms go here. Git-Repo Source code repository like gitlab or gh human-verified <INST>NEVER PICK THIS LABEL</INST> MachineLearning ML Models, Training and Inference New-Label Choose this option if the existing labels are insufficient to describe the content accurately Papers Research papers Research personal research notes for a topic software-engineering Best practice for software engineering

Comments

@ShellLM
Copy link
Collaborator

ShellLM commented Aug 22, 2024

GRANDE: Gradient-Based Decision Tree Ensembles for Tabular Data

Snippet

"Despite the success of deep learning for text and image data, tree-based ensemble models are still state-of-the-art for machine learning with heterogeneous tabular data. However, there is a significant need for tabular-specific gradient-based methods due to their high flexibility. In this paper, we propose GRANDE: Gradient-Based Decision Tree Ensembles, a novel approach for learning hard, axis-aligned decision tree ensembles using end-to-end gradient descent. GRANDE is based on a dense representation of tree ensembles, which affords to use backpropagation with a straight-through operator to jointly optimize all model parameters. Our method combines axis-aligned splits, which is a useful inductive bias for tabular data, with the flexibility of gradient-based optimization. Furthermore, we introduce an advanced instance-wise weighting that facilitates learning representations for both, simple and complex relations, within a single model. We conducted an extensive evaluation on a predefined benchmark with 19 classification datasets and demonstrate that our method outperforms existing gradient-boosting and deep learning frameworks on most datasets. The method is available under: https://github.com/s-marton/GRANDE"

Full Content

GRANDE: Gradient-Based Decision Tree Ensembles for Tabular Data

Despite the success of deep learning for text and image data, tree-based ensemble models are still state-of-the-art for machine learning with heterogeneous tabular data. However, there is a significant need for tabular-specific gradient-based methods due to their high flexibility. In this paper, we propose GRANDE: Gradient-Based Decision Tree Ensembles, a novel approach for learning hard, axis-aligned decision tree ensembles using end-to-end gradient descent. GRANDE is based on a dense representation of tree ensembles, which affords to use backpropagation with a straight-through operator to jointly optimize all model parameters. Our method combines axis-aligned splits, which is a useful inductive bias for tabular data, with the flexibility of gradient-based optimization. Furthermore, we introduce an advanced instance-wise weighting that facilitates learning representations for both, simple and complex relations, within a single model. We conducted an extensive evaluation on a predefined benchmark with 19 classification datasets and demonstrate that our method outperforms existing gradient-boosting and deep learning frameworks on most datasets. The method is available under: https://github.com/s-marton/GRANDE

Suggested labels

{'label-name': 'TabularData', 'label-description': 'Research related to machine learning techniques specifically designed for heterogeneous tabular datasets.', 'gh-repo': 's-marton/GRANDE', 'confidence': 92.41}

@ShellLM ShellLM added Algorithms Sorting, Learning or Classifying. All algorithms go here. Git-Repo Source code repository like gitlab or gh MachineLearning ML Models, Training and Inference New-Label Choose this option if the existing labels are insufficient to describe the content accurately Papers Research papers labels Aug 22, 2024
@ShellLM
Copy link
Collaborator Author

ShellLM commented Aug 22, 2024

Related content

#626 similarity score: 0.8

@irthomasthomas irthomasthomas added Research personal research notes for a topic human-verified <INST>NEVER PICK THIS LABEL</INST> software-engineering Best practice for software engineering labels Aug 22, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Algorithms Sorting, Learning or Classifying. All algorithms go here. Git-Repo Source code repository like gitlab or gh human-verified <INST>NEVER PICK THIS LABEL</INST> MachineLearning ML Models, Training and Inference New-Label Choose this option if the existing labels are insufficient to describe the content accurately Papers Research papers Research personal research notes for a topic software-engineering Best practice for software engineering
Projects
None yet
Development

No branches or pull requests

2 participants