Skip to content

๐Ÿฆ An open-source NLP framework that offers high-level wrappers designed for effortless launch, enhanced reproducibility, superior control, and unmatched flexibility for your experiments.

License

Notifications You must be signed in to change notification settings

tamohannes/urartu

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

57 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

PyPI - Package Version PyPI - Python Version GitHub - License

UrarTU ๐Ÿฆ

Harness the power of UrarTU, a versatile ML framework meticulously designed to provide intuitive abstractions of familiar pipeline components. With a .yaml file-based configuration system, and the added convenience of seamless slurm job submission on clusters, UrarTU takes the complexity out of machine learning, so you can focus on what truly matters! ๐Ÿš€

urartu_schema drawio

Installation

Getting Started with UrarTU is a Breeze! ๐ŸŒ€ Simply do

pip install urartu

Or follow these steps to install from the source:

  • Clone the repository: git clone git@github.com:tamohannes/urartu.git
  • Navigate to the project directory: cd urartu
  • Execute the magic command: pip install -e .

Adding a dash of convenience! โœจ Once you've executed the previous command, you'll also have an alias conjured, granting you easy access to UrarTU from any directory within your operating system:

urartu --help

The Structure

Think of UrarTU as a project insantiator. It simplifies project creation by offering high-level abstractions, yaml-based file configuration, and slurm job management. It also includes essential features for machine learning pipelines, such as dataset reading, model loading, device handling and more.

Urartu features a registry where you can manage your project (a Python module) and execute its actions using the urartu command. To register a project simply run:

urartu register --name=example --path=PATH_TO_EXAMPLE_MODULE

Here, we register a module named example located at PATH_TO_EXAMPLE_MODULE.

To remove a module from the registry, simply run:

urartu unregister --name=example

To see the available modules, use the command urartu -h. Under the launch command, youโ€™ll find a list of all registered modules.

Navigating the UrarTU Architecture

Within UrarTU lies a well-organized structure that simplifies your interaction with machine learning components.

Configs: Tailoring Your Setup

The default configs which shape the way of configs are defined under urartu/config directory:

  • urartu/config/main.yaml: This core configuration file sets the foundation for default settings, covering all available keys within the system.
  • urartu/config/action_config Directory: A designated space for specific action configurations.

Crafting Customizations

Tailoring configurations to your needs is a breeze with UrarTU. You have two flexible options:

  1. Custom Config Files: To simplify configuration adjustments, UrarTU will read the content of your inherited module in configs directory where you can store personalized configuration files. These files seamlessly integrate with Hydra's search path. The directory structure mirrors that of urartu/config. You can define project-specific configurations in specially named files. For instance, an example.yaml file within the configs directory can house all the configurations specific to your 'example' project, with customized settings.

    • Personalized User Configs: To further tailor configurations for individual users, create a directory named configs_{username} at the same level as the configs directory, where {username} represents your operating system username. The beauty of this approach is that there are no additional steps required. Your customizations will smoothly load and override the default configurations, ensuring a seamless and hassle-free experience. โœจ

    The order of precedence for configuration overrides is as follows: urartu/config, configs, configs_{username}, giving priority to user-specific configurations.

  2. CLI Approach: For those who prefer a command-line interface (CLI) approach, UrarTU offers a convenient method. You can enhance your commands with specific key-value pairs directly in the CLI. For example, modifying your working directory path is as simple as:

    urartu action_config=example action_config.workdir=PATH_TO_WORKDIR

Choose the method that suits your workflow best and enjoy the flexibility UrarTU provides for crafting custom configurations.

Actions: Shaping Functionality

Central to UrarTU's architecture is the Action class. This foundational script governs all actions and their behavior. From loading CLI arguments to orchestrating the main function of a chosen action, the action_name parameter plays the pivotal role in this functionality.

Effortless Launch

With UrarTU, launching actions becomes a breeze, offering you two distinctive pathways. ๐Ÿš€

  • Local Marvel: The first route lets you run jobs on your local machine โ€“ the very platform where the script takes flight.
  • Cluster Voyage: The second option invites you to embark on a journey to the slurm cluster. By setting the slurm.use_slurm configuration in config/main.yaml which takes boolean values, you can toggle between these options effortlessly.

Experience the freedom to choose your launch adventure, tailored to your needs and aspirations!

And just like that, you're all set to embark on your machine learning journey with UrarTU! ๐ŸŒŸ If you run into any hiccups along the way or have any suggestions, don't hesitate to open an issue for assistance.

Example Project

For a sample project see Getting Started Guide

Exploring the Experiments

Unveiling Insights with Ease! ๐Ÿ” UrarTU, pairs up with Aim, a remarkable open-source AI metadata tracker designed to be both intuitive and potent. To dive into the wealth of metrics that Aim effortlessly captures, simply follow these steps:

  • Navigate to the directory housing the .aim repository.
  • Execute the command that sparks the magic:
aim up

Behold as your experiments come to life with clarity and depth! Aim brings your data to the forefront, and with it, the power to make informed decisions and chart new territories in the realm of machine learning. ๐Ÿ“ˆ

About

๐Ÿฆ An open-source NLP framework that offers high-level wrappers designed for effortless launch, enhanced reproducibility, superior control, and unmatched flexibility for your experiments.

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages