biniou is a self-hosted webui for several kinds of GenAI (generative artificial intelligence). You can generate multimedia contents with AI and use a chatbot on your own computer, even without dedicated GPU and starting from 8GB RAM. Can work offline (once deployed and required models downloaded).
GNU/Linux [ OpenSUSE base | RHEL base | Debian base ] β’ Windows β’ macOS Intel (experimental) β’ Docker
Documentation β | Showroom πΌοΈ
-
π 2024-09-22 : π₯ Enhancement of models lists for some modules π₯ > Handling of locals models (manually downloaded .safetensors and .gguf models) is now modified for Stable Diffusion-based modules, Chatbot and LoRAs models. These models are now listed at the bottom of models lists, in the "Local models" category, instead of being at the top of these lists.
-
π 2024-09-21 : π₯ This week's updates π₯ >
- Support for High-end Chatbot model bartowski/Mistral-Small-Instruct-2409-GGUF.
- Support for SDXL Fast models stablediffusionapi/dream-diffusion-lightning, John6666/comradeship-xl-v9a-spo-dpo-flash-sdxl and SDXL anime model GraydientPlatformAPI/sanae-xl to Stable Diffusion-based module.
- Support for SD3 Fast LoRA model ByteDance/Hyper-SD. You can now generate images with SD3 using only 4 steps insetad of 20 !
- Support for SDXL LoRA model GraydientPlatformAPI/spiderman-sdxl
- Add support for multiple LoRAs (up to 5) to LCM module. All eligibles modules can now use multiple LoRAs
- Bugfix in multi LoRAs for incompatible LoRA models.
-
π 2024-09-14 : π₯ This week's updates π₯ >
- Support for Chatbot models mradermacher/reflection-llama-3.1-8B-Solshine-Full-GGUF and bartowski/Nemotron-Mini-4B-Instruct-GGUF to Chatbot module.
- Support for SDXL model misri/juggernautXL_juggXIByRundiffusion to Stable Diffusion-based module.
- Add support for multiple LoRAs (up to 5) to Img2img, IP-Adapter and ControlNet modules.
- Bugfixes for metadatas in Photobooth module, incorret LoRA models cache dir location and immutable safety checker with local safetensors models in image modules.
-
π 2024-09-08 : π₯ New Windows installer and Multi-LoRA for Photobooth module π₯ >
- Thanks to the contributions of @natlamir, Windows users now have access to a faster and more reliable Windows installer. It works both for the executable and .cmd installers.
- Multi-LoRAs models are now supported for Photobooth module. You can now add up to 5 LoRAs models to your generation settings.
-
π 2024-09-07 : π₯ This week's updates π₯ >
- Update of Tiny model bartowski/Phi-3.5-mini-instruct-GGUF to Phi-3.5-mini-instruct_Uncensored-GGUF for Chatbot module.
- Update for SDXL family model RealvisXL to SG161222/RealVisXL_V5.0 and fast SDXL Model SG161222/RealVisXL_V5.0_Lightning to Stable Diffusion-based module.
- Add support for SDXL model GraydientPlatformAPI/flashback-xl to Stable Diffusion-based module.
- Add support for Fast SDXL LoRA models GraydientPlatformAPI/lightning-faster-lora to Stable Diffusion-based module.
- Add support for multiple LoRA (up to 5 !) in the Stable Diffusion module. It will be extended to others eligibles modules after further testing. Please note that only the first LoRA model list display Fast LoRA models, as there's no reason to use several of them.
- Bugfixes for CUDA optimization in Chatbot and llava modules thanks to @natlamir.
- Workaround for installation of biniou on Apple silicon thanks to @lepicodon.
-
π 2024-08-31 : π₯ This week's updates π₯ >
- Add support for model bartowski/Llama-3.1-Storm-8B-GGUF and bartowski/xLAM-7b-r-GGUF to Chatbot module.
- Add support for SD 1.5 model Yntec/VisionVision, SDXL anime model stablediffusionapi/anime-journey-v2 and fast SDXL Model GraydientPlatformAPI/lustify-lightning to Stable Diffusion-based module.
- Add support for SDXL LoRA models goofyai/disney_style_xl, goofyai/cyborg_style_xl and goofyai/Leonardo_Ai_Style_Illustration to Stable Diffusion-based module.
- Add support for custom install path in install_win.cmd.
- Bugfixes and typofix to various modules : thanks to @trolley813, @vincent0408 and @eltociear.
β’ Features
β’ Prerequisites
β’ Installation
Β Β Β Β GNU/Linux
Β Β Β Β Β Β OpenSUSE Leap 15.5 / OpenSUSE Tumbleweed
Β Β Β Β Β Β Rocky 9.3 / Alma 9.3 / CentOS Stream 9 / Fedora 39
Β Β Β Β Β Β Debian 12 / Ubuntu 22.04.3 / Ubuntu 24.04 / Linux Mint 21.2
Β Β Β Β Windows 10 / Windows 11
Β Β Β Β macOS Intel Homebrew install
Β Β Β Β Dockerfile
β’ CUDA support
β’ How To Use
β’ Good to know
β’ Credits
β’ License
-
Text generation using :
- βοΈ llama-cpp based chatbot module (uses .gguf models)
- ποΈ Llava multimodal chatbot module (uses .gguf models)
- ποΈ Microsoft GIT image captioning module
- π Whisper speech-to-text module
- π₯ nllb translation module (200 languages)
- π Prompt generator (require 16GB+ RAM for ChatGPT output type)
-
Image generation and modification using :
- πΌοΈ Stable Diffusion module
- πΌοΈ Kandinsky module (require 16GB+ RAM)
- πΌοΈ Latent Consistency Models module
- πΌοΈ Midjourney-mini module
- πΌοΈPixArt-Alpha module
- ποΈ Stable Diffusion Img2img module
- ποΈ IP-Adapter module
- πΌοΈ Stable Diffusion Image variation module (require 16GB+ RAM)
- ποΈ Instruct Pix2Pix module
- ποΈ MagicMix module
- ποΈ Stable Diffusion Inpaint module
- ποΈ Fantasy Studio Paint by Example module (require 16GB+ RAM)
- ποΈ Stable Diffusion Outpaint module (require 16GB+ RAM)
- πΌοΈ Stable Diffusion ControlNet module
- πΌοΈ Photobooth module
- π Insight Face faceswapping module
- π Real ESRGAN upscaler module
- πGFPGAN face restoration module
-
Audio generation using :
- πΆ MusicGen module
- πΆ MusicGen Melody module (require 16GB+ RAM)
- πΆ MusicLDM module
- π Audiogen module (require 16GB+ RAM)
- π Harmonai module
- π£οΈ Bark module
-
Video generation and modification using :
- πΌ Modelscope module (require 16GB+ RAM)
- πΌ Text2Video-Zero module
- πΌ AnimateDiff module (require 16GB+ RAM)
- πΌ Stable Video Diffusion module (require 16GB+ RAM)
- ποΈ Video Instruct-Pix2Pix module (require 16GB+ RAM)
-
3D objects generation using :
- π§ Shap-E txt2shape module
- π§ Shap-E img2shape module (require 16GB+ RAM)
-
Other features
- Zeroconf installation through one-click installers or Windows exe.
- User friendly : Everything required to run biniou is installed automatically, either at install time or at first use.
- WebUI in English, French, Chinese (traditional).
- Easy management through a control panel directly inside webui : update, restart, shutdown, activate authentication, control network access or share your instance online with a single click.
- Easy management of models through a simple interface.
- Communication between modules : send an output as an input to another module
- Powered by π€ Huggingface and gradio
- Cross platform : GNU/Linux, Windows 10/11 and macOS(experimental, via homebrew)
- Convenient Dockerfile for cloud instances
- Generation settings saved as metadatas in each content.
- Support for CUDA (see CUDA support)
- Experimental support for ROCm (see here)
- Support for Stable Diffusion SD-1.5, SD-2.1, SD-Turbo, SDXL, SDXL-Turbo, SDXL-Lightning, Hyper-SD, Stable Diffusion 3, LCM, VegaRT, Segmind, Playground-v2, Koala, Pixart-Alpha, Pixart-Sigma, Kandinsky and compatible models, through built-in model list or standalone .safetensors files
- Support for LoRA models
- Support for textual inversion
- Support llama-cpp-python optimizations CUDA, OpenBLAS, OpenCL BLAS, ROCm and Vulkan through a simple setting
- Support for Llama/2/3, Mistral, Mixtral and compatible GGUF quantized models, through built-in model list or standalone .gguf files.
- Easy copy/paste integration for TheBloke GGUF quantized models.
-
Minimal hardware :
- 64bit CPU (AMD64 architecture ONLY)
- 8GB RAM
- Storage requirements :
- for GNU/Linux : at least 20GB for installation without models.
- for Windows : at least 30GB for installation without models.
- for macOS : at least ??GB for installation without models.
- Storage type : HDD
- Internet access (required only for installation and models download) : unlimited bandwith optical fiber internet access
-
Recommended hardware :
- Massively multicore 64bit CPU (AMD64 architecture ONLY) and a GPU compatible with CUDA or ROCm
- 16GB+ RAM
- Storage requirements :
- for GNU/Linux : around 200GB for installation including all defaults models.
- for Windows : around 200GB for installation including all defaults models.
- for macOS : around ??GB for installation including all defaults models.
- Storage type : SSD Nvme
- Internet access (required only for installation and models download) : unlimited bandwith optical fiber internet access
-
Operating system :
- a 64 bit OS :
- Debian 12
- Ubuntu 22.04.3 / 24.04
- Linux Mint 21.2+ / 22
- Rocky 9.3
- Alma 9.3
- CentOS Stream 9
- Fedora 39
- OpenSUSE Leap 15.5
- OpenSUSE Tumbleweed
- Windows 10 22H2
- Windows 11 22H2
- macOS ???
- a 64 bit OS :
Note : biniou support Cuda or ROCm but does not require a dedicated GPU to run. You can install it in a virtual machine.
- Copy/paste and execute the following command in a terminal :
sh <(curl https://raw.githubusercontent.com/Woolverine94/biniou/main/oci-opensuse.sh || wget -O - https://raw.githubusercontent.com/Woolverine94/biniou/main/oci-opensuse.sh)
- Copy/paste and execute the following command in a terminal :
sh <(curl https://raw.githubusercontent.com/Woolverine94/biniou/main/oci-rhel.sh || wget -O - https://raw.githubusercontent.com/Woolverine94/biniou/main/oci-rhel.sh)
- Copy/paste and execute the following command in a terminal :
sh <(curl https://raw.githubusercontent.com/Woolverine94/biniou/main/oci-debian.sh || wget -O - https://raw.githubusercontent.com/Woolverine94/biniou/main/oci-debian.sh)
- Install the pre-requisites as root :
apt install git pip python3 python3-venv gcc perl make ffmpeg openssl
- Clone this repository as user :
git clone https://github.com/Woolverine94/biniou.git
- Launch the installer :
cd ./biniou
./install.sh
- (optional, but highly recommended) Install TCMalloc as root to optimize memory management :
apt install google-perftools
Windows installation has more prerequisites than GNU/Linux one, and requires following softwares (which will be installed automatically) :
- Git
- Python 3.11 (and specifically 3.11 version)
- OpenSSL
- Visual Studio Build tools
- Windows 10/11 SDK
- Vcredist
- ffmpeg
- ... and all their dependencies.
It's a lot of changes on your operating system, and this could potentially bring unwanted behaviors on your system, depending on which softwares are already installed on it.
- Download and execute : biniou_netinstall.exe
OR
- Download and execute : install_win.cmd (right-click on the link and select "Save Target/Link as ..." to download)
All the installation is automated, but Windows UAC will ask you confirmation for each software installed during the "prerequisites" phase. You can avoid this by running the chosen installer as administrator.
install_win.cmd
Proceed as follow :
- Download and edit install_win.cmd
- Modify
set DEFAULT_BINIOU_DIR="%userprofile%"
toset DEFAULT_BINIOU_DIR="E:\datas\somedir"
(for example) - Only use absolute path (e.g.:
E:\datas\somedir
and not.\datas\somedir
) - Don't had a trailing slash (e.g.:
E:\datas\somedir
and notE:\datas\somedir\
) - Don't add a "biniou" suffix to your path (e.g.:
E:\datas\somedir\biniou
), as the biniou directory will be created by the git clone command - Save and launch install_win.cmd
-
Install Homebrew for your operating system
-
Install required homebrew "bottles" :
brew install git python3 gcc gcc@11 perl make ffmpeg openssl
- Install python virtualenv :
python3 -m pip install virtualenv
- Clone this repository as user :
git clone https://github.com/Woolverine94/biniou.git
- Launch the installer :
cd ./biniou
./install.sh
These instructions assumes that you already have a configured and working docker environment.
- Create the docker image :
docker build -t biniou https://github.com/Woolverine94/biniou.git
or, for CUDA support :
docker build -t biniou https://raw.githubusercontent.com/Woolverine94/biniou/main/CUDA/Dockerfile
- Launch the container :
docker run -it --restart=always -p 7860:7860 \
-v biniou_outputs:/home/biniou/biniou/outputs \
-v biniou_models:/home/biniou/biniou/models \
-v biniou_cache:/home/biniou/.cache/huggingface \
-v biniou_gfpgan:/home/biniou/biniou/gfpgan \
biniou:latest
or, for CUDA support :
docker run -it --gpus all --restart=always -p 7860:7860 \
-v biniou_outputs:/home/biniou/biniou/outputs \
-v biniou_models:/home/biniou/biniou/models \
-v biniou_cache:/home/biniou/.cache/huggingface \
-v biniou_gfpgan:/home/biniou/biniou/gfpgan \
biniou:latest
- Access the webui by the url :
https://127.0.0.1:7860 or https://127.0.0.1:7860/?__theme=dark for dark theme (recommended)
... or replace 127.0.0.1 by ip of your container
Note : to save storage space, the previous container launch command defines common shared volumes for all biniou containers and ensure that the container auto-restart in case of OOM crash. Remove
--restart
and-v
arguments if you didn't want these behaviors.
biniou is natively cpu-only, to ensure compatibility with a wide range of hardware, but you can easily activate CUDA support through Nvidia CUDA (if you have a functionnal CUDA 12.1 environment) or AMD ROCm (if you have a functionnal ROCm 5.6 environment) by selecting the type of optimization to activate (CPU, CUDA or ROCm for Linux), in the WebUI control module.
Currently, all modules except Chatbot, Llava and faceswap modules, could benefits from CUDA optimization.
- Launch by executing from the biniou directory :
- for GNU/Linux :
cd /home/$USER/biniou
./webui.sh
- for Windows :
Double-click webui.cmd in the biniou directory (C:\Users\%username%\biniou\). When asked by the UAC, configure the firewall according to your network type to authorize access to the webui
Note : First start could be very slow on Windows 11 (comparing to others OS).
-
Access the webui by the url :
https://127.0.0.1:7860 or https://127.0.0.1:7860/?__theme=dark for dark theme (recommended)
You can also access biniou from any device (including smartphones) on the same LAN/Wifi network by replacing 127.0.0.1 in the url with biniou host ip address. -
Quit by using the keyboard shortcut CTRL+C in the Terminal
-
Update this application (biniou + python virtual environment) by using the WebUI control updates options.
-
Most frequent cause of crash is not enough memory on the host. Symptom is biniou program closing and returning to/closing the terminal without specific error message. You can use biniou with 8GB RAM, but 16GB at least is recommended to avoid OOM (out of memory) error.
-
biniou use a lot of differents AI models, which requires a lot of space : if you want to use all the modules in biniou, you will need around 200GB of disk space only for the default model of each module. Models are downloaded on the first run of each module or when you select a new model in a module and generate content. Models are stored in the directory /models of the biniou installation. Unused models could be deleted to save some space.
-
... consequently, you will need a fast internet access to download models.
-
A backup of every content generated is available inside the /outputs directory of the biniou folder.
-
biniou natively only rely on CPU for all operations. It use a specific CPU-only version of PyTorch. The result is a better compatibility with a wide range of hardware, but degraded performances. Depending on your hardware, expect slowness. See here for Nvidia CUDA support and AMD ROCm experimental support (GNU/Linux only).
-
Defaults settings are selected to permit generation of contents on low-end computers, with the best ratio performance/quality. If you have a configuration above the minimal settings, you could try using other models, increase media dimensions or duration, modify inference parameters or others settings (like token merging for images) to obtain better quality contents.
-
biniou is licensed under GNU GPL3, but each model used in biniou has its own license. Please consult each model license to know what you can and cannot do with the models. For each model, you can find a link to the huggingface page of the model in the "About" section of the associated module.
-
Don't have too much expectations : biniou is in an early stage of development, and most open source software used in it are in development (some are still experimentals).
-
Every biniou modules offers 2 accordions elements About and Settings :
- About is a quick help features that describes the module and give instructions and tips on how to use it.
- Settings is a panel setting specific to the module that let you configure the generation parameters.
This application uses the following softwares and technologies :
- π€ Huggingface : Diffusers and Transformers libraries and almost all the generatives models.
- Gradio : webUI
- llama-cpp-python : python bindings for llama-cpp
- Llava
- BakLLava
- Microsoft GIT : Image2text
- Whisper : speech2text
- nllb translation : language translation
- Stable Diffusion : txt2img, img2img, Image variation, inpaint, ControlNet, Text2Video-Zero, img2vid
- Kandinsky : txt2img
- Latent consistency models : txt2img
- PixArt-Alpha : PixArt-Alpha
- IP-Adapter : IP-Adapter img2img
- Instruct pix2pix : pix2pix
- MagicMix : MagicMix
- Fantasy Studio Paint by Example : paintbyex
- Controlnet Auxiliary models : preview models for ControlNet module
- IP-Adapter FaceID : Adapter model for Photobooth module
- Photomaker Adapter model for Photobooth module
- Insight Face : faceswapping
- Real ESRGAN : upscaler
- GFPGAN : face restoration
- Audiocraft : musicgen, musicgen melody, audiogen
- MusicLDM : MusicLDM
- Harmonai : harmonai
- Bark : text2speech
- Modelscope text-to-video-synthesis : txt2vid
- AnimateLCM : txt2vid
- Open AI Shap-E : txt2shape, img2shape
- compel : Prompt enhancement for various
StableDiffusionPipeline
-based modules - tomesd : Token merging for various
StableDiffusionPipeline
-based modules - Python
- PyTorch
- Git
- ffmpeg
... and all their dependencies
GNU General Public License v3.0
GitHub @Woolverine94 Β Β·Β