-
NVIDIA
- Midwest
- https://www.linkedin.com/in/brandonstuttle/
-
NeMo-Run Public
Forked from NVIDIA/NeMo-RunA tool to configure, launch and manage your machine learning experiments.
Python Apache License 2.0 UpdatedDec 18, 2024 -
-
DALI Public
Forked from NVIDIA/DALIA GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
C++ Apache License 2.0 UpdatedJun 17, 2024 -
gtc-2023-SE52140 Public
Developer Breakout - Accelerating Enterprise Workflows With Triton Server and DALI
-
cluster-maintenance Public
General maintenance for my home servers using Ansible
Python UpdatedJan 20, 2024 -
facedetect Public
Detects one or more faces in the given image / video using NVIDIA Triton Inference Server
-
-
T5-TensorRT-LLM Public
T5 model on TensorRT-LLM & Triton Inference Server
-
triton-server-demo Public
A brief hands on demo of how to use NVIDIA Triton Server for multiple models.
-
llm-continuous-batching-benchmarks Public
Forked from anyscale/llm-continuous-batching-benchmarksPython UpdatedJul 24, 2023 -
-
kubernetes-jupyterlab Public
A simple configuration for launching a Jupyterlab on Kubernetes.
UpdatedJul 19, 2023 -
nemo-supervised-fine-tuning Public
Supervised Fine Tuning (SFT) is the process of finetuning all of the model's parameters on supervised data of inputs and outputs that teaches the model how to follow user specified instructions.
Shell UpdatedJul 17, 2023 -
-
nvidia-megatron-on-triton Public
An end-to-end framework for training and deploying LLMs with billions and trillions of parameters. This example uses the publicly available 20 billion GPT variant.
-
terraform-oci-arch-redis Public
Forked from oracle-devrel/terraform-oci-arch-redisterraform-oci-arch-redis
HCL Universal Permissive License v1.0 UpdatedApr 1, 2023 -
terraform-oci-arch-postgresql Public
Forked from oracle-devrel/terraform-oci-arch-postgresqlTerraform module to deploy PostgreSQL on Oracle Cloud Infrastructure (OCI).
HCL Universal Permissive License v1.0 UpdatedMar 30, 2023 -
-
t5-faster-transformer Public
My working repo for following the NVIDIA blog post originally written by Denis Timonin, Bo Yang Hsueh, Dhruv Singal and Vinh Nguyen
-
flair-on-nvidia-triton Public
Scripts to help deploy the Flair ner-english-fast model on Triton Server as a TorchScript model.
Jupyter Notebook UpdatedJan 17, 2023 -
nvfuser Public
Testing the new, integrated compilers in PyTorch.
-
riva-speech-skills Public
State-of-the-art models, fully accelerated pipelines, and tools to easily add Speech AI capabilities to real-time applications like virtual assistants, call center agent assist, and video conferenc…
-
layoutlmv3-triton-server Public
An NVIDIA Triton Server workflow for OCR and the LayoutLMv3 Transformer Model
-
quickperf Public
A simple performance collection tool to determine the performance of a GPU enabled environment.
Dockerfile UpdatedAug 31, 2022 -
multi-node-k8s-ml Public
End-to-end deployment for multi-node training using GPU nodes on a Kubernetes cluster.
-
Kubernetes-WireGuard-Server Public
Helm based deployment for a WireGuard server on Kubernetes
-
TimeSeriesPredictionPlatform Public
Helm port of NVIDIA's TimeSeriesPredictionPlatform
-
-
steganography Public
A simple notebook demonstrating an image manipulation technique called steganography!
Python UpdatedDec 28, 2021 -
generative-style-transfer Public
Generative-style-transfer with tensorflow 2
Python UpdatedDec 20, 2021