Stars
Official inference library for Mistral models
刷算法全靠套路,认准 labuladong 就够了!English version supported! Crack LeetCode, not only how, but also why.
Download scripts for EPIC-KITCHENS
Curated list of papers and resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in the coming months.
[IJCV 2024] LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models
Official implementation of "LucidDreamer: Towards High-Fidelity Text-to-3D Generation via Interval Score Matching"
[CVPR 2024] GaussianEditor: Swift and Controllable 3D Editing with Gaussian Splatting
[ECCV2022] Gen6D: Generalizable Model-Free 6-DoF Object Pose Estimation from RGB Images
The implementation of the paper 'CheckerPose: Progressive Dense Keypoint Localization for Object Pose Estimation with Graph Neural Network' (ICCV2023).
Official implementation of our NeurIPS 2023 paper "Augmenting Language Models with Long-Term Memory".
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
Official repository of ICLR 2022 paper FILM: Following Instructions in Language with Modular Methods
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
🏘️ Scaling Embodied AI by Procedurally Generating Interactive 3D Houses
ALFRED - A Benchmark for Interpreting Grounded Instructions for Everyday Tasks
Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model
Official Task Suite Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"
Official Algorithm Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"
A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
An open-source framework for training large multimodal models.
Code for 3D-LLM: Injecting the 3D World into Large Language Models