Skip to content
@FoundationVision

FoundationVision

Hi there 👋

This is FoundationVision official website repo

Popular repositories Loading

  1. VAR VAR Public

    [NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-sim…

    Python 4.3k 314

  2. LlamaGen LlamaGen Public

    Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

    Python 1.3k 56

  3. GLEE GLEE Public

    [CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

    Python 1.1k 85

  4. Groma Groma Public

    [ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization

    Python 564 61

  5. OmniTokenizer OmniTokenizer Public

    [NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.

    Python 261 7

  6. UniRef UniRef Public

    [ICCV2023] Segment Every Reference Object in Spatial and Temporal Spaces

    Python 235 15

Repositories

Showing 9 of 9 repositories
  • GLEE Public

    [CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

    FoundationVision/GLEE’s past year of commit activity
    Python 1,083 MIT 85 36 2 Updated Oct 21, 2024
  • VAR Public

    [NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

    FoundationVision/VAR’s past year of commit activity
    Python 4,268 MIT 314 30 0 Updated Oct 6, 2024
  • LlamaGen Public

    Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

    FoundationVision/LlamaGen’s past year of commit activity
    Python 1,323 MIT 56 48 0 Updated Aug 15, 2024
  • OmniTokenizer Public

    [NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.

    FoundationVision/OmniTokenizer’s past year of commit activity
    Python 261 MIT 7 8 0 Updated Jul 9, 2024
  • vaex Public

    🔥stable, simple, state-of-the-art VQVAE toolkit & cookbook

    FoundationVision/vaex’s past year of commit activity
    Python 41 MIT 0 1 0 Updated Jun 23, 2024
  • Groma Public

    [ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization

    FoundationVision/Groma’s past year of commit activity
    Python 564 Apache-2.0 61 7 1 Updated Jun 7, 2024
  • GenerateU Public

    [CVPR2024] Generative Region-Language Pretraining for Open-Ended Object Detection

    FoundationVision/GenerateU’s past year of commit activity
    Python 139 6 12 0 Updated Mar 25, 2024
  • UniRef Public

    [ICCV2023] Segment Every Reference Object in Spatial and Temporal Spaces

    FoundationVision/UniRef’s past year of commit activity
    Python 235 MIT 15 4 0 Updated Jan 10, 2024
  • .github Public
    FoundationVision/.github’s past year of commit activity
    0 0 0 0 Updated Dec 16, 2023

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Python