Skip to content

Popular repositories Loading

  1. Tune-A-Video Tune-A-Video Public

    [ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

    Python 4.2k 385

  2. Awesome-Video-Diffusion Awesome-Video-Diffusion Public

    A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

    3.4k 200

  3. Show-1 Show-1 Public

    Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation

    Python 1.1k 62

  4. Show-o Show-o Public

    Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

    Python 1k 44

  5. MotionDirector MotionDirector Public

    [ECCV 2024 Oral] MotionDirector: Motion Customization of Text-to-Video Diffusion Models.

    Python 841 50

  6. Image2Paragraph Image2Paragraph Public

    [A toolbox for fun.] Transform Image into Unique Paragraph with ChatGPT, BLIP2, OFA, GRIT, Segment Anything, ControlNet.

    Python 789 53

Repositories

Showing 10 of 71 repositories
  • Awesome-Video-Diffusion Public

    A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

    showlab/Awesome-Video-Diffusion’s past year of commit activity
    3,407 200 0 0 Updated Nov 9, 2024
  • LOVA3 Public

    (NeurIPS 2024) Learning to Visual Question Answering, Asking and Assessment

    showlab/LOVA3’s past year of commit activity
    Python 63 1 0 0 Updated Nov 7, 2024
  • computer_use_ootb Public

    An out-of-the-box (OOTB) version of Anthropic Claude Computer Use

    showlab/computer_use_ootb’s past year of commit activity
    Python 238 MIT 23 8 2 Updated Nov 7, 2024
  • Awesome-Unified-Multimodal-Models Public

    📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.

    showlab/Awesome-Unified-Multimodal-Models’s past year of commit activity
    205 8 0 0 Updated Nov 7, 2024
  • Show-o Public

    Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

    showlab/Show-o’s past year of commit activity
    Python 1,011 Apache-2.0 44 29 0 Updated Nov 5, 2024
  • VideoLISA Public

    [NeurlPS 2024] One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos

    showlab/VideoLISA’s past year of commit activity
    25 0 2 0 Updated Nov 3, 2024
  • ShowUI Public
    showlab/ShowUI’s past year of commit activity
    2 0 0 0 Updated Oct 31, 2024
  • VisInContext Public

    Official implementation of Leveraging Visual Tokens for Extended Text Contexts in Multi-Modal Learning

    showlab/VisInContext’s past year of commit activity
    Python 11 1 1 0 Updated Oct 30, 2024
  • showlab/Exo2Ego-V’s past year of commit activity
    2 0 0 0 Updated Oct 29, 2024
  • Awesome-GUI-Agent Public

    💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.

    showlab/Awesome-GUI-Agent’s past year of commit activity
    178 10 0 0 Updated Oct 27, 2024

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Most used topics

Loading…