Skip to content
View pipixin321's full-sized avatar
🎯
Focusing
🎯
Focusing
  • HUST(Huazhong University of Science and Technology)
  • Wuhan
  • 21:43 (UTC +08:00)

Block or report pipixin321

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
pipixin321/README.md

Hi there 👋, I'm Huaxin Zhang

Huaxin Zhang github Google Scholar

I am a Master of HUST (Huazhong University of Science and Technology), supervised by Prof. Changxin Gao and Prof. Nong Sang.

🔭 Reseach-wise, I mainly focus on:

  • Multi-modal Large Language Models
  • Video Understanding, more specifically, Weakly-supervised Temporal Action Localization (WSTAL) & Weakly-suervised Video Anomaly Detection (WSVAD).

😄 I am open to:

  • A internship/job/PhD offer with computer vision/multimodal LLM research and engineering.

📫 Contact me by:

💬 News:

  • 2024-07-01: We release our code and model of "Holmes-VAD: Towards Unbiased and Explainable Video Anomaly Detection via Multi-modal LLM".[project page]
  • 2024-06-10: We release our code and model of "Arcana: Improving Multi-modal Large Language Model through Boosting Vision Capabilities".[project page]
  • 2024-01-29: I start my internship in Baidu VIS, to do some research on Multi-modal Large Language Model (MLLM).
  • 2023-12-09: One paper about point supervised temporal action localization is accepted on AAAI 2024.

Huaxin's github stats

Pinned Loading

  1. HolmesVAU HolmesVAU Public

    ✨✨✨Official implementation of "Holmes-VAU: Towards Long-term Video Anomaly Understanding at Any Granularity"

    Python 6

  2. HolmesVAD HolmesVAD Public

    Official implementation of "Holmes-VAD: Towards Unbiased and Explainable Video Anomaly Detection via Multi-modal LLM"

    Python 88 3

  3. Awesome-Video-MLLMs Awesome-Video-MLLMs Public

    🔥 🔥 🔥 Awesome MLLMs/Benchmarks for Short/Long/Streaming Video Understanding 📹

    2

  4. Arcana Arcana Public

    Forked from syp2ysy/Arcana

    Implementation of "Arcana: Improving Multi-modal Large Language Model through Boosting Vision Capabilitie"

    Python

  5. HR-Pro HR-Pro Public

    [AAAI24] Official implementation of "Point-supervised Temporal Action Localization via Hierarchical Reliability Propagation"

    Python 27 1

  6. GlanceVAD GlanceVAD Public

    Official implementation of "GlanceVAD: Exploring Glance Supervision for Label-efficient Video Anomaly Detection"

    Python 21