Skip to content

Latest commit

 

History

History
34 lines (23 loc) · 963 Bytes

File metadata and controls

34 lines (23 loc) · 963 Bytes

DLCV Fall 2022

This repository contains my homework submissions for the "Deep Learning for Computer Vision" course at National Taiwan University (NTU), taught by Frank Wang.

Overview

HW1

1. Image Classification

  • Train a CNN model from scratch
  • Try alternative models/methods (e.g., fine-tune a pre-trained model)

2. Image Segmentation

  • VGG16 + FCN32s (baseline model)
  • Improved model

HW2

  • Image Generation

    • Problem 1: GAN [face dataset - CelebA]
    • Problem 2: Diffusion models [digit dataset - MNIST-M]
  • Unsupervised Domain Adaptation (UDA)

    • Problem 3: DANN [digit dataset - MNIST-M, SVHN and USPS]

HW3

  • Problem 1: Zero-shot image classification with CLIP

  • Problem 2: Image Captioning with Vision and Language Model

  • Problem 3: Visualization of Attention in Image Captioning

HW4

  • Problem 1: 3D Novel View Synthesis

  • Problem 2: Self-supervised Pre-training for Image Classification