Research

Generative models

Generative models, such as Generative Adversarial Networks (GANs), are a class of deep learning models designed to generate new, previously unseen data similar to the training data. 

This research enhances GANs by introducing a novel Parametric Mish (PMish) activation function, MMD-GAN repulsive loss incorporated neural architecture search strategy, and an adaptive rank decomposition method for network compression. The study improves GAN performance and stability with Modified MMD-GAN repulsive loss, demonstrating significant results on datasets like CIFAR-10, CIFAR-100, STL-10, and CelebA. Key contributions include the introduction of the PMish activation function, modified MMD-GAN rep loss, training strategy, and efficient model compression using ARD for deployment on resource-constrained devices.

MMD-AdversarialNAS is a framework that utilizes Neural Architecture Search (NAS) and Maximum Mean Discrepancy (MMD) repulsive loss to optimize GAN architectures for image generation while incorporating tensor decomposition to significantly reduce model complexity and storage footprint.

ProGAN PyTorch Implementation [Code]

ProGAN (Progressive Growing of GANs) is a method for training GANs that progressively increases the resolution of generated images, starting from low resolution and growing layers incrementally for better stability and image quality.

Human Action Recognition [Paper][Code]

Human Action Recognition seeks to automatically identify human actions from a series of observations (image frames), typically a video. The advent of Deep Neural Networks has significantly advanced the state-of-the-art in this field. Our research aims to enhance action recognition performance by leveraging data augmentation, specifically by generating synthetic videos using GANs.

Domain Adaptation

Domain adaptation is a technique used to adapt a model trained on one domain (aka, source domain) to another domain (aka, target domain) where the data distribution is different. This research aims to develop domain adaptation techniques.   

SPIE 2022: Unsupervised domain adaptation using feature aligned maximum classifier discrepancy [Paper]

Image Denoising

Image Denoising using Invertible Rescaling Network [Code]

An Invertible Rescaling Network (IRN) is trained to perform single-image denoising.