3DMM-GAN: Multi-Modal Alignment With Adversarial Learning for Compositional 3D Human Image Synthesis
Abstract: Current 3D-aware Generative Adversarial Networks (GANs) struggle to produce high-quality human images due to their limited ability to effectively integrate multi-modal information, ...
Abstract: Multi-label image classification, which involves recognizing multiple objects within a single image, is a fundamental task in computer vision. Recently, Visual-Language Models (VLMs) have ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results