Project's information
Project's title | Study on Human Visual Tracking Core Technology based on Multi-modal Deep Neural Network |
Project’s code | QTKR01.01/20-21 |
Research hosting institution | Institute of Information Technology |
Coordinating unit, co-chair | Chosun University, Korea |
Project leader’s name | Dr. Dung Nguyen Duc |
Project duration | 01/06/2020 - 30/06/2022 |
Project’s budget | 200 million VND |
Classify | Fair |
Goal and objectives of the project | Study on Human Visual Tracking Core Technology based on Multi-modal Deep Neural Network. |
Main results | Research: 01 Publication |
Novelty and actuality and scientific meaningfulness of the results | We introduce a novel generative model called Attention ResCUNet-GAN to generate complete facial UV maps, which allows us to synthesize various faces of arbitrary poses and improve pose-invariant face recognition performance. We leverage the residual connections in ResNet, intra-block and extra-block feature fusion in coupled UNets to enhance the generator. The skip connections within each U-Net are amplifed with attention gates, while the contextual feature maps from two U-Nets are fused with trainable scalar weights. We jointly train global and local adversarial losses with identity preserving loss. The experiments show that the proposed Attention ResCUNet-GAN outperforms the original UV-GAN by order of magnitude in terms of both reconstruction metrics and the performance on the pose-invariant face verifcation task. We also developed a method based on the CenterNet model to detect and track people. The model is trained on an extensive dataset combining the benchmarks and self-collected images. |
Products of the project | - Publications: |
Recommendations | That would be much appreciated if the leaders of Vietnam Academy of Science and Technology make consideration, evaluation and acceptance for the research results. |
Images of project | |