The IRMA Community
Newsletters
Research IRM
Click a keyword to search titles using our InfoSci-OnDemand powered search:
|
Multimodal Dance Generation Networks Based on Audio-Visual Analysis
Abstract
3D human dance generation from music is an interesting and challenging task in which the aim is to estimate 3D pose from visual and audio information. Existing methods only use skeleton information to complete this task, which may cause jittering results. In addition, due to lack of appropriate evaluation metrics for this task, it is difficult to evaluate the quality of the generated results. In this paper, the authors explore multi-modality dance generation networks through constructing the correspondence between the visual and the audio cues. Specifically, they propose a 2D prediction module to predict future frames by fusing visual and audio features. Moreover, they propose a 3D conversion module, which is able to generate the 3D skeleton from the 2D skeleton. In addition, some new human dance generation evaluation metrics are proposed to evaluate the quality of the generated results. Experimental results indicate that the proposed modules can meet the requirements of authenticity and diversity.
Related Content
Yasasi Abeysinghe, Bhanuka Mahanama, Gavindya Jayawardena, Yasith Jayawardana, Mohan Sunkara, Andrew T. Duchowski, Vikas Ashok, Sampath Jayarathna.
© 2024.
20 pages.
|
Chengxuan Huang, Evan Brock, Dalei Wu, Yu Liang.
© 2023.
23 pages.
|
Duleep Rathgamage Don, Jonathan Boardman, Sudhashree Sayenju, Ramazan Aygun, Yifan Zhang, Bill Franks, Sereres Johnston, George Lee, Dan Sullivan, Girish Modgil.
© 2023.
17 pages.
|
Wei-An Teng, Su-Ling Yeh, Homer H. Chen.
© 2023.
17 pages.
|
Hemanth Gudaparthi, Prudhviraj Naidu, Nan Niu.
© 2022.
20 pages.
|
Anchen Sun, Yudong Tao, Mei-Ling Shyu, Angela Blizzard, William Andrew Rothenberg, Dainelys Garcia, Jason F. Jent.
© 2022.
19 pages.
|
Suvojit Acharjee, Sheli Sinha Chaudhuri.
© 2022.
16 pages.
|
|
|