Learning a Contextualized Multimodal Embedding for Zero-shot Cooking Video Caption Generation
| Title: | Learning a Contextualized Multimodal Embedding for Zero-shot Cooking Video Caption Generation |
|---|---|
| Authors: | wang, lin; Zhang, Hongyi; wang, xingfu; xiong, yan |
| Source: | Proceedings of the 5th ACM International Conference on Multimedia in Asia. :1-8 |
| Availability: | http://dl.acm.org/doi/10.1145/3595916.3626413 |
| Database: | ACM Full-Text Collection |