CVPR Daily - Friday

14 DAILY CVPR Women in Computer Vision Friday Marcella Cornia is an Associate Professor at the University of Modena, Italy. She’s been almost a decade at this university. Read 160 FASCINATING interviews with Women in Computer Vision Marcella, tell us about your work. My research activities are mainly related to vision and language. I work on multimodal learning in general. During my PhD I worked a lot on image captioning: I developed solutions to automatically describe an image in natural language. Since the AI research changes in the last couple of years, now we mainly focus on multimodal large language models, which are probably the state-of-the-art architectures in the vision and language literature. Is it true that many vision people switch to language because of this? Yeah, now probably 60/70% of the computer vision papers are related to multimodal LLMs. Many architectures are now based on language models. Even when we want to generate an image, basically

RkJQdWJsaXNoZXIy NTc3NzU=