CVPR Daily - Friday

15 DAILY CVPR Friday Marcella Cornia we do that by giving an input in natural language sentences. Should I change the name of the magazine to Natural Language News and maybe CVPR should change its name too? No, I don't think so. There are also a lot of problems that are based on computer vision and it is very important that the computer vision community focus on the visual part and the understanding of the visual components. Maybe a serious answer would be: it's not that vision people went to language, it's that vision and language converged in some way. Yeah, yeah, yeah, it's true. So now there is no more a very significant difference between the two fields. Natural language processing and computer vision, we are now very related somehow. How has the very strong wave of AI LLMs in the last couple of years changed your work? Oh, well, when I started a PhD, my research was related to language also at the beginning, but basically, we trained the architecture from scratch, so we didn't have a pretrained architecture, a pre-trained language model as a base of our solutions. Nowadays, many research efforts focus on starting from a pretrained language model and teaching it multimodal capabilities. I think the most significant change is the starting point itself. Also the size of the models changed a lot. The models that we developed at the beginning of my PhD were quite small in the size. Now we have large architectures that are also quite expensive to train and to use. I think that the most important change is the starting point somehow. Also the size of the models changed a lot. The models that we developed at the beginning of my PhD were quite small in the size. Now we have large architectures that are also quite expensive to train and use.

RkJQdWJsaXNoZXIy NTc3NzU=