

Prof. Nakayama has conducted extensive research in both computer vision and natural language processing. His expertise lies in multimodal deep learning, particularly in transparently linking various types of data, such as images and text, to perform recognition, understanding, and generation. Recently, he has been actively working on the development and application of multimodal large language models. To date, he has published over 40 papers at top-tier international conferences, such as ICLR and AAAI (Machine Learning/AI), CVPR, ICCV, and ECCV (Computer Vision), and ACL, EMNLP, and NAACL (Natural Language Processing), all ranked as CORE A*/A. In addition, he has served in key roles such as Area Chair, Senior Area Chair, and Editor for the review systems of these top conferences and journals, establishing himself as a leading researcher in the field both domestically and internationally. Below, we will discuss some of his research outcomes that are particularly relevant to this project.
We are part of the University of Tokyo’s Graduate School of Information Science and Technology, Department of Creative Informatics and focuses on computer networks and cyber-physical systems
Address
4F, I-REF building, Graduate School of Information Science and Technology, The University of Tokyo, 1-1-1, Yayoi, Bunkyo-ku, Tokyo, 113-8657 Japan
Room 91B1, Bld 2 of Engineering Department, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo 113-8656, Japan
Mail: