讲座 | 三维场景的标签高效学习和细粒度感知

文摘   2024-08-07 11:21   广东  

报告人:Yuexin Ma

主持人:Ruizhen Hu

日期:2024.08.09

时间:9:30am

地点:深圳大学致真楼801


Abstract



Recent advancements in foundation models have spurred AI development to new heights. However, constructing 3D vision foundation models is challenging due to the high cost of acquiring and annotating 3D data. 


To tackle this, we've approached the problem from two angles. First, we've developed label-efficient learning algorithms tailored for 3D scenes. These algorithms excel in unsupervised learning, domain adaptation, label-free learning, and open vocabulary learning tasks. Second, we've focused on fine-grained understanding for human-centric scenes. We proposed several large-scale datasets and benchmarks for understanding dense crowds, human-object interactions, and human motions. These efforts are significant for building 3D vision foundation models and are crucial for applications like autonomous driving, service robots, and human-robot collaboration.



Bio



Yuexin Ma is an Assistant Professor in SIST, ShanghaiTech University. She received the PhD degree from the University of Hong Kong. Her research interests include computer vision and artificial intelligence. Particularly, her current research focuses on 3D scene understanding, multi-modal learning, autonomous driving, embodied AI, etc. 


She has published more than 60 papers on top journals and conferences, including Science Robotics, TPAMI, CVPR, ICCV, ECCV, AAAI, SIGGRAPH, etc., which have obtained more than 3000 citations. Her first-author paper had been awarded as one of the most influential AAAI-19 papers. More information can be found in her homepage (http://yuexinma.me/ ).



深圳大学可视计算研究中心
Visual Computing Research Center
----------------------------------
https://vcc.tech


中心以计算机图形学、计算机视觉、可视化、机器人、人工智能、人机交互为学科基础,致力促进多个学科的深入交叉与集成创新,重点推进大规模静动态数据获取与优化融合、多尺度几何建模与图像处理、可视内容生成与仿真渲染、复杂场景重建与识别理解、三维移动协同感知与人机交互、智能模拟学习与强化认知、海量信息可视化与可视分析等方面的科学研究。

📫
转载及合作:szuvcc@gmail.com

深圳大学可视计算研究中心
深圳大学可视计算研究中心致力于大力提升可视计算科学研究与高等教育水平,以计算机图形学、计算机视觉、人机交互、机器学习、机器人、可视化和可视分析为学科基础,促进多个学科的深入交叉和集成创新。详见官网: vcc.tech
 最新文章