DSA研讨会 | Exploring Trustworthy Foundation Models under Imperf...

文摘 2024-08-30 18:33 广东

DSA Thrust Seminar

Exploring Trustworthy Foundation Models under Imperfect Data

Abstract

In the current landscape of machine learning, it is crucial to build trustworthy foundation models that can operate under imperfect conditions, since most real-world data, such as unexpected inputs, image artifacts, and adversarial inputs, are easily noisy. These models need to possess human-like capabilities to learn and reason in uncertainty. In this talk, I will focus on three recent research advancements, each shedding light on the reliability, robustness, and safety in this field. Specifically, the reliability will be explored through the enhancement of vision-language models by introducing negative labels, which effectively detect out-of-distribution samples. Meanwhile, robustness will be explored through our investigation into image interpolation using diffusion models, addressing the challenge of information loss to ensure consistency and quality of generated content. Then, safety will be highlighted by our study on hypnotizing large language models, DeepInception, which leverages the creation of a novel nested scenario to induce adaptive jailbreak behaviors, revealing vulnerabilities during interactive model engagement. Furthermore, I will introduce the newly established Trustworthy Machine Learning and Reasoning (TMLR) Group at Hong Kong Baptist University.

Speaker

Prof

HAN

Bo Han is an Assistant Professor in Machine Learning at Hong Kong Baptist University and a BAIHO Visiting Scientist at RIKEN AIP, where his research focuses on machine learning, deep learning, foundation models and their applications. He was a Visiting Research Scholar at MBZUAI MLD, a Visiting Faculty Researcher at Microsoft Research and Alibaba DAMO Academy, and a Postdoc Fellow at RIKEN AIP. He has co-authored two machine learning monographs by MIT Press and Springer Nature. He has served as Senior Area Chair of NeurIPS, and Area Chairs of NeurIPS, ICML and ICLR. He has also served as Action Editors of IEEE TPAMI, MLJ and JAIR, and Editorial Board Members of JMLR and MLJ. He received Outstanding Paper Award at NeurIPS, Most Influential Paper at NeurIPS, Notable Area Chair at NeurIPS, Outstanding Area Chair at ICLR, and Outstanding Associate Editor at IEEE TNNLS. He received the RGC Early CAREER Scheme, NSFC General Program, IJCAI Early Career Spotlight, RIKEN BAIHO Award, Dean's Award for Outstanding Achievement, Microsoft Research StarTrack Program, and Faculty Research Awards from ByteDance, Baidu, Alibaba and Tencent.

Seminar Info

Time: Sep 4, Wed, 11:00-11:50 (UTC+8)

Venue: E4-102

更多数据科学与分析学域资讯，请见官网：

http://dsa.hkust-gz.edu.cn/

PhD项目咨询邮箱：dsarpg@hkust-gz.edu.cn

MPhil项目咨询邮箱：rbmadmit@hkust-gz.edu.cn

MSc项目咨询邮箱：mscdcai@hkust-gz.edu.cn

http://mp.weixin.qq.com/s?__biz=MzkyNDQxNTYyOA==&mid=2247490243&idx=1&sn=7e1249d54626c77ffb277f5b9f45177e

港科大广州 I 数据科学与分析

香港科技大学（广州）信息枢纽数据科学与分析学域官方公众平台 Data Science and Analytics Thrust-Information Hub- HKUST(GZ)

数据科学与分析学域谢泽柯教授获2024年度CCF-百度松果基金支持

DSA研讨会 | Trustworthy Online Learning for Networked Systems

香港科技大学（广州）超算队招新！

硕博宣讲| 2024硕博招生宣讲西安、兰州专场

DSA研讨会 | Generative principal component analysis and fast...

DSA研讨会 | Spatial Audio, Spatial Audio-Visual and Visual Learning

硕博宣讲 | 10月14-17日硕博士招生师生见面会西安、兰州专场

DSA研讨会 | Machine Learning for Real-Time Constrained...

DSA研讨会 | Towards Commercial Wi-Fi Integrated Sensing and...

活动回顾 | 数据科学与分析学域学生聚会 DSA Gathering

祝贺！数据科学与分析学域8位教授入选全球前2%顶尖科学家榜单

DSA研讨会 | Strategic Waiting for Disruption Forecasts in Cross-...

香港科技大学（广州）诚邀全球英才加盟

硕博宣讲| 信息枢纽招生宣讲上交&同济专场

DSA研讨会 | Interpretable Graph Neural Networks: From Robust GNN...

活动报名 | DSA Gathering-Stories, Connections and Fun!

PG招生简章丨港科大（广州）信息枢纽2025-26年硕博项目招生简章

硕博宣讲| 信息枢纽宣讲电子科技大&川大专场

论文回顾 | 港科广数据科学与分析学域24篇论文入选国际学术会议VLDB 2024 & KDD 2024

2024MGPIC大赛火热报名中！9月19日再次走进港科大（广州）线下宣讲，敬请期待！

报名启动 | CyberC 2024 大数据竞赛Big-Big Data Analytics Competition

DSA研讨会 | GOFA: A Generative One-For-All Model for Joint Graph...

硕博宣讲| 信息枢纽宣讲北航北邮专场

报名开启！IDEA研究院编程语言MoonBit全球编程创新挑战赛启动

DSA研讨会 | Exploring Trustworthy Foundation Models under Imperf...

Welcome to DSA！数据科学与分析学域博士迎新会精彩回顾

DSA学域本科专业选课小贴士

DSA研讨会 | Towards privacy-preserving distributed...

覆盖本科到博士所有阶段！数据科学与分析学域2024秋季课程抢先看！

DSA研讨会 | Efficient Deep Neural Architecture Design and Training

DSA研讨会 | Harnessing LLMs for Practical NL2SQL...

DSA研讨会 | Efficient Deep Neural Architecture Design and Training

论文回顾 | 港科广数据科学与分析学域12篇论文入选国际学术会议ICML 2024&ACL 2024

DSA学域计算机社团XCPC集训队招新进行中

数据科学与分析学域谢泽柯教授获“豆包大模型基金”支持

师资介绍 | 唐国明数据科学与分析学域助理教授

数据科学与分析学域骆昱宇教授获2024年度CCF-华为胡杨林基金数据库专项支持

数据科学与分析学域梁宇轩教授获2024年度CCF-滴滴盖亚学者科研基金项目支持

数据科学与分析学域梁宇轩教授入选2024年度CCF-腾讯犀牛鸟基金

活动回顾|2024博士夏令营数据科学与分析学域学术活动回顾

活动回顾 | 7月DSA学术沙龙成功举办

活动回顾 | DSA学域举办本科生科研项目进展分享会

活动报名 | DSA Salon-Research Milestones and Notable Progress

DSA研讨会 | Scalable Algorithms for Random-Walk Probability...

行政招聘 | Officer (Student Affairs, Publicity...)

KDD2024 | GCOPE：港科广数据科学与分析学域李佳教授团队联合港中文提出首个跨域图预训练框架

师资介绍 | 芦尚奇数据科学与分析学域助理教授

@高考生，想报考港科大（广州）的你，看这篇就够！

分类

时事

民生

政务

教育

文化

科技

财富

体娱

健康

情感

旅行

百科

职场

楼市

企业

乐活

学术

汽车

时尚

创业

美食

幽默

美体

文摘

原创标签

时事社会财经军事教育体育科技汽车科学房产搞笑综艺明星音乐动漫游戏时尚健康旅游美食生活摄影宠物职场育儿情感小说曲艺文化历史三农文学娱乐电影视频图片新闻宗教电视剧纪录片广告创意壁纸头像心灵鸡汤星座命理教育培训艺术文化金融财经健康医疗美妆时尚餐饮美食母婴育儿社会新闻工业农业时事政治星座占卜幽默笑话独立短篇连载作品文化历史科技互联网

发布位置

广东北京山东江苏河南浙江山西福建河北上海四川陕西湖南安徽湖北内蒙古江西云南广西甘肃辽宁黑龙江贵州新疆重庆吉林天津海南青海宁夏西藏香港澳门台湾美国加拿大澳大利亚日本新加坡英国西班牙新西兰韩国泰国法国德国意大利缅甸菲律宾马来西亚越南荷兰柬埔寨俄罗斯巴西智利卢森堡芬兰瑞典比利时瑞士土耳其斐济挪威朝鲜尼日利亚阿根廷匈牙利爱尔兰印度老挝葡萄牙乌克兰印度尼西亚哈萨克斯坦塔吉克斯坦希腊南非蒙古奥地利肯尼亚加纳丹麦津巴布韦埃及坦桑尼亚捷克阿联酋安哥拉