信管·讲座 | Hash functions bridging the gap from theory to practice

教育 2024-12-05 20:15 上海

Time：Tuesday, Dec.10, 10:00--11:00
Venue: Room 308, ITCS

1. 主讲人介绍

Mikkel Thorup (born 1965) has a D.Phil. from Oxford University from 1993. From 1993 to 1998 he was at the University of Copenhagen. From 1998 to 2013 he was at AT&T Labs-Research. Since 2013 he has been back as Professor at the University of Copenhagen. He is currently a VILLUM Investigator heading Center for Basic Algorithms Research Copenhagen (BARC).Mikkel Thorup is a Fellow of the ACM and of AT&T, and a Member of the Royal Danish Academy of Sciences and Letters. He is co-winner of the 2011 MAA Robbins Award in mathematics and winner of the 2015 Villum Kann Rasmussen Award for Technical and Scientific Research, which is Denmark's biggest individual prize for research. More recently he was co-winner of the 2021 AMS-MOS Fulkerson Prize and an ACM STOC 20-year test of time award. His main work is in algorithms and data structures, where he has worked on both upper and lower bounds. Recently one of his main focusses has been on hash functions unifying theory and practice. Mikkel prefers to seek his mathematical inspiration in nature, combining the quest with his hobbies of bird watching and mushroom picking.

2. 讲座介绍

Title: Hash functions bridging the gap from theory to practice

Abstract: Randomized algorithms are often enjoyed for their simplicity, but the hash functions employed to yield the desired probabilistic guarantees are often too complicated to be practical. Hash functions are used everywhere in computing, e.g., hash tables, sketching, dimensionality reduction, sampling, and estimation. Many of these applications are relevant to Machine Learning, where we are often interested in similarity between high dimensional objects. Reducing the dimensionality is key to efficient processing. Abstractly, we like to think of hashing as fully-random hashing, assigning independent hash values to every possible key, but essentially this requires us to store the hash values for all keys, which is unrealistic for most key universes, e.g., 64-bit keys. In practice we have to settle for implementable hash functions, and often practitioners settle for implementations that are too simple in that the algorithms ends up working only for sufficiently random input. However, the real world is full of structured/non-random input. The issue is severe, for simplistic hash functions will often work very well in tests with random input. Moreover, the issue is often that error events that should never happen in practice, happen with way too high probability. This does not show in a few test, but will show up over time when you put the system into production. Over the last decade there has been major developments in simple to implement tabulation based hash functions offering strong theoretical guarantees, so as to support fundamental properties such as Chernoff bounds, Sparse Johnson-Lindenstrauss transforms, and fully-random hashing on a given set w.h.p. etc. I will discuss some of the principles of these developments and offer insights on how far we can bridge from theory (assuming fully-random hash functions) to practice (needing something that can actually implemented efficiently).

编审：唐志皓江波

上财信息

上海财经大学信息管理与工程学院官方新媒体平台，用于学院各类信息发布，欢迎关注！

“数智赋能”教学研讨系列活动 |人工智能时代的教学思考和实践

“数智赋能”教学研讨系列活动 | 用户中心化方法构建可信推荐系统

信息人的故事·窦露 | 行远自迩，登高博见

信管·讲座 | Security and Privacy of AI-based systems and...

信管·讲座 | Neural-Network Mixed Logit Choice Model...

2025年全国硕士研究生招生考试上海财经大学考点（代码：3109）考前提醒（一）

信管·讲座 | Active Learning of General Halfspaces: Label Queries..

信管·讲座 | Discrete Choice Modeling and Assortment Optimization...

青春信息 | 活动预告 · 寻找合伙人，沉浸体验商场沉浮

信管·讲座 | Towards Trustworthy and Responsible Large...

青春信息·十大歌手 | 信息管理与工程学院第一届十佳歌手决赛活动回顾

2025研考生请注意！11日开通《准考证》下载

信管·讲座 | Towards Robust and Efficient Large-Scale Stochastic

信管·新闻 | 我院获得2025年CCF中国数字金融大会承办权

逐梦数海智驭未来｜2022级大数据2班获评校“文明班级”提名奖

“智慧未来，引领出行” | 信息管理与工程学院学生党支部走进蔚来汽车

2024年校级文明班级|信息管理与工程学院2022级数据科学与大数据技术1班获评校“文明班级”

信管·喜报丨上财MEM学子在第八届上海市工程管理创新大赛中荣获一等奖

青春信息 | 信管学院十佳歌手决赛即将来袭

信管·讲座 | Hash functions bridging the gap from theory to practice

信息先锋·党章知识竞赛 | 活动回顾：七十五载风雨路，砥砺前行谱华章

信管·讲座 | Screening with Limited Information: A Dual Perspective

信管·讲座 | Incorporating LLMs for Effective and Efficient...

信管·讲座预告 | 【武东大讲坛第2期】证券公司大模型实践与证券行业探索前瞻

追寻三曾里足迹传承红色薪火 | 蒲公英先锋党支部赴中共三大后中央局机关历史纪念馆开展主题党课

经验分享•师生面对面 | 星灯冉冉，师生益谈—江敏祺老师座谈会回顾

青春信息·四大名著巡礼 | 纵谋三国叱风云活动回顾

育人于微服务于行 | “倾听·一站式”学生社区活动成功举办

“数智赋能”教研室活动系列 | 面向计算社会科学的教学设计

青春信息·冬日来信 | 时光信笺，冬日来信活动预告

信管·喜报 | 2021级信息管理与信息系统班团支部荣获2024年上海高校活力团支部

蓝色信息 | 蓝色信息创业创新基金期中汇报暨第九期立项答辩会成功举行

访企拓岗 | 信息管理与工程学院师生赴中国银行参访

信息人的故事·张俊 | 力学不倦，以信息管理赋能机械制造

经验分享•进博会︱进博潮头志愿影，服务浪尖青春行

论文指引 | 2024-2025学年第二学期MEM学位论文答辩工作指引

“数智赋能”教学研讨系列活动 | 数字财经的融合路径

信息先锋·党章知识竞赛 | 活动预告：七十五载风雨路，砥砺前行谱华章

听见心声唱响青春 | 信管学院十佳歌手大赛即将震撼开启

信管·讲座 | Building Generalizable Sequential Decision-Making...

经验分享•师生面对面 | 星灯冉冉，师生益谈——江敏祺老师座谈会预告

青春信息·四大名著巡礼｜西游×三国·破关克难突重围活动回顾

科技启航梦想｜N.O.P.E机器人协会与财大附小机器人互动体验课活动圆满结束

信息战报·校运会｜信息学子志千里，运动健儿梦今朝

信息学联 | 受聘大会：新力加盟启征程，继往开来谱新篇

青春信息·四大名著巡礼 | 三国·纵谋三国叱风云

信管·论文指引 | 2024-2025学年第二学期学术型硕士学位论文答辩工作指引

信管·论文指引 | 2024-2025学年第二学期博士学位论文答辩工作指引

信息先锋·BBWALK11.0 | “博学慎思，笃行明志”活动总结

分类

时事

民生

政务

教育

文化

科技

财富

体娱

健康

情感

旅行

百科

职场

楼市

企业

乐活

学术

汽车

时尚

创业

美食

幽默

美体

文摘

原创标签

时事社会财经军事教育体育科技汽车科学房产搞笑综艺明星音乐动漫游戏时尚健康旅游美食生活摄影宠物职场育儿情感小说曲艺文化历史三农文学娱乐电影视频图片新闻宗教电视剧纪录片广告创意壁纸头像心灵鸡汤星座命理教育培训艺术文化金融财经健康医疗美妆时尚餐饮美食母婴育儿社会新闻工业农业时事政治星座占卜幽默笑话独立短篇连载作品文化历史科技互联网

发布位置

广东北京山东江苏河南浙江山西福建河北上海四川陕西湖南安徽湖北内蒙古江西云南广西甘肃辽宁黑龙江贵州新疆重庆吉林天津海南青海宁夏西藏香港澳门台湾美国加拿大澳大利亚日本新加坡英国西班牙新西兰韩国泰国法国德国意大利缅甸菲律宾马来西亚越南荷兰柬埔寨俄罗斯巴西智利卢森堡芬兰瑞典比利时瑞士土耳其斐济挪威朝鲜尼日利亚阿根廷匈牙利爱尔兰印度老挝葡萄牙乌克兰印度尼西亚哈萨克斯坦塔吉克斯坦希腊南非蒙古奥地利肯尼亚加纳丹麦津巴布韦埃及坦桑尼亚捷克阿联酋安哥拉