2024年第12期目录 | SCIENCE CHINA Information Sciences

文摘   科学   2024-12-28 14:39   北京  
   
Vol. 67, No. 12, 2024

Special Topic: Large Multimodal Models 



How far are we to GPT-4V? Closing the gap to commercial multimodal models with open-source suites 

Zhe CHEN, Weiyun WANG, Hao TIAN, Shenglong YE, Zhangwei GAO, Erfei CUI, Wenwen TONG, Kongzhi HU, Jiapeng LUO, Zheng MA, Ji MA, Jiaqi WANG, Xiaoyi DONG, Hang YAN, Hewei GUO, Conghui HE, Botian SHI, Zhenjiang JIN, Chao XU, Bin WANG, Xingjian WEI, Wei LI, Wenjian ZHANG, Bo ZHANG, Pinlong CAI, Licheng WEN, Xiangchao YAN, Min DOU, Lewei LU, Xizhou ZHU, Tong LU, Dahua LIN, Yu QIAO, Jifeng DAI & Wenhai WANG

Sci China Inf Sci, 2024, 67(12):220101

林达华,乔宇,代季峰联合团队 | 我们距离GPT-4V还有多远?使用开源套件弥合与商用多模态模型的差距


OCRBench: on the hidden mystery of OCR in large multimodal models 

Yuliang LIU, Zhang LI, Mingxin HUANG, Biao YANG, Wenwen YU, Chunyuan LI, Xu-Cheng YIN, Cheng-Lin LIU, Lianwen JIN & Xiang BAI

Sci China Inf Sci, 2024, 67(12):220102

OCRBench:多模态大模型中隐藏的OCR奥秘


MMInstruct: a high-quality multi-modal instruction tuning dataset with extensive diversity

Yangzhou LIU, Yue CAO, Zhangwei GAO, Weiyun WANG, Zhe CHEN, Wenhai WANG, Hao TIAN, Lewei LU, Xizhou ZHU, Tong LU, Yu QIAO & Jifeng DAI

Sci China Inf Sci, 2024, 67(12):220103

24个领域97万条指令!MMInstruct:具备丰富多样性的高质量多模态指令调优数据集


Woodpecker: hallucination correction for multimodal large language models

Shukang YIN, Chaoyou FU, Sirui ZHAO, Tong XU, Hao WANG, Dianbo SUI, Yunhang SHEN, Ke LI, Xing SUN & Enhong CHEN

Sci China Inf Sci, 2024, 67(12):220105

中科大陈恩红团队 | Woodpecker: 多模态大语言模型的幻觉缓解方法


DocPedia: unleashing the power of large multimodal model in the frequency domain for versatile document understanding 

Hao FENG, Qi LIU, Hao LIU, Jingqun TANG, Wengang ZHOU, Houqiang LI & Can HUANG

Sci China Inf Sci, 2024, 67(12):220106

中科大李厚强&字节跳动联合团队 | DocPedia:高分辨率多模态文档大模型


Modality-experts coordinated adaptation for large multimodal models 

Yan ZHANG, Zhong JI, Yanwei PANG, Jungong HAN & Xuelong LI

Sci China Inf Sci, 2024, 67(12):220107

张晏,冀中,庞彦伟,韩军功,李学龙 | 模态专家协调的多模态大模型参数高效微调方法


COMET: “cone of experience” enhanced large multimodal model for mathematical problem generation

Sannyuya LIU, Jintian FENG, Zongkai YANG, Yawei LUO, Qian WAN, Xiaoxuan SHEN & Jianwen SUN

Sci China Inf Sci, 2024, 67(12):220108

华中师范大学杨宗凯团队 | COMET:用于数学题目生成的教育领域多模态模型


ChemDFM-X: towards large multimodal model for chemistry 

Zihan ZHAO, Bo CHEN, Jingpiao LI, Lu CHEN, Liyang WEN, Pengyu WANG, Zichen ZHU, Danyang ZHANG, Yansi LI, Zhongyang DAI, Xin CHEN & Kai YU

Sci China Inf Sci, 2024, 67(12):220109

上海交通大学&苏州实验室联合团队 | ChemDFM-X:跨模态化学材料大模型


REVIEW



Physical layer signal processing for XR communications and systems

Yongpeng WU, Mai XU, Guangtao ZHAI & Wenjun ZHANG

Sci China Inf Sci, 2024, 67(12):221301

上交大吴泳澎, 北航徐迈等 | XR通信与系统的物理层信号处理


RESEARCH PAPER



MuxFlow: efficient GPU sharing in production-level clusters with more than 10000 GPUs 

Xuanzhe LIU, Yihao ZHAO, Shufan LIU, Xiang LI, Yibo ZHU, Xin LIU & Xin JIN

Sci China Inf Sci, 2024, 67(12):222101

北京大学刘譞哲 金鑫等 | 万卡深度学习集群中的高效GPU共享系统


An efficient binary programming method for black-box optimization and its application in processor design

Xiaoliang LV, Qiaozhu ZHAI, Jianchen HU, Yuhang ZHU, Jinhui LIU & Xiaohong GUAN

Sci China Inf Sci, 2024, 67(12):222102

管晓宏院士团队 | 基于 0-1 整数规划的黑盒优化方法及其在处理器设计中的应用


A novel memetic algorithm for distributed shape formation of swarm robots with both acceleration and velocity constraints 

Yun QU, Bin XIN, Qinqin WANG, Ruocheng LI & Zhaofeng DU

Sci China Inf Sci, 2024, 67(12):222201

北理工辛斌团队 | 集群机器人分布式编队构型生成的文化基因算法


Graph-geometric message passing via a graph convolution transformer for FKP regression 

Huizhi ZHU, Wenxia XU, Jian HUANG & Baocheng YU

Sci China Inf Sci, 2024, 67(12):222202

朱辉志,徐文霞,黄剑等 | 基于图卷积Transformer的并联机器人运动学正解方法


Time-varying formation tracking control of high-order multi-agent systems with multiple leaders and multiplicative noise

Ruru JIA, Xiaofeng ZONG & Qing WANG

Sci China Inf Sci, 2024, 67(12):222203

贾茹茹,宗小峰,王庆 | 具有多领导者和乘性噪声的高阶多智能体系统时变编队跟踪控制


Controllability of descriptor multi-agent systems with signed networks

Yu SHEN, Yongqiang GUAN & Ye TIAN

Sci China Inf Sci, 2024, 67(12):222204

谌煜,关永强,田野 | 符号网络下广义多智能体系统的能控性


A distributed decomposition algorithm for solving large-scale mixed integer programming problem

Fangzheng TIAN, Hongzhe LIU & Wenwu YU

Sci China Inf Sci, 2024, 67(12):222205

东南大学虞文武团队 | 求解大规模混合整数规划问题的分布式算法架构


Learning continuous network emerging dynamics from scarce observations via data-adaptive stochastic processes

Jiaxu CUI, Qipeng WANG, Bingyi SUN, Jiming LIU & Bo YANG

Sci China Inf Sci, 2024, 67(12):222206


Neural Liénard system: learning periodic manipulation skills through dynamical systems

Haoyu ZHANG, Long CHENG, Yu ZHANG & Yifan WANG

Sci China Inf Sci, 2024, 67(12):222207

中国科学院自动化所程龙团队 | 动态系统辅助机器人学习周期性操作技能


Optimization methods rooted in optimal control 

Huanshui ZHANG, Hongxia WANG, Yeming XU & Ziyuan GUO

Sci China Inf Sci, 2024, 67(12):222208

山东科技大学张焕水团队 | 基于最优控制的优化方法


Ultra-low power IGZO optoelectronic synaptic transistors for neuromorphic computing 

Li ZHU, Sixian LI, Junchen LIN, Yuanfeng ZHAO, Xiang WAN, Huabin SUN, Shancheng YAN, Yong XU, Zhihao YU, Chee Leong TAN & Gang HE

Sci China Inf Sci, 2024, 67(12):222401

朱力,Chee Leong TAN,何刚等 | 用于神经形态计算的超低功耗IGZO光电突触晶体管


Physical origin of planar linear dichroism in van der Waals semiconductors using main group elements 

Qiang GAO, Yali YU, Kaiyao XIN, Ziqi ZHOU, Hui-Xiong DENG, Lin LI, Xiaojie TANG, Congxin XIA, Duan-Yang LIU, Jian-Bai XIA, Jun KANG & Zhongming WEI

Sci China Inf Sci, 2024, 67(12):222402


A 3D MCAM architecture based on flash memory enabling binary neural network computing for edge AI

Maoying BAI, Shuhao WU, Hai WANG, Hua WANG, Yang FENG, Yueran QI, Chengcheng WANG, Zheng CHAI, Tai MIN, Jixuan WU, Xuepeng ZHAN & Jiezhi CHEN

Sci China Inf Sci, 2024, 67(12):222403

山东大学陈杰智等 | 面向高稳定硬件神经网络的多功能闪存CAM存算单元设计


Lattice-based access authentication scheme for quantum communication networks 

Min WANG & Gui-Lu LONG

Sci China Inf Sci, 2024, 67(12):222501

北京量子研究院龙桂鲁课题组 | 基于格密码的量子通信网络接入认证方案


MOOP



Ground-to-air wireless coverage extension for 6G: a triangular prism structure-based approach 

Junyu LIU, Min SHENG, Jiandong LI, Xuhui CHEN & Chenxi ZHAO

Sci China Inf Sci, 2024, 67(12):224301


LETTER



Multi-dimensional ability diagnosis for machine learning algorithms 

Qi LIU, Zheng GONG, Zhenya HUANG, Chuanren LIU, Hengshu ZHU, Zhi LI, Enhong CHEN & Hui XIONG

Sci China Inf Sci, 2024, 67(12):229101


DcnnGrasp: towards accurate grasp pattern recognition with adaptive regularizer learning

Xiaoqin ZHANG, Ziwei HUANG, Jingjing ZHENG, Shuo WANG & Xianta JIANG

Sci China Inf Sci, 2024, 67(12):229102

张笑钦,黄自玮,郑晶晶等 | DcnnGrasp:采用自适应学习方式的抓取手势识别


Mean-square prescribed finite-time output consensus of high-order linear multi-agent systems

Qingpeng LIANG, Deqing HUANG, Lei MA, Jiangping HU & Yanzhi WU

Sci China Inf Sci, 2024, 67(12):229201


Controllability of neighborhood Corona product networks 

Bo LIU, Xuan LI, Qiang ZHANG, Junjie HUANG & Housheng SU

Sci China Inf Sci, 2024, 67(12):229202


Game-based computation offloading and resource allocation in stochastic geometry-modeling vehicular networks

Jianjie YANG, Zhijian LIN, Yingyang CHEN, Xiaoqiang LU & Yi FANG

Sci China Inf Sci, 2024, 67(12):229301

杨剑杰, 林志坚, 陈颖玚, 卢孝强, 方毅 | 随机几何建模的车联网中基于博弈论的计算卸载和资源分配


What is the optimal inter-site distance in multi-BS cooperative sensing?

Zhichu REN, Yiming YU, Hong REN, Cunhua PAN & Jiangzhou WANG

Sci China Inf Sci, 2024, 67(12):229302

东南大学任之初, 任红, 潘存华等 | 多基站协作感知系统的最优站间距规划


Beamforming prediction based on the multireward DQN framework for UAV-RIS-assisted THz communication systems

Yuewei WU, Peng XU, Yi LV, Dongming WANG, Feifei GAO & Jiangzhou WANG

Sci China Inf Sci, 2024, 67(12):229303

基于多奖励DQN框架的UAV-RIS辅助太赫兹通信系统波束赋形预测


Unreliability normalization weighted bit-flipping algorithms of LDPC decoding for ReRAM systems

Qike PANG, Zheng MA & Xiaohu TANG

Sci China Inf Sci, 2024, 67(12):229304

西南交通大学庞琦珂, 马征, 唐小虎 | 新一代非易失性存储器下的LDPC硬译码算法


A high consistency ramp circuit design method for negative feedback adaptive adjustment mechanism applied to large area array CMOS image sensors 

Zhongjie GUO, Lin LI, Ruiming XU, Suiyang LIU, Ningmei YU, Yuan YANG & Longsheng WU

Sci China Inf Sci, 2024, 67(12):229401

超大面阵CMOS传感器的自适应负反馈斜坡电路设计


High-voltage quasi-vertical GaN-on-Si Schottky barrier diode with edge termination structure of optimized multi-level N ion implantation 

Qingyuan CHANG, Bin HOU, Ling YANG, Mei WU, Meng ZHANG, Hao LU, Fuchun JIA, Xuerui NIU, Chunzhou SHI,

Jiale DU, Mao JIA, Qian YU, Shiming LI, Youjun ZHU, Xiaohua MA & Yue HAO

Sci China Inf Sci, 2024, 67(12):229402


A 10-kHz 12–16-bit reconfigurable zoom ADC with pole optimization technique and floating current-starved amplifier

Zhangming ZHU, Jiajun SONG & Yuhua LIANG

Sci China Inf Sci, 2024, 67(12):229403

西电朱樟明课题组 | 使用极点优化技术的精度可配置Zoom型模数转换器设计



 标识Springer限时免费下载

点击“阅读全文”, 免费下载全文


中国科学信息科学
《中国科学:信息科学》及其英文版《Science China Information Sciences》的宣传平台。
 最新文章