Special Topic: Large Multimodal Models
How far are we to GPT-4V? Closing the gap to commercial multimodal models with open-source suites
Zhe CHEN, Weiyun WANG, Hao TIAN, Shenglong YE, Zhangwei GAO, Erfei CUI, Wenwen TONG, Kongzhi HU, Jiapeng LUO, Zheng MA, Ji MA, Jiaqi WANG, Xiaoyi DONG, Hang YAN, Hewei GUO, Conghui HE, Botian SHI, Zhenjiang JIN, Chao XU, Bin WANG, Xingjian WEI, Wei LI, Wenjian ZHANG, Bo ZHANG, Pinlong CAI, Licheng WEN, Xiangchao YAN, Min DOU, Lewei LU, Xizhou ZHU, Tong LU, Dahua LIN, Yu QIAO, Jifeng DAI & Wenhai WANG
Sci China Inf Sci, 2024, 67(12):220101
林达华,乔宇,代季峰联合团队 | 我们距离GPT-4V还有多远?使用开源套件弥合与商用多模态模型的差距
OCRBench: on the hidden mystery of OCR in large multimodal models
Yuliang LIU, Zhang LI, Mingxin HUANG, Biao YANG, Wenwen YU, Chunyuan LI, Xu-Cheng YIN, Cheng-Lin LIU, Lianwen JIN & Xiang BAI
Sci China Inf Sci, 2024, 67(12):220102
MMInstruct: a high-quality multi-modal instruction tuning dataset with extensive diversity
Yangzhou LIU, Yue CAO, Zhangwei GAO, Weiyun WANG, Zhe CHEN, Wenhai WANG, Hao TIAN, Lewei LU, Xizhou ZHU, Tong LU, Yu QIAO & Jifeng DAI
Sci China Inf Sci, 2024, 67(12):220103
24个领域97万条指令!MMInstruct:具备丰富多样性的高质量多模态指令调优数据集
Woodpecker: hallucination correction for multimodal large language models
Shukang YIN, Chaoyou FU, Sirui ZHAO, Tong XU, Hao WANG, Dianbo SUI, Yunhang SHEN, Ke LI, Xing SUN & Enhong CHEN
Sci China Inf Sci, 2024, 67(12):220105
中科大陈恩红团队 | Woodpecker: 多模态大语言模型的幻觉缓解方法
DocPedia: unleashing the power of large multimodal model in the frequency domain for versatile document understanding
Hao FENG, Qi LIU, Hao LIU, Jingqun TANG, Wengang ZHOU, Houqiang LI & Can HUANG
Sci China Inf Sci, 2024, 67(12):220106
中科大李厚强&字节跳动联合团队 | DocPedia:高分辨率多模态文档大模型
Modality-experts coordinated adaptation for large multimodal models
Yan ZHANG, Zhong JI, Yanwei PANG, Jungong HAN & Xuelong LI
Sci China Inf Sci, 2024, 67(12):220107
张晏,冀中,庞彦伟,韩军功,李学龙 | 模态专家协调的多模态大模型参数高效微调方法
COMET: “cone of experience” enhanced large multimodal model for mathematical problem generation
Sannyuya LIU, Jintian FENG, Zongkai YANG, Yawei LUO, Qian WAN, Xiaoxuan SHEN & Jianwen SUN
Sci China Inf Sci, 2024, 67(12):220108
华中师范大学杨宗凯团队 | COMET:用于数学题目生成的教育领域多模态模型
ChemDFM-X: towards large multimodal model for chemistry
Zihan ZHAO, Bo CHEN, Jingpiao LI, Lu CHEN, Liyang WEN, Pengyu WANG, Zichen ZHU, Danyang ZHANG, Yansi LI, Zhongyang DAI, Xin CHEN & Kai YU
Sci China Inf Sci, 2024, 67(12):220109
上海交通大学&苏州实验室联合团队 | ChemDFM-X:跨模态化学材料大模型
REVIEW
Physical layer signal processing for XR communications and systems
Yongpeng WU, Mai XU, Guangtao ZHAI & Wenjun ZHANG
Sci China Inf Sci, 2024, 67(12):221301
RESEARCH PAPER
MuxFlow: efficient GPU sharing in production-level clusters with more than 10000 GPUs
Xuanzhe LIU, Yihao ZHAO, Shufan LIU, Xiang LI, Yibo ZHU, Xin LIU & Xin JIN
Sci China Inf Sci, 2024, 67(12):222101
北京大学刘譞哲 金鑫等 | 万卡深度学习集群中的高效GPU共享系统
An efficient binary programming method for black-box optimization and its application in processor design
Xiaoliang LV, Qiaozhu ZHAI, Jianchen HU, Yuhang ZHU, Jinhui LIU & Xiaohong GUAN
Sci China Inf Sci, 2024, 67(12):222102
管晓宏院士团队 | 基于 0-1 整数规划的黑盒优化方法及其在处理器设计中的应用
A novel memetic algorithm for distributed shape formation of swarm robots with both acceleration and velocity constraints
Yun QU, Bin XIN, Qinqin WANG, Ruocheng LI & Zhaofeng DU
Sci China Inf Sci, 2024, 67(12):222201
北理工辛斌团队 | 集群机器人分布式编队构型生成的文化基因算法
Graph-geometric message passing via a graph convolution transformer for FKP regression
Huizhi ZHU, Wenxia XU, Jian HUANG & Baocheng YU
Sci China Inf Sci, 2024, 67(12):222202
朱辉志,徐文霞,黄剑等 | 基于图卷积Transformer的并联机器人运动学正解方法
Time-varying formation tracking control of high-order multi-agent systems with multiple leaders and multiplicative noise
Ruru JIA, Xiaofeng ZONG & Qing WANG
Sci China Inf Sci, 2024, 67(12):222203
贾茹茹,宗小峰,王庆 | 具有多领导者和乘性噪声的高阶多智能体系统时变编队跟踪控制
Controllability of descriptor multi-agent systems with signed networks
Yu SHEN, Yongqiang GUAN & Ye TIAN
Sci China Inf Sci, 2024, 67(12):222204
A distributed decomposition algorithm for solving large-scale mixed integer programming problem
Fangzheng TIAN, Hongzhe LIU & Wenwu YU
Sci China Inf Sci, 2024, 67(12):222205
东南大学虞文武团队 | 求解大规模混合整数规划问题的分布式算法架构
Learning continuous network emerging dynamics from scarce observations via data-adaptive stochastic processes
Jiaxu CUI, Qipeng WANG, Bingyi SUN, Jiming LIU & Bo YANG
Sci China Inf Sci, 2024, 67(12):222206
Neural Liénard system: learning periodic manipulation skills through dynamical systems
Haoyu ZHANG, Long CHENG, Yu ZHANG & Yifan WANG
Sci China Inf Sci, 2024, 67(12):222207
中国科学院自动化所程龙团队 | 动态系统辅助机器人学习周期性操作技能
Optimization methods rooted in optimal control
Huanshui ZHANG, Hongxia WANG, Yeming XU & Ziyuan GUO
Sci China Inf Sci, 2024, 67(12):222208
Ultra-low power IGZO optoelectronic synaptic transistors for neuromorphic computing
Li ZHU, Sixian LI, Junchen LIN, Yuanfeng ZHAO, Xiang WAN, Huabin SUN, Shancheng YAN, Yong XU, Zhihao YU, Chee Leong TAN & Gang HE
Sci China Inf Sci, 2024, 67(12):222401
朱力,Chee Leong TAN,何刚等 | 用于神经形态计算的超低功耗IGZO光电突触晶体管
Physical origin of planar linear dichroism in van der Waals semiconductors using main group elements
Qiang GAO, Yali YU, Kaiyao XIN, Ziqi ZHOU, Hui-Xiong DENG, Lin LI, Xiaojie TANG, Congxin XIA, Duan-Yang LIU, Jian-Bai XIA, Jun KANG & Zhongming WEI
Sci China Inf Sci, 2024, 67(12):222402
A 3D MCAM architecture based on flash memory enabling binary neural network computing for edge AI
Maoying BAI, Shuhao WU, Hai WANG, Hua WANG, Yang FENG, Yueran QI, Chengcheng WANG, Zheng CHAI, Tai MIN, Jixuan WU, Xuepeng ZHAN & Jiezhi CHEN
Sci China Inf Sci, 2024, 67(12):222403
山东大学陈杰智等 | 面向高稳定硬件神经网络的多功能闪存CAM存算单元设计
Lattice-based access authentication scheme for quantum communication networks
Min WANG & Gui-Lu LONG
Sci China Inf Sci, 2024, 67(12):222501
北京量子研究院龙桂鲁课题组 | 基于格密码的量子通信网络接入认证方案
MOOP
Ground-to-air wireless coverage extension for 6G: a triangular prism structure-based approach
Junyu LIU, Min SHENG, Jiandong LI, Xuhui CHEN & Chenxi ZHAO
Sci China Inf Sci, 2024, 67(12):224301
LETTER
Multi-dimensional ability diagnosis for machine learning algorithms
Qi LIU, Zheng GONG, Zhenya HUANG, Chuanren LIU, Hengshu ZHU, Zhi LI, Enhong CHEN & Hui XIONG
Sci China Inf Sci, 2024, 67(12):229101
DcnnGrasp: towards accurate grasp pattern recognition with adaptive regularizer learning
Xiaoqin ZHANG, Ziwei HUANG, Jingjing ZHENG, Shuo WANG & Xianta JIANG
Sci China Inf Sci, 2024, 67(12):229102
张笑钦,黄自玮,郑晶晶等 | DcnnGrasp:采用自适应学习方式的抓取手势识别
Mean-square prescribed finite-time output consensus of high-order linear multi-agent systems
Qingpeng LIANG, Deqing HUANG, Lei MA, Jiangping HU & Yanzhi WU
Sci China Inf Sci, 2024, 67(12):229201
Controllability of neighborhood Corona product networks
Bo LIU, Xuan LI, Qiang ZHANG, Junjie HUANG & Housheng SU
Sci China Inf Sci, 2024, 67(12):229202
Game-based computation offloading and resource allocation in stochastic geometry-modeling vehicular networks
Jianjie YANG, Zhijian LIN, Yingyang CHEN, Xiaoqiang LU & Yi FANG
Sci China Inf Sci, 2024, 67(12):229301
杨剑杰, 林志坚, 陈颖玚, 卢孝强, 方毅 | 随机几何建模的车联网中基于博弈论的计算卸载和资源分配
What is the optimal inter-site distance in multi-BS cooperative sensing?
Zhichu REN, Yiming YU, Hong REN, Cunhua PAN & Jiangzhou WANG
Sci China Inf Sci, 2024, 67(12):229302
东南大学任之初, 任红, 潘存华等 | 多基站协作感知系统的最优站间距规划
Beamforming prediction based on the multireward DQN framework for UAV-RIS-assisted THz communication systems
Yuewei WU, Peng XU, Yi LV, Dongming WANG, Feifei GAO & Jiangzhou WANG
Sci China Inf Sci, 2024, 67(12):229303
基于多奖励DQN框架的UAV-RIS辅助太赫兹通信系统波束赋形预测
Unreliability normalization weighted bit-flipping algorithms of LDPC decoding for ReRAM systems
Qike PANG, Zheng MA & Xiaohu TANG
Sci China Inf Sci, 2024, 67(12):229304
西南交通大学庞琦珂, 马征, 唐小虎 | 新一代非易失性存储器下的LDPC硬译码算法
A high consistency ramp circuit design method for negative feedback adaptive adjustment mechanism applied to large area array CMOS image sensors
Zhongjie GUO, Lin LI, Ruiming XU, Suiyang LIU, Ningmei YU, Yuan YANG & Longsheng WU
Sci China Inf Sci, 2024, 67(12):229401
High-voltage quasi-vertical GaN-on-Si Schottky barrier diode with edge termination structure of optimized multi-level N ion implantation
Qingyuan CHANG, Bin HOU, Ling YANG, Mei WU, Meng ZHANG, Hao LU, Fuchun JIA, Xuerui NIU, Chunzhou SHI,
Jiale DU, Mao JIA, Qian YU, Shiming LI, Youjun ZHU, Xiaohua MA & Yue HAO
Sci China Inf Sci, 2024, 67(12):229402
A 10-kHz 12–16-bit reconfigurable zoom ADC with pole optimization technique and floating current-starved amplifier
Zhangming ZHU, Jiajun SONG & Yuhua LIANG
Sci China Inf Sci, 2024, 67(12):229403
西电朱樟明课题组 | 使用极点优化技术的精度可配置Zoom型模数转换器设计
标识Springer限时免费下载
点击“阅读全文”, 免费下载全文