MicrobiomeStatPlot | 分面排序堆叠柱状图教程Faceted sorted stack bar plot

学术 2024-10-22 07:02 广东

分面排序堆叠柱状图简介

在微生物组分析中，有时候有多个分组，同时每个分组中有多个样本，为了展示所有样本的物种组成，同时根据微生物物种丰度从高到低排序观察微生物组成变化趋势，会用到多组分面排序堆叠柱状图。

标签：#微生物组数据分析 #MicrobiomeStatPlot #分面排序堆叠柱状图 #R语言可视化 #Faceted sorted stack bar plot

作者：First draft(初稿)：Defeng Bai(白德凤)；Proofreading(校对)：Ma Chuang(马闯) and Jiani Xun(荀佳妮)；Text tutorial(文字教程)：Defeng Bai(白德凤)

源代码及测试数据链接：

https://github.com/YongxinLiu/MicrobiomeStatPlot/项目中目录 3.Visualization_and_interpretation/Faceted_sorted_stacked_plot

或公众号后台回复“MicrobiomeStatPlot”领取

分面排序堆叠柱状图应用案例

这是来自于中科院分子植物卓越创新中心王二涛团队2024年发表于Nature Communications上的一篇论文用到的分组排序堆叠柱状图，展示每个样本的门水平微生物相对丰度组成。论文题目为：Dynamic root microbiome sustains soybean productivity under unbalanced fertilization.https://doi.org/10.1038/s41467-024-45925-5

图 3 | 基于 16S rRNA 测序数据的土体土壤、根际和内土壤中的细菌负荷和组成。

结果

植物发育过程中根相关微生物组的组装

根际和内生菌群的细菌负荷随着植物的发育呈现增加趋势，1-14天的细菌丰度相当（根际5.8×109 拷贝/g，内生菌群1.5×108 拷贝/g），但逐渐增加并在第72天达到最高丰度（根际2.3×1010 拷贝/g，内生菌群4.7×109拷贝/g）（图3，补充图6）。施肥处理对细菌负荷有显著影响（补充表 5），与对照相比，-P 处理根际的微生物负荷持续降低，尤其是在后期发育阶段，在第 42、60 和 72 天，细菌负荷分别减少了 54%、61% 和 75%（P < 0.05，图 3B）。相比之下，在植物发育阶段，对照和 -P 处理之间的根际细菌负荷变化几乎未观察到，且不一致（图 3C）。

根系相关细菌主要属于变形菌、放线菌和拟杆菌，其中放线菌在早期阶段占主导地位，而变形菌在后期阶段占主导地位，尤其是在内生层（图 3，补充图 6）。虽然放线菌的相对丰度随着植物的发育呈现逐渐下降的趋势（在根际和内生层分别从第 1 天的 33.4% 和 68.3% 降低到第 72 天的 18.0% 和 32.6%），但它们的绝对丰度在根际和内生层分别增加了 2.1 倍和 18.8 倍（图 3、4），表明尽管放线菌的相对丰度降低，但植物支撑的放线菌负荷却不断增加。

分面排序堆叠柱状图R语言实战

源代码及测试数据链接：

https://github.com/YongxinLiu/MicrobiomeStatPlot/

或公众号后台回复“MicrobiomeStatPlot”领取

软件包安装

# 基于CRAN安装R包，检测没有则安装 Installing R packages based on CRAN and installing them if they are not detectedp_list = c("ggplot2", "reshape2", "ggprism", "patchwork", "dplyr", "plyr")for(p in p_list){if (!requireNamespace(p)){install.packages(p)}    library(p, character.only = TRUE, quietly = TRUE, warn.conflicts = FALSE)}
# 加载R包 Loading R packagessuppressWarnings(suppressMessages(library(ggplot2)))suppressWarnings(suppressMessages(library(reshape2)))suppressWarnings(suppressMessages(library(ggprism)))suppressWarnings(suppressMessages(library(patchwork)))suppressWarnings(suppressMessages(library(dplyr)))suppressWarnings(suppressMessages(library(plyr)))

实战

此处选择门水平作为一个案例进行分析，其他分类水平可根据需要进行代码调整。

# 导入数据# Load datadata <- read.table(file = "data/phylum_data.txt", sep = "\t", header = T, check.names = FALSE)design <- read.table(file = "data/metadata.txt", sep = "\t", header = T, row.names=1)# sum of Genus# 计算每个Genus微生物相对丰度之和，避免有重复Phylum统计data <- aggregate(.~ Phylum,data=data,sum)rownames(data) = data$Phylumdata = data[, -1]# 计算相对丰度# Calculate relative abundancedata = apply(data , 2, function(x) x/sum(x))# Decreased sort by abundance# 相对丰度按降序排列mean_sort = data[(order(-rowSums(data))), ]mean_sort = as.data.frame(mean_sort)mean_sort2 = t(mean_sort)mean_sort2 = mean_sort2[order(-mean_sort2[,1]),]mean_sort3 = t(mean_sort2)mean_sort3 = as.data.frame(mean_sort3)# Phylum水平展示前5个# Top 5other = colSums(mean_sort3[6:dim(mean_sort3)[1], ])mean_sort3 = mean_sort3[(6 - 1):1, ]mean_sort3 = rbind(other,mean_sort3)rownames(mean_sort3)[1] = c("others")mean_sort3 = as.data.frame(mean_sort3)# Add taxonomy# 加入微生物分类信息mean_sort3$tax = rownames(mean_sort3)data_all = as.data.frame(melt(mean_sort3, id.vars = c("tax")))data_all$group = data_all$variabledata_all$group = as.character(data_all$group)data_all$group = gsub("[0-9]","", data_all$group)# 给分组排序# Sort for different groupslevels(as.factor(data_all$group))#> [1] "A" "B" "C"data_all2 = data_all %>%  mutate(group = ordered(group,                         levels=c("A", "B", "C")))# Stackplot# 绘图p01 = ggplot(data_all2, aes(x=factor(variable, levels = unique(variable)),                           y = value, fill = factor(tax, levels = unique(tax)))) +    geom_bar(stat = "identity",position="stack", width=1)+    scale_y_continuous(labels = scales::percent, expand = c(0,0)) +    guides(fill=guide_legend(title="Phylum"))+    facet_grid( ~ group, scales = "free_x", switch = "x") +      theme(strip.background = element_blank())+    theme(axis.ticks.x = element_blank(), axis.text.x = element_blank())+    xlab("Groups")+ylab("Percentage (%)")+     theme_classic()+    theme(axis.text.x=element_text(angle=45,vjust=1, hjust=1))+    theme(axis.text.x = element_blank(),          axis.ticks.x = element_blank(),          legend.key.size = unit(0.4, "cm"),          axis.title.x =element_blank())+    scale_fill_manual(values = c("#d2da93","#5196d5","#00ceff","#ff630d","#35978b",                  "#e5acd7","#77aecd","#ec8181","#dfc6a5","#e50719",                  "#d27e43","#8a4984","#fe5094","#8d342e","#f94e54",                  "#ffad00","#36999d","#00fc8d","#b64aa0","#9b82e1"))+    scale_color_manual(values = c("#d2da93","#5196d5","#00ceff","#ff630d","#35978b",                  "#e5acd7","#77aecd","#ec8181","#dfc6a5","#e50719",                  "#d27e43","#8a4984","#fe5094","#8d342e","#f94e54",                  "#ffad00","#36999d","#00fc8d","#b64aa0","#9b82e1"))ggsave(paste("results/Phylum_top5_1.pdf",".pdf", sep=""), p01, width=89 * 1.5, height=50 * 1.5, unit='mm')#p01# 根据样本数量确定每个分面的宽度，图例在最右侧# Determine the width of each facet based on the number of samples, the legend is on the far right# 生成每个 group 的子图，并为后三个分面移除所有 y 轴元素# Generate subplots for each group and remove all y-axis elements for the last three facetsplots <- lapply(split(data_all2, data_all2$group), function(df) {  group_name <- unique(df$group)  ggplot(df, aes(x = factor(variable, levels = unique(df$variable)),                 y = value, fill = factor(tax, levels = unique(df$tax)))) +    geom_bar(stat = "identity", position = "stack", width = 1) +    scale_y_continuous(labels = scales::percent, expand = c(0, 0)) +    theme_classic() +    labs(x = group_name, y = NULL) +    scale_fill_manual(values = c("#e5acd7",  "#00ceff", "#ff630d", "#35978b","#d2da93",                                  "#5196d5", "#77aecd", "#ec8181", "#dfc6a5", "#e50719",                                  "#d27e43", "#8a4984", "#fe5094", "#8d342e", "#f94e54",                                  "#ffad00", "#36999d", "#00fc8d", "#b64aa0", "#9b82e1")) +    guides(fill = guide_legend(title = "Phylum"))  # 确保图例存在Make sure the legend exists})# 移除后三个分面的所有 y 轴元素和图例# Remove all y-axis elements and legends for the last three facetsfor (i in 2:3) {  plots[[i]] <- plots[[i]] + theme(axis.text.y = element_blank(),                                   axis.text.x = element_blank(),                                   axis.ticks.y = element_blank(),                                   axis.ticks.x = element_blank(),                                   axis.title.y = element_blank(),                                   axis.line.y = element_blank(),                                   legend.position = "none")}# 为第一个分面保留 y 轴标签和图例# Keep y-axis label and legend for the first facetplots[[1]] <- plots[[1]] +   ylab("Percentage (%)") +   theme(axis.text.x = element_blank(),        axis.ticks.x = element_blank(),        legend.position = "left",  # 将图例放在最右侧Place the legend on the far right        legend.justification = c("left", "top"))  # 确保图例位置Ensure legend position# 每个分面的宽度由样本数量决定# The width of each facet is determined by the number of samplessample_counts <- table(data_all2$group)relative_widths <- sample_counts / sum(sample_counts)# 使用 patchwork 组合图形，设置每个分面的宽度# Use patchwork to combine graphics and set the width of each facetp02 <- wrap_plots(plots) +  plot_layout(widths = relative_widths,               design = "ABCD",               guides = "collect") &   theme(axis.title.y = element_text(size = 10),        plot.margin = unit(c(0.05, 0.05, 0.05, 0.05), "cm"))  # 调整左右分面的间隔Adjust the spacing between left and right facets# 保存图像# Save plotggsave("results/Phylum_top5_2.pdf", p02, width = 139 * 1.5, height = 60 * 1.5, unit = 'mm')# 根据样本数量确定每个分面的宽度，图例在顶部# Determine the width of each facet based on the number of samples, the legend is at the topplots <- lapply(split(data_all2, data_all2$group), function(df) {  group_name <- unique(df$group)  ggplot(df, aes(x = factor(variable, levels = unique(df$variable)),                 y = value, fill = factor(tax, levels = unique(df$tax)))) +    geom_bar(stat = "identity", position = "stack", width = 1) +    scale_y_continuous(labels = scales::percent, expand = c(0, 0)) +    theme_classic() +    labs(x = group_name, y = NULL) +    scale_fill_manual(values = c("#e5acd7",  "#00ceff", "#ff630d", "#35978b","#d2da93",                                  "#5196d5", "#77aecd", "#ec8181", "#dfc6a5", "#e50719",                                  "#d27e43", "#8a4984", "#fe5094", "#8d342e", "#f94e54",                                  "#ffad00", "#36999d", "#00fc8d", "#b64aa0", "#9b82e1")) +    guides(fill = guide_legend(title = "Phylum"))})# 移除后三个分面的所有 y 轴元素和图例# Remove all y-axis elements and legends for the last three facetsfor (i in 2:3) {  plots[[i]] <- plots[[i]] + theme(axis.text.y = element_blank(),                                   axis.text.x = element_blank(),                                   axis.ticks.y = element_blank(),                                   axis.ticks.x = element_blank(),                                   axis.title.y = element_blank(),                                   axis.line.y = element_blank(),                                   legend.position = "none")}# 为第一个分面保留 y 轴标签和图例# Keep y-axis label and legend for the first facetplots[[1]] <- plots[[1]] +   ylab("Percentage (%)") +   theme(axis.text.x = element_blank(),        axis.ticks.x = element_blank(),        legend.position = "left",        legend.justification = c("left", "top"))# 每个分面的宽度由样本数量决定# The width of each facet is determined by the number of samplessample_counts <- table(data_all2$group)relative_widths <- sample_counts / sum(sample_counts)# 使用 patchwork 组合图形，设置每个分面的宽度并统一图例# Use patchwork to combine graphics, set the width of each facet and unify the legendp03 <- wrap_plots(plots) +  plot_layout(widths = relative_widths, guides = "collect") &  theme(legend.position = "top",         legend.justification = "center",        legend.direction = "horizontal",         legend.key.size = unit(0.3, "cm"),         legend.text = element_text(size = 8),        legend.spacing.x = unit(0.1, "cm"),        axis.title.y = element_text(size = 10),        plot.margin = unit(c(0.05, 0.05, 0.05, 0.05), "cm")) # 保存图像# Save plotggsave("results/Phylum_fungi_top5_3.pdf", p03, width = 139 * 1.5, height = 80 * 1.5, unit = 'mm')# 组合图# Combinedlibrary(cowplot)width = 89height = 59p0 = plot_grid(p01, p02, p03, labels = c("A", "B", "C"), ncol = 2)ggsave("results/multigroup_faceted_sorted_stack_bar_plot_all.pdf", p0, width = width * 2, height = height * 1.7, units = "mm")

使用此脚本，请引用下文：

Yong-Xin Liu, Lei Chen, Tengfei Ma, Xiaofang Li, Maosheng Zheng, Xin Zhou, Liang Chen, Xubo Qian, Jiao Xi, Hongye Lu, Huiluo Cao, Xiaoya Ma, Bian Bian, Pengfan Zhang, Jiqiu Wu, Ren-You Gan, Baolei Jia, Linyang Sun, Zhicheng Ju, Yunyun Gao, Tao Wen, Tong Chen. 2023. EasyAmplicon: An easy-to-use, open-source, reproducible, and community-based pipeline for amplicon data analysis in microbiome research. iMeta 2: e83. https://doi.org/10.1002/imt2.83

宏基因组推荐

本公众号现全面开放投稿，希望文章作者讲出自己的科研故事，分享论文的精华与亮点。投稿请联系小编（微信号：yongxinliu 或 meta-genomics）

10000+：菌群分析宝宝与猫狗梅毒狂想曲提DNA发Nature

系列教程：微生物组入门 Biostar 微生物组宏基因组

专业技能：学术图表高分文章生信宝典不可或缺的人

一文读懂：宏基因组寄生虫益处进化树必备技能：提问搜索 Endnote

扩增子分析：图表解读分析流程统计绘图

16S功能预测 PICRUSt FAPROTAX Bugbase Tax4Fun

生物科普: 肠道细菌人体上的生命生命大跃进细胞暗战人体奥秘

写在后面

为鼓励读者交流快速解决科研困难，我们建立了“宏基因组”讨论群，己有国内外6000+ 科研人员加入。请添加主编微信meta-genomics带你入群，务必备注“姓名-单位-研究方向-职称/年级”。高级职称请注明身份，另有海内外微生物PI群供大佬合作交流。技术问题寻求帮助，首先阅读《如何优雅的提问》学习解决问题思路，仍未解决群内讨论，问题不私聊，帮助同行。

点击阅读原文

http://mp.weixin.qq.com/s?__biz=MzUzMjA4Njc1MA==&mid=2247513098&idx=2&sn=f3b44d56c3ef430465e3bb8569c66578

宏基因组

宏基因组/微生物组是当今世界科研最热门的研究领域之一，为加强本领域的技术交流与传播，推动中国微生物组计划发展，中科院青年科研人员创立“宏基因组”公众号，目标为打造本领域纯干货技术及思想交流平台。

最新文章

【2025肠道大会】征稿正式启动！

iMeta成都-四川分舵线下编委会(11.4下午成都大学)

MPB：中农戴兆来组-猪肠道微生物的体外培养与功能研究

MicrobiomeStatPlot | 森林图教程Forest plot tutorial

视频回放 | 陈程杰-“接地气的”生物软件开发与社区维护-“用户视角”

MPB：陈同等-ImageGP在微生物组可视化中的应用

2024年影响因子预测: 中国期刊(10.30更新)

iMetaOmics | 白立景/邢凯组-解析脊椎动物肠道微生物多样性的影响因素

MPB | 南农韦中组-根际细菌群落资源利用网络的研究方法

iMeta 讲坛12 | 陈程杰-“接地气的”生物软件开发与社区维护-“用户视角”(10.31晚7点)

文献解读 | 微生物单细胞项目案例-香港大学张彤教授团队首次绘制活性污泥微生物及耐药基因单细胞图谱

iMeta | 在线网站和数据库文章汇总

iMetaOmics | 甘肃农大刘自刚组-强抗寒甘蓝型冬油菜的基因组组装和基因组特征解析

报告 | 刘永鑫-微生物组学科研生态构建(10.31上午东北农大)

报告 | 刘永鑫-微生物组学科研生态构建(10.30上午黑龙江大学)

Microbiome | 宁波大学陈剑平院士团队等共同揭示小麦根际招募有益菌群激活小麦抗性抵御病毒侵染

iMeta | 被引超5000次，发文224篇，平均引用22.39，百引耗时7天(2024/10/27)

iMetaOmics | 徐州医科大学朱作斌组-微生物对寿命的调节：机制和治疗策略

MicrobiomeStatPlot | 费舍尔精确检验Fisher’s exact test

SCLS | 内农大孙志宏团队建立人工智能乳酸菌发酵剂菌株筛选方法

iMetaOmics | 魏来/贾慧珏/何明光-多组学助力揭示塑造转录组的基因型-微生物组相互作用

视频回放 | 刘永鑫-iMeta期刊介绍和高影响力文章(研究/方法/综述)特点

MPB | 深大李猛组-基于PacBio SMRT三代测序的红树林沉积物真菌群落的研究

Nature | 北京大学钱珑、张成课题组开发表观比特DNA存储新技术

iMeta主编刘双江研究员专访

MPB | 扬大林淼组-瘤胃内容物样本中有机酸的定量分析 (高效液相色谱)

iMetaOmics | 南京农大朱伟云组-外周血清素在结肠稳态中的作用

第二轮通知 | 中国微生物学会微生物组专业委员会2024年学术年会暨微生物组与大健康学术论坛

MPB | 南农朱伟云等-瘤胃厌氧真菌代谢产物的检测方法

直播预告！Advanced Science顶刊作者分享会：“当人工智能遇到肠道微生物，会催生什么“化学反应”？”

南开大学孙宝发研究员生物信息学组博士招生

2024年科学探索奖颁奖，49人荣获殊荣！

iMeta 讲座11 | 刘永鑫-高影响力研究/方法/综述文章特点 (24.10.24晚7点)

2024年国家优秀青年科学基金获得者履历（部分）

微生物组-宏基因组分析专题技术研讨会(2024.11)

MicrobiomeStatPlot | 分面排序堆叠柱状图教程Faceted sorted stack bar plot

2023年度广东省科学技术奖颁布

iMeta| 上海交大贾伟/赵爱华组-新型微生物修饰的胆汁酸和它们的功能意义

MPB | 扬大林淼组-瘤胃混合细菌连续传代培养技术

Agronomy | 3.3分JCR1区微生物组专刊/37天毕业神器(刘永鑫/于鹏)

重磅突破！中国农大杰青团队Cell子刊揭示肠道菌群调节脂肪酸代谢增强蜜蜂奖励学习

MicrobiomeStatPlot | 误差棒点图教程Error bar plot tutorial

MPB | 南农成艳芬等-瘤胃体外发酵过程中产气量与甲烷产量的检测

iMeta期刊宣传片(iMeta大会2024更新版)

iMeta大会2024精彩瞬间回顾

MPB：林科院袁志林组-提取杨树人工林土壤微生物菌体细胞的4种方法

iMeta | 山大-青大附院联合发现并培养类风湿关节炎肠道核心微生物组

MicrobiomeStatPlot | 边绑定图教程Edge Bundling Plot

MPB | 南农金巍等-瘤胃甲烷菌的分离培养与保存

iMeta | 华南农大任文凯组发现肠道真菌与细菌互作影响肺炎

分类

时事

民生

政务

教育

文化

科技

财富

体娱

健康

情感

旅行

百科

职场

楼市

企业

乐活

学术

汽车

时尚

创业

美食

幽默

美体

文摘

原创标签

时事社会财经军事教育体育科技汽车科学房产搞笑综艺明星音乐动漫游戏时尚健康旅游美食生活摄影宠物职场育儿情感小说曲艺文化历史三农文学娱乐电影视频图片新闻宗教电视剧纪录片广告创意壁纸头像心灵鸡汤星座命理教育培训艺术文化金融财经健康医疗美妆时尚餐饮美食母婴育儿社会新闻工业农业时事政治星座占卜幽默笑话独立短篇连载作品文化历史科技互联网

发布位置

广东北京山东江苏河南浙江山西福建河北上海四川陕西湖南安徽湖北内蒙古江西云南广西甘肃辽宁黑龙江贵州新疆重庆吉林天津海南青海宁夏西藏香港澳门台湾美国加拿大澳大利亚日本新加坡英国西班牙新西兰韩国泰国法国德国意大利缅甸菲律宾马来西亚越南荷兰柬埔寨俄罗斯巴西智利卢森堡芬兰瑞典比利时瑞士土耳其斐济挪威朝鲜尼日利亚阿根廷匈牙利爱尔兰印度老挝葡萄牙乌克兰印度尼西亚哈萨克斯坦塔吉克斯坦希腊南非蒙古奥地利肯尼亚加纳丹麦津巴布韦埃及坦桑尼亚捷克阿联酋安哥拉

MicrobiomeStatPlot | 分面排序堆叠柱状图教程Faceted sorted stack bar plot

一站式论文提升服务，助您顺利发高分论文！

猜你喜欢

写在后面