「微生物组」基于LorMe包进行微生物组数据分析—Alpha多样性分析！！！

文摘科学 2024-10-08 09:05 宁夏

为了避免各位错过最新的推文教程，强烈建议大家将“科研后花园”设置为“星标”！

1、安装并加载LorMe包：

###安装LorMe包install.packages("LorMe") ##加载MicrobiomeStat包library(LorMe) # Lightening One-Code Resolving Microbial Ecology Solutionlibrary(ggplot2) # Create Elegant Data Visualisations Using the Grammar of Graphics

2、加载数据（这里使用LorMe包自带数据集）并对分析所用的数据进行封装：

##加载示例数据集data(testotu) # Load the standard Qiime output feature tablehead(testotu) # First column -ID, last column -taxonomic annotation, others - the feature table.feature_table<- testotu[, -c(1,22)]tax_anno<- testotu[, c(1,22)]
#创建分组数据groupinformation <- data.frame(  group = c(rep("a", 10), rep("b", 10)),   factor1 = rnorm(10),   factor2 = rnorm(mean = 100, 10),   subject = factor(c(1:10, 1:10)),   group2 = c(rep("e", 5), rep("f", 5), rep("e", 5), rep("f", 5)))
###对数据进行封装test_object <- tax_summary(  groupfile = groupinformation,   inputtable = feature_table,  reads = TRUE,  taxonomytable = tax_anno)
###对一些默认设置进行配置my_col=color_scheme("Plan1")my_order=c("b","a")my_facet_order=c("e","f")test_object_plan2 <- object_config(  taxobj = test_object,  treat_location = 1,  rep_location = 4,  facet_location = 5,   subject_location = NULL,   treat_col = my_col,  treat_order = my_order,  facet_order = my_facet_order)

# Taxonomy tidying...# Genarating tables...# Done# # Contained elements:#   [1] "Groupfile"       "Base"            "Base_percent"   # [4] "Base_taxonomy"   "Phylum"          "Phylum_percent" # [7] "Phylum_taxonomy" "Genus"           "Genus_percent"  # [10] "Genus_taxonomy"  "parameters"     # Warning message:#   Expected 7 pieces. Missing pieces filled with `NA` in 261 rows [4,#                                                                   6, 9, 12, 18, 20, 24, 32, 37, 43, 44, 45, 46, 58, 59, 68, 70, 71,#                                                                   72, 76, ...].

3、Alpha多样性分析：

1）默认基础分析：

Alpha_results<- Alpha_diversity_calculator(taxobj = test_object_plan2,                                            taxlevel = "Base"  #analysis level)

###Distribution hypothesis####Normality Test (Shapiro-Wilk): Failed (P =  0.005623099 )Equal Variance Test (Brown-Forsythe):  Passed  (P =  0.404 )  Treatment_Name  N   Median       Q1       Q31              a 10 5.432989 4.492731 5.1242262              b 10 5.395290 4.973662 5.331129

###Dependent Variable: Indexvalue ####
###Mann-Whitney Rank Sum Test#### 
###Statistics#### 
Mann-Whitney U Statistic=  42  
n(small)=  10 , n(big)=  10 P_estimate =  0.5707504  ,P_exact =  0.5453497 
###Conclusion####The difference in the median values between the two groups is not great enough to exclude the possibility that the difference is due to random sampling variability; there is not a statistically significant difference  (P =  0.5453497 ).
Analysis finished###Distribution hypothesis####Normality Test (Shapiro-Wilk): Failed (P =  0.006954524 )Equal Variance Test (Brown-Forsythe):  Passed  (P =  0.264 )  Treatment_Name  N Median     Q1    Q31              a 10  626.5 336.25 522.42              b 10  608.0 516.00 586.9

###Dependent Variable: Indexvalue ####
###Mann-Whitney Rank Sum Test#### 
###Statistics#### 
Mann-Whitney U Statistic=  44  
n(small)=  10 , n(big)=  10 P_estimate =  0.6774705  ,P_exact =  0.6500246 
###Conclusion####The difference in the median values between the two groups is not great enough to exclude the possibility that the difference is due to random sampling variability; there is not a statistically significant difference  (P =  0.6500246 ).
Analysis finished###Distribution hypothesis####Normality Test (Shapiro-Wilk): Failed (P =  0.02277682 )Equal Variance Test (Brown-Forsythe):  Passed  (P =  0.906 )  Treatment_Name  N    Median        Q1        Q31              a 10 0.8415442 0.7893959 0.82665272              b 10 0.8431299 0.8036199 0.8368023

###Dependent Variable: Indexvalue ####
###Mann-Whitney Rank Sum Test#### 
###Statistics#### 
Mann-Whitney U Statistic=  42  
n(small)=  10 , n(big)=  10 P_estimate =  0.5707504  ,P_exact =  0.5453497 
###Conclusion####The difference in the median values between the two groups is not great enough to exclude the possibility that the difference is due to random sampling variability; there is not a statistically significant difference  (P =  0.5453497 ).
Analysis finished###Distribution hypothesis####Normality Test (Shapiro-Wilk): Failed (P =  0.0004472301 )Equal Variance Test (Brown-Forsythe):  Passed  (P =  0.921 )  Treatment_Name  N    Median        Q1        Q31              a 10 0.9909291 0.9793478 0.98653302              b 10 0.9903503 0.9799472 0.9865132

###Dependent Variable: Indexvalue ####
###Mann-Whitney Rank Sum Test#### 
###Statistics#### 
Mann-Whitney U Statistic=  50  
n(small)=  10 , n(big)=  10 P_estimate =  1  ,P_exact =  1 
###Conclusion####The difference in the median values between the two groups is not great enough to exclude the possibility that the difference is due to random sampling variability; there is not a statistically significant difference  (P =  1 ).
Analysis finished###Distribution hypothesis####Normality Test (Shapiro-Wilk): Failed (P =  0.01048326 )Equal Variance Test (Brown-Forsythe):  Passed  (P =  0.215 )  Treatment_Name  N  Median       Q1       Q31              a 10 718.085 414.0291 607.20042              b 10 693.228 652.6971 679.5795

###Dependent Variable: Indexvalue ####
###Mann-Whitney Rank Sum Test#### 
###Statistics#### 
Mann-Whitney U Statistic=  47  
n(small)=  10 , n(big)=  10 P_estimate =  0.8501067  ,P_exact =  0.8205958 
###Conclusion####The difference in the median values between the two groups is not great enough to exclude the possibility that the difference is due to random sampling variability; there is not a statistically significant difference  (P =  0.8205958 ).
Analysis finished###Distribution hypothesis####Normality Test (Shapiro-Wilk): Failed (P =  0.0108276 )Equal Variance Test (Brown-Forsythe):  Passed  (P =  0.237 )  Treatment_Name  N   Median       Q1       Q31              a 10 712.4917 423.9344 600.29882              b 10 671.8403 616.9682 658.7892

###Dependent Variable: Indexvalue ####
###Mann-Whitney Rank Sum Test#### 
###Statistics#### 
Mann-Whitney U Statistic=  53  
n(small)=  10 , n(big)=  10 P_estimate =  0.8501067  ,P_exact =  0.8205958 
###Conclusion####The difference in the median values between the two groups is not great enough to exclude the possibility that the difference is due to random sampling variability; there is not a statistically significant difference  (P =  0.8205958 ).
Analysis finished

2）在不同分类水平进行Alpha分析（以属水平为例）：

Alpha_results2<- Alpha_diversity_calculator(taxobj = test_object_plan2,                                             taxlevel = "Genus",                                            prefix = "Genus"#"Domain","Phylum","Class","Order","Family","Genus","Species","Base")

###Distribution hypothesis####Normality Test (Shapiro-Wilk): Failed (P =  0.002079162 )Equal Variance Test (Brown-Forsythe):  Passed  (P =  0.356 )  Treatment_Name  N   Median       Q1       Q31              a 10 4.774252 4.130925 4.5414992              b 10 4.754462 4.511624 4.690952

###Dependent Variable: Indexvalue ####
###Mann-Whitney Rank Sum Test#### 
###Statistics#### 
Mann-Whitney U Statistic=  43  
n(small)=  10 , n(big)=  10 P_estimate =  0.6231762  ,P_exact =  0.5967012 
###Conclusion####The difference in the median values between the two groups is not great enough to exclude the possibility that the difference is due to random sampling variability; there is not a statistically significant difference  (P =  0.5967012 ).
Analysis finished###Distribution hypothesis####Normality Test (Shapiro-Wilk): Failed (P =  0.0127267 )Equal Variance Test (Brown-Forsythe):  Passed  (P =  0.12 )  Treatment_Name  N Median     Q1    Q31              a 10  285.5 196.75 253.72              b 10  281.0 257.50 275.5

###Dependent Variable: Indexvalue ####
###Mann-Whitney Rank Sum Test#### 
###Statistics#### 
Mann-Whitney U Statistic=  52  
n(small)=  10 , n(big)=  10 P_estimate =  0.9095864  ,P_exact =  0.8796494 
###Conclusion####The difference in the median values between the two groups is not great enough to exclude the possibility that the difference is due to random sampling variability; there is not a statistically significant difference  (P =  0.8796494 ).
Analysis finished###Distribution hypothesis####Normality Test (Shapiro-Wilk): Failed (P =  0.002801331 )Equal Variance Test (Brown-Forsythe):  Passed  (P =  0.845 )  Treatment_Name  N    Median        Q1        Q31              a 10 0.8443819 0.7964615 0.82386312              b 10 0.8451148 0.8118400 0.8349486

###Dependent Variable: Indexvalue ####
###Mann-Whitney Rank Sum Test#### 
###Statistics#### 
Mann-Whitney U Statistic=  40  
n(small)=  10 , n(big)=  10 P_estimate =  0.4726756  ,P_exact =  0.4496918 
###Conclusion####The difference in the median values between the two groups is not great enough to exclude the possibility that the difference is due to random sampling variability; there is not a statistically significant difference  (P =  0.4496918 ).
Analysis finished###Distribution hypothesis####Normality Test (Shapiro-Wilk): Failed (P =  0.0003992957 )Equal Variance Test (Brown-Forsythe):  Passed  (P =  0.955 )  Treatment_Name  N    Median        Q1        Q31              a 10 0.9852937 0.9742226 0.98104972              b 10 0.9853026 0.9751303 0.9807535

###Dependent Variable: Indexvalue ####
###Mann-Whitney Rank Sum Test#### 
###Statistics#### 
Mann-Whitney U Statistic=  49  
n(small)=  10 , n(big)=  10 P_estimate =  0.96985  ,P_exact =  0.939743 
###Conclusion####The difference in the median values between the two groups is not great enough to exclude the possibility that the difference is due to random sampling variability; there is not a statistically significant difference  (P =  0.939743 ).
Analysis finished###Distribution hypothesis####Normality Test (Shapiro-Wilk): Failed (P =  0.02471859 )Equal Variance Test (Brown-Forsythe):  Passed  (P =  0.161 )  Treatment_Name  N   Median       Q1       Q31              a 10 310.6667 245.1957 283.15032              b 10 300.8500 294.2031 297.7674

###Dependent Variable: Indexvalue ####
###Mann-Whitney Rank Sum Test#### 
###Statistics#### 
Mann-Whitney U Statistic=  54  
n(small)=  10 , n(big)=  10 P_estimate =  0.7913368  ,P_exact =  0.7623688 
###Conclusion####The difference in the median values between the two groups is not great enough to exclude the possibility that the difference is due to random sampling variability; there is not a statistically significant difference  (P =  0.7623688 ).
Analysis finished###Distribution hypothesis####Normality Test (Shapiro-Wilk): Failed (P =  0.02658585 )Equal Variance Test (Brown-Forsythe):  Passed  (P =  0.163 )  Treatment_Name  N   Median       Q1       Q31              a 10 303.4108 250.0425 279.37952              b 10 294.1081 279.7710 291.9993

###Dependent Variable: Indexvalue ####
###Mann-Whitney Rank Sum Test#### 
###Statistics#### 
Mann-Whitney U Statistic=  56  
n(small)=  10 , n(big)=  10 P_estimate =  0.677585  ,P_exact =  0.6501474 
###Conclusion####The difference in the median values between the two groups is not great enough to exclude the possibility that the difference is due to random sampling variability; there is not a statistically significant difference  (P =  0.6501474 ).
Analysis finished

3）可视化：

#柱状图Alpha_results$plotlist$Plotobj_Shannon$Barplot#箱线图Alpha_results$plotlist$Plotobj_Shannon$Boxplot#小提琴图Alpha_results$plotlist$Plotobj_Shannon$Violinplot

4）差异检验：

Alpha_results$plotlist$Plotobj_Shannon$Statistics

5）提取计算结果并自定义可视化：

####提取计算结果并基于ggplot2包自定义绘图library(ggsignif) # Significance Brackets for 'ggplot2'data_Alpha <- Alpha_results$alphaframe#Shannonp1 <- ggplot(data_Alpha[data_Alpha$Indexname=="Shannon",],       aes(group2, Indexvalue, fill = group2))+  geom_boxplot(width=0.8,outlier.color =NA)+  geom_jitter(aes(fill = group2), shape = 21, color = "black",size = 2.5,width = 0.2)+  theme_bw() +   theme(axis.title = element_text(size = 18),        panel.grid = element_blank(),        axis.text = element_text( size = 13),        legend.position = "none")+  scale_y_continuous(expand = c(0,0),limits = c(4,6.5))+  labs(x=NULL,y="Shannon")+  geom_signif(comparisons = list(c("e","f")),              map_signif_level = T, #使用*号显示显著性              test = "t.test",#检验方法              textsize = 4,#字号大小              y_position = c(6.2),#横线位置              tip_length = c(0),#横线下方线条的长度              size=0.8,color="black")#线条颜色及粗细#Simpsonp2 <- ggplot(data_Alpha[data_Alpha$Indexname=="Simpson",],       aes(group2, Indexvalue, fill = group2))+  geom_boxplot(width=0.8,outlier.color =NA)+  geom_jitter(aes(fill = group2), shape = 21, color = "black",size = 2.5,width = 0.2)+  theme_bw() +   theme(axis.title = element_text(size = 18),        panel.grid = element_blank(),        axis.text = element_text( size = 13),        legend.position = "none")+  scale_y_continuous(expand = c(0,0),limits = c(0.72,0.95))+  labs(x=NULL,y="Simpson")+  geom_signif(comparisons = list(c("e","f")),              map_signif_level = T, #使用*号显示显著性              test = "t.test",#检验方法              textsize = 4,#字号大小              y_position = c(0.92),#横线位置              tip_length = c(0),#横线下方线条的长度              size=0.8,color="black")#线条颜色及粗细#ACEp3 <- ggplot(data_Alpha[data_Alpha$Indexname=="ACE",],       aes(group2, Indexvalue, fill = group2))+  geom_boxplot(width=0.8,outlier.color =NA)+  geom_jitter(aes(fill = group2), shape = 21, color = "black",size = 2.5,width = 0.2)+  theme_bw() +   theme(axis.title = element_text(size = 18),        panel.grid = element_blank(),        axis.text = element_text( size = 13),        legend.position = "none")+  scale_y_continuous(expand = c(0,0),limits = c(290,880))+  labs(x=NULL,y="ACE")+  geom_signif(comparisons = list(c("e","f")),              map_signif_level = T, #使用*号显示显著性              test = "t.test",#检验方法              textsize = 4,#字号大小              y_position = c(800),#横线位置              tip_length = c(0),#横线下方线条的长度              size=0.8,color="black")#线条颜色及粗细#Chaop4 <- ggplot(data_Alpha[data_Alpha$Indexname=="Chao",],       aes(group2, Indexvalue, fill = group2))+  geom_boxplot(width=0.8,outlier.color =NA)+  geom_jitter(aes(fill = group2), shape = 21, color = "black",size = 2.5,width = 0.2)+  theme_bw() +   theme(axis.title = element_text(size = 18),        panel.grid = element_blank(),        axis.text = element_text( size = 13),        legend.position = "none")+  scale_y_continuous(expand = c(0,0),limits = c(290,880))+  labs(x=NULL,y="Chao")+  geom_signif(comparisons = list(c("e","f")),              map_signif_level = T, #使用*号显示显著性              test = "t.test",#检验方法              textsize = 4,#字号大小              y_position = c(800),#横线位置              tip_length = c(0),#横线下方线条的长度              size=0.8,color="black")#线条颜色及粗细library(patchwork)p1+p2+p3+p4

更多内容大家请阅读LorMe包的帮助文档！

参考：LorMe包帮助文档

微生物组系列推文合集：

扩增子测序数据分析还不会？小编整理的全套R语言代码助您轻松解决问题！（更新版）

「微生物组」基于microeco包进行共现性网络分析！！！

「微生物组」基于microeco包进行lefsef分析及随机森林分析！！！

「微生物组」Stamp结果图复现——分组差异图的绘制！！！

「微生物组」基于microeco包进行环境数据关联分析—以RDA分析为例！！！

「微生物组」基于microeco包进行共享/独特OTU计算及可视化！！！

「微生物组」基于microeco包进行Alpha多样性分析！！！

「微生物组」基于microeco包进行微生物群落结构组成分析！！！

R可视化——Mantel test分析及可视化

基于R语言的微生物群落组成多样性分析——α多样性及其可视化

基于R语言的微生物群落组成多样性分析—β多样性之PCoA分析

基于R语言的微生物群落组成多样性分析—β多样性之组间差异性检验

基于R语言的微生物群落组成多样性分析——物种丰度计算及可视化

基于R语言的微生物群落组成多样性分析——物种丰度可视化之热图（Heatmap）

基于R语言的微生物群落组成多样性分析——物种丰度可视化之弦图（Chord Diagram）

基于R语言的微生物群落组成多样性分析——共线性网络分析

基于R语言的微生物群落组成多样性分析——RDA分析

基于R语言的微生物群落组成多样性分析——CCA分析

基于R语言的微生物群落组成多样性分析——PCA分析

基于R语言的微生物群落组成多样性分析——聚类分析

基于LEfSe分析进行微生物物种差异及关联分析

基于R语言的微生物群落组成分析—差异OTU筛选及展示

基于R语言的微生物群落组成多样性分析——NMDS分析

基于R语言的微生物群落组成分析—α多样性指数的正态分布检验及差异性标注（）

R可视化——基于MicrobiomeStat包进行微生物组数据分析（更新版）！！！

基于R语言进行扩增子数据的UMAP分析！

R可视化——共现性网络分析第二弹

R可视化——基于MicrobiotaProcess包进行物种群落结构组成分析！

R可视化——基于MicrobiotaProcess包进行物种差异分析！

基于microeco包进行PCoA分析！！！

基于MicrobiotaProcess包进行PCoA分析！

R可视化——基于MicrobiotaProcess包进行Alpha多样性分析！

PS: 以上内容是小编个人学习代码笔记分享，仅供参考学习，欢迎大家一起交流学习。

R绘图模板合集（包括附带注释的源码、测试数据及绘制效果图）可在公众号后台菜单栏查看具体获取方式！绘图模板合集效果图展示：

http://mp.weixin.qq.com/s?__biz=MzkzODM4NzQ1NQ==&mid=2247495795&idx=1&sn=1390c1ad1062bfbf061898384240e6c3

科研后花园

专注于R语言绘图及数据分析！

最新文章

独自一人，怒发顶会！！

「R绘图模板」并列柱状图+组内差异字母标记+组间差异字母标记！！！

真心建议！还没有课题的都进来看看！

如何看懂宏基因组分析结果，做个性化分析，深入挖掘数据，高效发文章？

「微生物组」基于LorMe包进行微生物组数据分析—网络分析！！！

为啥本科生都能发顶会，而博士一篇都没有？

聪明医生选择智能专病库！ 9月已有 800+ 主任下单~ 比Excel更高效、更精准！

「R绘图模板」中国地图+省份子图+采样点放大效果展示！！！

诚征男友！98年妹纸，北大本科，人大硕士，部委公务员，偶尔被夸好看，江浙沪独女

「微生物组」基于LorMe包进行微生物组数据分析—物种差异分析！！！

2024年学术志团队新作重磅上市！

科研绘图必不可少的工具确定不了解一下吗？

一篇SCI学校奖励8万！靠SCI论文奠定学术圈地位！

「微生物组」基于LorMe包进行微生物组数据分析—微生物群落结构组成分析！！！

「R绘图模板」世界地图+采样点标记+子地图放大效果!!!

发一篇顶会真不难

「微生物组」基于LorMe包进行微生物组数据分析—Beta多样性分析！！！

扩增子测序数据分析还不会？小编整理的全套R语言代码助您轻松解决问题！（更新版）

专补大模型短板的RAG入门与实战书来了！

发一篇顶会真不难

「R绘图模板」跟着Cell学绘图—双分组倾斜火山图！！！

一文教你自己做宏基因组数据分析！

「R绘图模板」组合型热图+分组差异图等在多组学技术中的重要体现，生信必备!

「微生物组」基于LorMe包进行微生物组数据分析—Alpha多样性分析！！！

扩增子测序数据分析还不会？小编整理的全套R语言代码助您轻松解决问题！（更新版）

2025年最全申博攻略！

「R绘图技巧」基于ggplot2包绘制的图形设置坐标轴以百分比或者对数样式显示！！！

「R绘图模板」并列箱线图+散点+气泡图+分组+显著性！！！

跟着Nature学绘图！数据可视化+1比1完美复现！生信小白也能“逆袭！”

「图书推荐」保姆级R语言绘图教程，带你轻轻松松入门R语言绘图！！！

「R绘图模板」气泡图+非等宽分面+分组！！！

跟着GPT做科研，让绘图与数据更有魅力，纯数据分析与挖掘…

「R绘图模板」箱线图+散点+均值连线+显著性的箱上箱下标注！！！

招博士！加入我们！

「R绘图模板」环形多组火山图+标签！！！

「微生物组」基于microeco包进行共现性网络分析！！！

扩增子测序数据分析还不会？小编整理的全套R语言代码助您轻松解决问题！（更新版）

OpenAI发布最强模型o1，大模型杀疯了！

发SCI真不难！

「R绘图模板」组合图系列—相对丰度柱状堆积图+单组柱状图+显著性！！！

数据分析 | 微生物宏基因组数据组装、分箱等高级分析如何可视化？

「R绘图模板」气泡图+分组+标签+拟合曲线！！！

豆瓣9.6，多语言版本全球发行，这本书为啥这么受欢迎？

「R绘图模板」多变量热图+r值+p值+其他变量！！！

找到一个快速发论文的方法

「R绘图模板」用于两组比较的多图形并列组合图-小提琴图+箱线图+散点图！！！

跟着Nature Communications学绘图，让数据更有魅力，纯数据分析与挖掘！

的确可以封神了！原来用GPT科研绘图、数据处理这么简单...

2024年8月份绘图模板教程一览

「R绘图模板」跟着Science学绘图—世界地图+采样点标记气泡图+分布频率注释侧图！！！

分类

时事

民生

政务

教育

文化

科技

财富

体娱

健康

情感

旅行

百科

职场

楼市

企业

乐活

学术

汽车

时尚

创业

美食

幽默

美体

文摘

原创标签

时事社会财经军事教育体育科技汽车科学房产搞笑综艺明星音乐动漫游戏时尚健康旅游美食生活摄影宠物职场育儿情感小说曲艺文化历史三农文学娱乐电影视频图片新闻宗教电视剧纪录片广告创意壁纸头像心灵鸡汤星座命理教育培训艺术文化金融财经健康医疗美妆时尚餐饮美食母婴育儿社会新闻工业农业时事政治星座占卜幽默笑话独立短篇连载作品文化历史科技互联网

发布位置

广东北京山东江苏河南浙江山西福建河北上海四川陕西湖南安徽湖北内蒙古江西云南广西甘肃辽宁黑龙江贵州新疆重庆吉林天津海南青海宁夏西藏香港澳门台湾美国加拿大澳大利亚日本新加坡英国西班牙新西兰韩国泰国法国德国意大利缅甸菲律宾马来西亚越南荷兰柬埔寨俄罗斯巴西智利卢森堡芬兰瑞典比利时瑞士土耳其斐济挪威朝鲜尼日利亚阿根廷匈牙利爱尔兰印度老挝葡萄牙乌克兰印度尼西亚哈萨克斯坦塔吉克斯坦希腊南非蒙古奥地利肯尼亚加纳丹麦津巴布韦埃及坦桑尼亚捷克阿联酋安哥拉