两组均数检验理论基础和样本量估计的PASS实现

2024-03-09 19:34 上海

两组均数的样本量是老生常谈的内容，但为了让各位读者理解各种数据分布类型的假设检验，我们重新梳理下基本概念，并实例讲解增加理解。

优效

1. Two-Sample T-Tests for Superiority by a Margin Assuming Equal Variance

1.1 The Statistical Hypotheses

1.2 Superiority by a Margin Tests

1.3 Two-Sample Equal-Variance T-Test Statistic

1.4 Computing the Power

2. Two-Sample T-Tests for Superiority by a Margin Allowing Unequal Variance

2.1 Two-Sample Unequal-Variance T-Test (Welch’s T-Test) Statistic

2.2 Computing the Power

3. Mann-Whitney U or Wilcoxon Rank-Sum Tests for Superiority by a Margin

3.1 Mann-Whitney U or Wilcoxon Rank-Sum Test Statistic

3.2 Computing the Power

4. Superiority by a Margin Tests for the Ratio of Two Means (Log-Normal Data)

4.1 Superiority Testing Using Ratios

4.2 Log-Transformation

4.3 Coefficient of Variation

4.4 Example 1 – Finding Power

A company has developed a drug for treating rheumatism and wants to show that it is superior to the standard drug by a certain amount. Responses following either treatment are known to follow a log normal distribution. A parallel-group design will be used, and the logged data will be analyzed with a two-sample t test. Researchers have decided to set the margin of superiority at 0.20. Past experience leads the researchers to set the COV to 1.50. The significance level is 0.025. The power will be computed assuming that the true ratio is either 1.30 or 1.40. Sample sizes between 100 and 1000 will be included in the analysis.

第一步：参数录入

第二步：结果输出

4.5 Example 2 – Validation using Another Procedure

我们用Two-Sample T-Tests for Superiority by a Margin Assuming Equal Variance板块来验证：

𝑆M= ln(1 + 𝑆M) = ln(1.2) = 0.182322

𝛿 = ln(𝑅1) = ln(1.3) = 0.262364

= ln(1.5² + 1) = 1.085659

第一步：参数录入

第二部：结果输出

可以发现和example1红色高亮的power一致。

5. Superiority by a Margin Tests for the Ratio of Two Means (Normal Data)

5.1 Superiority Testing Using Ratios

5.2 Coefficient of Variation

5.3 Power Calculation

5.4 Tests

5.4.1 Equal Variances T-Test

5.4.2 Unequal Variances Large Sample Z-Test

5.4.3 Unequal Variances Satterthwaite T-Test

5.4.4 Unequal Variances Delta Method Z-Test

非劣效

和优效类似，只是界值的方向不一致。

差异性检验

The two-sample t-test is commonly used in this situation. When the variances of the two groups are unequal, Welch’s unequal-variance t-test is often used. When the data are not normally distributed, the Mann-Whitney U (or Wilcoxon Rank-Sum) test may be used.

1.Technical Details

2.Generating Random Distributions

3.Test Statistics

3.1 Two-Sample T-Test with Equal Variances

3.2 Welch’s Unequal-Variance T-Test

3.3 Trimmed T-Test with Equal Variances

3.4 Trimmed T-Test with Unequal Variances

3.5 Mann-Whitney U or Wilcoxon Rank-Sum Test

4. Standard Deviations

Specify the distribution that represents the type of data you expect from your study.

The possible distributions are Beta, Exponential, Gamma, Gumbel, Laplace, Logistic, Lognormal, Normal, Poisson, TukeyGH, Uniform, Weibull.

All these distributions can be specified with a mean and standard deviation. The Beta and TukeyGH distributions each require two additional parameters. The Exponential and Poisson distributions require only the mean to be specified since the standard deviation can be computed from the mean.

Normal vs. TukeyGH

Tukey's distribution can be used to generate values that are nearly Normal, with departure from normality controlled by entering skewness (G) and kurtosis (H) parameters. Tukey's distribution with G = H = 0 is equivalent to the Normal distribution.

5. Example 1 – Power at Various Sample Sizes

Researchers are planning a parallel-group experiment to test whether the difference in response to a

certain drug is zero. The researchers will use a two-sided t-test with an alpha level of 0.05. They want to compare the power at sample sizes of 50, 100, and 200 when the shift in the means is 0.6 from drug 1 to drug 2. They assume that the data are normally distributed with a standard deviation of 2. Since this is an exploratory analysis, they set the number of simulation iterations to 2000.

第一步：参数录入

第二步：结果输出

6. Example 2 – Comparative Results

第一步：参数录入，因为是对比各种方法，所以选择checked，然后会输出表和图。

第二步:结果输出

由上表和图可知，在小中大样本中，power最大的是Welch，小样本和大样本中，一类错误最小的是T test和Welch，中样本最小的是trim type。

7. Example 3 – Selecting a Test Statistic when the Data Contain Outliers

The two-sample t-test is known to be robust to the violation of some assumptions, but it is susceptible to inaccuracy because the data contain outliers. This example will investigate the impact of outliers on the power and precision of the five test statistics available in PASS.

A mixture of two normal distributions will be used to randomly generate outliers. The mixture will draw 95% of the data from a normal distribution with a mean of 0 and a standard deviation of 1. The other 5% of the data will come from a normal distribution with a mean of 0 and a standard deviation that ranges from 1 to 10.

第一步:参数录入，按照高亮的格式录入。

第二步：结果输出

第一行给出了两个标准差（S 和 A）相等的标准情况的结果。

请注意，在这种情况下，t 检验的功效略高于其他检验的功效。随着标准差的增加（A 等于 5，然后是 10），Trim检验和 Mann-Whitney检验的功效仍然很高，但 t 检验的功效从 88% 下降到 43%。此外，对于Trim检验和非参数检验，alpha 的值保持不变，但 t 检验的 alpha 变得非常保守。

此模拟的结论是，如果存在异常值的可能性，则应使用非参数检验或Trim检验。

8. Example 4– Selecting a Test Statistic when the Data are Skewed

The two-sample t-test is known to be robust to the violation of some assumptions, but it is susceptible to inaccuracy when the underlying distributions are skewed. This example will investigate the impact of skewness on the power and precision of the five test statistics available in PASS.

Tukey’s lambda distribution will be used because it allows the amount of skewness to be gradually

increased.

第一步：参数录入

第二步：结果输出

第一行给出了没有偏度（G = 0）的标准情况的结果。请注意，在这种情况下，t 检验的功效略高于其他检验的功效。随着偏度的增加（G 等于 0.5，然后是 0.9），Trim检验和 Mann-Whitney 检验的功效增加，但 t 检验的功效大致相同。此外，alpha 的值在所有测试中都保持不变。

此模拟的结论是，如果存在偏度，您将通过使用非参数或Trim检验来获得功效。

Take home message

1.通常使用双样本 t 检验。当两组的方差不相等时，通常使用Welch的不等方差t检验。当数据不是正态分布时，可以使用 Mann-Whitney U（或 Wilcoxon 秩和）检验。

2.如果存在异常值的可能性，则应使用非参数检验或Trim检验。

3.如果存在偏度，您将通过使用非参数或Trim检验来获得功效

参考文献

PASS说明书

http://mp.weixin.qq.com/s?__biz=MzU3NzY1MzgxOQ==&mid=2247490729&idx=1&sn=25b624c5ddf8a2074f62edac86c14448

流行病学与卫生统计学

Pivot数据交流平台，每周分享临床试验研究设计、实施、统计等相关信息。

最新文章

基于风险和基于暴露的调整后安全发生率

揭开发病率的神秘面纱：面向新手程序员的不良事件分析分步指南

使用 NCI - 不良事件通用术语标准（CTCAE）对实验室毒性进行分级

肿瘤试验中的相对剂量强度的计算

多重填补的学习途径介绍

选择模型和共享参数模型

重复测量资料的Sas code和SAP撰写

MMRM-PMM的delta法实现

MAR下多重填补的 Sas code和SAP撰写

分类数据多重填补后的结果合并的非正态考量点

内卷时代！Nature重磅，新技术掀起临床医学新篇章，助力发顶刊！

Time to event的多重填补

Delta 为基础的多重填补的 Sas code 和 SAP撰写

对照组为基础的多重填补的 Sas code和SAP撰写

模式混合模型（PMM)的Sas code和SAP撰写

Tipping point的Sas code和SAP撰写

Predicted interval plots在East中的实例模拟解读

盲和非盲Enrolment and event prediction 的模拟解读

Subject level的Enrolment and event prediction的模拟解读

Muller and Schafer method在East中的模拟解读

Extension CDL method在East中的模拟解读

CHW and CDL method在East中的模拟解读

数据分布模拟在PASS的应用

Multi-Arm Multi-stage trials的East软件模拟解读

两组率中可能你不知道的事

Multiple comparisions procedures的East软件模拟解读

考虑竞争风险Logrank tests的样本量估计

两组率优效无效分析的实例模拟

什么是Miettinen & Nurminen Likelihood Score Test

Statistical team lead（STL）的日常工作

统计角度解读抗真菌药物Cresemba的统计审评报告

统计角度解读梗阻性肥厚型心肌病创新药mavacamten（玛伐凯泰）

IVD定量检测产品相关系数的样本量估计

低优指标率差的统计学检验

两样本比例差异的非劣效性检验的条件功效和样本量重估的实例模拟

JHU教授统计学专题科研项目招生啦 | 发论文，拿推荐信的机会来啦

浅谈统计师应如何自学

Efficacy Monitoring with Time-to-event Endpoint

Backfilling BOIN (BF-BOIN) Guideline

期中分析和最终分析以外的统计support

布朗大学生物学终身教授科研项目招生啦 | 发论文，拿推荐信的机会来啦

两组均数检验理论基础和样本量估计的PASS实现

重温Mixed model和Repeated measure的理论基础

建立剂量反应关系时如何估计样本量?

假设方差相等的方差对比的单因素分析和协方差分析

Williams Test样本量的PASS实现

分层Wilcoxon-Mann-Whitney检验样本量的PASS实现

比例趋势的 Cochran-Armitage 检验和PASS样本量计算

统计角度解读抗真菌药物研发

统计角度解读礼来偏头痛新药

分类

时事

民生

政务

教育

文化

科技

财富

体娱

健康

情感

旅行

百科

职场

楼市

企业

乐活

学术

汽车

时尚

创业

美食

幽默

美体

文摘

原创标签

时事社会财经军事教育体育科技汽车科学房产搞笑综艺明星音乐动漫游戏时尚健康旅游美食生活摄影宠物职场育儿情感小说曲艺文化历史三农文学娱乐电影视频图片新闻宗教电视剧纪录片广告创意壁纸头像心灵鸡汤星座命理教育培训艺术文化金融财经健康医疗美妆时尚餐饮美食母婴育儿社会新闻工业农业时事政治星座占卜幽默笑话独立短篇连载作品文化历史科技互联网

发布位置

广东北京山东江苏河南浙江山西福建河北上海四川陕西湖南安徽湖北内蒙古江西云南广西甘肃辽宁黑龙江贵州新疆重庆吉林天津海南青海宁夏西藏香港澳门台湾美国加拿大澳大利亚日本新加坡英国西班牙新西兰韩国泰国法国德国意大利缅甸菲律宾马来西亚越南荷兰柬埔寨俄罗斯巴西智利卢森堡芬兰瑞典比利时瑞士土耳其斐济挪威朝鲜尼日利亚阿根廷匈牙利爱尔兰印度老挝葡萄牙乌克兰印度尼西亚哈萨克斯坦塔吉克斯坦希腊南非蒙古奥地利肯尼亚加纳丹麦津巴布韦埃及坦桑尼亚捷克阿联酋安哥拉