ゼロ・インフレーションされたカウント・データに対する均質性のロバストなスコア検定

/ /

日本語AIでPubMedを検索

PubMedの提供する医学論文データベースを日本語で検索できます。AI(Deep Learning)を活用した機械翻訳エンジンにより、精度高く日本語へ翻訳された論文をご参照いただけます。

Stat Methods Med Res.2020 Jul;:962280220937324. doi: 10.1177/0962280220937324.Epub 2020-07-10.

ゼロ・インフレーションされたカウント・データに対する均質性のロバストなスコア検定

A robust score test of homogeneity for zero-inflated count data.

Wei-Wen Hsu
David Todem
Nadeesha R Mawella
KyungMann Kim
Richard R Rosenkranz

PMID: 32646310 DOI: 10.1177/0962280220937324.

抄録

ゼロ・インフレーション・モデルの多くの応用では、スコア検定が、これらのモデルによって暗示される母集団の不均質性がデータと一致しているかどうかを評価するためによく使用される。スコア検定を使用するための最も頻繁に引用される正当化は、帰無仮説の下での推定だけを必要とするということです。この推定は帰無仮説と一致するもっともらしいモデルを指定することを伴うので、検定の手順はモデルの不特定化の下で信頼性の低い推論を導く可能性がある。本論文では、ある種のモデルの誤仕様に対してロバストな、ゼロインフレモデルの同質性のスコア検定を提案する。実践的な環境では真のモデルは未知であるため、我々の提案は、モデル仕様の不確実性を考慮してモデルにランダム性の層を課す混合モデルの一般的な枠組みの下で開発されている。我々は、このアプローチをゼロインフレーションしたポアソンモデルのクラスで例証する。ここでは、ランダム項がポアソン平均に課され、平均モデルから欠落した共変量や関数形の指定ミスを調整する。この例では、帰無仮説の下で、よく指定された非ランダム平均モデルのパワーは失われるものの、結果として得られるゼロインフレのスコア検定は、すべてのレベルで経験的な大きさを維持することをシミュレーションを通して示す。ガールスカウトの若年層の健康増進活動の頻度と都市部の子供たちの虫歯指数を用いて、提案した検定方法のロバスト性を説明する。

In many applications of zero-inflated models, score tests are often used to evaluate whether the population heterogeneity as implied by these models is consistent with the data. The most frequently cited justification for using score tests is that they only require estimation under the null hypothesis. Because this estimation involves specifying a plausible model consistent with the null hypothesis, the testing procedure could lead to unreliable inferences under model misspecification. In this paper, we propose a score test of homogeneity for zero-inflated models that is robust against certain model misspecifications. Due to the true model being unknown in practical settings, our proposal is developed under a general framework of mixture models for which a layer of randomness is imposed on the model to account for uncertainty in the model specification. We exemplify this approach on the class of zero-inflated Poisson models, where a random term is imposed on the Poisson mean to adjust for relevant covariates missing from the mean model or a misspecified functional form. For this example, we show through simulations that the resulting score test of zero inflation maintains its empirical size at all levels, albeit a loss of power for the well-specified non-random mean model under the null. Frequencies of health promotion activities among young Girl Scouts and dental caries indices among inner-city children are used to illustrate the robustness of the proposed testing procedure.