Inicio Nosotros Búsquedas
Buscar en nuestra Base de Datos:     
Título: =Persistence of plug-in rule in classification of high dimensional multivariate binary data
Sólo un registro cumplió la condición especificada en la base de información BIBCYT.
Publicación seriada
Referencias AnalíticasReferencias Analíticas
Autor: Park, Junyong ; Grosh, Jayanta K
Título: Persistence of plug-in rule in classification of high dimensional multivariate binary data
Páginas/Colación: p3687-3705, 19p
Journal of Statistical Planning and Inference Vol. 137, no. 11 November 2007
Información de existenciaInformación de existencia

Resumen
In this paper, we consider the classification problem when the predictors are multivariate binary random variables. Variables are modeled as independent, but not necessarily identical, Bernoulli. A triangular array for parameters, (p11(n),…,p1d(n), p21(n),…,p2d(n)), is assumed to allow parameters to change and the number of the variables, d, to increase for adopting more flexible models as the sample size, n, increases. Our results are obtained under moderate assumptions on the triangular array of the probability vectors. We use maximum likelihood estimators for the parameters and plug them into the Bayes classifier. This is a plug-in classifier, a sort of objective Bayes rule. It is shown in Wilbur et al. [2002. Variable selection in high-dimensional multivariate binary data with application to the analysis of microbial DNA fingerprints. Biometrics 58, 378–386] via simulations that the plug-in rule classifies quite well even when the assumption of independence is violated. The main interest in this paper is in the complex case of d/n??c for some ?>0 and c>0 for which very little is known. Using linearity of the plug-in rule, we show its persistence, a generalization of the notion of consistency, when the variance of the plug-in rule or a quantity measuring signal to noise ratio is divergent; otherwise we show there exists an example of non-persistence of the plug-in rule. In case of non-persistence, we introduce the notion of sparsity and overcome non-persistence by selecting a subset of the variables. This shows why a variable selection procedure may be effective especially for contemporary practical problems with high dimensional data [Wilbur et al., 2002. Variable selection in high-dimensional multivariate binary data with application to the analysis of microbial DNA fingerprints. Biometrics 58, 378–386].

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

UCLA - Biblioteca de Ciencias y Tecnologia Felix Morales Bueno

Generados por el servidor 'bibcyt.ucla.edu.ve' (18.225.31.159)
Adaptive Server Anywhere (07.00.0000)
ODBC
Sesión="" Sesión anterior=""
ejecutando Back-end Alejandría BE 7.0.7b0 ** * *
18.225.31.159 (NTM) bajo el ambiente Apache/2.2.4 (Win32) PHP/5.2.2.
usando una conexión ODBC (RowCount) al manejador de bases de datos..
Versión de la base de información BIBCYT: 7.0.0 (con listas invertidas [2.0])

Cliente: 18.225.31.159
Salida con Javascript


** Back-end Alejandría BE 7.0.7b0 *