• Title/Summary/Keyword: non-parametric regression model

Search Result 38, Processing Time 0.031 seconds

ASYMPTOTIC NORMALITY OF ESTIMATOR IN NON-PARAMETRIC MODEL UNDER CENSORED SAMPLES

  • Niu, Si-Li;Li, Qlan-Ru
    • Journal of the Korean Mathematical Society
    • /
    • v.44 no.3
    • /
    • pp.525-539
    • /
    • 2007
  • Consider the regression model $Y_i=g(x_i)+e_i\;for\;i=1,\;2,\;{\ldots},\;n$, where: (1) $x_i$ are fixed design points, (2) $e_i$ are independent random errors with mean zero, (3) g($\cdot$) is unknown regression function defined on [0, 1]. Under $Y_i$ are censored randomly, we discuss the asymptotic normality of the weighted kernel estimators of g when the censored distribution function is known or unknown.

Application of machine learning models for estimating house price (단독주택가격 추정을 위한 기계학습 모형의 응용)

  • Lee, Chang Ro;Park, Key Ho
    • Journal of the Korean Geographical Society
    • /
    • v.51 no.2
    • /
    • pp.219-233
    • /
    • 2016
  • In social science fields, statistical models are used almost exclusively for causal explanation, and explanatory modeling has been a mainstream until now. In contrast, predictive modeling has been rare in the fields. Hence, we focus on constructing the predictive non-parametric model, instead of the explanatory model. Gangnam-gu, Seoul was chosen as a study area and we collected single-family house sales data sold between 2011 and 2014. We applied non-parametric models proposed in machine learning area including generalized additive model(GAM), random forest, multivariate adaptive regression splines(MARS) and support vector machines(SVM). Models developed recently such as MARS and SVM were found to be superior in predictive power for house price estimation. Finally, spatial autocorrelation was accounted for in the non-parametric models additionally, and the result showed that their predictive power was enhanced further. We hope that this study will prompt methodology for property price estimation to be extended from traditional parametric models into non-parametric ones.

  • PDF

Note on response dimension reduction for multivariate regression

  • Yoo, Jae Keun
    • Communications for Statistical Applications and Methods
    • /
    • v.26 no.5
    • /
    • pp.519-526
    • /
    • 2019
  • Response dimension reduction in a sufficient dimension reduction (SDR) context has been widely ignored until Yoo and Cook (Computational Statistics and Data Analysis, 53, 334-343, 2008) founded theories for it and developed an estimation approach. Recent research in SDR shows that a semi-parametric approach can outperform conventional non-parametric SDR methods. Yoo (Statistics: A Journal of Theoretical and Applied Statistics, 52, 409-425, 2018) developed a semi-parametric approach for response reduction in Yoo and Cook (2008) context, and Yoo (Journal of the Korean Statistical Society, 2019) completes the semi-parametric approach by proposing an unstructured method. This paper theoretically discusses and provides insightful remarks on three versions of semi-parametric approaches that can be useful for statistical practitioners. It is also possible to avoid numerical instability by presenting the results for an orthogonal transformation of the response variables.

The Rank Transform Method in Nonparametric Fuzzy Regression Model

  • Choi, Seung-Hoe;Lee, Myung-Sook
    • Journal of the Korean Data and Information Science Society
    • /
    • v.15 no.3
    • /
    • pp.617-624
    • /
    • 2004
  • In this article the fuzzy number rank and the fuzzy rank transformation method are introduced in order to analyse the non-parametric fuzzy regression model which cannot be described as a specific functional form such as the crisp data and fuzzy data as a independent and dependent variables respectively. The effectiveness of fuzzy rank transformation methods is compared with other methods through the numerical examples.

  • PDF

Kernel Regression with Correlation Coefficient Weighted Distance (상관계수 가중법을 이용한 커널회귀 방법)

  • Shin, Ho-Cheol;Park, Moon-Ghu;Lee, Jae-Yong;You, Skin
    • Proceedings of the KIEE Conference
    • /
    • 2006.10c
    • /
    • pp.588-590
    • /
    • 2006
  • Recently, many on-line approaches to instrument channel surveillance (drift monitoring and fault detection) have been reported worldwide. On-line monitoring (OLM) method evaluates instrument channel performance by assessing its consistency with other plant indications through parametric or non-parametric models. The heart of an OLM system is the model giving an estimate of the true process parameter value against individual measurements. This model gives process parameter estimate calculated as a function of other plant measurements which can be used to identify small sensor drifts that would require the sensor to be manually calibrated or replaced. This paper describes an improvement of auto-associative kernel regression by introducing a correlation coefficient weighting on kernel distances. The prediction performance of the developed method is compared with conventional auto-associative kernel regression.

  • PDF

Finite-Sample, Small-Dispersion Asymptotic Optimality of the Non-Linear Least Squares Estimator

  • So, Beong-Soo
    • Journal of the Korean Statistical Society
    • /
    • v.24 no.2
    • /
    • pp.303-312
    • /
    • 1995
  • We consider the following type of general semi-parametric non-linear regression model : $y_i = f_i(\theta) + \epsilon_i, i=1, \cdots, n$ where ${f_i(\cdot)}$ represents the set of non-linear functions of the unknown parameter vector $\theta' = (\theta_1, \cdots, \theta_p)$ and ${\epsilon_i}$ represents the set of measurement errors with unknown distribution. Under suitable finite-sample, small-dispersion asymptotic framework, we derive a general lower bound for the asymptotic mean squared error (AMSE) matrix of the Gauss-consistent estimator of $\theta$. We then prove the fundamental result that the general non-linear least squares estimator (NLSE) is an optimal estimator within the class of all regular Gauss-consistent estimators irrespective of the type of the distribution of the measurement errors.

  • PDF

Efficient Prediction in the Semi-parametric Non-linear Mixed effect Model

  • So, Beong-Soo
    • Journal of the Korean Statistical Society
    • /
    • v.28 no.2
    • /
    • pp.225-234
    • /
    • 1999
  • We consider the following semi-parametric non-linear mixed effect regression model : y\ulcorner=f($\chi$\ulcorner;$\beta$)+$\sigma$$\mu$($\chi$\ulcorner)+$\sigma$$\varepsilon$\ulcorner,i=1,…,n,y*=f($\chi$;$\beta$)+$\sigma$$\mu$($\chi$) where y'=(y\ulcorner,…,y\ulcorner) is a vector of n observations, y* is an unobserved new random variable of interest, f($\chi$;$\beta$) represents fixed effect of known functional form containing unknown parameter vector $\beta$\ulcorner=($\beta$$_1$,…,$\beta$\ulcorner), $\mu$($\chi$) is a random function of mean zero and the known covariance function r(.,.), $\varepsilon$'=($\varepsilon$$_1$,…,$\varepsilon$\ulcorner) is the set of uncorrelated measurement errors with zero mean and unit variance and $\sigma$ is an unknown dispersion(scale) parameter. On the basis of finite-sample, small-dispersion asymptotic framework, we derive an absolute lower bound for the asymptotic mean squared errors of prediction(AMSEP) of the regular-consistent non-linear predictors of the new random variable of interest y*. Then we construct an optimal predictor of y* which attains the lower bound irrespective of types of distributions of random effect $\mu$(.) and measurement errors $\varepsilon$.

  • PDF

The Evaluation of Relative Management Efficiency of Automobile Companies Using Non-parametric Approach (비모수 검정을 활용한 자동차 기업의 상대적 경영 효율성 평가)

  • Ha, Gui Ryong;Choi, Suk Bong
    • Knowledge Management Research
    • /
    • v.15 no.2
    • /
    • pp.147-164
    • /
    • 2014
  • This paper investigated the efficiency of automobile firms by using several non-parametric approaches. First, using Data Envelopment Analysis (DEA), the paper has investigated the critical factors that determine the relative efficiency of management performance in automobile companies. Second, we examined how the firm size impact on the difference of this efficiency by using Kruskl-Wallis Test. Third, by using Mann-whitney test, we also investigated the difference of the efficiency accoss existence of technological innovation activity. Finally, the paper explored the relationship between technological innovation and management efficiency by using logistic regression model. The findings of this study provided practical information for inefficient automobile firms to find benchmarking firms and strategic position to improve their efficiency. The result also provided theoretical and methodological implications for those who explore factors affecting management efficiencies. Future research directions with the limitation of the study are discussed.

  • PDF

Bias corrected non-response estimation using nonparametric function estimation of super population model (선형 응답률 모형에서 초모집단 모형의 비모수적 함수 추정을 이용한 무응답 편향 보정 추정)

  • Sim, Joo-Yong;Shin, Key-Il
    • The Korean Journal of Applied Statistics
    • /
    • v.34 no.6
    • /
    • pp.923-936
    • /
    • 2021
  • A large number of non-responses are occurring in the sample survey, and various methods have been developed to deal with them appropriately. In particular, the bias caused by non-ignorable non-response greatly reduces the accuracy of estimation and makes non-response processing difficult. Recently, Chung and Shin (2017, 2020) proposed an estimator that improves the accuracy of estimation using parametric super-population model and response rate model. In this study, we suggested a bias corrected non-response mean estimator using a nonparametric function generalizing the form of a parametric super-population model. We confirmed the superiority of the proposed estimator through simulation studies.

Nonparametic Kernel Regression model for Rating curve (수위-유량곡선을 위한 비매개 변수적 Kernel 회귀모형)

  • Moon, Young-Il;Cho, Sung-Jin;Chun, Si-Young
    • Journal of Korea Water Resources Association
    • /
    • v.36 no.6
    • /
    • pp.1025-1033
    • /
    • 2003
  • In common with workers in hydrologic fields, scientists and engineers relate one variable to two or more other variables for purposes of predication, optimization, and control. Statistics methods have improved to establish such relationships. Regression, as it is called, is indeed the most commonly used statistics technique in hydrologic fields; relationship between the monitored variable stage and the corresponding discharges(rating curve). Regression methods expressed in the form of mathematical equations which has parameters, so called parametric methods. some times, the establishment of parameters is complicated and uncertain. Many non-parametric regression methods which have not parameters, have been proposed and studied. The most popular of these are kernel regression method. Kernel regression offer a way of estimation the regression function without the specification of a parametric model. This paper conducted comparisons of some bandwidth selection methods which are using the least squares and cross-validation.