• KAFM
  • Contact us
  • E-Submission
ABOUT
ARTICLE CATEGORY
BROWSE ARTICLES
AUTHOR INFORMATION

Articles

Commentary

Comments on Statistical Issues in January 2014

Korean Journal of Family Medicine 2014;35(1):42-43.
Published online: January 23, 2014

Department of Biostatistics, The Catholic University of Korea College of Medicine, Seoul, Korea.

Copyright © 2014 The Korean Academy of Family Medicine

This is an open-access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/) which permits unrestricted noncommercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

  • 4,219 Views
  • 23 Download
  • 2 Web of Science
prev next
In this section, we suggest some guidelines for reporting multiple linear regression analyses, which appeared in the article titled, "Factors associated with serum levels of carcinoembryonic antigen in healthy non-smokers", by No et al.1) published in November 2013.
In statistics, the multiple linear regression analysis is an approach to model the relationship between a response variable and several explanatory variables. Typically, a researcher will collect data on several potential explanatory variables, determine which variables are most strongly associated with the response variable, and then incorporate these variables into a mathematical model (a regression equation). The purpose of multiple linear regression analysis, then, is to identify which combination of variables best predicts the response variable.
Here, we suggest some guidelines for reporting a multiple linear regression analysis.
1. State How Each Assumption Was Met and Checked
A statement that the assumptions were verified is all that need be included. The assumptions of a multiple linear regression are as follows: 1) The relationship between each explanatory variable and response variable is linear; 2) The distributions of response variables have equal variances at each value of each explanatory variable; 3) Each response variable value is independent of one another for each value of each explanatory variable; and 4) The response variable has a normal distribution at each value of each explanatory variable. Sometimes, data that violate the assumptions can be adjusted (for example, with data transformation) to meet the required assumptions. If such adjustments are made, it should be noted.
2. Specify How the Final Results Were Derived
Describe the process of selecting the best combination of explanatory variables when the variable selection methods such as stepwise, forward, or backward selection, were used. State whether explanatory variables were assessed for multi-collinearity and tested for interaction.
3. Report the Multiple Linear Regression Equation or Summarize the Equation in a Table
An example for reporting a multiple linear regression with three explanatory variables is presented in Table 1.2)
4. Report the Coefficient of Multiple Determination (R2)
The coefficient of multiple determination (R2) indicates how much of the variation in the response variable is explained by the explanatory variables included in the model. An upper-case R should be used.

No potential conflict of interest relevant to this article was reported.

  • 1. No JI, Yang JY, Hyun HJ, Yeon CS, Choi HJ. Factors associated with serum levels of carcinoembryonic antigen in healthy non-smokers. Korean J Fam Med 2013;34:413-419. PMID: 24340163.
  • 2. Lang TA, Secic M. How to report statistics in medicine: annotated guidelines for authors, editors, and reviewers. Philadelphia (PA): American College of Physicians; 1997.
Table 1
Sample table for reporting a multiple linear regression analysis

Where intercept = point at which the regression line crosses the Y axis; value of the response variable when all explanatory variables are set to zero; no clinical interpretation; do not use the term 'constant' which appeared in SPSS, X1 to X3 = names of the three explanatory variables, Coefficient = regression coefficients; "unstandardized coefficient" in SPSS, SE = standard errors; estimated precision of the coefficients, 95% CI = 95% confidence intervals for the coefficients, P = variables X2 and X3 are statistically significant predictors of the response variable.

kjfm-35-42-i001.jpg

Figure & Data

References

    Citations

    Citations to this article as recorded by  

      Download Citation

      Download a citation file in RIS format that can be imported by all major citation management software, including EndNote, ProCite, RefWorks, and Reference Manager.

      Format:

      Include:

      Comments on Statistical Issues in January 2014
      Korean J Fam Med. 2014;35(1):42-43.   Published online January 23, 2014
      Download Citation
      Download a citation file in RIS format that can be imported by all major citation management software, including EndNote, ProCite, RefWorks, and Reference Manager.

      Format:
      • RIS — For EndNote, ProCite, RefWorks, and most other reference management software
      • BibTeX — For JabRef, BibDesk, and other BibTeX-specific software
      Include:
      • Citation for the content below
      Comments on Statistical Issues in January 2014
      Korean J Fam Med. 2014;35(1):42-43.   Published online January 23, 2014
      Close
      Comments on Statistical Issues in January 2014
      Comments on Statistical Issues in January 2014

      Sample table for reporting a multiple linear regression analysis

      Where intercept = point at which the regression line crosses the Y axis; value of the response variable when all explanatory variables are set to zero; no clinical interpretation; do not use the term 'constant' which appeared in SPSS, X1 to X3 = names of the three explanatory variables, Coefficient = regression coefficients; "unstandardized coefficient" in SPSS, SE = standard errors; estimated precision of the coefficients, 95% CI = 95% confidence intervals for the coefficients, P = variables X2 and X3 are statistically significant predictors of the response variable.

      Table 1 Sample table for reporting a multiple linear regression analysis

      Where intercept = point at which the regression line crosses the Y axis; value of the response variable when all explanatory variables are set to zero; no clinical interpretation; do not use the term 'constant' which appeared in SPSS, X1 to X3 = names of the three explanatory variables, Coefficient = regression coefficients; "unstandardized coefficient" in SPSS, SE = standard errors; estimated precision of the coefficients, 95% CI = 95% confidence intervals for the coefficients, P = variables X2 and X3 are statistically significant predictors of the response variable.

      TOP