Skip to main content
Ctrl+K

A Little Bit of Everything in Biostatistics for Health Science Students

  • A Little Bit of Everything in Biostatistics for Health Science Students

Front Matter

  • Table of Contents
  • Preface
  • Copyright and Citation
  • Dedication and Acknowledgments
  • About the Authors

Part I — Describing the World in Numbers

  • Ch. 1 — The Vitals of Data
  • Ch. 2 — The Middle and the Mess
  • Ch. 3 — The Margin of Error

Part II — Testing What We Think We Know

  • Ch. 4 — The Laws of Chance
  • Ch. 5 — The Hypothesis Gamble
  • Ch. 6 — A Tale of Two Groups

Part III — Comparing More Than Two Groups

  • Ch. 7 — Scaling Up
  • Ch. 8 — Reading the Future

Part IV — Putting It All Together

  • Ch. 9 — Course Review and Exam Preparation

Appendices

  • Appendix A — Data Dictionaries
  • Appendix B — Glossary

Back Matter

  • References and Further Reading
  • How to Contribute
  • Repository
  • Suggest edit
  • Open issue
  • .md

Copyright and Licensing

Contents

  • Textbook
  • Teaching Dataset
    • 1. The Framingham Heart Study (Observational)
    • 2. The Anorexia Clinical Trial (Experimental)
  • Figures
  • Software

Copyright and Licensing#

Textbook#

A Little Bit of Everything in Biostatistics for Health Science Students

Copyright © 2026 Payton Yau, Suhirthakumar Puvanendran and Jarunee Nualyong.

This textbook is published under a Creative Commons Attribution–NonCommercial–ShareAlike 4.0 International (CC BY-NC-SA 4.0) licence.

You are free to:

  • Share — copy and redistribute the material in any medium or format

  • Adapt — remix, transform, and build upon the material

Under the following terms:

  • Attribution — You must give appropriate credit to the authors and cite the original work.

  • NonCommercial — You may not use the material for commercial purposes.

  • ShareAlike — If you remix or adapt the material, you must distribute your contributions under the same licence.

Full licence text: https://creativecommons.org/licenses/by-nc-sa/4.0/


Teaching Dataset#

1. The Framingham Heart Study (Observational)#

The Framingham Heart Study teaching dataset (framingham_teaching.csv) is derived from data provided by the National Heart, Lung, and Blood Institute (NHLBI) of the National Institutes of Health, under the Framingham Heart Study (Contract No. N01-HC-25195).

“This is a teaching dataset. Specific methods were employed to ensure an anonymous dataset that protects patient confidentiality; therefore, this dataset is inappropriate for publication purposes.” — NHLBI/BioLINCC

This teaching subset is provided for educational use only. It is not appropriate for research publication or clinical decision-making. Any analyses presented in this textbook are for illustrative purposes only and do not constitute valid epidemiological findings.

Citation for the dataset:

National Heart, Lung, and Blood Institute. Framingham Heart Study Teaching Dataset. Bethesda, MD: National Heart, Lung, and Blood Institute; provided for educational use under NHLBI teaching dataset programme (N01-HC-25195).

2. The Anorexia Clinical Trial (Experimental)#

The anorexia dataset is an experimental dataset used to teach the effectiveness of psychological interventions. It is sourced from the MASS library in R and was originally published in A Handbook of Small Data Sets.

Like the Framingham subset, this data is used here for pedagogical purposes. While it is based on real clinical outcomes, students should treat the results as a learning exercise in biostatistical methods rather than a clinical guide for eating disorder treatment.

Citation for the dataset:

Hand, D. J., Daly, F., Lunn, A. D., McConway, K. J. and Ostrowski, E. (1994) A Handbook of Small Data Sets. London: Chapman and Hall. Data originally from Hand et al.


Figures#

All figures in this textbook were generated programmatically using Python (matplotlib) by the authors and are released under CC BY-NC-SA 4.0.


Software#

This book uses two free, open-source statistical software packages:

  • PSPP — GNU PSPP, free software released under the GNU General Public Licence (GPL). https://www.gnu.org/software/pspp/

  • R — R Project for Statistical Computing, released under GPL-2. https://www.r-project.org/


Source files for this textbook are maintained at the GitHub repository listed on the How to Contribute page.

previous

Preface

next

Acknowledgements

Contents
  • Textbook
  • Teaching Dataset
    • 1. The Framingham Heart Study (Observational)
    • 2. The Anorexia Clinical Trial (Experimental)
  • Figures
  • Software

By Payton Yau, Suhirthakumar Puvanendran and Jarunee Nualyong

© Copyright 2026.