Statistical data processing templates according to GOST in JupyterLab and MS Exsel

Once a student told how he passed a test on a software product, and in the next year he could not use it. He told me and shrugged. And I remembered that the goal of any course is to solve my own problems after the course. In the spring, I keep statistics from doctors and came up with templates for processing my own data using methods from national standards. I share the result.

We replace the data with our own and customize the design for our goals
We replace the data with our own and customize the design for our goals

Using the example of the height and weight values ​​of 5000 men and 5000 women in MS Exsel and Jupiter Lab, histograms and boxes with a mustache were built, the mean, variance, standard deviation were calculated, an interval estimate of the general average was given, and an estimate of the probability of falling into a given interval was given. In MS Exsel, the correlation coefficient is additionally calculated and its significance is assessed. Analysis of variance ANOVA was added based on an arbitrary problem from the textbook.

Commands and formulas are commented in detail. Without knowing the Excel and Python tools, and without going into reference books, you can replace data, change the form of the report and the design of graphs.

Methods, terms and definitions in textbooks on statistics differ, therefore, whenever possible, terms and methods from GOST R 50779.10-2000 “Statistical methods. Probability and fundamentals of statistics. Terms and definitions”, GOST R ISO 3534-1-2019 “Statistical methods. Vocabulary and conventions. Part 1. General statistical terms and terms used in probability theory “, and GOST R 50779.21-2004” Statistical methods. Rules for determining and methods for calculating statistical characteristics from sample data “.

You can download everything here: https://disk.yandex.ru/d/bg6ORywD3bZBxA

The current version of the template in MS Exsel is “20210608 mystat.xlsx”. The JupiterLab template “20210401-7_mystat.ipynb” is launched according to the instructions in the “What to do.txt” file.

The templates are not made for students, but together with students during the semester. And these patterns are a process, not a result. The process will continue in the next semester. If you find an inaccuracy or do not find the method you need, report it, modify it, send it. I will update the version and insert your name or nickname in the “Contributors” field.

If you want to modify the template and post it somewhere else, I suggest adding yourself to the list of authors, referring to the authors of the previous version and to this post.

Similar Posts

Leave a Reply