# Statistical data processing templates according to GOST in JupyterLab and MS Exsel

Once a student told how he passed a test on a software product, and in the next year he could not use it. He told me and shrugged. And I remembered that the goal of any course is to solve my own problems after the course. In the spring, I keep statistics from doctors and came up with templates for processing my own data using methods from national standards. I share the result.

Using the example of the height and weight values ​​of 5000 men and 5000 women in MS Exsel and Jupiter Lab, histograms and boxes with a mustache were built, the mean, variance, standard deviation were calculated, an interval estimate of the general average was given, and an estimate of the probability of falling into a given interval was given. In MS Exsel, the correlation coefficient is additionally calculated and its significance is assessed. Analysis of variance ANOVA was added based on an arbitrary problem from the textbook.

Commands and formulas are commented in detail. Without knowing the Excel and Python tools, and without going into reference books, you can replace data, change the form of the report and the design of graphs.

Methods, terms and definitions in textbooks on statistics differ, therefore, whenever possible, terms and methods from GOST R 50779.10-2000 “Statistical methods. Probability and fundamentals of statistics. Terms and definitions”, GOST R ISO 3534-1-2019 “Statistical methods. Vocabulary and conventions. Part 1. General statistical terms and terms used in probability theory “, and GOST R 50779.21-2004” Statistical methods. Rules for determining and methods for calculating statistical characteristics from sample data “.