Showing 1 - 10 of 781
Somers' D and Kendall's tau-a are parameters behind rank or nonparametric statistics, interpreted as differences between proportions. Given two bivariate data pairs (X1, Y1) and (X2, Y2), Kendall’s tau-a parameter tau-XY is the difference between the probability that the two X–Y pairs are...
Persistent link: https://www.econbiz.de/10005074240
Most Stata users make their living producing results in a form accessible to end users. Most of these end users cannot immediately understand Stata logs. However, they can understand tables (in paper, PDF, HTML, spreadsheet, or word processor documents) and plots (produced by using Stata or...
Persistent link: https://www.econbiz.de/10005074329
A resultsset is a Stata dataset created as output by a Stata program. It can be used as input to other Stata programs, which may in turn output the results as publication-ready plots or tables. Programs that create resultssets include xcontract, xcollapse, parmest, parmby and descsave. Stata...
Persistent link: https://www.econbiz.de/10005101308
This contribution is based on my programs bspline and frencurv, which are used to generate bases for Schoenberg B-splines and splines parameterized by their values at reference points on the X-axis (presented in STB-57 as insert sg151). The program frencurv ("French curve") makes it possible for...
Persistent link: https://www.econbiz.de/10005102727
Statisticians make their living mostly by producing confidence intervals and p-values. However, those supplied in the Stata log are not in any fit state to be delivered to the end user, who usually at least wants them tabulated and formatted, and may appreciate them even more if they are plotted...
Persistent link: https://www.econbiz.de/10005028078
Scientists often have good reasons for wanting to calculate multiple confidence intervals and/or p-values, especially when scanning a genome. However, if we do this, then the probability of not observing at least one "significant" difference tends to fall, even if all null hypotheses are true. A...
Persistent link: https://www.econbiz.de/10005028082
In October 2012 HarvardX, through edX, offered its first two online courses. One of these was called PH207X: Health in Numbers. The course covered biostatistics and epidemiology at an introductory level and lasted 12 weeks. 60,000 students later we had exposed more students to those disciplines...
Persistent link: https://www.econbiz.de/10010888633
binscatter is a newly released program that produces binned scatterplots, which provide a non-parametric estimate of a conditional expectation function. This presentation will describe the features of binscatter, and explore its versatile applications. Those applications include: observing the...
Persistent link: https://www.econbiz.de/10010888634
Many estimation problems focus on classification of cases (into bins) with tools that aim to identify cases using only a small subset of all possible questions. These tools can be used in diagnoses of disease, identification of advanced or failing students using tests, or classification into...
Persistent link: https://www.econbiz.de/10010888635
The last three years have seen explosive growth in the variety and sophistication of interactive online graphics. These are mostly implemented in the web language JavaScript, with the D3 (Data Driven Documents) library being the most popular and flexible at present. Leaflet is a mapping library...
Persistent link: https://www.econbiz.de/10010888636