Sections of the StatPages.net web site
{{ x.Name }}

Free Statistical Software


This page contains links to free software packages that you can download and install on your computer for stand-alone (offline, non-Internet) computing. They are listed below, under the following general headings:

General Packages: No package does everything, but these programs support a wide variety of statistical analyses.
Completely Free... can be freely downloaded and used in their fully-functional mode (no strings attached) Free, but... "demonstration" or "student versions" of commercial packages; can be freely downloaded, but are usually restricted or limited in some way.

Gnumeric -- a high-powered spreadsheet with better statistical features than Excel. Has 60 extra functions, basic support for financial derivatives (Black Scholes) and telecommunication engineering, advanced statistical analysis, extensive random number generation, linear and non-linear solvers, implicit intersection, implicit iteration, goal seek, and Monte Carlo simulation tools. It is not yet available for Windows

InVivoStat is a free to use, statistical Windows program which uses R as its statistics engine. It is designed specifically for scientists conducting animal experiment. The user interface is both easy to use and intuitive to the non-statistician.
The main modules is Summary statistics, Single and Repeated Measures Parametric analysis, P-value analysis and Power analysis.
Additional modules includes Dose/Response, Paired T-test, Chi-square and Fisher's test, Correction, Survival and regression analysis.

jamovi -- a new "3rd generation" statistical spreadsheet based on R
It has a very nice user interface making is easy to create or load and edit data including computed variables and apply statistical analysis.
For now there is only a few functions available compared to R's great many packages, but the setup makes it relatively easy for skilled users to add new functions.
Thus it's expected that more functions will be available from the online library in the near future.
New modules on meta-analysis, mixed linear effects models, general linear models, robust statistics been added recently. It's contributed to the scientific community by the people behind JASP.
It's free and opensource.
Jamovi is available for download for Windows (Vista and above), macOS (Mavericks and above) and Ubuntu (Xenial and Yakkety).

JASP -- is described by the authors as a "low-fat alternative to SPSS", and "Bayesian statistics made accessible". Provides a user-friendly interface to many of the commonly-used statistical analyses -- descriptive statistics, plots, t tests, Levene’s Test, ANOVA, ANCOVA, contingency tables, Pearson and Spearman correlation, Kendall’s Tau-B, and linear regression. For many of these analyses, the JASP also provides the closest corresponding Bayesian equivalent, implemented in a way that will be understandable to people not familiar with Bayesian concepts and terminology. Windows only.

KyPlot -- is a Windows application and offers you an integrated environment for data analysis and visualization. There is two types of windows: "Spread" and "Figure". A "Spread" window contains Excel-like spreadsheets, supporting calculation with cell formulae. It provides a wide range of computing methods such as mathematical optimization, spectral, time-series and wavelet analysis. The program also offers a variety of statistical procedures including parametric and nonparametric tests and multivariate analysis. In a “Figure” window, you can create graphs and draw schemes and present them as a slide show.

MicrOsiris -- a comprehensive statistical and data management package for Windows, derived from the OSIRIS IV package developed at the University of Michigan. It was developed for serious survey analysis using moderate to large data sets. Main features: handles any size data set; has Excel data entry; imports/exports SPSS, SAS, and Stats datasets; reads ICPSR (OSIRIS) and UNESCO (IDAMS) datasets; data mining techniques for market analysis (SEARCH --very fast for large datasets); interactive decision tree for selecting appropriate tests; database maniuplation (dictionaries, sorting, merging, consistency checking, recoding, transforming) extensive statistics (univariate, staccerplot, cross-tabs, ANOVA/MANOVA, log-linear, correlation/regressionMCA, MNA, binary segmentation, cluster, factor, MINISSA, item analysis, survival analysis, internal consistency); online, web-enabled users manual; requires only 6MB RAM; uses 12MB disk, including manual. Fully-functional version is free; the authors would appreciate a small donation to support ongoing development and distribution.

OpenStat -- is a general stats package for all Windows versions (including Win 7 and Win 8) and for Linux systems (under Wine), developed by Bill Miller of Iowa State U, with a very broad range of data manipulation and analysis capabilities and an SPSS-like user interface. Bill also has provided an excellent User Manual as an Adobe Acrobat file. In addition, there is a free Pascal program, manual, sample data and source code for LazStats which contains programs similar to OpenStat.
Bill Miller has closed maintainance and development. The software is made available for your convenience as is until further notice.

PSPP -- a free replacement for SPSS (although at this time it implements only a small fraction of SPSS's analyses). But it's free, and will never "expire".  It replicates the "look and feel" of SPSS very closely, and even reads native SPSS syntax and files!  Some other features...

  • Supports over 1 billion cases and over 1 billion variables.
  • Choice of terminal or graphical user interface; Choice of text, postscript or html output formats.
  • Inter-operates with Gnumeric, OpenOffice.Org and other free software.
  • Easy data import from spreadsheets, text files and database sources.
  • Fast statistical procedures, even on very large data sets.
  • No license fees; no expiration period; no unethical “end user license agreements”.
  • Fully indexed user manual.
  • Cross platform; Runs on many different computers and many different operating systems.

SOFA (Statistics Open For All) -- an innovative statistics, analysis, and reporting program. Available for Windows, Mac and Linux systems. Has an emphasis on ease of use, learn as you go, and beautiful output. Check out list of features.

SalStat-2  -- a multi-platform, easy-to-use statistical system that provides data management (importing, editing, pivot tables), statistical calculations (descriptive summaries, probability functions, chi-square, t-tests, 1-way ANOVA, regression, correlation, non-parametric tests, Six-Sigma), and graphs (bar, line, scatter, area, histogram, box&whisker, stem, adaptive, ternary scatter, normal probability, quality control). Last updated in 2013.

ViSta -- a Visual Statistics program for Win3.1, Win 95/NT, Mac and Unix, featuring a Structured Desktop, with features designed to structure and assist the statistical analyst. Last updated in 2014.

OpenEpi -- OpenEpi is a free, web-based, open source, operating-system-independent series of programs for use in public health and medicine, providing a number of epidemiologic and statistical tools. Version 2 (4/25/2007) has a new interface that presents results without using pop-up windows, and has better installation methods so that it can be run without an internet connection. Version 2.2 (2007/11/09) lets users run the software in English, French, Spanish, or Italian. Last updated in 2013.

Statist -- a compact, portable program that provides most basic statistical capabilities: data manipulation (recoding, transforming, selecting), descriptive stats (including histograms, box&whisker plots), correlation & regression, and the common significance tests (chi-square, t-test, etc.). Written in C (source available); runs on Unix/Linux, Windows, Mac, among others. Last updated in 2006.

Tanagra -- a free (open-source) data-mining package, which supports the standard "stream diagram" paradigm used by most data-mining systems. Contains components for Data source (tab-delimited text), Visualization (grid, scatterplots), Descriptive statistics (cross-tab, ANOVA, correlation), Instance selection (sampling, stratified), Feature selection and construction, Regression (multiple linear), Factorial analysis (principal components, multiple correspondence), Clustering (kMeans, SOM, LVQ, HAC), Supervised learning (logistic regr., k-NN, multi-layer perceptron, prototype-NN, ID3, discriminant analysis, naive Bayes, radial basis function), Meta-spv learning (instance Spv, arcing, boosting, bagging), Learning assessment (train-test, cross-validation), and Association (Agrawal a-priori). Last updated in 2013.

Dap -- a statistics and graphics package developed by Susan Bassein for Unix and Linux systems, with commonly-needed data management, analysis, and graphics (univariate statistics, correlations and regression, ANOVA, categorical data analysis, logistic regression, and nonparametric analyses). Provides some of the core functionality of SAS, and is able to read and run many (but not all) SAS program files. Dap is freely distributed under a GNU-style "copyleft". Last updated in 2014. Not clear whether it's available in Windows.

Dataplot -- (Unix, Linux, PC-DOS, Windows) for scientific visualization, statistical analysis, and non-linear modeling. Has extensive mathematical and graphical capabilities. Closely integrated with the NIST/SEMATECH Engineering Statistics Handbook.

WINKS (Windows KWIKSTAT) -- a full-featured, easy-to-use stats package with statistics (means, standard deviations, medians, etc.), histograms, t-tests, correlation, chi-square, regression, nonparametrics, analysis of variance (ANOVA), probability, QC plots, cpk, graphs, life tables, time series, crosstabs, and more. Works on Windows XP (as well as Windows 2000, NT, 98, ME and 95.) Comes in Basic and Professional editions. Evaluation version available for download. Last updated in 2015

SISA -- Simple Interactive Statistical Analysis from Daan Uitenbroek. An excellent collection of individual Windows and DOS modules for several statistical calculations, including some analyses not readily available elsewhere. Some examples are: Distributions - analysis of discrete single dimension distributions. Tables - analysis of tables with up to 2*7 and 3*3 cells.

SAS on Demand -- SAS Studio. Anyone can sign up to access this web based version of SAS.

Also, check out R, described in the Programming Languages section below.

curios.IT -- data exploration software especially suited for multivariate data analysis (visualizing up to 13 variables simultaneously using 3d models). Commercial but free for up to 250 records.

Develve -- stats package for fast and easy interpretation of experimental data in science and R&D in a technical environment. Everything is directly accessible and  results are directly visible, with no hidden menus; e.g.:  graphs are easily scrollable, and when clicked, a bigger version pops up. Results for group comparisons directly indicate the significance of the difference in average and variation, and if the sample size is sufficiently large. Has a basic mode for statistical testing, and a design-of-experiments mode.

Explorer -- A data exploration / graphing / analysis program with a very elegant drag-and-drop interface.  Accepts data from text files, Excel spreadsheet, MySQL databases, and copy/pasted from the clipboard. Provides over a dozen kinds of plots and diagrams, basic statistical summaries, significance tests (chi-square, t, ANOVA ) and more advanced analyses (factorial, principal components, discriminant, variance, linear regression). Executable programs can be downloaded for Windows and Mac OSX. Written in JavaScript, so it can also be run in any modern browser.

Statext-- Provides a nice assortment of basic statistical tests, with text output (and text-based graphics). Capabilities include: rearrange, transpose, tabulate and count data; random sample; basic descriptives; text-plots for dot, box-and-whiskers, stem-and-leaf, histogram, scatterplot; find z-values, confidence interval for means, t-tests (one and two group, and paired; one- and two-way ANOVA; Pearson, Spearman and Kendall correlation; ;inear regression, Chi-square goodness-of-fit test and independence tests; sign test, Mann-Whitney U and Kruskal-Wallis H tests, probability tables (z, t, Chi-square, F, U); random number generator; Central Limit Theorem, Chi-square distribution. The site says "Statext is built on .NET Framework, it needs .NET Framework 2.0 or later. If you do not have .NET Framework in your computer system, Statext does not work."

PAST -- an easy-to-use data analysis package aimed at paleontology including a large selection of common statistical, plotting and modelling functions: a spreadsheet-type data entry form, graphing, curve fitting, significance tests (F, t, permutation t, Chi-squared w. permutation test, Kolmogorov-Smirnov, Mann-Whitney, Shapiro-Wilk, Spearman's Rho and Kendall's Tau tests, correlation, covariance, contingency tables, one-way ANOVA, Kruskal-Wallis test), diversity and similarity indices & profiles, abundance model fitting, multivariate statistics, time series analysis, geometrical analysis, parsimony analysis (cladistics), and biostratigraphy.

AM -- a free package for analyzing data from complex samples, especially large-scale assessments, as well as non-assessment survey data. Has sophisticated stats, easy drag & drop interface, and integrated help system that explains the statistics as well as how to use the system. Can estimate models via marginal maximum likelihood (MML), which defines a probability distribution over the proficiency scale. Also analyzes "plausible values" used in programs like NAEP. Automatically provides appropriate standard errors for complex samples via Taylor-series approximation, jackknife & other replication techniques.

Also, check out Ox, described in the Programming Languages section below.

Data Desk -- first released in 1986, is one of the oldest Mac programs still actively developed. The modern versions (for Mac OS X and Windows computers) are available for a 30 day trail.

Statistics Manually -- an Andriod Mobile App. This app contains a large collection of formulas of statistical methods common in the social sciences as well as the statistical tables needed to interpret your test results. These tables not just cover the "standard" z-, t-, chi²-, F- and binomial distribution, but exact values for the Hartley's Fmax-test, the Mann-Whitney-U-test and Wilcoxon's signed ranked test. You will find the formulas of these tests as well.

MaxStat -- statistics & graphics program (for Windows). Easy to use, modern interface. Specify an analysis in three simple steps within a single dialog. Two versions are available:
  *  Lite (free) has descriptive statistics, hypothesis testing (t-tests, chi-square, 1-way ANOVA with post-hocs, non-parametric tests), distribution testing, linear regression,  correlation, many basic types of graphs; limited to 254 rows and 12 columns of data, no import/export.
  *  Professional  (free 30-day trial) advanced analyses (unlimited rows & columns, import/export, many more analyses (2-way ANOVA, logistic and non-linear regression, principal-components analysis and multidimensional scaling, time series, power/sample-size, factorial design), more graph types, word-processing and formatting of results.


SYSTAT -- powerful statistical software ranging from the most elementary descriptive statistics to very advanced statistical methodology. Novices can work with its friendly and simple menu-dialog; statistically-savvy users can use its intuitive command language. Carry out very comprehensive analysis of univariate and multivariate data based on linear, general linear, and mixed linear models; carry out different types of robust regression analysis when your data are not suitable for conventional multiple regression analysis;compute partial least-squares regression;design experiments, carry out power analysis, do probability calculations on many distributions and fit them to data; perform matrix computations. Provides Time Series, Survival Analysis, Response Surface Optimization, Spatial Statistics, Test Item Analysis, Cluster Analysis, Classification and Regression Trees, Correspondence Analysis, Multidimensional Scaling, Conjoint Analysis, Quality Analysis, Path Analysis, etc. A 30-day evaluation version is available for free download.

Wizard -- (for Macintosh only), provides all basic statistical tests, emphasizes easy-to-understand, graphically-based results. Imports data from text files, spreadsheets, SQL databases,  and R workspaces (the Pro version can also import SAS, Stata and SPSS files). The free trial versions (standard or Pro versions) never expire; they don't provide p values, or export/save.

StudyResult -- (30-day free trial) General statistics package for: paired & unpaired t-test, one-way ANOVA, Fisher's exact , McNemar's, Chi2, Chi2 homogeneity , life table & survival analysis, Wilcoxon rank-sum & signed-rank, sign test, bioequivalence testing, correlation & regression coefficient tests. Special features for interpreting summary data found in publications (p-values & conf. intervals from summary statistics, converts p-values to CI's & vice versa, what observed results are needed to get a significant result, estimates from publications needed for sample size calculations). Includes equivalence- and non-inferiority testing for most tests.

STATGRAPHICS Plus (for Windows) -- over 250 statistical analyses: regression, probit, enhanced logistic, factor effects plots, automatic forecasting, matrix plots, outlier identification, general linear models (random and mixed), multiple regression with automatic Cochrane-Orcutt and Box-Cox procedures, Levene's, Friedman's, Dixon's and Grubb's tests, Durbin-Watson p-values and 1-variable bootstrap estimates, enhanced 3D charts. For Six Sigma work: gage linearity and accuracy analysis, multi-vari charts, life data regression for reliability analysis and accelerated life-testing, long-term and short-term capability assessment estimates. Two free downloads are available: full-function but limited-time(30 days), and unlimited-time but limited-function (no Save, no Print, not all analyses).

NCSS (Statistical Analysis System), PASS (Power and Sample Size) for Windows. Free 7-day evaluation versions.

MiniTab -- a powerful, full-featured MS Windows package, with good coverage of industrial / quality control analyses. The free Version 12 Demo expires after 30 days.

Prism Performs basic biostatistics, fits curves and creates publication quality scientific graphs in one complete package (Mac and Windows). Windows demo is fully-functional for 30 days, then disables printing, saving and exporting; Mac demo always disables these functions.

CoStat -- an easy-to-use program for data manipulation and statistical analysis, from CoHort Software. Use a spreadsheet with any number of columns and rows of data: floating point, integer, date, time, degrees, text, etc. Import ASCII, Excel, MatLab, S+, SAS, Genstat, Fortran, and others. Has ANOVA, multiple comparisons of means, correlation, descriptive statistics, analysis of frequency data, miscellaneous tests of hypotheses, nonparametric tests, regression (curve fitting), statistical tables, and utilities. Has an auto-recorder and macro programming language. Callable from the command line, batch files, shell scripts, pipes, and other programs; can be used as the statistics engine for web applications. Free time-limited demo available.

Subset Packages: Each of these programs deals with a specific area of statistics (such as power analysis or mulitvariate analysis), or carries out a specific test or computation.

G*Power 3 -- a very general Power Analysis program for Windows and Macintosh. Performs exact analysis for 6 types of correlation tests, 3 types of bivariate regression tests, 1-group and 2-group comparison of means tests (parametric and non-parametric), 4 types of multiple regression tests, logistic regression, poisson regression, ordinary and repeated-measures ANOVAs, ANCOVAs, MANOVAs, multivariate T2 and MANOVAs, 8 types of tests of proportions (McNemar, Fisher, etc.), 1-group and 2-group variance tests, and completely generic tests involving the binomial, normal, t, chi-square, and F distributions. Computes power, sample sizes, alpha, beta, and alpha/beta ratios. Has a comprehensive web-based tutorial and reference manual.

Factor  -- a comprehensive factor analysis program. Provides univariate and multivariate descriptive statistics of input variables (mean, variance, skewness, kurtosis), Var charts for ordinal variables, dispersion matrices (user defined , covariance, pearson correlation, polychoric correlation matrix with optional Ridge estimates). Uses MAP, PA (Parallel Analysis), and PA - MBS (with marginally bootstrapped samples) to determine the number of factors/components to be retained. Performs the following factor and component analyses: PCA, ULS (with Heywood correction), EML, MRFA, Schmid-Leiman second-order solution, and Factor scores. Rotation methods: Quartimax, ,Varimax , Weighted Varimax, Orthomin , Direct Oblimin, Weighted Oblimin, Promax, Promaj , Promin, and Simplimax. Indices used in the analysis: dispersion matrix  tests (determinant, Bartlett's, Kaiser-Meyer-Olkin), goodness of fit: Chi-Square ,non-normed fit index, comparative fit index, goodness of fit index, adjusted GFI, RMS error of approx, and estimated non-centrality parameter (NCP), reliabilities of rotated components , simplicity indices: Bentler’s, and loading simplicity index. Provides mean, variance and histogram of fitted and standardized residuals, and automatic detection of large standardized residuals.

KEYFINDER -- a menu-driven interactive program for generating, randomizing and tabulating blocked and/or fractional-replicate factorial designs in completely general situations. It can generate blocked and/or fractional-replicate designs with user-specified confounding and aliasing properties. KEYFINDER runs on all versions of Windows. You can download the Version 3.3 Overview document, in PDF format, here.
To obtain a free copy of the program and manual, send an e-mail to the custodians: Statistics-Chemometrics@shell.com
Contact the author Peter Zemroch.

Weka -- a collection of machine learning algorithms for data mining tasks, implemented in Java. Can be executed from a command-line environment, or from a graphical interface, or can either be called from your own Java code. Weka contains tools for data pre-processing, classification, regression, clustering, association rules, and visualization, and is well-suited for developing new machine learning schemes.

StatCalc -- a PC calculator that computes table values and other statistics for 34 probability distributions. Also includes some nonparametric table values, tolerance factors, and bivariate normal distribution. A help file is provided for each distribution.

Scientific Calculator - ScienCalc program contains high-performance arithmetic, trigonometric, hyperbolic and transcendental calculation routines. All the function routines therein map directly to Intel 80387 FPU floating point machine instructions.

MorePower -- another well-implemented power/sample-size calculator for any ANOVA design, for 1- and 2-sample t-tests, and for 1- and 2-sample binomial testing (sign test, chi-square test).

EqPlot -- Equation graph plotter program plots 2D graphs from equations. The application comprises algebraic, trigonometric, hyperbolic and transcendental functions.

PCP (Pattern Classification Program) -- a machine-learning program for supervised classification of patterns (vectors of measurements). PCP implements: Fisher's linear discriminant, dimensionality reduction using SVD, PCA, feature subset selection, Bayes error estimation, parametric classifiers (linear and quadratic), L-S (pseudo-inverse) linear discriminant, k-Nearest Neighbor, neural networks (Multi-Layer Perceptron), SVM, model selection for SVM, cross-validation, and bagging (committee) classification. Supports interactive (keyboard-driven menus) and batch processing. Last updated in 2006, and the website says it's no longer being developed.

WINPEPI -- a suite of programs for epidemiologists. An augmented Windows version (Aug. 2016) is available for download here. They were written to accompany the book Computer Programs for Epidemiologic Analyses: PEPI v. 4.0, by Abramson and Gahlinger, which is available for purchase. The currently available WINPEPI programs are:

TETRAD (from the TETRAD Project at CMU) -- a free program for creating, simulating data from, estimating, testing, predicting with, and searching for causal/statistical models of categorical (or ordinal) data and to linear models ("structural equation models') with a Normal probability distribution, and to a very limited class of time series models. Provides sophisticated methods in a friendly interface. It performs many of the functions in commercial programs such as Netica, Hugin, LISREL, EQS and other programs, and many discovery functions these commercial programs do not perform. TETRAD is limited to models The TETRAD programs describe causal models in three distinct parts or stages: a picture, representing a directed graph specifying hypothetical causal relations among the variables; a specification of the family of probability distributions and kinds of parameters associated with the graphical model; and a specification of the numerical values of those parameters.

EpiData -- a comprehensive yet simple tool for documented data entry. Overall frequency tables (codebook) and listing of data included, but no statistical analysis tools.

Calculate sample size required for a given confidence interval, or confidence interval for a given sample size. Can handle finite populations. Online calculator also available.

Biomapper  -- a kit of GIS and statistical tools designed to build habitat suitability (HS) models and maps for any kind of animal or plant. Deals with: preparing ecogeographical maps for use as input for ENFA (e.g. computing frequency of occurrence map, standardisation, masking, etc.); Exploring and comparing them by mean of descriptive statistics (distribution analysis, etc.); Computing the Ecological Niche Factor Analysis and exploring its output; and Computing and evaluating a Habitat Suitability map

ROC Curves -- a set of downloadable programs and Excel spreadsheets to calculate and graph various kinds of ROC (Receiver Operator Characteristic) curves.

ADE-4 -- multivariate analysis and graphical display software package for Mac andWin 95/NT. Includes component analysis and correspondence analysis, spatial data analysis methods (analogous to Moran and Geary indices), discriminant analysis and within/between groups analyses, many linear regression methods including lowess and polynomial regression, multiple and PLS (partial least squares) regression and orthogonal (principal component) regression, projection methods like principal component analysis on instrumental variables, canonical correspondence analysis and many other variants, coinertia analysis and the RLQ method, and several three-way table (k-table) analysis methods. Graphical displays include an automatic collection of elementary graphics corresponding to groups of rows or to columns in the data table, automatic k-table graphics and geographical mapping options, searching, zooming, selection of points, and display of data values on factor maps. Simple and homogeneous user interface.

Weibull Trend Toolkit -- Fits a Weibull distribution function (like a normal distribution, but more flexible) to a set of data points by matching the skewness of the data. (Windows)

BUGS -- Bayesian inference Using Gibbs Sampling. Software for the Bayesian analysis of complex statistical models using Markov chain Monte Carlo (MCMC) methods. Command-line interface versions available for major computer platform; a Windows version, WinBUGS, supports a graphical user interface, on-line monitoring and convergence diagnostics.

GUIDE GUIDE Classification and Regression Trees and Forests. GUIDE is a multi-purpose machine learning algorithm for constructing classification and regression trees. It is designed and maintained by Wei-Yin Loh at the University of Wisconsin, Madison. GUIDE stands for Generalized, Unbiased, Interaction Detection and Estimation.

AMELIA -- A program for substituting reasonable values for missing data (called "imputation")

Completely Free... can be freely downloaded and used in their fully-functional mode (no strings attached) Free, but... "demonstration" or "student versions" of commercial packages; can be freely downloaded, but are usually restricted or limited in some way.

gretl -- An econometric package. It has some simple routines and menus, but it is also programmable for more sophisticated analyses. Its routines are pretty powerful.

Data Preparator -- handles the "pre-processing" chores of getting a data file ready for analysis... data cleaning, discretization, numeration, scaling, attribute selection, missing values, outliers, statistics, visualization, balancing, sampling, row selection, and several other tasks, with a user-friendly graphical interface. It's written in Java, and runs on Windows, Mac OS/X, and Linux. The free demo has all features enabled, and will handle up to 200 cases.

Statistics Problem Solver -- tutoring software that not only solves statistical problems, but also generates step-by-step solutions in order to help students understand how to solve statistical problems. Includes: Histograms, Binomial, Poisson, Exponential, Continuous, Group Comparison and Test of Significance. Two other similar tutorial programs are available from the same company:  A&G Grapher -- for plotting any 2D or 3D equation typed into the program. Graphs can be customized in color, scale, resolution, etc., and can be exported or copied-and-pasted into other application. Also calculates slope, area under the curve, tracing and matrix transformation. Calculus Problem Solver -- differentiates any arbitrary equation and outputs the result, providing detailed step-by-step solutions in a tutorial-like format. Can also initiate an interactive quiz in which you can solve differentiation while the computer corrects your solutions.

ZeroRejects -- Implements the "Six Sigma" statistical process control methodology developed by Motorola. The alpha and beta version are freely downloadable. (Win 95/98/NT).

StatCalc (30-day free trial download) -- a handy desk-top tool and instructional aid that transforms from a standard calculator to a collection of modules that calculate statistics, graph distributions, and provide statistical help with definitions, formulas, and interpretation. (Windows) Free version is limited to 1 survey, 10 questions, 25 total responses.

WinSPC (30-day free trial)-- statistical process control software to:

  • collect quality data from devices, shop-floor machines, data sources, other software systems, or via keyboard;
  • monitor plant-wide operations from a single screen, and initiate corrective actions for out-of-control processes (trigger alarm, send email, page an operator, or shut down an out-of-control machine);
  • perform statistical analysis to solve problems, optimize processes, and create quality reports.

Power and Precision -- (30-day free trial download) An "industrial strength" program for calculating power, sample size, and attainable precision for: t-tests and z-tests (1-group, 2-group, paired, unpaired), Proportions (1-group, 2-group Chi Square or Fisher Exact, McNemar, KxC, Sign test), Correlations (1-group and 2-group), ANOVA / ANCOVA (1-, 2-, or 3-way), Multiple hierarchical linear regression (covariates / main effects, interactions, dummy-coded variables, polynomial), R2 increments, sequential sets), Logistic regression (1 or 2 continuous variables, 5-level categorical), Survival analysis (accrual options , constant or variable hazard, attrition), Equivalence tests (proportions or means).

StudySize (free demo subset and 30-day trial versions) -- for power / precision / sample size calculations, hypothesis testing, point estimation and confidence interval calculations. Includes equivalence- and non-inferiority testing for most tests, Monte Carlo simulation for small samples; group sequential interim analyses . Tests: Wilcoxon, Mann-Whitney, Sign; Student t, 1-way ANOVA, Fisher's exact, McNemar's,Chi2, Life table, Log-rank,Bioequivalence, Correlation and Regression coefficients. Point estimates & confidence intervals for means, medians, SDs, location & scale params in normal, log-normal, exponential, binomial, hypergeometric, Poisson distributions, and more.

Design-Ease and Design-Expert -- two programs from Stat-Ease that specialize in the design of experiments. Full-function 45-day evaluation copies of both programs are available for download.

Curve-fitting & Modeling:
Completely Free... can be freely downloaded and used in their fully-functional mode (no strings attached) Free, but... "demonstration" or "student versions" of commercial packages; can be freely downloaded, but are usually restricted or limited in some way.
EasyReg (Easy Regression Analysis), by Herman J. Bierens. Incredibly powerful and multi-featured program for data manipulation and analysis. Designed for econometrics, but useful in many other disciplines as well. For Win 98/98/NT4.

Also see RegressIt in the Excel Spreadsheets and Add-ins section.

gretl -- a cross-platform (Linux, Windows, Mac, etc.) package for econometric analysis. Has an intuitive interface (English, French, Italian & Spanish). Supports a wide variety of least-squares based estimators, including two-stage & nonlinear least squares, augmented Dickey-Fuller test, Chow test for structural stability, Vector Autoregressions, ARMA estimation. Creates output modelss as LaTeX files, in tabular or equation format. Has an integrated scripting language: enter commands either via the gui or via script, command loop structure for Monte Carlo simulations and iterative estimation procedures, GUI controller for fine-tuning Gnuplot graphs, Link to GNU R for further data analysis. Reads own format XML data files, Comma Separated Values files, Excel and Gnumeric worksheets, BOX1 files, own format binary databases (allowing mixed data frequencies and series lengths) and RATS 4 databases. Includes a sample US macro database. See also the gretl data page.

mle - Maximum Likelihood Estimation -- a simple programming language for building and estimating parameters of likelihood models. Originally designed for survival models, but the language has evolved into a general-purpose tool for building and estimating  general likelihood models. Available for Windows and Linux; also provides User Manual, Reference Manual, and Quick Reference Card.

DEMETRA+ -- user-friendly interface to TRAMO/SEATS and X-13-ARIMA .

Joinpoint Trend Analysis Software (from the National Cancer Institute) -- for the analysis of trends using joinpoint models (where several different lines are connected together at the "joinpoints."). Takes trend data (e.g cancer rates) and fits the simplest joinpoint model that the data allow, using a Monte Carlo Permutation method. Models may incorporate estimated variation for each point (e.g. when the responses are age adjusted rates) or use a Poisson model of variation. In addition, the models may also be linear on the log of the response (e.g. for calculating annual percentage rate change). The software also allows viewing one graph for each joinpoint model, from the model with the minimum number of joinpoints to the model with maximum number of joinpoints.

CurveExpert -- comprehensive curve fitting system for Windows. Handles linear regression models, nonlinear regression models, interpolation, or splines. Over 30 models built-in; custom user-defined regression models. Full-featured graphing capability. Supports an automated process that compares your data to each model to choose the best curve. 30-day evaluation of shareware package.

DTREG generates classification and regression decision trees. It uses V-fold cross-valication with pruning to generate the optimal size tree, and it uses surrogate splitters to handle missing data. A free demonstration copy is available for download.

NLREG performs general nonlinear regression. NLREG will fit a general function, whose form you specify, to a set of data values. A free demonstration copy is available for download.

Origin -- technical graphics and data analysis software for Windows. Includes 3D and contour plotting, FFT filtering; works closely with Excel. 30 evaluation.

Biostatistics and Epidemiology:
Completely Free... can be freely downloaded and used in their fully-functional mode (no strings attached) Free, but... "demonstration" or "student versions" of commercial packages; can be freely downloaded, but are usually restricted or limited in some way.
OpenEpi Version 2.2.1 -- OpenEpi is a free, web-based, open source, operating-system-independent series of programs for use in public health and medicine, providing a number of epidemiologic and statistical tools. It is written in JavaScript and HTML and operates similar to a calculator. OpenEpi can be thought of as an important companion to Epi Info, EpiData, SAS, SPSS, and Stata.

M.D. Anderson Statistical Software Library-- A large collection of free statistical software (almost 70 programs!) from the Biostatistics and Applied Mathematics department of the M.D. Anderson Cancer Center. Software is distributed in the form of program source files and/or self-extracting archives of executable programs for Windows, Mac, Unix/Linux environments.

Lifetables -- Windows program for Mortality Analysis for Demography and Epidemiology. The program will calculate the life expectancy, including all intermediary statistics, variance an confidence interval for the life expectancy, Potential Gains in Life Expectancy (PGLE), Years of Potential Life Lost (YPLL) and Lifetime Years of Potential Life Lost (LYPLL). YPLL can be calculated adjusted for competing causes of mortality and both YPLL and LYPL can also be discounted. Two populations can be compared using direct and indirect standardization, the SMR and CMF and by comparing two lifetables. Confidence intervals and statistical test are provided. There is an extensive helpfile in which everything is explained. Lifetables is listed in the Downloads section of the QuantitativeSkills web site.

Sample Size for Microarray Experiments -- compute how many samples needed for a microarray experiment to find genes that are differentially expressed between two kinds of samples (e.g.: cancer vs. normal tissue), by performing separate gene-by-gene t-tests. You specify how many genes you're looking at, how many false positives you are willing to accept, how large a difference you want to be able to detect (as the fold difference between the two kinds of samples), the power of the test (% of differentially expressed genes likely to be detected by the experiment), and an estimate of the logarithmic SD of the gene intensities.

MIX (Meta-analysis with Interactive eXplanations) -- a statistical add-in for Excel 2000 or later (Windows only). Ideal for learning meta-analysis (reproduces the data, calculations, and graphs of virtually all data sets from the most authoritative meta-analysis books, and lets you analyze your own data "by the book"). Handles datasets with dichotomous & continuous outcomes; calculates Risk Diff, RR, OR, Mean Diff, Hedges's g, Cohen's d; performs standard & cumulative meta-analysis with CI ,z & p; fixed and random effects modeling; Cochran's Q with p-value; Higgins's I2 and H with CI; and publication bias tests: Rank correlation (tau-b) test with z & p, Egger's and Macaskill's regression tests with CI, and Trim-and-Fill. Generates numerous plots: tandard and cumulative forest, p-value function, four funnel types, several funnel regression types, exclusion sensitivity, Galbraith, L'Abbe, Baujat, modeling sensitivity, and Trim-and-Fill.

EWOC - Escalation With Overdose Control -- a Bayesian method for selecting dose levels in Phase I Clinical Trials while controlling the probability of exceeding the maximum tolerated dose. This is a stand-alone Windows (95 through XP) program that receives information about dose-limiting toxicities (DLTs) observed at some starting dose, and calculates the doses to be administered next. DLT information obtained at each dosing level guides the calculation of the next dose level. (For some strange reason, the EWOC download web site does not work properly with the FireFox web browser; but it does work with MS Internet Explorer.)

STPLAN -- Performs power, sample size, and related calculations needed to plan studies. Covers a wide variety of situations, including studies whose outcomes involve the Binomial, Poisson, Normal, and log-normal distributions, or are survival times or correlation coefficients. Available for MS-DOS and Mac; also as Fortran and C source code.

Epi Info -- Public domain statistical software for epidemiology developed by Centers for Disease Control and Prevention (CDC) in Atlanta, Georgia (USA). Epi Info has been in existence for over 20 years and is currently available for Microsoft Windows. The program allows for data entry and analysis. Within the analysis module, analytic routines include t-tests, ANOVA, nonparametric statistics, cross tabulations and stratification with estimates of odds ratios, risk ratios, and risk differences, logistic regression (conditional and unconditional), survival analysis (Kaplan Meier and Cox proportional hazard), and analysis of complex survey data. The software is in the public domain, free, and can be downloaded from http://www.cdc.gov/epiinfo. Limited support is available.

Free Public Health & Epidemiology Software -- written by Mark Myatt)and others:

PAMCOMP (Person-years And Mortality COMputation Program) -- a free Windows 95/98/NT application for calculating person-years and standardised mortality ratios (SMRs). The calculation of person-years allows flexible stratification by sex, and self-defined and unrestricted calendar periods and age groups, and can lag person-years to account for latency periods.The SMR computation includes calculation of 90%, 95%, and 99% confidence intervals. Has filters for ASCII, dBase, Excel, Access, Paradox to import cohort and reference data and to export distributions of person-years and deaths.

ARIMA -- a seasonal adjustment program for PC and Unix, developed by the Census Bureau.

DEMETRA+ -- (Win 9x/NT) a user-friendly interface to the seasonal adjustment methods TRAMO/SEATS and X-12-ARIMA . Developed by Eurostat to facilitate the application of these modern time series techniques to large-scale sets of time series and in the explicit consideration of the needs of production units in statistical institutes. Client/server architecture can access various kinds of databases and files. Contains two main modules: seasonal adjustment and trend estimation with an automated procedure (e.g. for unexperienced users or for large-scale sets of time series), and with a user-friendly procedure for detailed analysis of single time series.

Meta-analysis 5.3 -- Free DOS statistics software for meta-analysis. Probably still the most frequently used meta-analysis software in the world. Can select the analysis of exact p values or effect sizes (d or r, with a cluster size option). Can plot a stem-and-leaf display of correlation coefficients. A utility menu is provided that allows various transformations and preliminary computations that are typically required before the final meta-analysis can be performed.

Surveys, Testing, and Measurement:

StatPac Survey Software -- to design andimplement surveys, and to acquire, manage and analyze data from surveys. Supports multiply data types and question formats, multi-language spell-checking, large files (2,000 variables & 10,000,000 cases, basic statistics (crosstab & banner tables) & graphics, automatic coding of text responses, and data import / export capabilities. Optional Web Survey Module and Advanced Statistics Module (curve fitting, multiple regression, logistic regression, factor, analysis of variance, discriminant function, cluster, and canonical correlation). A demo version is available (limited to 35 cases).

NewMDSX -- software for Multidimensional Scaling (MDS), a term that refers to a family of models where the structure in a set of data is represented graphically by the relationships between a set of points in a space. MDS can be used on a variety of data, using different models and allowing different assumptions about the level of measurement. This site offers a free month trial of the Windows version; a completely free copy of constituent programs, notes, documentation, test Input& Output in MS-DOS; a not-for-profit full Windows copy priced at cost; and a site with a range of data, cross-reference, & information.

Completely Free... can be freely downloaded and used in their fully-functional mode (no strings attached) Free, but... "demonstration" or "student versions" of commercial packages; can be freely downloaded, but are usually restricted or limited in some way.
CCOUNT -- a package for market research data cleaning, manipulation, cross tabulation and data analysis. Similar to, and uses the same syntax as, SPSS-MR "Quantum", a well known commercial package for processing market research data. Available for Windows, Linux, and SunOS. C++ source code also available, under the GNU General Public Licerse.

GGUM2004 (Item Response Theory Models for Unfolding) -- a Windows-based program that estimates parameters in the generalized graded unfolding model (GGUM; Roberts, Donoghue, & Laughlin, 2000). Has a user-friendly interface to prepare command files, run the core estimation program, and display results. Allows different questionnaire items to have varying numbers of response categories (useful when sparse responses require recoding into fewer response categories. Handles sporadically missing responses. Provides item fit statistics and diagnostic graphics of performance.

Rasch Measurement Software -- deals with the various nuances of constructing optimal rating scales from a number of (usually) dichotomous measurements, such as responses to questions in a survey or test. Several free student/demo software packages are available. These may be freely downloaded, used, and distributed, and they do not expire. They are:

  • BIGSTEPS -- a DOS-based precursor to the Windows-based WINSTEPS Rasch measurements program.
  • MINISTEP -- a free evaluation/student version of WINSTEPS. It has complete WINSTEPS functionality, but is limited to 25 items and 100 persons (cases).
  • MINIFAC -- a free evaluation/student version of FACETS (Many-Facet Rasch Analysis). Contains all features except limited to 2,000 data points (responses).

CSPro (Census and Survey Processing System) -- a public-domain software package for entering, tabulating and mapping census and survey data.

IMPS (Integrated Microcomputer Processing System) -- performs the major tasks in survey and census data processing: data entry, data editing, tabulation, data dissemination, statistical analysis and data capture control. (from CDC)

Stats 2.0 -- Windows program for several commonly-needed statistical functions for marketing researchers: random numbers;sample sizes needed for surveys; mean, standard deviation, standard error and range for keyboard-entered data; standard error of a proportion; significance testing between two percentages from independent samples; significance between two percentages from dependent samples; significance testing between two averages from independent samples; contingency table analysis (i.e., Chi-Square)

SABRE -- for the statistical analysis of multi-process random effect response data. Responses can be binary, ordinal, count and linear recurrent events; response sequences can be of different types. Such multi-process data is common in many research areas, e.g. the analysis of work and life histories. Sabre has been used intensively on many longitudinal datasets surveys either with recurrent information collected over time or with a clustered sampling scheme. 

WISC-III Profile Calculator for Macintosh and Windows -- uses generalized distance method to determine if the subtest profile of a single case is multivariately unusual or common in comparison to subtest clusters found in the WISC-III standardization sample. (Mac, 360K; Win anticipated in September)

DEMETRA+ -- user-friendly interface to TRAMO/SEATS and X-12-ARIMA .

Excel Spreadsheets and Add-Ins:
Completely Free... can be freely downloaded and used in their fully-functional mode (no strings attached) Free, but... "demonstration" or "student versions" of commercial packages; can be freely downloaded, but are usually restricted or limited in some way.
This Excel spreadsheet converts confidence intervals to p values, and this PDF file explains it's background and use.


RegressIt - An Excel add-in for teaching and applied work.  Performs multivariate descriptive analysis and ordinary linear regression.  .  Creates presentation-quality charts in native editable Excel format, intelligently formatted tables, high quality scatterplot matrices, parallel time series plots of many variables, summary statistics, and correlation matrices.   Easily explore variations on models, apply nonlinear and time transformations to variables, test model assumptions, and  generate out-of-sample forecasts.   Includes audit trail and organization features: output stored in separate worksheets, date/time/username worksheet stamps, unique model names in all table and chart titles, and an additional worksheet with side-by-side comparisons of models.

SimulAr -- Provides a very elegant point-and-click graphical interface that makes it easy to generate random variables (correlated or uncorrelated) from twenty different distributions, run Monte-Carlo simulations, and generate extensive tabulations and elegant graphical displays of the results.

EZAnalyze -- enhances Excel (Mac and PC) by adding "point and click" functionality for analyzing data and creating graphs (no formula entry required). Does all basic "descriptive statistics" (mean, median, standard deviation, and range), and "disaggregates" data (breaks it down by categories), with results shown as tables or disaggregation graphs". Advanced features: correlation; one-sample, independent samples, and paired samples t-tests; chi square; and single factor ANOVA.
Update Available! The latest version can create z-scores, percentile ranks, and random numbers as new variables; has repeated-measures ANOVA; does simple post hoc tests for single factor and repeated-measures ANOVA; can graph multiple variables on a single graph, and can add error bars for +/- 2 SD’s; adds the sum function to the disaggregate and descriptive statistics functions, and the mode function to descriptive stats; adds delete sheets; adds English & Spanish language options, and works better in international environments; incorporates various bug fixes; and contains an updated user manual.

EZ-R Stats -- supports a variety of analytical techniques, such as: Benford's law, univariate stats, cross-tabs, histograms. Also supports databases such as mySQL, SQLite, MS-Access, MS-SQL. Simplifies the analysis of large volumes of data, enhances audit planning by better characterizing data, identifies potential audit exceptions and facilitates reporting and analysis. This language is a Computer Assisted Audit Technique (CAAT) in support of COSO, SAS 78, SAS 99 and analysis required by Sarbanes-Oxley.

Marko Lucijanic's Excel spreadsheet to perform Log Rank test on survival data, and his article.

SSC-Stat -- an Excel add-in designed to strengthen those areas where the spreadsheet package is already strong, principally in the areas of data management, graphics and descriptive statistics. SSC-Stat is especially useful for datasets in which there are columns indicating different groups. Menu features within SSC-Stat can:

  • help users manipulate their data (stacking, unstacking columns, 2-way unstacking, lookups, generating factors, etc.);
  • generate good graphs (X-Y Scatter Plot, Category-Value Plot, Boxplot, Normal Probability Plot, Density Estimate), that can be edited and polished like any other Excel graph ;
  • provide basic statistical analysis (descriptive statistics, summary statistics, 1- and 2-sample t tests, 1- and 2-sample tests of proportion).

22 Distribution Functions -- There is one spreadsheet for each of the following distribution functions: Beta, Binomial, Chi-Square, Discrete Uniform, Gamma, Geometric, Hypergeometric, Multivariate Hypergeometric, Laplace, Logistic, Multinomial, Negative, Binomial, Normal, Bivariate Normal, Log-normal, Pareto, Poisson, Rectangular, Snedecor F, Student-t, Triangular, Weibull. Each spreadsheet gives a graph of the distribution, along with the value of various parameters, for whatever shape and scale parameters you specify. You can also download a file containing all 22 spreadsheets.

Sample-size calculator for cluster randomized controlled trials, which are used when the outcomes are not completely independent of each other. This independence assumption is violated in cluster randomized trials because subjects within any one cluster are more likely to respond in a similar manner. A measure of this similarity is known as the intra-correlation coefficient (ICC). Because of the lack of independence, sample sizes have to be increased. This web site contains two tools to aid the design of cluster trials – a database of ICCs and a sample size calculator (along with instruction manuals).

DAG_Stat -- calculates an enormous number of quantities from a 2 -by-2 table:

  • for diagnostic tests: sensitivity, sensitivity of a random test given the observed prevalence and test level., sensitivity quality index, specificity, specificity of a random test, specificity quality index, efficiency (the correct classification rate), efficiency of a random test, quality index, Youden's index, the predictive value of positive test, predictive value of a positive random test, predictive value of negative test, predictive. value of a negative random test, likelihood ratio of a positive and negative tests, the odds ratio, false positive and false negative rates, prevalence observed in the sample and test level (proportion of subjects classified as 'positive.'
  • for interrater agreement: Cohen's Kappa, observed agreement, chance agreement, agreement about positive and negative cases, Byrt's bias index, Byrt's prevalence asymmetry index, bias adjusted Kappa, prevalence & bias adjusted Kappa. DAG_Stat also calculates Dice's index, Yule's Q (Gamma), Phi, Scott's agreement index, the tetrachoric correlation coefficient, Goodman & Kruskal's tau, Lambda, the Uncertainty Coefficient, Pearson's Chi Square (with and without Yates' correction), the likelihood ratio Chi Square, McNemar's Test, (with and without Yates' correction).

MIX (Meta-analysis with Interactive eXplanations) -- a statistical add-in for Excel 2000 or later (Windows only). Ideal for learning meta-analysis (reproduces the data, calculations, and graphs of virtually all data sets from the most authoritative meta-analysis books, and lets you analyze your own data "by the book"). Handles datasets with dichotomous & continuous outcomes; calculates Risk Diff, RR, OR, Mean Diff, Hedges's g, Cohen's d; performs standard & cumulative meta-analysis with CI ,z & p; fixed and random effects modeling; Cochran's Q with p-value; Higgins's I2 and H with CI; and publication bias tests: Rank correlation (tau-b) test with z & p, Egger's and Macaskill's regression tests with CI, and Trim-and-Fill. Generates numerous plots: tandard and cumulative forest, p-value function, four funnel types, several funnel regression types, exclusion sensitivity, Galbraith, L'Abbe, Baujat, modeling sensitivity, and Trim-and-Fill.

OZGRID -- contains over 4000 pages (and growing) of information on Excel and VBA for Excel. Many add-on's are for sale, but there is also an enormous amount of totally free content: downloads, a free 24/7 question and answer support forum for MS Office, a free Excel monthly newsletter full of detailed tips, tricks, hacks and more for Excel and VBA.

Exact confidence intervals for samples from the Binomial and Poisson distributions -- an Excel spreadsheet with several built-in functions for calculating probabilities and confidence intervals. (42k long).

BiPlot -- by Ilya Lipkovich and Eric P. Smith, of Virginia Tech. A user-friendly add-in for Excel to draw a biplot display (a graph of row and column markers from data that forms a two-way table) based on results from principal components analysis, correspondence analysis, canonical discriminant analysis, metric multidimensional scaling, redundancy analysis, canonical correlation analysis or canonical correspondence analysis. Allows for a variety of transformations of the data prior to the singular value decomposition and scaling of the markers following the decomposition.

  • Self-made Sampling Plans -- Examine the OC curves for your own custom sampling plans. Use either binomial or hypergeometric calculations. Now be able to explain the "valid statistical rationale" of the sampling plans you already use.
  • Sequential Sampling Plans -- Provides an analysis and planning tool for sample sizes in situations where lots undergo sequential inspections (e.g., 1st by Manufacturing, 2nd by QC, and finally by QA).

Lifetable -- does a full abridged current life table analysis to obtain the life expectancy of a population. Furthermore, one can calculate Potential Gains in Life Expectancy (PGLE) after removing cause k, considering competing causes of death; the (Premature) Years of Potential Life Lost (YPLL), this is the number of person years added to the total number of person years lived in a population if cause of death k would be removed; the Standardized Mortality Ratio (SMR), standardized numbers per 100,000 and the Comparative Mortality Figure (CMF) can also be calculated. From the Downloads section of the QuantitativeSkills web site.

-

Intracorrelation -- does intra correlation calculations for dichotomous or binary yes/no type outcome variables according to two different methods proposed for the single cluster one by Fleiss and another one by Bennett et.al. A third spreadsheet concerns a method for two clusters by Donner and Klar. You will have to insert your own data by overwriting the tables in the second (total number of positive responses) and third (total number of negative responses) or fourth column (total number). From the Downloads section of the QuantitativeSkills web site.

XLStatistics -- a set of Excel (ver 5+) workbooks for statistical analysis of data. A step-by-step guide to data analysis with separate workbooks for handling data with different numbers and types of variables. Contains most standard analyses, analyses using only summary data, power / sample size , nonparametrics, curve fitting , non-linear regression, analysis for 2x2 tables. XLStatistics is not an Excel add-in and all the working and code is visible. A free version for analysis of 1- and 2-variable data is available.

Analyse-it -- includes over 30 parametric & non-parametric statistical functions, including multiple linear regression analysis, ANOVA, & chi-square statistics. A separate specialized package for clinical method evaluation provides NCCLS and IFCC procedures for accuracy & imprecision.

XLSTAT -- an Excel add-in for PC and MAC that holds more than 200 statistical features including data visualization, multivariate data analysis, modeling, machine learning, statistical tests as well as field-oriented solutions: features for sensory data analysis (preference mapping), time series analysis (forecasting), marketing (conjoint analysis, PLS structural equation modeling), biostatistics (survival analysis, OMICs data analysis) and more. It proposes a free 30-day trial of all features as well as a free version.

Programming Languages and Subroutine Libraries:
Completely Free... can be freely downloaded and used in their fully-functional mode (no strings attached) Free, but... "demonstration" or "student versions" of commercial packages; can be freely downloaded, but are usually restricted or limited in some way.
MuPAD -- a very powerful and general computerized algebra system, developed at the University of Paderborn, now distributed by SciFace Software. In the same category as Mathematica and Maple, it does numerical calculations, symbolic manipulation (algebra, differentiation & integration), graphing, and programming. A free "lite" (but still very powerful) version for PC and Mac can be downloaded.

Statistics101 -- executes programs written in the easy-to-learn Resampling Stats statistical simulation language. You write a short, simple program in the language, describing the process behind a probability or statistics problem. Statistics101 then executes your Resampling Stats model thousands of times, each time with different random numbers or samples, keeping track of the results. When the program completes, you have your answer. Runs on Windows, Mac, Lunux -- any system that supports Java.

R -- a programming language and environment for statistical computing and graphics. Similar to S or S-plus (will run most S code unchanged). Available for Windows, various Unix flavors (including Linux), NextStep and Mac. Provides a wide variety of statistical (linear and nonlinear modelling, classical statistical tests, time-series analysis, classification, clustering, ...) and graphical techniques, and is highly extensible. Well-designed publication-quality plots can be produced, including mathematical symbols and formulae where needed. The R environment includes:

  • an effective data handling and storage facility,
  • a suite of operators for calculations on arrays, in particular matrices,
  • a large, coherent, integrated collection of intermediate tools for data analysis,
  • graphical facilities for data analysis and display either on-screen or on hardcopy, and
  • a well-developed, simple and effective programming language which includes conditionals, loops, user-defined recursive functions and input and output facilities.

RStudio -– is a set of integrated tools designed to help you be more productive with R. It includes a console, syntax-highlighting editor that supports direct code execution, as well as tools for plotting, history, debugging and workspace management.
Available in open source and commercial editions and runs on the desktop (Windows, Mac, and Linux) or in a browser connected to RStudio Server or RStudio Server Pro (Debian/Ubuntu, RedHat/CentOS, and SUSE Linux).

    Integrated development environment
  • Access RStudio locally
  • Syntax highlighting, code completion, and smart indentation
  • Execute R code directly from the source editor
  • Quickly jump to function definitions
  • Easily manage multiple working directories using projects
  • Integrated R help and documentation
  • Interactive debugger to diagnose and fix errors quickly
  • Extensive package development tools
    RStudio Server
  • Access via a web browser
  • Move computation closer to the data
  • Scale compute and RAM centrally
    Shiny
  • A web application framework for R. Turn your analyses into interactive web applications

ILNumerics -- a numerical library for .NET that turns C# into a 1st class mathematical language. It offers both scientists and software developers convenient syntax (similar to Matlab), toolboxes for statistical functions and machine learning, high performance, wide platform support and 2D and 3D visualization features. There's a free "Community" edition and a pay-for "Professional" edition. Both have the same features and capabilities; they differ in how you would re-distribute them in your own software products.

Zelig -- an add-on for R that can estimate, help interpret, and present the results of a large range of statistical methods. It translates hard-to-interpret coefficients into quantities of interest; combines multiply imputed data sets to deal with missing data; automates bootstrapping for all models; uses sophisticated nonparametric matching commands which improve parametric procedures; allows one-line commands to run analyses in all designated strata; automates the creation of replication data files so that you (or anyone else) can replicate the results of your analyses (hence satisfying the replication standard); makes it easy to evaluate counterfactuals; and allows conditional population and superpopulation inferences. It includes many specific methods, based on likelihood, frequentist, Bayesian, robust Bayesian, and nonparametric theories of inference. Zelig comes with detailed, self-contained documentation that minimizes startup costs for Zelig and R, automates graphics and summaries for all models, and, with only three simple commands required, generally makes the power of R accessible for all users. Zelig also works well for teaching, and is designed so that scholars can use the same program with students that they use for their research.

Apophenia -- a statistics library for C. It provides functions on the same level as those of the typical stats package (OLS, probit, singular value decomposition, &c.) but doesn't tie the user to an ad hoc language or environment.

Octave -- a high-level mathematical programming language, similar to MATLAB, for numerical computations -- solving common numerical linear algebra problems, finding the roots of nonlinear equations, integrating ordinary functions, manipulating polynomials, and integrating ordinary differential and differential-algebraic equations. It is easily extensible and customizable via user-defined functions written in Octave's own language, or using dynamically loaded modules written in C++, C, Fortran, or other languages. Runs under Linux and Windows.

J -- a modern, high-level, general-purpose, high-performance programming language. Runs on Windows, Unix, Mac, and PocketPC handhelds. J runs both as a GUI and in a console (command line). Much like APL, but uses "conventional" symbols, rather than APL's a specialized character set. J is particularly strong in the mathematical, statistical, and logical analysis of arrays of data. J systems have:

  • an integrated development environment
  • standard libraries, utilities, and packages
  • a form designer for your application forms
  • an event-driven graphical user interface to your application
  • interfaces with other programming languages and applications
  • integrated 2d and 3d graphics
  • memory mapped files for high performance data applications

mle - Maximum Likelihood Estimation -- a simple programming language for building and estimating parameters of likelihood models. Originally designed for survival models, but the language has evolved into a general-purpose tool for building and estimating  general likelihood models. Available for Windows and Linux; also provides User Manual, Reference Manual, and Quick Reference Card.

Mx  -- a matrix algebra interpreter and numerical optimizer for exploration of matrix algebra. Many built-in fit fuctions for structural equation modeling and other statistical modeling. Has fitting fuctions like those in LISREL, LISCOMP, EQS and CALIS, along with facilities for maximum likelihood estimation of parameters from missing data structures, under normal theory. Users can easily specify complex 'nonstandard' models, define their own fit functions, and perform optimization subject to linear and nonlinear equality or boundary constraints.

JDB -- Relational Database and Elementary Statistics for a Unix environment. Useful for manipulating experimental data (joining files, cleaning data, reformatting for input into other programs). Computes basic statistics (mean, std. dev., confidence intervals, quartiles, n-tiles, percentiles, histograms, correlations, z-scores, t-scores.

MacANOVA -- comprehensive statistical package for the Mac and PC/Windows. MacAnova has macros which are used just like functions. Several macros are built in, and three files of additional macros (general, time series, design of experiments) are distributed with MacAnova. Like S, MacAnova is a programming language with for and while loops, if, else, elseif, break, and a full range of operations including bit manipulation.

Lisp-Stat  -- an extensible statistical computing environment for data analysis, statistical instruction and research, and  for exploring the use of dynamic graphical methods. Based on an extended subset of Common Lisp, performs element-wise operations on lists and vectors, and adds a variety of basic statistical and linear algebra functions. Graphics system is object-oriented, and can be customized and adapted. Supports linear and nonlinear regression models and generalized linear models. Runs on Mac, X-window (UNIX), and MS Windows

Resampling Stats -- a different approach to learning statistics and performing statistical analyses, using simulation with random numbers instead of complex mathematics. 30-day trial version available for Win 95/NT.

O-Matrix -- an extensive matrix manipulation system (for Windows) with lots of statistical capability. The "Light" version can be freely downloaded and tried for 30 days. Some capabilities include:

  • Matrix Functions: determinant, eigenvalues and eigenvectors, systems of equations
  • Statistics: minimum, maximum, mean, median, standard deviation, linear regression, correlation, covariance, sorting, t-distributions, f-distributions, probability, normal distributions, population simulations, Kolmogorov-Smirnov Test
  • Optimization: linear & nonlinear least squares, with and without box constraints and with or without derivatives, quadratic and general nonlinear programming, linear complementarity problems
  • Random Simulations: uniform and normal random number generators, auto-regressive process simulation
  • Special Functions: error, gamma, incomplete beta, Y and J Bessel
  • Also: quadrature, differential equations, Fourier analysis, spectral estimation, convolution, FFT, Interpolation, filtering, Kalman-Bucy filtering, wWavelets: Haar and Daubechies transforms, polynomials, and general functions (trig, hyp, inv trig &  hyp, exp, log, roots, forward & backward difference approximations to the derivatives of vector-valued functions

Also provides extensive plotting capabilities, with multiple windows, axis scaling & labeling, titling, free-form text, selectable fonts. Plots exportable to word processors, spreadsheets, etc. Plot Types: line, contour, surface, mesh, bar, stair, polar, vector, error bar, smith charts, and histogram; line plots can contain unlimited points per curve and hundreds of curves per plot; two- and three-dimensional plotting is supported which provides additional flexibility with contours and surface plots; multiple colors, markers, and line types.

OxMetrics -- an object-oriented matrix programming language with a comprehensive mathematical and statistical function library. Matrices can be used directly in expressions, for example to multiply two matrices, or to invert a matrix. The major features of Ox are its speed, extensive library, and well-designed syntax, which leads to programs which are easier to maintain. Versions of Ox are available for many platforms. The "Console" version can be freely downloaded for academic and research use; the "Professional" version must be purchased.

Scripts and Macros:
Completely Free... can be freely downloaded and used in their fully-functional mode (no strings attached) Free, but... "demonstration" or "student versions" of commercial packages; can be freely downloaded, but are usually restricted or limited in some way.

Link-King -- a SAS program to detect duplicate entries in a file, or to link matching records in two files, based on criteria like names (first, middle, last, maiden, nickname), date of birth, gender, and social security number. A graphical interface, a “Link King for Knaves” feature, and a powerful interface for manually reviewing uncertain matches make it easy to use. It features both probabilistic and deterministic record linkage algorithms, phonetic name matching (NYSIIS and Soundex), and many other features for dealing with "mushy matches".


Miscellaneous:
Completely Free... can be freely downloaded and used in their fully-functional mode (no strings attached) Free, but... "demonstration" or "student versions" of commercial packages; can be freely downloaded, but are usually restricted or limited in some way.

IND -- Creation and manipulation of decision trees from data.  For supervised classification and prediction in artificial intelligence and statistical pattern recognition. A tree is "grown" from data using a recursive partitioning algorithm to create a tree which (hopefully) has good prediction of classes on new data. IND improves on standard algorithms and introduces Bayesian and MML methods, producing more accurate class probability estimates that are important in applications like diagnosis. For UNIX systems. Currently available only in beta-test mode, and only to US citizens.

MANET -- ("Missings Are Now Equally Treated") Macintosh software for interactive graphics tools for data sets with missing values (last updated in 2000). Generates missing values chart, histograms & barcharts, boxplots & dotplots, scatterplots, mosaic plots, polygon plots, highlighted boxplots, interactive trellis displays, traces, context-sensitive interrogation, cues, redframing, selection sequences.

StudioLine Photo Basic -- Photo editing software from H&M Software. Add descriptions to images, re-size photos for efficient e-mail transmission, print high-quality copies, display slide-shows, publish web-galleries, safe-keep images on CD or DVD. Version 2.2 has a new user interface, dual-monitor support, increased speed and other technical improvements. SmartUpdate feature checks for new versions. Has a web-board for user-to-user help.

WAFO -- Wave Analysis for Fatigue and Oceanography. A toolbox of Matlab (ver. 5.x / 6.x, for Windows & Unix) routines for statistical analysis and simulation of random waves and random loads. Tools are provided for analysis of measured data with routines for estimation of parameters in statistical distributions, estimation of spectra, plotting in probability papers, etc. Has routines for theoretical distributions of characteristic wave parameters from observed or theoretical power spectra of the sea. Another part is related to statistical analysis of fatigue. The theoretical density of rainflow cycles can be computed from parameters of random loads. Has routines is included for modelling of switching loads (hidden Markov models). Also contains general statistical tools.

Graphing Calculator 3D -- Wide range of plotting capabilities. Plot equations and data points (Excel, CSV). Available for Windows, Mac or Linux computers.

CoPlot 6.2 -- for publication-quality 2D and 3D scientific graphs (from data and equations), maps, and technical drawings. From CoHort Software. Creates precise technical drawings using drawing objects, genetic maps, field maps, flow charts, apparatus diagrams, circuit diagrams, chemical structures, etc. Text in drawing objects and graphs can include HTML-like text formatting tags and over 1000 special characters. Supports animated graphs. Exports graphs to .eps, .gif, .jpg, .pdf, .png, .svg, .wmf, and others. Has an auto-recorder and macro programming language. Invoke CoPlot from the command line, batch files, shell scripts, pipes, and other programs. Can be used as a graphics server program on a web site. Free time-limited demo version available.

Other Links to Collections of Free Software:


Please send all suggestions for new pages, notifications for dead links, and other requests in an e-mail to statpages.org@gmail.com