Provided by: mlpack-bin_3.2.2-3_amd64 bug

NAME

       mlpack_preprocess_describe - descriptive statistics

SYNOPSIS

        mlpack_preprocess_describe -i string [-d int] [-P bool] [-p int] [-r bool] [-V bool] [-w int] [-h -v]

DESCRIPTION

       This  utility  takes  a  dataset  and  prints  out  the  descriptive  statistics of the data. Descriptive
       statistics is the  discipline  of  quantitatively  describing  the  main  features  of  a  collection  of
       information,  or  the quantitative description itself. The program does not modify the original file, but
       instead prints out the statistics to the console. The printed result will look like a table.

       Optionally, width and precision of the output can be adjusted by a user  using  the  '--width  (-w)'  and
       '--precision  (-p)'  parameters.  A user can also select a specific dimension to analyze if there are too
       many dimensions. The ’--population (-P)' parameter can be specified when the dataset should be considered
       as a population. Otherwise, the dataset will be considered as a sample.

       So,  a  simple  example  where we want to print out statistical facts about the dataset 'X.csv' using the
       default settings, we could run

       $ mlpack_preprocess_describe --input_file X.csv --verbose

       If we want to customize the width to 10 and precision to 5 and consider the dataset as a  population,  we
       could run

       $ mlpack_preprocess_describe --input_file X.csv --width 10 --precision 5 --verbose

REQUIRED INPUT OPTIONS

       --input_file (-i) [string]
              Matrix containing data,

OPTIONAL INPUT OPTIONS

       --dimension (-d) [int]
              Dimension of the data. Use this to specify a dimension Default value 0.

       --help (-h) [bool]
              Default help info.

       --info [string]
              Print help on a specific option. Default value ''.

       --population (-P) [bool]
              If  specified,  the  program  will calculate statistics assuming the dataset is the population. By
              default, the program will assume the dataset as a sample.

       --precision (-p) [int]
              Precision of the output statistics. Default value 4.

       --row_major (-r) [bool]
              If specified, the program will calculate statistics across rows, not  across  columns.   (Remember
              that in mlpack, a column represents a point, so this option is generally not necessary.)

       --verbose (-v) [bool]
              Display informational messages and the full list of parameters and timers at the end of execution.

       --version (-V) [bool]
              Display the version of mlpack.

       --width (-w) [int]
              Width of the output table. Default value 8.

ADDITIONAL INFORMATION

       For  further  information,  including  relevant  papers, citations, and theory, consult the documentation
       found at http://www.mlpack.org or included with your distribution of mlpack.