Ubuntu Manpage: mlpack_lars

Provided by: mlpack-bin_4.1.0-1ubuntu1_amd64

NAME

       mlpack_lars - lars

SYNOPSIS

        mlpack_lars [-i unknown] [-m unknown] [-l double] [-L double] [-r unknown] [-t unknown] [-c bool] [-V bool] [-M unknown] [-o unknown] [-h -v]

DESCRIPTION

       An  implementation of LARS: Least Angle Regression (Stagewise/laSso). This is a stage-wise
       homotopy-based   algorithm   for   L1-regularized   linear    regression    (LASSO)    and
       L1+L2-regularized linear regression (Elastic Net).

       This  program  is  able to train a LARS/LASSO/Elastic Net model or load a model from file,
       output regression predictions for a test set, and save the trained model to  a  file.  The
       LARS algorithm is described in more detail below:

       Let X be a matrix where each row is a point and each column is a dimension, and let y be a
       vector of targets.

       The Elastic Net problem is to solve

         min_beta 0.5 || X * beta - y ||_2^2 + lambda_1 ||beta||_1 +
           0.5 lambda_2 ||beta||_2^2

       If lambda1 > 0 and lambda2 = 0, the problem is the LASSO.  If lambda1 > 0 and lambda2 > 0,
       the  problem  is  the  Elastic  Net.  If lambda1 = 0 and lambda2 > 0, the problem is ridge
       regression.  If lambda1 =  0  and  lambda2  =  0,  the  problem  is  unregularized  linear
       regression.

       For  efficiency reasons, it is not recommended to use this algorithm with ’--lambda1 (-l)'
       =  0.  In  that  case,  use  the  'linear_regression'  program,  which   implements   both
       unregularized linear regression and ridge regression.

       To  train  a  LARS/LASSO/Elastic  Net model, the '--input_file (-i)' and ’--responses_file
       (-r)'  parameters  must  be  given.  The   '--lambda1   (-l)',   ’--lambda2   (-L)',   and
       '--use_cholesky  (-c)'  parameters  control  the  training options. A trained model can be
       saved with the '--output_model_file (-M)'.  If no training is desired at all, a model  can
       be passed via the ’--input_model_file (-m)' parameter.

       The  program  can also provide predictions for test data using either the trained model or
       the given input model. Test points can be specified with the ’--test_file (-t)' parameter.
       Predicted  responses  to  the test points can be saved with the '--output_predictions_file
       (-o)' output parameter.

       For example, the following command trains a model on the  data  'data.csv'  and  responses
       'responses.csv'  with lambda1 set to 0.4 and lambda2 set to 0 (so, LASSO is being solved),
       and then the model is saved to 'lasso_model.bin':

       $ mlpack_lars --input_file data.csv --responses_file responses.csv --lambda1 0.4 --lambda2
       0 --output_model_file lasso_model.bin

       The  following  command  uses the 'lasso_model.bin' to provide predicted responses for the
       data 'test.csv' and save those responses to ’test_predictions.csv':

       $     mlpack_lars     --input_model_file     lasso_model.bin     --test_file      test.csv
       --output_predictions_file test_predictions.csv

OPTIONAL INPUT OPTIONS

       --help (-h) [bool]
              Default help info.

       --info [string]
              Print help on a specific option. Default value ''.

       --input_file (-i) [unknown]
              Matrix of covariates (X).

       --input_model_file (-m) [unknown]
              Trained LARS model to use.

       --lambda1 (-l) [double]
              Regularization parameter for l1-norm penalty.  Default value 0.

       --lambda2 (-L) [double]
              Regularization parameter for l2-norm penalty.  Default value 0.

       --responses_file (-r) [unknown]
              Matrix of responses/observations (y).

       --test_file (-t) [unknown]
              Matrix containing points to regress on (test points).

       --use_cholesky (-c) [bool]
              Use  Cholesky decomposition during computation rather than explicitly computing the
              full Gram matrix.

       --verbose (-v) [bool]
              Display informational messages and the full list of parameters and  timers  at  the
              end of execution.

       --version (-V) [bool]
              Display the version of mlpack.

OPTIONAL OUTPUT OPTIONS

       --output_model_file (-M) [unknown]
              Output LARS model.

       --output_predictions_file (-o) [unknown]
              If  --test_file  is  specified,  this file is where the predicted responses will be
              saved.

ADDITIONAL INFORMATION

       For further information, including relevant papers, citations,  and  theory,  consult  the
       documentation found at http://www.mlpack.org or included with your distribution of mlpack.