xenial (1) mlpack_lsh.1.gz

Provided by: mlpack-bin_2.0.1-1_amd64 bug

NAME

       mlpack_lsh - all k-approximate-nearest-neighbor search with lsh

SYNOPSIS

        mlpack_lsh [-h] [-v] [-B int] [-d string] [-H double] [-m string] [-k int] [-n string] [-M string] [-K int] [-q string] [-r string] [-S int] [-s int] [-L int] -V

DESCRIPTION

       This  program  will  calculate  the  k  approximate-nearest-neighbors  of a set of points using locality-
       sensitive hashing. You may specify a separate set of  reference  points  and  query  points,  or  just  a
       reference set which will be used as both the reference and query set.

       For  example, the following will return 5 neighbors from the data for each point in 'input.csv' and store
       the distances in 'distances.csv' and the neighbors in the file 'neighbors.csv':

       $ lsh -k 5 -r input.csv -d distances.csv -n neighbors.csv

       The output files are organized such that row i and column j in the neighbors output file  corresponds  to
       the  index  of  the  point  in the reference set which is the i'th nearest neighbor from the point in the
       query set with index j.  Row i and column j in the distances output  file  corresponds  to  the  distance
       between those two points.

       Because this is approximate-nearest-neighbors search, results may be different from run to run. Thus, the
       --seed option can be specified to set the random seed.

OPTIONS

       --bucket_size (-B) [int]
              The size of a bucket in the second level hash.  Default value 500.  --distances_file (-d) [string]
              File to output distances into. Default value ’'.

       --hash_width (-H) [double]
              The  hash  width  for  the first-level hashing in the LSH preprocessing. By default, the LSH class
              automatically estimates a hash width for its use. Default value 0.

       --help (-h)
              Default help info.

       --info [string]
              Get help on a specific module or option.  Default value ''.  --input_model_file (-m) [string] File
              to load LSH model from. (Cannot be specified with --reference_file.) Default value ’'.

       --k (-k) [int]
              Number  of  nearest  neighbors  to  find. Default value 0.  --neighbors_file (-n) [string] File to
              output neighbors into. Default value ’'.  --output_model_file (-M) [string] File to save LSH model
              to. Default value ''.

       --projections (-K) [int]
              The number of hash functions for each table Default value 10.

       --query_file (-q) [string]
              File  containing  query points (optional).  Default value ''.  --reference_file (-r) [string] File
              containing the reference dataset. Default value ''.  --second_hash_size (-S) [int] The size of the
              second level hash table.  Default value 99901.

       --seed (-s) [int]
              Random seed. If 0, 'std::time(NULL)' is used.  Default value 0.

       --tables (-L) [int]
              The number of hash tables to be used. Default value 30.

       --verbose (-v)
              Display informational messages and the full list of parameters and timers at the end of execution.

       --version (-V)
              Display the version of mlpack.

ADDITIONAL INFORMATION

ADDITIONAL INFORMATION

       For  further  information,  including  relevant  papers,  citations, and theory, For further information,
       including   relevant   papers,   citations,   and   theory,   consult   the   documentation   found    at
       http://www.mlpack.org  or  included with your consult the documentation found at http://www.mlpack.org or
       included with your DISTRIBUTION OF MLPACK.  DISTRIBUTION OF MLPACK.

                                                                                                   mlpack_lsh(1)