Ubuntu Manpage: qlc - Query Interface to Mnesia, ETS, Dets, etc

name
description
overview
syntax
evaluation
join
common options
getting started
implementing a qlc table
key equality
data types
exports
see also

Provided by: erlang-manpages_16.b.3-dfsg-1ubuntu2.2_all

NAME

       qlc - Query Interface to Mnesia, ETS, Dets, etc

DESCRIPTION

       The  qlc  module provides a query interface to Mnesia, ETS, Dets and other data structures that implement
       an iterator style traversal of objects.

OVERVIEW

The qlc module implements a query interface to QLC tables. Typical QLC tables are ETS, Dets, and Mnesia
tables. There is also support for user defined tables, see the Implementing a QLC table section. A query
is stated using Query List Comprehensions (QLCs). The answers to a query are determined by data in QLC
tables that fulfill the constraints expressed by the QLCs of the query. QLCs are similar to ordinary list
comprehensions as described in the Erlang Reference Manual and Programming Examples except that variables
introduced in patterns cannot be used in list expressions. In fact, in the absence of optimizations and
options such as cache and unique (see below), every QLC free of QLC tables evaluates to the same list of
answers as the identical ordinary list comprehension.

While ordinary list comprehensions evaluate to lists, calling qlc:q/1,2 returns a Query Handle. To
obtain all the answers to a query, qlc:eval/1,2 should be called with the query handle as first argument.
Query handles are essentially functional objects ("funs") created in the module calling q/1,2. As the
funs refer to the module's code, one should be careful not to keep query handles too long if the module's
code is to be replaced. Code replacement is described in the Erlang Reference Manual. The list of answers
can also be traversed in chunks by use of a Query Cursor. Query cursors are created by calling
qlc:cursor/1,2 with a query handle as first argument. Query cursors are essentially Erlang processes. One
answer at a time is sent from the query cursor process to the process that created the cursor.

SYNTAX

       Syntactically QLCs have the same parts as ordinary list comprehensions:

       [Expression || Qualifier1, Qualifier2, ...]

       Expression (the template) is an arbitrary Erlang expression. Qualifiers are either filters or generators.
       Filters  are  Erlang  expressions  returning  bool(). Generators have the form Pattern <- ListExpression,
       where ListExpression is an expression evaluating to a query handle or a list. Query handles are  returned
       from      qlc:table/2,      qlc:append/1,2,     qlc:sort/1,2,     qlc:keysort/2,3,     qlc:q/1,2,     and
       qlc:string_to_handle/1,2,3.

EVALUATION

       The evaluation of a query handle begins by the inspection of options and the  collection  of  information
       about  tables.  As  a  result  qualifiers  are  modified  during  the  optimization  phase. Next all list
       expressions are evaluated. If a cursor has been created evaluation takes place in the cursor process. For
       those list expressions that are QLCs, the list expressions of the QLCs' generators are evaluated as well.
       One has to be careful if list expressions have side effects since the order in which list expressions are
       evaluated  is unspecified. Finally the answers are found by evaluating the qualifiers from left to right,
       backtracking when some filter returns false, or collecting the template when all filters return true.

       Filters that do not return bool() but fail are handled differently depending  on  their  syntax:  if  the
       filter is a guard it returns false, otherwise the query evaluation fails. This behavior makes it possible
       for the qlc module to do some optimizations without affecting the meaning of a query. For  example,  when
       testing  some  position  of  a  table and one or more constants for equality, only the objects with equal
       values are candidates for further evaluation. The other objects are guaranteed to make the filter  return
       false,  but  never  fail.  The (small) set of candidate objects can often be found by looking up some key
       values of the table or by traversing the table using a match specification. It is necessary to place  the
       guard  filters  immediately  after  the  table's  generator,  otherwise the candidate objects will not be
       restricted to a small set. The reason is that objects that could make the query evaluation fail must  not
       be excluded by looking up a key or running a match specification.

JOIN

The qlc module supports fast join of two query handles. Fast join is possible if some position P1 of one
query handler and some position P2 of another query handler are tested for equality. Two fast join
methods have been implemented:

* Lookup join traverses all objects of one query handle and finds objects of the other handle (a QLC
table) such that the values at P1 and P2 match or compare equal. The qlc module does not create any
indices but looks up values using the key position and the indexed positions of the QLC table.

* Merge join sorts the objects of each query handle if necessary and filters out objects where the
values at P1 and P2 do not compare equal. If there are many objects with the same value of P2 a
temporary file will be used for the equivalence classes.

The qlc module warns at compile time if a QLC combines query handles in such a way that more than one
join is possible. In other words, there is no query planner that can choose a good order between possible
join operations. It is up to the user to order the joins by introducing query handles.

The join is to be expressed as a guard filter. The filter must be placed immediately after the two joined
generators, possibly after guard filters that use variables from no other generators but the two joined
generators. The qlc module inspects the operands of =:=/2, ==/2, is_record/2, element/2, and logical
operators (and/2, or/2, andalso/2, orelse/2, xor/2) when determining which joins to consider.

COMMON OPTIONS

       The following options are accepted by cursor/2, eval/2, fold/4, and info/2:

         * {cache_all, Cache} where Cache is equal to ets or list adds a {cache, Cache}  option  to  every  list
           expression  of the query except tables and lists. Default is {cache_all, no}. The option cache_all is
           equivalent to {cache_all, ets}.

         * {max_list_size, MaxListSize} where MaxListSize is the size in bytes of terms on the external  format.
           If  the  accumulated  size  of  collected  objects exceeds MaxListSize the objects are written onto a
           temporary file. This option is used by the {cache, list} option as well as by the merge join  method.
           Default is 512*1024 bytes.

         * {tmpdir_usage,  TmpFileUsage} determines the action taken when qlc is about to create temporary files
           on the directory set by the tmpdir option. If the value is not_allowed an error  tuple  is  returned,
           otherwise  temporary  files  are  created  as  needed. Default is allowed which means that no further
           action is taken. The values info_msg, warning_msg, and error_msg mean  that  the  function  with  the
           corresponding  name in the module error_logger is called for printing some information (currently the
           stacktrace).

         * {tmpdir, TempDirectory} sets the directory used by merge join for temporary files and by the  {cache,
           list}  option. The option also overrides the tmpdir option of keysort/3 and sort/2. The default value
           is "" which means that the directory returned by file:get_cwd() is used.

         * {unique_all, true} adds a {unique, true} option to every list expression of  the  query.  Default  is
           {unique_all, false}. The option unique_all is equivalent to {unique_all, true}.

GETTING STARTED

As already mentioned queries are stated in the list comprehension syntax as described in the Erlang
Reference Manual. In the following some familiarity with list comprehensions is assumed. There are
examples in Programming Examples that can get you started. It should be stressed that list comprehensions
do not add any computational power to the language; anything that can be done with list comprehensions
can also be done without them. But they add a syntax for expressing simple search problems which is
compact and clear once you get used to it.

Many list comprehension expressions can be evaluated by the qlc module. Exceptions are expressions such
that variables introduced in patterns (or filters) are used in some generator later in the list
comprehension. As an example consider an implementation of lists:append(L): [X ||Y <- L, X <- Y]. Y is
introduced in the first generator and used in the second. The ordinary list comprehension is normally to
be preferred when there is a choice as to which to use. One difference is that qlc:eval/1,2 collects
answers in a list which is finally reversed, while list comprehensions collect answers on the stack which
is finally unwound.

What the qlc module primarily adds to list comprehensions is that data can be read from QLC tables in
small chunks. A QLC table is created by calling qlc:table/2. Usually qlc:table/2 is not called directly
from the query but via an interface function of some data structure. There are a few examples of such
functions in Erlang/OTP: mnesia:table/1,2, ets:table/1,2, and dets:table/1,2. For a given data structure
there can be several functions that create QLC tables, but common for all these functions is that they
return a query handle created by qlc:table/2. Using the QLC tables provided by OTP is probably sufficient
in most cases, but for the more advanced user the section Implementing a QLC table describes the
implementation of a function calling qlc:table/2.

Besides qlc:table/2 there are other functions that return query handles. They might not be used as often
as tables, but are useful from time to time. qlc:append traverses objects from several tables or lists
after each other. If, for instance, you want to traverse all answers to a query QH and then finish off by
a term {finished}, you can do that by calling qlc:append(QH, [{finished}]). append first returns all
objects of QH, then {finished}. If there is one tuple {finished} among the answers to QH it will be
returned twice from append.

As another example, consider concatenating the answers to two queries QH1 and QH2 while removing all
duplicates. The means to accomplish this is to use the unique option:

qlc:q([X || X <- qlc:append(QH1, QH2)], {unique, true})

The cost is substantial: every returned answer will be stored in an ETS table. Before returning an answer
it is looked up in the ETS table to check if it has already been returned. Without the unique options all
answers to QH1 would be returned followed by all answers to QH2. The unique options keeps the order
between the remaining answers.

If the order of the answers is not important there is the alternative to sort the answers uniquely:

qlc:sort(qlc:q([X || X <- qlc:append(QH1, QH2)], {unique, true})).

This query also removes duplicates but the answers will be sorted. If there are many answers temporary
files will be used. Note that in order to get the first unique answer all answers have to be found and
sorted. Both alternatives find duplicates by comparing answers, that is, if A1 and A2 are answers found
in that order, then A2 is a removed if A1 == A2.

To return just a few answers cursors can be used. The following code returns no more than five answers
using an ETS table for storing the unique answers:

C = qlc:cursor(qlc:q([X || X <- qlc:append(QH1, QH2)],{unique,true})),
R = qlc:next_answers(C, 5),
ok = qlc:delete_cursor(C),
R.

Query list comprehensions are convenient for stating constraints on data from two or more tables. An
example that does a natural join on two query handles on position 2:

qlc:q([{X1,X2,X3,Y1} ||
{X1,X2,X3} <- QH1,
{Y1,Y2} <- QH2,
X2 =:= Y2])

The qlc module will evaluate this differently depending on the query handles QH1 and QH2. If, for
example, X2 is matched against the key of a QLC table the lookup join method will traverse the objects of
QH2 while looking up key values in the table. On the other hand, if neither X2 nor Y2 is matched against
the key or an indexed position of a QLC table, the merge join method will make sure that QH1 and QH2 are
both sorted on position 2 and next do the join by traversing the objects one by one.

The join option can be used to force the qlc module to use a certain join method. For the rest of this
section it is assumed that the excessively slow join method called "nested loop" has been chosen:

qlc:q([{X1,X2,X3,Y1} ||
{X1,X2,X3} <- QH1,
{Y1,Y2} <- QH2,
X2 =:= Y2],
{join, nested_loop})

In this case the filter will be applied to every possible pair of answers to QH1 and QH2, one at a time.
If there are M answers to QH1 and N answers to QH2 the filter will be run M*N times.

If QH2 is a call to the function for gb_trees as defined in the Implementing a QLC table section,
gb_table:table/1, the iterator for the gb-tree will be initiated for each answer to QH1 after which the
objects of the gb-tree will be returned one by one. This is probably the most efficient way of traversing
the table in that case since it takes minimal computational power to get the following object. But if QH2
is not a table but a more complicated QLC, it can be more efficient use some RAM memory for collecting
the answers in a cache, particularly if there are only a few answers. It must then be assumed that
evaluating QH2 has no side effects so that the meaning of the query does not change if QH2 is evaluated
only once. One way of caching the answers is to evaluate QH2 first of all and substitute the list of
answers for QH2 in the query. Another way is to use the cache option. It is stated like this:

QH2' = qlc:q([X || X <- QH2], {cache, ets})

or just

QH2' = qlc:q([X || X <- QH2], cache)

The effect of the cache option is that when the generator QH2' is run the first time every answer is
stored in an ETS table. When next answer of QH1 is tried, answers to QH2' are copied from the ETS table
which is very fast. As for the unique option the cost is a possibly substantial amount of RAM memory. The
{cache, list} option offers the possibility to store the answers in a list on the process heap. While
this has the potential of being faster than ETS tables since there is no need to copy answers from the
table it can often result in slower evaluation due to more garbage collections of the process' heap as
well as increased RAM memory consumption due to larger heaps. Another drawback with cache lists is that
if the size of the list exceeds a limit a temporary file will be used. Reading the answers from a file is
very much slower than copying them from an ETS table. But if the available RAM memory is scarce setting
the limit to some low value is an alternative.

There is an option cache_all that can be set to ets or list when evaluating a query. It adds a cache or
{cache, list} option to every list expression except QLC tables and lists on all levels of the query.
This can be used for testing if caching would improve efficiency at all. If the answer is yes further
testing is needed to pinpoint the generators that should be cached.

IMPLEMENTING A QLC TABLE

       As  an  example of how to use the qlc:table/2 function the implementation of a QLC table for the gb_trees
       module is given:

       -module(gb_table).

       -export([table/1]).

       table(T) ->
           TF = fun() -> qlc_next(gb_trees:next(gb_trees:iterator(T))) end,
           InfoFun = fun(num_of_objects) -> gb_trees:size(T);
                        (keypos) -> 1;
                        (is_sorted_key) -> true;
                        (is_unique_objects) -> true;
                        (_) -> undefined
                     end,
           LookupFun =
               fun(1, Ks) ->
                       lists:flatmap(fun(K) ->
                                             case gb_trees:lookup(K, T) of
                                                 {value, V} -> [{K,V}];
                                                 none -> []
                                             end
                                     end, Ks)
               end,
           FormatFun =
               fun({all, NElements, ElementFun}) ->
                       ValsS = io_lib:format("gb_trees:from_orddict(~w)",
                                             [gb_nodes(T, NElements, ElementFun)]),
                       io_lib:format("gb_table:table(~s)", [ValsS]);
                  ({lookup, 1, KeyValues, _NElements, ElementFun}) ->
                       ValsS = io_lib:format("gb_trees:from_orddict(~w)",
                                             [gb_nodes(T, infinity, ElementFun)]),
                       io_lib:format("lists:flatmap(fun(K) -> "
                                     "case gb_trees:lookup(K, ~s) of "
                                     "{value, V} -> [{K,V}];none -> [] end "
                                     "end, ~w)",
                                     [ValsS, [ElementFun(KV) || KV <- KeyValues]])
               end,
           qlc:table(TF, [{info_fun, InfoFun}, {format_fun, FormatFun},
                          {lookup_fun, LookupFun},{key_equality,'=='}]).

       qlc_next({X, V, S}) ->
           [{X,V} | fun() -> qlc_next(gb_trees:next(S)) end];
       qlc_next(none) ->
           [].

       gb_nodes(T, infinity, ElementFun) ->
           gb_nodes(T, -1, ElementFun);
       gb_nodes(T, NElements, ElementFun) ->
           gb_iter(gb_trees:iterator(T), NElements, ElementFun).

       gb_iter(_I, 0, _EFun) ->
           '...';
       gb_iter(I0, N, EFun) ->
           case gb_trees:next(I0) of
               {X, V, I} ->
                   [EFun({X,V}) | gb_iter(I, N-1, EFun)];
               none ->
                   []
           end.

       TF is the traversal function. The qlc module requires that there is a way of traversing  all  objects  of
       the  data  structure;  in gb_trees there is an iterator function suitable for that purpose. Note that for
       each object returned a new fun is created. As long as the list is not terminated by [] it is assumed that
       the  tail  of  the  list is a nullary function and that calling the function returns further objects (and
       functions).

       The lookup function is optional. It is assumed that the lookup function always finds values  much  faster
       than  it  would take to traverse the table. The first argument is the position of the key. Since qlc_next
       returns the objects as {Key, Value} pairs the position is 1. Note that the lookup function should  return
       {Key, Value} pairs, just as the traversal function does.

       The  format  function  is  also optional. It is called by qlc:info to give feedback at runtime of how the
       query will be evaluated. One should try to give as good feedback as possible  without  showing  too  much
       details.  In the example at most 7 objects of the table are shown. The format function handles two cases:
       all means that all objects of the table will be traversed; {lookup, 1, KeyValues} means that  the  lookup
       function will be used for looking up key values.

       Whether the whole table will be traversed or just some keys looked up depends on how the query is stated.
       If the query has the form

       qlc:q([T || P <- LE, F])

       and P is a tuple, the qlc module analyzes P and F in compile time to find positions of the tuple  P  that
       are tested for equality to constants. If such a position at runtime turns out to be the key position, the
       lookup function can be used, otherwise all objects of the table have to be  traversed.  It  is  the  info
       function  InfoFun that returns the key position. There can be indexed positions as well, also returned by
       the info function. An index is an extra table that makes lookup on some position fast.  Mnesia  maintains
       indices  upon  request,  thereby  introducing so called secondary keys. The qlc module prefers to look up
       objects using the key before secondary keys regardless of the number of constants to look up.

KEY EQUALITY

       In Erlang there are two operators for testing term  equality,  namely  ==/2  and  =:=/2.  The  difference
       between  them  is  all  about  the  integers  that  can  be represented by floats. For instance, 2 == 2.0
       evaluates to true while 2 =:= 2.0 evaluates to false. Normally this is a minor issue, but the qlc  module
       cannot ignore the difference, which affects the user's choice of operators in QLCs.

       If the qlc module can find out at compile time that some constant is free of integers, it does not matter
       which one of ==/2 or =:=/2 is used:

       1> E1 = ets:new(t, [set]), % uses =:=/2 for key equality
       Q1 = qlc:q([K ||
       {K} <- ets:table(E1),
       K == 2.71 orelse K == a]),
       io:format("~s~n", [qlc:info(Q1)]).
       ets:match_spec_run(lists:flatmap(fun(V) ->
                                               ets:lookup(20493, V)
                                        end,
                                        [a,2.71]),
                          ets:match_spec_compile([{{'$1'},[],['$1']}]))

       In the example the ==/2 operator has been handled exactly as =:=/2 would have been handled. On the  other
       hand,  if  it  cannot  be determined at compile time that some constant is free of integers and the table
       uses =:=/2 when comparing keys for equality (see the option key_equality), the qlc module will not try to
       look up the constant. The reason is that there is in the general case no upper limit on the number of key
       values that can compare equal to such a constant; every combination of integers  and  floats  has  to  be
       looked up:

       2> E2 = ets:new(t, [set]),
       true = ets:insert(E2, [{{2,2},a},{{2,2.0},b},{{2.0,2},c}]),
       F2 = fun(I) ->
       qlc:q([V || {K,V} <- ets:table(E2), K == I])
       end,
       Q2 = F2({2,2}),
       io:format("~s~n", [qlc:info(Q2)]).
       ets:table(53264,
                 [{traverse,
                   {select,[{{'$1','$2'},[{'==','$1',{const,{2,2}}}],['$2']}]}}])
       3> lists:sort(qlc:e(Q2)).
       [a,b,c]

       Looking up just {2,2} would not return b and c.

       If  the  table  uses  ==/2  when  comparing  keys  for equality, the qlc module will look up the constant
       regardless of which operator is used in the QLC. However, ==/2 is to be preferred:

       4> E3 = ets:new(t, [ordered_set]), % uses ==/2 for key equality
       true = ets:insert(E3, [{{2,2.0},b}]),
       F3 = fun(I) ->
       qlc:q([V || {K,V} <- ets:table(E3), K == I])
       end,
       Q3 = F3({2,2}),
       io:format("~s~n", [qlc:info(Q3)]).
       ets:match_spec_run(ets:lookup(86033, {2,2}),
                          ets:match_spec_compile([{{'$1','$2'},[],['$2']}]))
       5> qlc:e(Q3).
       [b]

       Lookup join is handled analogously to lookup of constants in a table: if the join operator  is  ==/2  and
       the  table  where constants are to be looked up uses =:=/2 when testing keys for equality, the qlc module
       will not consider lookup join for that table.

DATA TYPES

       abstract_expr() = erl_parse:abstract_expr()

              Parse trees for Erlang expression, see the abstract format documentation in the ERTS User's Guide.

       answer() = term()

       answers() = [answer()]

       cache() = ets | list | no

       match_expression() = ets:match_spec()

              Match specification, see the match specification  documentation  in  the  ERTS  User's  Guide  and
              ms_transform(3erl).

       no_files() = integer() >= 1

              Actually an integer > 1.

       key_pos() = integer() >= 1 | [integer() >= 1]

       max_list_size() = integer() >= 0

       order() = ascending | descending | order_fun()

       order_fun() = fun((term(), term()) -> boolean())

       query_cursor()

              A query cursor.

       query_handle()

              A query handle.

       query_handle_or_list() = query_handle() | list()

       query_list_comprehension() = term()

              A literal query list comprehension.

       spawn_options() = default | [proc_lib:spawn_option()]

       sort_options() = [sort_option()] | sort_option()

       sort_option() = {compressed, boolean()}
                     | {no_files, no_files()}
                     | {order, order()}
                     | {size, integer() >= 1}
                     | {tmpdir, tmp_directory()}
                     | {unique, boolean()}

              See file_sorter(3erl).

       tmp_directory() = [] | file:name()

       tmp_file_usage() = allowed
                        | not_allowed
                        | info_msg
                        | warning_msg
                        | error_msg

EXPORTS

       append(QHL) -> QH

              Types:

                 QHL = [query_handle_or_list()]
                 QH = query_handle()

              Returns  a query handle. When evaluating the query handle QH all answers to the first query handle
              in QHL are returned followed by all answers to the rest of the query handles in QHL.

       append(QH1, QH2) -> QH3

              Types:

                 QH1 = QH2 = query_handle_or_list()
                 QH3 = query_handle()

              Returns a query handle. When evaluating the query handle QH3  all  answers  to  QH1  are  returned
              followed by all answers to QH2.

              append(QH1, QH2) is equivalent to append([QH1, QH2]).

       cursor(QH) -> Cursor

       cursor(QH, Options) -> Cursor

              Types:

                 QH = query_handle_or_list()
                 Options = [Option] | Option
                 Option = {cache_all, cache()}
                        | cache_all
                        | {max_list_size, max_list_size()}
                        | {spawn_options, spawn_options()}
                        | {tmpdir_usage, tmp_file_usage()}
                        | {tmpdir, tmp_directory()}
                        | {unique_all, boolean()}
                        | unique_all
                 Cursor = query_cursor()

              Creates  a query cursor and makes the calling process the owner of the cursor. The cursor is to be
              used as argument to next_answers/1,2 and (eventually) delete_cursor/1. Calls  erlang:spawn_opt  to
              spawn  and  link  a  process  which  will  evaluate  the  query  handle.  The  value of the option
              spawn_options is used as last argument when calling spawn_opt. The default value is [link].

              1> QH = qlc:q([{X,Y} || X <- [a,b], Y <- [1,2]]),
              QC = qlc:cursor(QH),
              qlc:next_answers(QC, 1).
              [{a,1}]
              2> qlc:next_answers(QC, 1).
              [{a,2}]
              3> qlc:next_answers(QC, all_remaining).
              [{b,1},{b,2}]
              4> qlc:delete_cursor(QC).
              ok

              cursor(QH) is equivalent to cursor(QH, []).

       delete_cursor(QueryCursor) -> ok

              Types:

                 QueryCursor = query_cursor()

              Deletes a query cursor. Only the owner of the cursor can delete the cursor.

       eval(QH) -> Answers | Error

       eval(QH, Options) -> Answers | Error

       e(QH) -> Answers | Error

       e(QH, Options) -> Answers | Error

              Types:

                 QH = query_handle_or_list()
                 Options = [Option] | Option
                 Option = {cache_all, cache()}
                        | cache_all
                        | {max_list_size, max_list_size()}
                        | {tmpdir_usage, tmp_file_usage()}
                        | {tmpdir, tmp_directory()}
                        | {unique_all, boolean()}
                        | unique_all
                 Answers = answers()
                 Error = {error, module(), Reason}
                 Reason = file_sorter:reason()

              Evaluates a query handle in the calling process and collects all answers in a list.

              1> QH = qlc:q([{X,Y} || X <- [a,b], Y <- [1,2]]),
              qlc:eval(QH).
              [{a,1},{a,2},{b,1},{b,2}]

              eval(QH) is equivalent to eval(QH, []).

       fold(Function, Acc0, QH) -> Acc1 | Error

       fold(Function, Acc0, QH, Options) -> Acc1 | Error

              Types:

                 QH = query_handle_or_list()
                 Function = fun((answer(), AccIn) -> AccOut)
                 Acc0 = Acc1 = AccIn = AccOut = term()
                 Options = [Option] | Option
                 Option = {cache_all, cache()}
                        | cache_all
                        | {max_list_size, max_list_size()}
                        | {tmpdir_usage, tmp_file_usage()}
                        | {tmpdir, tmp_directory()}
                        | {unique_all, boolean()}
                        | unique_all
                 Error = {error, module(), Reason}
                 Reason = file_sorter:reason()

              Calls Function on successive answers to the query handle together with an  extra  argument  AccIn.
              The query handle and the function are evaluated in the calling process. Function must return a new
              accumulator which is passed to the next call. Acc0 is returned if there  are  no  answers  to  the
              query handle.

              1> QH = [1,2,3,4,5,6],
              qlc:fold(fun(X, Sum) -> X + Sum end, 0, QH).
              21

              fold(Function, Acc0, QH) is equivalent to fold(Function, Acc0, QH, []).

       format_error(Error) -> Chars

              Types:

                 Error = {error, module(), term()}
                 Chars = io_lib:chars()

              Returns a descriptive string in English of an error tuple returned by some of the functions of the
              qlc module or the parse transform. This function is mainly used by the compiler invoking the parse
              transform.

       info(QH) -> Info

       info(QH, Options) -> Info

              Types:

                 QH = query_handle_or_list()
                 Options = [Option] | Option
                 Option = EvalOption | ReturnOption
                 EvalOption = {cache_all, cache()}
                            | cache_all
                            | {max_list_size, max_list_size()}
                            | {tmpdir_usage, tmp_file_usage()}
                            | {tmpdir, tmp_directory()}
                            | {unique_all, boolean()}
                            | unique_all
                 ReturnOption = {depth, Depth}
                              | {flat, boolean()}
                              | {format, Format}
                              | {n_elements, NElements}
                 Depth = infinity | integer() >= 0
                 Format = abstract_code | string
                 NElements = infinity | integer() >= 1
                 Info = abstract_expr() | string()

              Returns  information  about  a  query  handle.  The  information describes the simplifications and
              optimizations that are the results of  preparing  the  query  for  evaluation.  This  function  is
              probably useful mostly during debugging.

              The  information  has  the form of an Erlang expression where QLCs most likely occur. Depending on
              the format functions of mentioned QLC tables it may not be absolutely accurate.

              The default is to return a sequence of QLCs in a block, but if the option {flat, false} is  given,
              one  single  QLC  is  returned.  The  default  is  to  return a string, but if the option {format,
              abstract_code} is given, abstract code is returned instead. In the abstract code port identifiers,
              references,  and  pids are represented by strings. The default is to return all elements in lists,
              but if the {n_elements, NElements} option  is  given,  only  a  limited  number  of  elements  are
              returned.  The  default  is  to  show  all of objects and match specifications, but if the {depth,
              Depth} option is given, parts of terms below a certain depth are replaced by '...'.

              1> QH = qlc:q([{X,Y} || X <- [x,y], Y <- [a,b]]),
              io:format("~s~n", [qlc:info(QH, unique_all)]).
              begin
                  V1 =
                      qlc:q([
                             SQV ||
                                 SQV <- [x,y]
                            ],
                            [{unique,true}]),
                  V2 =
                      qlc:q([
                             SQV ||
                                 SQV <- [a,b]
                            ],
                            [{unique,true}]),
                  qlc:q([
                         {X,Y} ||
                             X <- V1,
                             Y <- V2
                        ],
                        [{unique,true}])
              end

              In this example two simple QLCs have been inserted just to hold the {unique, true} option.

              1> E1 = ets:new(e1, []),
              E2 = ets:new(e2, []),
              true = ets:insert(E1, [{1,a},{2,b}]),
              true = ets:insert(E2, [{a,1},{b,2}]),
              Q = qlc:q([{X,Z,W} ||
              {X, Z} <- ets:table(E1),
              {W, Y} <- ets:table(E2),
              X =:= Y]),
              io:format("~s~n", [qlc:info(Q)]).
              begin
                  V1 =
                      qlc:q([
                             P0 ||
                                 P0 = {W,Y} <- ets:table(17)
                            ]),
                  V2 =
                      qlc:q([
                             [G1|G2] ||
                                 G2 <- V1,
                                 G1 <- ets:table(16),
                                 element(2, G1) =:= element(1, G2)
                            ],
                            [{join,lookup}]),
                  qlc:q([
                         {X,Z,W} ||
                             [{X,Z}|{W,Y}] <- V2
                        ])
              end

              In this example the query list comprehension V2 has been inserted to show  the  joined  generators
              and  the join method chosen. A convention is used for lookup join: the first generator (G2) is the
              one traversed, the second one (G1) is the table where constants are looked up.

              info(QH) is equivalent to info(QH, []).

       keysort(KeyPos, QH1) -> QH2

       keysort(KeyPos, QH1, SortOptions) -> QH2

              Types:

                 KeyPos = key_pos()
                 SortOptions = sort_options()
                 QH1 = query_handle_or_list()
                 QH2 = query_handle()

              Returns a query handle. When evaluating the query handle QH2 the answers to the query  handle  QH1
              are sorted by file_sorter:keysort/4 according to the options.

              The  sorter  will  use temporary files only if QH1 does not evaluate to a list and the size of the
              binary representation of the answers exceeds Size bytes, where Size  is  the  value  of  the  size
              option.

              keysort(KeyPos, QH1) is equivalent to keysort(KeyPos, QH1, []).

       next_answers(QueryCursor) -> Answers | Error

       next_answers(QueryCursor, NumberOfAnswers) -> Answers | Error

              Types:

                 QueryCursor = query_cursor()
                 Answers = answers()
                 NumberOfAnswers = all_remaining | integer() >= 1
                 Error = {error, module(), Reason}
                 Reason = file_sorter:reason()

              Returns  some or all of the remaining answers to a query cursor. Only the owner of QueryCursor can
              retrieve answers.

              The optional argument NumberOfAnswersdetermines  the  maximum  number  of  answers  returned.  The
              default value is 10. If less than the requested number of answers is returned, subsequent calls to
              next_answers will return [].

       q(QLC) -> QH

       q(QLC, Options) -> QH

              Types:

                 QH = query_handle()
                 Options = [Option] | Option
                 Option = {max_lookup, MaxLookup}
                        | {cache, cache()}
                        | cache
                        | {join, Join}
                        | {lookup, Lookup}
                        | {unique, boolean()}
                        | unique
                 MaxLookup = integer() >= 0 | infinity
                 Join = any | lookup | merge | nested_loop
                 Lookup = boolean() | any
                 QLC = query_list_comprehension()

              Returns a query handle for a query list comprehension. The query list comprehension  must  be  the
              first  argument to qlc:q/1,2 or it will be evaluated as an ordinary list comprehension. It is also
              necessary to add the line

              -include_lib("stdlib/include/qlc.hrl").

              to the source file. This causes a  parse  transform  to  substitute  a  fun  for  the  query  list
              comprehension. The (compiled) fun will be called when the query handle is evaluated.

              When  calling  qlc:q/1,2  from  the Erlang shell the parse transform is automatically called. When
              this happens the fun substituted for the query list comprehension is  not  compiled  but  will  be
              evaluated  by  erl_eval(3erl).  This  is  also  true  when  expressions  are evaluated by means of
              file:eval/1,2 or in the debugger.

              To be very explicit, this will not work:

              ...
              A = [X || {X} <- [{1},{2}]],
              QH = qlc:q(A),
              ...

              The variable A will be bound to the  evaluated  value  of  the  list  comprehension  ([1,2]).  The
              compiler complains with an error message ("argument is not a query list comprehension"); the shell
              process stops with a badarg reason.

              q(QLC) is equivalent to q(QLC, []).

              The {cache, ets} option can be used to cache the  answers  to  a  query  list  comprehension.  The
              answers  are stored in one ETS table for each cached query list comprehension. When a cached query
              list comprehension is evaluated again, answers are fetched from  the  table  without  any  further
              computations.  As  a  consequence, when all answers to a cached query list comprehension have been
              found, the ETS tables used for caching answers to the query list comprehension's qualifiers can be
              emptied. The option cache is equivalent to {cache, ets}.

              The  {cache, list} option can be used to cache the answers to a query list comprehension just like
              {cache, ets}. The difference is that the answers are kept in a list (on the process heap). If  the
              answers would occupy more than a certain amount of RAM memory a temporary file is used for storing
              the answers. The option max_list_size sets the limit in bytes and the temporary file is put on the
              directory set by the tmpdir option.

              The  cache option has no effect if it is known that the query list comprehension will be evaluated
              at most once. This is always true for the top-most query list comprehension and also for the  list
              expression  of  the  first  generator  in  a list of qualifiers. Note that in the presence of side
              effects in filters or callback functions the answers to query list comprehensions can be  affected
              by the cache option.

              The  {unique,  true} option can be used to remove duplicate answers to a query list comprehension.
              The unique answers are stored in one ETS table for each query list  comprehension.  The  table  is
              emptied every time it is known that there are no more answers to the query list comprehension. The
              option unique is equivalent to {unique, true}. If the unique option is combined with  the  {cache,
              ets}  option,  two  ETS tables are used, but the full answers are stored in one table only. If the
              unique option is combined with the {cache,  list}  option  the  answers  are  sorted  twice  using
              keysort/3; once to remove duplicates, and once to restore the order.

              The cache and unique options apply not only to the query list comprehension itself but also to the
              results of looking up constants, running match specifications, and joining handles.

              1> Q = qlc:q([{A,X,Z,W} ||
              A <- [a,b,c],
              {X,Z} <- [{a,1},{b,4},{c,6}],
              {W,Y} <- [{2,a},{3,b},{4,c}],
              X =:= Y],
              {cache, list}),
              io:format("~s~n", [qlc:info(Q)]).
              begin
                  V1 =
                      qlc:q([
                             P0 ||
                                 P0 = {X,Z} <-
                                     qlc:keysort(1, [{a,1},{b,4},{c,6}], [])
                            ]),
                  V2 =
                      qlc:q([
                             P0 ||
                                 P0 = {W,Y} <-
                                     qlc:keysort(2, [{2,a},{3,b},{4,c}], [])
                            ]),
                  V3 =
                      qlc:q([
                             [G1|G2] ||
                                 G1 <- V1,
                                 G2 <- V2,
                                 element(1, G1) == element(2, G2)
                            ],
                            [{join,merge},{cache,list}]),
                  qlc:q([
                         {A,X,Z,W} ||
                             A <- [a,b,c],
                             [{X,Z}|{W,Y}] <- V3,
                             X =:= Y
                        ])
              end

              In this example the cached results of the merge join are traversed for each value of A. Note  that
              without  the cache option the join would have been carried out three times, once for each value of
              A

              sort/1,2 and keysort/2,3 can also be used for caching answers and for  removing  duplicates.  When
              sorting  answers  are cached in a list, possibly stored on a temporary file, and no ETS tables are
              used.

              Sometimes (see qlc:table/2 below) traversal of tables can be done by looking up key values,  which
              is  assumed to be fast. Under certain (rare) circumstances it could happen that there are too many
              key values to look up. The {max_lookup, MaxLookup} option can then be used to limit the number  of
              lookups:  if  more  than  MaxLookup  lookups  would  be required no lookups are done but the table
              traversed instead. The default value is infinity which means that there is no limit on the  number
              of keys to look up.

              1> T = gb_trees:empty(),
              QH = qlc:q([X || {{X,Y},_} <- gb_table:table(T),
              ((X == 1) or (X == 2)) andalso
              ((Y == a) or (Y == b) or (Y == c))]),
              io:format("~s~n", [qlc:info(QH)]).
              ets:match_spec_run(
                     lists:flatmap(fun(K) ->
                                          case
                                              gb_trees:lookup(K,
                                                              gb_trees:from_orddict([]))
                                          of
                                              {value,V} ->
                                                  [{K,V}];
                                              none ->
                                                  []
                                          end
                                   end,
                                   [{1,a},{1,b},{1,c},{2,a},{2,b},{2,c}]),
                     ets:match_spec_compile([{{{'$1','$2'},'_'},[],['$1']}]))

              In  this example using the gb_table module from the Implementing a QLC table section there are six
              keys to look up: {1,a}, {1,b}, {1,c}, {2,a}, {2,b}, and {2,c}. The reason is that the two elements
              of the key {X, Y} are compared separately.

              The {lookup, true} option can be used to ensure that the qlc module will look up constants in some
              QLC table. If there are more than one QLC table among the generators' list expressions,  constants
              have to be looked up in at least one of the tables. The evaluation of the query fails if there are
              no constants to look up. This option is useful in situations when  it  would  be  unacceptable  to
              traverse  all  objects in some table. Setting the lookup option to false ensures that no constants
              will be looked up ({max_lookup, 0} has the same effect). The default value is any which means that
              constants will be looked up whenever possible.

              The  {join,  Join}  option  can  be used to ensure that a certain join method will be used: {join,
              lookup} invokes the lookup join method; {join, merge} invokes the merge join  method;  and  {join,
              nested_loop}  invokes  the  method  of  matching  every pair of objects from two handles. The last
              method is mostly very slow. The evaluation of the query fails if the qlc module cannot  carry  out
              the  chosen  join  method. The default value is any which means that some fast join method will be
              used if possible.

       sort(QH1) -> QH2

       sort(QH1, SortOptions) -> QH2

              Types:

                 SortOptions = sort_options()
                 QH1 = query_handle_or_list()
                 QH2 = query_handle()

              Returns a query handle. When evaluating the query handle QH2 the answers to the query  handle  QH1
              are sorted by file_sorter:sort/3 according to the options.

              The  sorter  will  use temporary files only if QH1 does not evaluate to a list and the size of the
              binary representation of the answers exceeds Size bytes, where Size  is  the  value  of  the  size
              option.

              sort(QH1) is equivalent to sort(QH1, []).

       string_to_handle(QueryString) -> QH | Error

       string_to_handle(QueryString, Options) -> QH | Error

       string_to_handle(QueryString, Options, Bindings) -> QH | Error

              Types:

                 QueryString = string()
                 Options = [Option] | Option
                 Option = {max_lookup, MaxLookup}
                        | {cache, cache()}
                        | cache
                        | {join, Join}
                        | {lookup, Lookup}
                        | {unique, boolean()}
                        | unique
                 MaxLookup = integer() >= 0 | infinity
                 Join = any | lookup | merge | nested_loop
                 Lookup = boolean() | any
                 Bindings = erl_eval:binding_struct()
                 QH = query_handle()
                 Error = {error, module(), Reason}
                 Reason = erl_parse:error_info() | erl_scan:error_info()

              A  string  version  of  qlc:q/1,2. When the query handle is evaluated the fun created by the parse
              transform is interpreted by erl_eval(3erl). The query string  is  to  be  one  single  query  list
              comprehension terminated by a period.

              1> L = [1,2,3],
              Bs = erl_eval:add_binding('L', L, erl_eval:new_bindings()),
              QH = qlc:string_to_handle("[X+1 || X <- L].", [], Bs),
              qlc:eval(QH).
              [2,3,4]

              string_to_handle(QueryString) is equivalent to string_to_handle(QueryString, []).

              string_to_handle(QueryString,  Options)  is  equivalent  to string_to_handle(QueryString, Options,
              erl_eval:new_bindings()).

              This function is probably useful mostly when called from outside of Erlang, for  instance  from  a
              driver written in C.

       table(TraverseFun, Options) -> QH

              Types:

                 TraverseFun = TraverseFun0 | TraverseFun1
                 TraverseFun0 = fun(() -> TraverseResult)
                 TraverseFun1 = fun((match_expression()) -> TraverseResult)
                 TraverseResult = Objects | term()
                 Objects = [] | [term() | ObjectList]
                 ObjectList = TraverseFun0 | Objects
                 Options = [Option] | Option
                 Option = {format_fun, FormatFun}
                        | {info_fun, InfoFun}
                        | {lookup_fun, LookupFun}
                        | {parent_fun, ParentFun}
                        | {post_fun, PostFun}
                        | {pre_fun, PreFun}
                        | {key_equality, KeyComparison}
                 FormatFun = undefined | fun((SelectedObjects) -> FormatedTable)
                 SelectedObjects = all
                                 | {all, NElements, DepthFun}
                                 | {match_spec, match_expression()}
                                 | {lookup, Position, Keys}
                                 | {lookup, Position, Keys, NElements, DepthFun}
                 NElements = infinity | integer() >= 1
                 DepthFun = fun((term()) -> term())
                 FormatedTable = {Mod, Fun, Args} | abstract_expr() | string()
                 InfoFun = undefined | fun((InfoTag) -> InfoValue)
                 InfoTag = indices | is_unique_objects | keypos | num_of_objects
                 InfoValue = undefined | term()
                 LookupFun = undefined | fun((Position, Keys) -> LookupResult)
                 LookupResult = [term()] | term()
                 ParentFun = undefined | fun(() -> ParentFunValue)
                 PostFun = undefined | fun(() -> term())
                 PreFun = undefined | fun((PreArgs) -> term())
                 PreArgs = [PreArg]
                 PreArg = {parent_value, ParentFunValue} | {stop_fun, StopFun}
                 ParentFunValue = undefined | term()
                 StopFun = undefined | fun(() -> term())
                 KeyComparison = '=:=' | '=='
                 Position = integer() >= 1
                 Keys = [term()]
                 Mod = Fun = atom()
                 Args = [term()]
                 QH = query_handle()

              Returns  a  query  handle for a QLC table. In Erlang/OTP there is support for ETS, Dets and Mnesia
              tables, but it is also possible to turn many other data structures into QLC  tables.  The  way  to
              accomplish this is to let function(s) in the module implementing the data structure create a query
              handle by calling qlc:table/2. The different ways to traverse the table as well as  properties  of
              the table are handled by callback functions provided as options to qlc:table/2.

              The  callback  function  TraverseFun  is  used for traversing the table. It is to return a list of
              objects terminated by either [] or a nullary fun to be used for traversing the not  yet  traversed
              objects  of  the  table.  Any  other  return  value  is immediately returned as value of the query
              evaluation. Unary TraverseFuns are  to  accept  a  match  specification  as  argument.  The  match
              specification  is created by the parse transform by analyzing the pattern of the generator calling
              qlc:table/2 and filters using variables introduced in the pattern. If the parse  transform  cannot
              find  a match specification equivalent to the pattern and filters, TraverseFun will be called with
              a match specification returning every object. Modules that can utilize  match  specifications  for
              optimized traversal of tables should call qlc:table/2 with a unary TraverseFun while other modules
              can provide a nullary TraverseFun. ets:table/2 is an example of the  former;  gb_table:table/1  in
              the Implementing a QLC table section is an example of the latter.

              PreFun  is  a  unary  callback function that is called once before the table is read for the first
              time. If the call fails, the query evaluation fails.  Similarly,  the  nullary  callback  function
              PostFun  is  called  once  after  the  table  was last read. The return value, which is caught, is
              ignored. If PreFun has been called for a table, PostFun is guaranteed to be called for that table,
              even if the evaluation of the query fails for some reason. The order in which pre (post) functions
              for different tables are evaluated is not specified. Other table  access  than  reading,  such  as
              calling InfoFun, is assumed to be OK at any time. The argument PreArgs is a list of tagged values.
              Currently there are two tags, parent_value and stop_fun, used by Mnesia for managing transactions.
              The  value  of  parent_value  is  the  value  returned  by  ParentFun, or undefined if there is no
              ParentFun. ParentFun is called once just before the call of PreFun in the context of  the  process
              calling  eval,  fold, or cursor. The value of stop_fun is a nullary fun that deletes the cursor if
              called from the parent, or undefined if there is no cursor.

              The binary callback function LookupFun is used for looking up objects  in  the  table.  The  first
              argument  Position  is  the  key position or an indexed position and the second argument Keys is a
              sorted list of unique values. The return value is to be a list of all objects (tuples)  such  that
              the  element  at  Position  is a member of Keys. Any other return value is immediately returned as
              value of the query evaluation. LookupFun is called instead of traversing the table  if  the  parse
              transform  at compile time can find out that the filters match and compare the element at Position
              in such a way that only Keys need to be looked up in order to find all potential answers. The  key
              position   is   obtained   by  calling  InfoFun(keypos)  and  the  indexed  positions  by  calling
              InfoFun(indices). If the key position can be used for lookup it is always  chosen,  otherwise  the
              indexed  position  requiring  the least number of lookups is chosen. If there is a tie between two
              indexed positions the one occurring first in the list returned by  InfoFun  is  chosen.  Positions
              requiring more than max_lookup lookups are ignored.

              The  unary callback function InfoFun is to return information about the table. undefined should be
              returned if the value of some tag is unknown:

                * indices. Returns a list of indexed positions, a list of positive integers.

                * is_unique_objects. Returns true if the objects returned by TraverseFun are unique.

                * keypos. Returns the position of the table's key, a positive integer.

                * is_sorted_key. Returns true if the objects returned by TraverseFun are sorted on the key.

                * num_of_objects. Returns the number of objects in the table, a non-negative integer.

              The unary callback function FormatFun is used by qlc:info/1,2 for displaying the call that created
              the  table's  query  handle.  The default value, undefined, means that info/1,2 displays a call to
              '$MOD':'$FUN'/0. It is up to FormatFun to present the selected objects of the table in a  suitable
              way.  However, if a character list is chosen for presentation it must be an Erlang expression that
              can be scanned and parsed (a trailing dot will be added by qlc:info though). FormatFun  is  called
              with  an  argument  that describes the selected objects based on optimizations done as a result of
              analyzing the filters of the QLC where the call to qlc:table/2 occurs. The possible values of  the
              argument are:

                * {lookup, Position, Keys, NElements, DepthFun}. LookupFun is used for looking up objects in the
                  table.

                * {match_spec, MatchExpression}. No way of finding all possible answers by looking up  keys  was
                  found,  but the filters could be transformed into a match specification. All answers are found
                  by calling TraverseFun(MatchExpression).

                * {all, NElements, DepthFun}. No optimization was found.  A  match  specification  matching  all
                  objects will be used if TraverseFun is unary.

              NElements  is  the value of the info/1,2 option n_elements, and DepthFun is a function that can be
              used for limiting the size of terms; calling DepthFun(Term) substitutes '...' for  parts  of  Term
              below  the  depth  specified  by  the info/1,2 option depth. If calling FormatFun with an argument
              including NElements and DepthFun fails, FormatFun is called once again with an argument  excluding
              NElements and DepthFun ({lookup, Position, Keys} or all).

              The  value of key_equality is to be '=:=' if the table considers two keys equal if they match, and
              to be '==' if two keys are equal if they compare equal. The default is '=:='.

              See ets(3erl), dets(3erl) and mnesia(3erl) for the various  options  recognized  by  table/1,2  in
              respective module.