Provided by: htcondor_8.6.8~dfsg.1-2ubuntu1_amd64
Name
condor_dagman_metrics_repor terReport - the statistics of a DAGMan run to a central HTTP server
Synopsis
condor_dagman_metrics_repor ter[-s] [-u URL] [-t maxtime] -f /path/to/metrics/file
Description
condor_dagman_metrics_reporteranonymously reports metrics from a DAGMan workflow to a central server. The reporting of workflow metrics is only enabled for DAGMan workflows run under Pegasus; metrics reporting has been requested by Pegasus' funding sources: see http://pegasus.isi.edu/wms/docs/latest/funding_citing_usage.php#usage_statisticsand https://confluence.pegasus.isi.edu/display/pegasus/DAGMan+Metrics+Reportingfor the requirements to collect this data. The data sent to the server is in JSON format. Here is an example of what is sent: { "client":"condor_dagman", "version":"8.1.0", "planner":"/lfs1/devel/Pegasus/pegasus/bin/pegasus-plan", "planner_version":"4.3.0cvs", "type":"metrics", "wf_uuid":"htcondor-test-job_dagman_metrics-A-subdag", "root_wf_uuid":"htcondor-test-job_dagman_metrics-A", "start_time":1375313459.603, "end_time":1375313491.498, "duration":31.895, "exitcode":1, "dagman_id":"26", "parent_dagman_id":"11", "rescue_dag_number":0, "jobs":4, "jobs_failed":1, "jobs_succeeded":3, "dag_jobs":0, "dag_jobs_failed":0, "dag_jobs_succeeded":0, "total_jobs":4, "total_jobs_run":4, "total_job_time":0.000, "dag_status":2 } Metrics are sent only if the condor_dagmanprocess has PEGASUS_METRICSset to Truein its environment, and the CONDOR_DEVELOPERSconfiguration variable does nothave the value NONE. Ordinarily, this program will be run by condor_dagman, and users do not need to interact with it. This program uses the following environment variables: PEGASUS_USER_METRICS_DEFAULT_SERVER The URL of the default server to which to send the data. It defaults to http://metrics.pegasus.isi.edu/metrics. It can be overridden at the command line with the -uoption. PEGASUS_USER_METRICS_SERVER A comma separated list of URLs of servers that will receive the data, in addition to the default server. The -fargument specifies the metrics file to be sent to the HTTP server.
Options
-s Sleep for a random number of seconds between 1 and 10, before attempting to send data. This option is used to space out the reporting from any sub-DAGs when a DAG is removed. -u URL Overrides setting of the environment variable PEGASUS_USER_METRICS_DEFAULT_SERVER. This option is unused by condor_dagman; it is for testing by developers. -t maxtime A maximum time in seconds that defaults to 100 seconds, setting a limit on the amount of time this program will wait for communication from the server. A setting of zero will result in a single attempt per server. condor_dagmanretrieves this value from the DAGMAN_PEGASUS_REPORT_TIMEOUTconfiguration variable. -f metrics_file The name of the file containing the metrics values to be reported.
Exit Status
condor_dagman_metrics_reporterwill exit with a status value of 0 (zero) upon success, and it will exit with a value of 1 (one) upon failure.
Author
Center for High Throughput Computing, University of Wisconsin–Madison
Copyright
Copyright © 1990-2016 Center for High Throughput Computing, Computer Sciences Department, University of Wisconsin-Madison, Madison, WI. All Rights Reserved. Licensed under the Apache License, Version 2.0. January 2020 condor_dagman_metrics_repor(1)