Customizing how Operator Metering generates reports is done using a custom resource called a ReportQuery
.
These ReportQuery
resources control the SQL queries that can be used to produce a report.
When writing a report you can specify the query it will use by setting the spec.query
field to metadata.name
of any ReportQuery
in the reporting-operator's namespace.
query
: A SQL SELECT statement. This SQL statement supports go templates and provides additional custom functions specific to Operator Metering (defined in the templating section below).columns
: A list of columns that match the schema of the results of the query. The order of these columns must match the order of the columns returned by the SELECT statement. Columns have 3 fields,name
,type
, andunit
. Each field is covered in more detail below.name
: This is the name of the column returned in theSELECT
statement.type
: This is the Presto column type.unit
: Unit refers to the unit of measurement of the column.tableHidden
: Takes a boolean, when true, hides the column from report results depending on the format and endpoint. See api docs for details.
inputs
: A list of inputs this report query accepts to control its behavior. For more in depth details, see the query inputs section.name
: The name used to refer to the input in theReport
orScheduledReport
spec.inputs
and within the queries template variables (see below).required
: A boolean indicating if this input is required for the query to run. Defaults to false.type
: An optional type indicating what data type this input takes. Available options arestring
,time
, andint
,ReportDataSource
,ReportQuery
, andReport
. If left empty, it defaults tostring
. For more details, see the query input types section.default
: An optional default value to use if unspecified.
Because much of the type of analysis being done depends on user-input, and because we want to enable users to re-use queries with copying & pasting things around, Operator Metering supports the go templating language to dynamically generate the SQL statements contained within the spec.query
field of ReportQuery
.
For example, when generating a report, the query needs to know what time range to consider when producing a report, so this is information is exposed within the template context as variables you can use and reference with various template functions.
Most of these functions are for referring to other resources such as ReportDataSources
or ReportQueries
as either tables, views, or sub-queries, and for formatting various types for use within the SQL query.
Report
: This object has two fields,ReportingStart
andReportingEnd
which are the value of thespec.reportingStart
andspec.reportingEnd
for aReport
. For aReport
with aspec.schedule
set, the values map to the specific period being collected when theReport
runs.ReportingStart
: A time.Time object that is generally used to filter the results of aSELECT
query using aWHERE
clause.ReportingEnd
: A time.Time object that is generally used to filter the results of aSELECT
query using aWHERE
clause. Built-in queries select datapoints matchingReportingStart <= timestamp > ReportingEnd
.Inputs
: This is amap[string]interface{}
of inputs passed in via the Report'sspec.inputs
. The values type is based on the report queries input definition type, and defaults to string unless the input's name isReportingStart
orReportingEnd
, in which case it's converted to a time.Time automatically.
Within the template context, the .Report.Inputs
template variable contains a map where each key is the name of an input defined by the ReportQuery's spec.inputs
.
The value of an input depends on a if the user specified a value in their Report
or ReportDataSource
, or if there is a default value defined.
- Depending on what is referencing the ReportQuery: The value comes from a
Report
'sspec.inputs
or from aReportDataSource
'sspec.reportQueryView.inputs
. - If not provided, it will use the
default
value as defined the ReportQuery'sspec.inputs
, if one exists. - Otherwise: the zero value for the type, according to Go's zero value rules.
Each input can have a different type
, which determines how the input should be processed.
Available options are string
, time
, and int
, ReportDataSource
, ReportQuery
, and Report
.
If left empty, it defaults to varchar
.
For each of these types, the behavior varies:
string
: A string value is passed through as a Go string.time
: A string value is parsed as an RFC3339 timestamp. Within the template context, the variable with be a Go time.Time object.int
: An int value is passed through as a Go int.ReportDataSource
: A string value referencing the name of a ReportDataSource within the same namespace as the query. When this query is referenced by a Report or ReportDataSource, allReportDataSource
inputs are validated by checking that all the ReportDataSources specified exist.ReportQuery
: A string value referencing the name of a ReportQuery within the same namespace as the query. When this query is referenced by a Report or ReportDataSource, allReportQuery
inputs are validated by checking that all the ReportQueries specified exist.Report
: A string value referencing the name of a Report within the same namespace as the query. When this query is referenced by a Report or ReportDataSource, allReport
inputs are validated by checking that all the Reports specified exist.
Below is an example of a ReportQuery spec.inputs
input definitions configuration.
inputs:
- name: a_string_input
type: string
- name: a_int_input
type: int
- name: a_time_input
type: time
- name: a_datasource_input
type: ReportDataSource
- name: a_reportquery_input
type: ReportQuery
- name: a_report_input
type: Report
Next is an example of specifying input values for the definitions above that might be specified in a Report
's spec.inputs
or from a ReportDataSource
's spec.reportQueryView.inputs
:
inputs:
- name: a_string_input
value: "helloworld"
- name: a_int_input
value: 47
- name: a_time_input
value: '2019-10-09T00:00:00Z'
- name: a_datasource_input
value: "pod-usage-cpu-cores" # must be the name of a ReportDataSource that exists in the namespace of the resource.
- name: a_reportquery_input
value: "namespace-cpu-usage" # must be the name of a ReportQuery that exists in the namespace of the resource.
- name: a_report_input
value: "namespace-cpu-usage-hourly" # must be the name of a Report that exists in the namespace of the resource.
Below is a list of the available template functions and descriptions on what they do.
dataSourceTableName
: Takes a one argument, a string referencing aReportDataSource
by name, and outputs a string which is the corresponding table name of theReportDataSource
specified.reportTableName
: Takes a one argument, a string referencing aReport
by name, and outputs a string which is the corresponding table name of theReport
specified.renderReportQuery
: Takes two arguments, a string referencing aReportQuery
by name, the template context (usually this is just.
in the template), and returns a string containing the specifiedReportQuery
in its rendered form, using the 2nd argument as the context for the template rendering.prestoTimestamp
: Takes a time.Time object as the argument, and outputs a string timestamp. Usually this is used on.Report.ReportingStart
and.Report.ReportingEnd
.prometheusMetricPartitionFormat
: Takes a time.Time object as the argument, and outputs a string in the form ofyear-month-day
, eg:2006-01-02
. Usually this is used on.Report.ReportingStart
and.Report.ReportingEnd
.billingPeriodFormat
: Takes a time.Time object as the argument, and outputs a string timestamp that can be used for comparing toawsBilling
an ReportDataSource'spartition_start
andpartition_stop
columns.
In addition to the above functions, the reporting-operator includes all of the functions from Sprig - useful template functions for Go templates..
Before going into examples, there's an important convention that all the built-in ReportQueries
follow that is worth calling out, as these examples will demonstrate them heavily.
The convention I am referring to is the fact that there are quite a few ReportQueries suffixed with -raw
in their metadata.name
.
These queries are not intended to be used by Reports, but are intended to be purely for re-use.
Currently, these queries suffixed with -raw
in their name are generally have no filtering and are used by ReportDataSources to create views.
Additionally, -raw
queries often expose complex types (array, maps) which are incompatible with Reports
, which is why the ReportQueries
that are not suffixed in -raw
never expose those types in their columns list.
The example below is a built-in ReportQuery
that is installed with Operator Metering by default.
The query is not intended to be used by Reports, but instead is intended to be re-used by other ReportQueries
, which is why it only does simple extraction of fields, and calculations.
The important things to note with this query is that it's querying a database table containing Prometheus metric data for the pod-request-memory-bytes
ReportDataSource
, and it's getting the table name using the dataSourceTableName
template function.
apiVersion: metering.openshift.io/v1
kind: ReportQuery
metadata:
name: pod-memory-request-raw
labels:
operator-metering: "true"
spec:
columns:
- name: pod
type: varchar
unit: kubernetes_pod
- name: namespace
type: varchar
unit: kubernetes_namespace
- name: node
type: varchar
unit: kubernetes_node
- name: labels
tableHidden: true
type: map<string, string>
- name: pod_request_memory_bytes
type: double
unit: bytes
- name: timeprecision
type: double
unit: seconds
- name: pod_request_memory_byte_seconds
type: double
unit: byte_seconds
- name: timestamp
type: timestamp
unit: date
- name: dt
type: varchar
inputs:
- name: PodRequestMemoryBytesDataSourceName
type: ReportDataSource
default: pod-request-memory-bytes
query: |
SELECT labels['pod'] as pod,
labels['namespace'] as namespace,
element_at(labels, 'node') as node,
labels,
amount as pod_request_memory_bytes,
timeprecision,
amount * timeprecision as pod_request_memory_byte_seconds,
"timestamp",
dt
FROM {| dataSourceTableName .Report.Inputs.PodRequestMemoryBytesDataSourceName |}
WHERE element_at(labels, 'node') IS NOT NULL
This next example is also one of the built-in ReportQueries
.
This example, unlike the previous is designed to be used with Reports.
You can modify the ReportQuery to display all columns or to hide or show columns as needed. The full endpoint displays all columns.
To query report results using the reporting-operator API for full endpoint the api call is:
http://127.0.0.1:8001/api/v1/namespaces/metering/services/http:reporting-operator:api/proxy/api/v2/reports/metering/namespace-cpu-request/full?format=json
This example is showing the namespace-cpu-request
query in JSON format.
To use the TableHidden display feature:
- enter
TableHidden
field in query as true or false. Innode-cpu-capacity
thelabels
tableHidden value is set totrue
. - Next run a report.
To query report results using the reporting-operator API for tableHidden endpoint the api call is:
http://127.0.0.1:8001/api/v1/namespaces/metering/services/http:reporting-operator:api/proxy/api/v2/reports/metering/namespace-cpu-request/table?format=json