Metrics
Metrics are automatically created and kept in-memory by Datagen web-server.
API
As mentionned in previous section, an API: /metrics/all allows you to get all current metrics about data generation:
- The total number of rows generated
- the total number of generation made
Foreach service:
- The number of rows generated
- The number of files generated (only for Ozone, HDFS & local)
This API call renders a raw JSON file with all metrics in base path and with values as numeric (no null, default is 0).
Cloudera Manager
Cloudera Agent is gathering for us emtrics automatically from this API and feeds them into Cloudera Manager database.
Hence, in Cloudera Manager, go to Charts > Chart Builder and you can try following query:
SELECT datagen_hbase_rows, datagen_hdfs_avro_rows, datagen_hdfs_csv_rows, datagen_hdfs_json_rows, datagen_hdfs_orc_rows, datagen_hdfs_parquet_rows, datagen_ozone_avro_rows, datagen_ozone_csv_rows, datagen_ozone_json_rows, datagen_ozone_orc_rows, datagen_ozone_parquet_rows, datagen_hive_rows, datagen_kudu_rows, datagen_kafka_rows, datagen_solr_rows, datagen_local_avro_rows, datagen_local_csv_rows, datagen_local_json_rows, datagen_local_orc_rows, datagen_local_parquet_rows
Output will be this chart (for us):