%toc

Overview

Conduit Visualization project helps in visualizing data that has been delivered by conduit within a given time period. The dashboard allows one to select the time range, cluster and stream for which they wish to see the graph.

The dashboard will query the database populated by the feeder service to get relevant information. Data flows covers all tiers

Setup

Installation

Install the deb package databus-visualization.

On installing the package, conf files are copied to /usr/local/conduit-visualization/conf, the war is copied directly to /var/lib/tomcat7/webapps and the environment file setenv.sh is copied to /var/lib/tomcat7/bin

Configuration

The conf files are audit-feeder.properties(same as that used for feeder service) and visualization.properties

audit-feeder.properties visualization.properties

All fields prperties in visualization.properties should be set. Mandatory fields in audit-feeder.properties are jdbc.driver.class.name, db.url, db.username, db.password, audit.table.master and rollup.tilldays

Hadoop home path should be set in /var/lib/tomcat7/bin/setenv.sh file which is copied on installing the deb, before restarting tomcat

Config Fields in visualization.properties

Config NameDesriptionExample
conduit.xml.pathThe path where databus.xml is stored according to which cluster and stream configuration for dashboard are loaded/usr/local/databus/conf/databus.xml
percentile.stringComma seperated list of percentiles for calculating latency. Any number of percentiles can be given but minimum is one80,90,95,99,99.9,99.99
publisher.slaThe SLA in minutes for all the messages to reach publisher2
agent.slaThe SLA in minutes for all the messages to reach agent2
vip.slaThe SLA in minutes for all the messages to reach vip2
collector.slaThe SLA in minutes for all the messages to reach collector2
hdfs.slaThe SLA in minutes for all the messages to reach hdfs3
local.slaThe SLA in minutes for all the messages to reach local tier3
merge.slaThe SLA in minutes for all the messages to reach merge tier5
mirror.slaThe SLA in minutes for all the messages to reach mirror tier6
percentile.for.slaNote: This value should be a part of percentile.string list99.99
percentage.for.lossThe percentage that is to be taken as loss margin. If difference between numbers of two to-be compared entities is more than percentage.for.loss of the compared-to entity then it is considered to be in unhealthy state.0.2
percentage.for.warnThe percentage that is to be taken as warn margin. If difference between numbers of two to-be compared entities is more than percentage.for.warn of the compared-to entity then the entity is considered to be in warning Note: percentage.for.warn should be less than percentage.for.loss0.02
max.start.timeThe maximum start time before which a start date cannot be selected in the filter. The date should be entered in the format of dd-MM-yyyyDate before 01-08 say 31-07 cannot be selected in the date filter of dashboard.
max.time.range.interval.in.hoursThe maximum time range in hours that user can query for in the dashboard.24
loss.warn.threshold.diff.in.minsDifference in minutes for loss SLA and warn SLA . The difference is the same for all tiers.For hdfs tier, if hdfs.sla 3, then if latency >3, it is considered to be in unhealthy state. If latency <3 and latency > 3-(loss.warn.threshold.diff.in.mins) then it is in warn state, and healthy if latency <= 3-(loss.warn.threshold.diff.in.mins)

Logs

The logs for the dashboard can be checked at /var/log/visualization and tomcat logs can be checked at either /var/log/tomcat7 or $CATALINA_HOME/logs