Alerts & Notifications

UptimeDock allows you to configure alerts for your ClickHouse database to get notified when specific conditions are met. This guide covers all available alert types, how to configure them, and best practices for effective monitoring.

Overview

Alerts in UptimeDock help you proactively monitor your ClickHouse database health. When a configured condition is met (e.g., disk usage exceeds 80%), UptimeDock sends notifications to your selected notification group via email, Slack, or other configured channels.

You can view and manage all alerts for a check by clicking the "All Alerts" button in the check's dashboard. This opens a popup showing all configured alerts organized by category.

All Alerts overview popup

Alert Scopes

Alerts in UptimeDock have different scopes depending on what they monitor:

Global Alerts

Global alerts apply to the entire ClickHouse instance and can be created directly from the "All Alerts" popup. These include:

  • Connection & Overview Alerts – Monitor connection status and high-level database metrics
  • Disk Alerts – Track disk usage across the entire instance
  • Memory Alerts – Monitor memory consumption

Specific Alerts

Some alerts require a specific scope (database, table, or query) and must be created from the corresponding section in the Reports dashboard:

  • Database-Specific Alerts – Created from the Database section, applies to a specific database
  • Table-Specific Alerts – Created from the Table section, applies to a specific table
  • Query-Specific Alerts – Created from the Query section, applies to a specific query pattern
Creating Specific Alerts

To create database, table, or query-specific alerts, navigate to the corresponding section in your check's Reports dashboard and click the "Alerts" button for the item you want to monitor.

Alert Categories

Connection & Overview Alerts

These alerts monitor the overall health and connectivity of your ClickHouse instance.

Alert TypeDescriptionUse Case
Connection DownTriggers when UptimeDock cannot connect to your ClickHouse instanceCritical for detecting outages and connectivity issues
Connection Response TimeTriggers when connection latency exceeds a thresholdDetect slow connections before they impact performance
Database DB CountTriggers when the number of databases exceeds a thresholdMonitor database sprawl or unexpected database creation
Database Table CountTriggers when total table count exceeds a thresholdDetect table proliferation across all databases
Database Row CountTriggers when total row count exceeds a thresholdMonitor data growth across the instance
Create Connection Alert

Disk Alerts

Disk alerts help you prevent storage-related issues before they cause problems.

Alert TypeDescriptionRecommended Threshold
Disk Used SpaceTriggers when used disk space exceeds a specific size (GB/TB)Based on your capacity planning
Disk Free SpaceTriggers when free disk space falls below a threshold< 50 GB or < 20% of total
Disk Usage %Triggers when disk usage percentage exceeds a threshold> 80% (warning), > 90% (critical)
Disk Space is Critical

Running out of disk space can cause ClickHouse to stop accepting writes and potentially corrupt data. Always set up disk alerts with adequate lead time for capacity planning.

Memory Alerts

Memory alerts help you detect memory pressure before it leads to out-of-memory (OOM) errors.

Alert TypeDescriptionRecommended Threshold
Memory UsedTriggers when memory usage exceeds a specific size (MB/GB)70-80% of available memory

Creating Alerts

To create a new alert:

  1. Open your ClickHouse check dashboard
  2. Click "All Alerts" to open the alerts popup
  3. Find the alert category you want (e.g., "Disk Alerts")
  4. Click "+ Create New"
  5. Configure the alert settings (see below)
  6. Click "Save Alert"

Alert Configuration Options

When creating an alert, you'll configure the following options:

OptionDescription
ScopeThe scope of the alert (Global, Database, Table, or Query). Displayed but not editable for global alerts.
Alert TypeThe specific metric to monitor. Options depend on the alert category.
ConditionThe threshold that triggers the alert. Set the operator (>, <, =) and value with unit (MB, GB, TB, %).
Notification GroupSelect which notification group receives alerts. Manage groups here.
Alert AfterHow many consecutive check cycles the condition must be met before triggering. "Immediately" triggers on first occurrence.
Notify when recoveredWhen enabled, sends a recovery notification when the condition is no longer met.

Here are our recommended alert thresholds for ClickHouse monitoring:

MetricWarningCritical
Disk Usage %> 70%> 85%
Memory Used> 70% of available> 85% of available
Connection Response Time> 500ms> 2000ms
Threshold Tuning

These are general recommendations. Adjust thresholds based on your specific workload patterns, hardware specifications, and SLA requirements.

Best Practices

  • Set up Connection Down alerts first – This is the most critical alert that tells you when your database is unreachable.
  • Use tiered alerting – Create both warning and critical alerts for metrics like disk usage to get early warnings before critical situations.
  • Enable recovery notifications – This helps you confirm when issues are resolved without manually checking the dashboard.
  • Use "Alert After" wisely – For metrics that naturally fluctuate (like memory), consider setting "Alert After" to 2-3 cycles to avoid false positives.
  • Organize notification groups – Create separate notification groups for different severity levels or teams to ensure the right people are notified.
  • Review and tune regularly – As your database grows and usage patterns change, revisit your thresholds to ensure they're still appropriate.
Next Steps

After setting up basic alerts, explore Query Performance Monitoring to set up alerts for slow queries, or Disk Space Management for more detailed storage monitoring strategies.