Monitoring & Observability

Learn how to monitor your Moose application and gain insights into its performance and health.

Monitoring Overview

Moose provides built-in monitoring capabilities for all components:

Metrics: Quantitative measurements of your application
Logs: Detailed event records
Traces: Request flows through your application
Alerts: Notifications when issues occur

Metrics Console

Moose includes a built-in metrics console that provides real-time visibility into your application:

# Start the metrics console
moose metrics

This opens the Metrics Console UI at http://localhost:4002 with dashboards for:

Ingestion rates
Processing latency
API response times
Database query performance
Error rates
Resource utilization

Key Metrics

Ingestion Metrics

Metric	Description	Alert Threshold
`ingestion.requests.count`	Total ingestion requests	-
`ingestion.requests.rate`	Requests per second	>500/s
`ingestion.error.rate`	Failed ingestion requests	>1%
`ingestion.latency`	Request processing time	>200ms p95

Stream Processing Metrics

Metric	Description	Alert Threshold
`stream.records.count`	Total records processed	-
`stream.processing.rate`	Records per second	-
`stream.lag`	Processing delay	>5s
`stream.errors`	Processing errors	>0

Database Metrics

Metric	Description	Alert Threshold
`db.query.count`	Total queries executed	-
`db.query.latency`	Query execution time	>500ms
`db.write.rate`	Records written per second	-
`db.error.rate`	Database errors	>0

Log Collection

Moose logs are structured for easy analysis:

{
  "timestamp": "2023-03-28T14:32:11.342Z",
  "level": "info",
  "service": "ingest-api",
  "message": "Processed event batch",
  "details": {
    "batchId": "batch-123",
    "recordCount": 52,
    "processingTimeMs": 120
  }
}

Log Levels

ERROR: Application errors requiring attention
WARN: Potential issues or degraded performance
INFO: Normal operations and key events
DEBUG: Detailed information for troubleshooting

Distributed Tracing

Moose supports distributed tracing for request flows:

// Trace context is automatically propagated
ingestStream.addTransform(processedStream, async (record, context) => {
  // Trace span is automatically created
  // ...processing logic
  return processedRecord;
});

Key trace spans include:

Ingestion request handling
Stream processing
Database operations
External API calls

Alerting

Configure alerts in moose.config.toml:

[monitoring.alerts]
# Configure alert thresholds
ingestion_error_rate = { threshold = 0.01, window = "5m" }
processing_lag = { threshold = 10000, window = "1m" }
api_latency_p95 = { threshold = 500, window = "5m" }
 
# Configure notification channels
[monitoring.notifications]
email = "alerts@example.com"
slack_webhook = "https://hooks.slack.com/services/..."

Integration with External Tools

Moose supports integration with common monitoring tools:

Prometheus / Grafana

[monitoring.exporters]
prometheus = { enabled = true, port = 9090 }

ELK Stack

[monitoring.exporters]
elasticsearch = { 
  enabled = true, 
  endpoint = "https://elasticsearch:9200",
  index = "moose-logs"
}

Datadog

[monitoring.exporters]
datadog = { 
  enabled = true, 
  api_key = "YOUR_API_KEY"
}

Best Practices

Monitor All Components:
- Ingestion endpoints
- Stream processing
- Database queries
- API responses
Use Structured Logging:
- Include request IDs
- Add contextual information
- Use appropriate log levels
Set Up Alerts:
- Define critical thresholds
- Configure notification channels
- Create runbooks for common issues
Performance Baseline:
- Establish normal performance metrics
- Alert on significant deviations
- Track trends over time

See the Metrics Console documentation for more details on the built-in monitoring capabilities.

Scheduling & Triggering Workflows

Planning