# Glassmkr > Bare-metal infrastructure monitoring. $3 per node per month, 3 free nodes. Self-hosted AI assistant (Furnace) for alert remediation. MIT-licensed Crucible agent on npm. For the complete public documentation in a single file optimized for LLM consumption, see /llms-full.txt. ## Quick reference - Homepage: https://glassmkr.com - Documentation: https://glassmkr.com/docs - Per-rule pages: https://glassmkr.com/docs/rules - Pricing: https://glassmkr.com/#pricing - Trust + security: https://glassmkr.com/trust - Verticals: https://glassmkr.com/for-sre, https://glassmkr.com/for-self-hosted, https://glassmkr.com/for-gpu - Comparison pages: https://glassmkr.com/vs/datadog, /vs/prometheus, /vs/netdata, /vs/checkmk, /vs/zabbix, /vs/librenms, /vs/cloudwatch, /vs/newrelic - Agent source (MIT): https://github.com/glassmkr/crucible - npm package: https://www.npmjs.com/package/@glassmkr/crucible - Current Crucible version: 0.13.3 ## Quick Start Install the Crucible agent on your server: ``` curl -sf https://glassmkr.com/install.sh | bash ``` Sign up at https://app.glassmkr.com/register, add your server, configure the API key in /etc/glassmkr/collector.yaml. Crucible pushes health snapshots every 60 seconds at the default interval. ## API Reference Base URL: https://app.glassmkr.com/api/v1 Authentication: Bearer token. Account API access: gmk_acct_live_xxx. Crucible ingest: gmk_cru_live_xxx. ### Server Management - GET /servers; list all servers for authenticated customer - GET /servers/:id; server details - DELETE /servers/:id; remove server - GET /servers/:id/health; latest snapshot with active alerts - GET /servers/:id/alerts?status=active|all; alerts (active only or full history) - GET /servers/:id/metrics?hours=24; time-series data for charts - GET /servers/:id/history?hours=24; raw snapshot history ### Alert Management - POST /alerts/:id/acknowledge; acknowledge an active alert - GET /servers/:id/mutes; list muted rules for a server - POST /servers/:id/mutes; mute a rule - DELETE /servers/:id/mutes; unmute a rule ### Notification Channels - GET /channels; list channels - POST /channels; create channel - PUT /channels/:id; update channel - DELETE /channels/:id; delete channel ## Rule library Glassmkr ships 61 alert rules tuned for bare-metal failure modes. Categories: - Storage: drive SMART degradation, RAID state, NVMe wear, NVMe Critical Warning, I/O errors, latency - ZFS: pool health, scrub errors, SLOG faulted - Filesystem: capacity, projection, inodes, read-only mounts, fd exhaustion, LVM thin metadata - Memory & CPU: utilisation, swap, OOM kills, load, iowait, PSI pressure - Network: bonding/LACP, interface errors, saturation, link speed, conntrack, retrans, listen overflow, syn flood, softnet drops - Hardware (BMC/IPMI): CPU temperature, ECC, MCE, PSU redundancy, SEL events, fan failure - GPU: XID, ECC, thermal, NVLink, PCIe degradation, driver drift, power cap throttling (NVIDIA, Crucible v0.13.0+) - Time & Services: clock drift, NTP, systemd service state, service flapping, unexpected reboot - Security & Patching: SSH root, firewall, pending updates, kernel vulnerabilities, reboot pending, unattended-upgrades, server unreachable Each rule has a public catalog entry at /docs/rules/ with title, summary, and priority. Per-alert remediation guidance (command, prerequisites, validation, rollback) is rendered inside the dashboard at https://app.glassmkr.com on the alert detail page.