Production Deployment

<h2 id="production-deployment" class="position-relative d-flex align-items-center group"> Production Deployment <button type="button" class="h-share btn btn-link p-0 text-decoration-none link-secondary opacity-50 hover-opacity-100 transition-all ms-1" data-share-target="production-deployment" aria-haspopup="dialog" aria-label="Share link: Production Deployment"> Share link </button> </h2><div id="headingShareModal" class="heading-share-modal" role="dialog" aria-modal="true" aria-labelledby="headingShareTitle" hidden> <div class="hsm-dialog" role="document"> <div class="hsm-header"> <h2 id="headingShareTitle" class="h6 mb-0 fw-bold">Share this section</h2> <button type="button" class="hsm-close" aria-label="Close"> </button> </div> <div class="hsm-body"> <label for="headingShareInput" class="form-label small text-muted mb-1 text-uppercase fw-bold" style="font-size: 0.7rem; letter-spacing: 0.5px;">Permalink</label> <div class="input-group mb-4 hsm-url-group"> <input id="headingShareInput" type="text" class="form-control font-monospace" readonly aria-readonly="true" style="font-size: 0.85rem;" /> <button class="btn btn-primary hsm-copy" type="button" aria-label="Copy" title="Copy"> </button> </div> <div class="small fw-bold mb-2 text-muted text-uppercase" style="font-size: 0.7rem; letter-spacing: 0.5px;">Share via</div> <div class="hsm-share-grid"> <a id="share-twitter" class="btn btn-outline-secondary w-100" target="_blank" rel="noopener noreferrer"> Twitter </a> <a id="share-linkedin" class="btn btn-outline-secondary w-100" target="_blank" rel="noopener noreferrer"> LinkedIn </a> <a id="share-facebook" class="btn btn-outline-secondary w-100" target="_blank" rel="noopener noreferrer"> Facebook </a> </div> </div> </div> </div> <style> .heading-share-modal { position: fixed; inset: 0; display: flex; justify-content: center; align-items: center; background: rgba(0, 0, 0, 0.6); z-index: 1050; padding: 1rem; backdrop-filter: blur(4px); -webkit-backdrop-filter: blur(4px); } .heading-share-modal[hidden] { display: none !important; } .hsm-dialog { max-width: 420px; width: 100%; background: var(--bs-body-bg, #fff); color: var(--bs-body-color, #212529); border: 1px solid var(--bs-border-color, rgba(0,0,0,0.1)); border-radius: 1rem; box-shadow: 0 25px 50px -12px rgba(0, 0, 0, 0.25); overflow: hidden; animation: hsm-fade-in 0.2s ease-out; } @keyframes hsm-fade-in { from { opacity: 0; transform: scale(0.95); } to { opacity: 1; transform: scale(1); } } [data-bs-theme="dark"] .hsm-dialog { background: #1e293b; border-color: rgba(255,255,255,0.1); color: #f8f9fa; } .hsm-header { display: flex; justify-content: space-between; align-items: center; padding: 1rem 1.5rem; border-bottom: 1px solid var(--bs-border-color, rgba(0,0,0,0.1)); background: rgba(0,0,0,0.02); } [data-bs-theme="dark"] .hsm-header { background: rgba(255,255,255,0.02); border-color: rgba(255,255,255,0.1); } .hsm-close { background: transparent; border: none; color: inherit; opacity: 0.5; padding: 0.25rem 0.5rem; border-radius: 0.25rem; font-size: 1.2rem; line-height: 1; transition: opacity 0.2s; } .hsm-close:hover { opacity: 1; } .hsm-body { padding: 1.5rem; } .hsm-url-group { display: flex !important; align-items: stretch; } .hsm-url-group .form-control { flex: 1; min-width: 0; margin: 0; background: var(--bs-secondary-bg, #f8f9fa); border-color: var(--bs-border-color, #dee2e6); border-top-right-radius: 0; border-bottom-right-radius: 0; height: 42px; } .hsm-url-group .btn { flex: 0 0 auto; margin: 0; margin-left: -1px; border-top-left-radius: 0; border-bottom-left-radius: 0; height: 42px; display: flex; align-items: center; justify-content: center; padding: 0 1.25rem; z-index: 2; } [data-bs-theme="dark"] .hsm-url-group .form-control { background: #0f172a; border-color: #334155; color: #e2e8f0; } .hsm-share-grid { display: flex; flex-direction: column; gap: 0.5rem; } .hsm-share-grid .btn { display: flex; align-items: center; justify-content: center; font-size: 0.9rem; padding: 0.6rem; border-color: var(--bs-border-color); width: 100%; } [data-bs-theme="dark"] .hsm-share-grid .btn { color: #e2e8f0; border-color: #475569; } [data-bs-theme="dark"] .hsm-share-grid .btn:hover { background: #334155; border-color: #cbd5e1; } </style> <script> (function(){ const modal = document.getElementById('headingShareModal'); if(!modal) return; const input = modal.querySelector('#headingShareInput'); const copyBtn = modal.querySelector('.hsm-copy'); const twitter = modal.querySelector('#share-twitter'); const linkedin = modal.querySelector('#share-linkedin'); const facebook = modal.querySelector('#share-facebook'); const closeBtn = modal.querySelector('.hsm-close'); let lastFocus=null; let trapBound=false; function buildUrl(id){ return window.location.origin + window.location.pathname + '#' + id; } function isOpen(){ return !modal.hasAttribute('hidden'); } function hydrate(id){ const url=buildUrl(id); input.value=url; const enc=encodeURIComponent(url); const text=encodeURIComponent(document.title); if(twitter) twitter.href=`https://twitter.com/intent/tweet?url=${enc}&text=${text}`; if(linkedin) linkedin.href=`https://www.linkedin.com/sharing/share-offsite/?url=${enc}`; if(facebook) facebook.href=`https://www.facebook.com/sharer/sharer.php?u=${enc}`; } function openModal(id){ lastFocus=document.activeElement; hydrate(id); if(!isOpen()){ modal.removeAttribute('hidden'); } requestAnimationFrame(()=>{ input.focus(); }); trapFocus(); } function closeModal(){ if(!isOpen()) return; modal.setAttribute('hidden',''); if(lastFocus && typeof lastFocus.focus==='function') lastFocus.focus(); } function copyCurrent(){ try{ navigator.clipboard.writeText(input.value).then(()=>feedback(true),()=>fallback()); } catch(e){ fallback(); } } function fallback(){ input.select(); try{ document.execCommand('copy'); feedback(true);}catch(e){ feedback(false);} } function feedback(ok){ if(!copyBtn) return; const icon=copyBtn.querySelector('i'); if(!icon) return; const prev=copyBtn.getAttribute('data-prev')||icon.className; if(!copyBtn.getAttribute('data-prev')) copyBtn.setAttribute('data-prev',prev); icon.className= ok ? 'fa-duotone fa-clipboard-check':'fa-duotone fa-circle-exclamation'; setTimeout(()=>{ icon.className=prev; },1800); } function handleShareClick(e){ e.preventDefault(); const btn=e.currentTarget; const id=btn.getAttribute('data-share-target'); if(id) openModal(id); } function bindShareButtons(){ document.querySelectorAll('.h-share').forEach(btn=>{ if(!btn.dataset.hShareBound){ btn.addEventListener('click', handleShareClick); btn.dataset.hShareBound='1'; } }); } bindShareButtons(); if(document.readyState==='loading'){ document.addEventListener('DOMContentLoaded', bindShareButtons); } else { requestAnimationFrame(bindShareButtons); } document.addEventListener('click', function(e){ const shareBtn=e.target.closest && e.target.closest('.h-share'); if(shareBtn && !shareBtn.dataset.hShareBound){ handleShareClick.call(shareBtn, e); } }, true); document.addEventListener('click', e=>{ if(e.target===modal) closeModal(); if(e.target.closest && e.target.closest('.hsm-close')){ e.preventDefault(); closeModal(); } if(copyBtn && (e.target===copyBtn || (e.target.closest && e.target.closest('.hsm-copy')))) { e.preventDefault(); copyCurrent(); } }); document.addEventListener('keydown', e=>{ if(e.key==='Escape' && isOpen()) closeModal(); }); function trapFocus(){ if(trapBound) return; trapBound=true; modal.addEventListener('keydown', f=>{ if(f.key==='Tab' && isOpen()){ const focusable=[...modal.querySelectorAll('a[href],button,input,textarea,select,[tabindex]:not([tabindex="-1"])')].filter(el=>!el.hasAttribute('disabled')); if(!focusable.length) return; const first=focusable[0]; const last=focusable[focusable.length-1]; if(f.shiftKey && document.activeElement===first){ f.preventDefault(); last.focus(); } else if(!f.shiftKey && document.activeElement===last){ f.preventDefault(); first.focus(); } } }); } if(closeBtn) closeBtn.addEventListener('click', e=>{ e.preventDefault(); closeModal(); }); })(); </script>Deploy Geode graph database to production environments with confidence. This comprehensive guide covers high-availability architecture, monitoring strategies, backup procedures, security hardening, and operational excellence for running Geode at scale. <h3 id="production-readiness" class="position-relative d-flex align-items-center group"> Production Readiness <button type="button" class="h-share btn btn-link p-0 text-decoration-none link-secondary opacity-50 hover-opacity-100 transition-all ms-1" data-share-target="production-readiness" aria-haspopup="dialog" aria-label="Share link: Production Readiness"> Share link </button> </h3>Geode is battle-tested for demanding production workloads: Proven Reliability: <ul> <li>97.4% test coverage (1644/1688 tests passing)</li> <li>100% GQL compliance (see conformance profile)</li> <li>ACID-compliant transactions</li> <li>Production deployments handling high query volumes</li> </ul> Enterprise Features: <ul> <li>Row-level security for multi-tenant architectures</li> <li>Full transactional consistency with savepoints</li> <li>TLS 1.3 encryption for all connections</li> <li>Comprehensive audit logging</li> </ul> Architecture for Scale: <ul> <li>Memory-mapped I/O for efficient storage access</li> <li>Connection pooling for concurrent clients</li> <li>Distributed deployment with up to 32 shards</li> </ul> <h3 id="architecture-patterns" class="position-relative d-flex align-items-center group"> Architecture Patterns <button type="button" class="h-share btn btn-link p-0 text-decoration-none link-secondary opacity-50 hover-opacity-100 transition-all ms-1" data-share-target="architecture-patterns" aria-haspopup="dialog" aria-label="Share link: Architecture Patterns"> Share link </button> </h3> <h4 id="single-node-deployment" class="position-relative d-flex align-items-center group"> Single-Node Deployment <button type="button" class="h-share btn btn-link p-0 text-decoration-none link-secondary opacity-50 hover-opacity-100 transition-all ms-1" data-share-target="single-node-deployment" aria-haspopup="dialog" aria-label="Share link: Single-Node Deployment"> Share link </button> </h4>Use Cases: <ul> <li>Development and testing environments</li> <li>Low-traffic production workloads</li> <li>Applications requiring ACID guarantees without replication complexity</li> <li>Budget-constrained deployments</li> </ul> Configuration: <div class="highlight"><pre tabindex="0" class="chroma"><code class="language-yaml" data-lang="yaml"># geode.yaml server: listen: 0.0.0.0:3141 max_connections: 1000 tls: cert_file: /etc/geode/tls/server.crt key_file: /etc/geode/tls/server.key client_ca: /etc/geode/tls/ca.crt storage: data_dir: /var/lib/geode/data wal_dir: /var/lib/geode/wal checkpoint_interval: 300s logging: level: info output: /var/log/geode/server.log format: json performance: query_timeout: 60s transaction_timeout: 300s max_query_memory: 2GB worker_threads: 8 </code></pre></div>Deployment: <div class="highlight"><pre tabindex="0" class="chroma"><code class="language-bash" data-lang="bash"># SystemD service cat > /etc/systemd/system/geode.service <<EOF [Unit] Description=Geode Graph Database After=network.target [Service] Type=simple User=geode Group=geode ExecStart=/usr/local/bin/geode serve --config /etc/geode/geode.yaml Restart=on-failure RestartSec=5s LimitNOFILE=65536 [Install] WantedBy=multi-user.target EOF systemctl daemon-reload systemctl enable geode systemctl start geode </code></pre></div> <h4 id="high-availability-cluster" class="position-relative d-flex align-items-center group"> High-Availability Cluster <button type="button" class="h-share btn btn-link p-0 text-decoration-none link-secondary opacity-50 hover-opacity-100 transition-all ms-1" data-share-target="high-availability-cluster" aria-haspopup="dialog" aria-label="Share link: High-Availability Cluster"> Share link </button> </h4>Use Cases: <ul> <li>Business-critical applications requiring 99.99% uptime</li> <li>High-traffic workloads (throughput depends on workload and server limits)</li> <li>Geographic distribution for disaster recovery</li> <li>Applications with strict RTO/RPO requirements</li> </ul> Architecture: <div class="highlight"><pre tabindex="0" class="chroma"><code class="language-fallback" data-lang="fallback">┌─────────────┐ ┌─────────────┐ ┌─────────────┐ │ Leader │────▶│ Follower 1 │ │ Follower 2 │ │ (Write) │ │ (Read) │ │ (Read) │ └──────┬──────┘ └─────────────┘ └─────────────┘ │ │ │ │ │ │ └────────────────────┴────────────────────┘ Raft Consensus </code></pre></div>Configuration (Leader): <div class="highlight"><pre tabindex="0" class="chroma"><code class="language-yaml" data-lang="yaml">server: listen: 0.0.0.0:3141 cluster: enabled: true node_id: leader-1 peers: - follower-1.internal:3141 - follower-2.internal:3141 election_timeout: 300ms heartbeat_interval: 100ms replication: mode: synchronous # or asynchronous for performance min_replicas: 1 # Wait for 1 replica before committing </code></pre></div>Configuration (Follower): <div class="highlight"><pre tabindex="0" class="chroma"><code class="language-yaml" data-lang="yaml">server: listen: 0.0.0.0:3141 cluster: enabled: true node_id: follower-1 leader: leader-1.internal:3141 peers: - leader-1.internal:3141 - follower-2.internal:3141 replication: mode: asynchronous catch_up_batch_size: 1000 </code></pre></div> <h4 id="kubernetes-deployment" class="position-relative d-flex align-items-center group"> Kubernetes Deployment <button type="button" class="h-share btn btn-link p-0 text-decoration-none link-secondary opacity-50 hover-opacity-100 transition-all ms-1" data-share-target="kubernetes-deployment" aria-haspopup="dialog" aria-label="Share link: Kubernetes Deployment"> Share link </button> </h4>StatefulSet Configuration: <div class="highlight"><pre tabindex="0" class="chroma"><code class="language-yaml" data-lang="yaml">apiVersion: apps/v1 kind: StatefulSet metadata: name: geode namespace: production spec: serviceName: geode replicas: 3 selector: matchLabels: app: geode template: metadata: labels: app: geode spec: affinity: podAntiAffinity: requiredDuringSchedulingIgnoredDuringExecution: - labelSelector: matchExpressions: - key: app operator: In values: - geode topologyKey: kubernetes.io/hostname containers: - name: geode image: geodedb/geode:0.2.18 ports: - containerPort: 3141 name: client - containerPort: 3142 name: cluster env: - name: GEODE_NODE_ID valueFrom: fieldRef: fieldPath: metadata.name - name: GEODE_CLUSTER_ENABLED value: "true" resources: requests: cpu: 2000m memory: 4Gi limits: cpu: 4000m memory: 8Gi volumeMounts: - name: data mountPath: /var/lib/geode - name: config mountPath: /etc/geode livenessProbe: exec: command: ["/usr/local/bin/geode", "health"] initialDelaySeconds: 30 periodSeconds: 10 readinessProbe: exec: command: ["/usr/local/bin/geode", "ready"] initialDelaySeconds: 5 periodSeconds: 5 volumeClaimTemplates: - metadata: name: data spec: accessModes: ["ReadWriteOnce"] storageClassName: fast-ssd resources: requests: storage: 100Gi </code></pre></div>Headless Service: <div class="highlight"><pre tabindex="0" class="chroma"><code class="language-yaml" data-lang="yaml">apiVersion: v1 kind: Service metadata: name: geode namespace: production spec: clusterIP: None selector: app: geode ports: - port: 3141 name: client - port: 3142 name: cluster </code></pre></div>Load Balancer for Reads: <div class="highlight"><pre tabindex="0" class="chroma"><code class="language-yaml" data-lang="yaml">apiVersion: v1 kind: Service metadata: name: geode-read namespace: production annotations: service.beta.kubernetes.io/aws-load-balancer-type: nlb spec: type: LoadBalancer selector: app: geode role: follower # Only route reads to followers ports: - port: 3141 targetPort: 3141 </code></pre></div> <h3 id="security-hardening" class="position-relative d-flex align-items-center group"> Security Hardening <button type="button" class="h-share btn btn-link p-0 text-decoration-none link-secondary opacity-50 hover-opacity-100 transition-all ms-1" data-share-target="security-hardening" aria-haspopup="dialog" aria-label="Share link: Security Hardening"> Share link </button> </h3> <h4 id="tls-configuration" class="position-relative d-flex align-items-center group"> TLS Configuration <button type="button" class="h-share btn btn-link p-0 text-decoration-none link-secondary opacity-50 hover-opacity-100 transition-all ms-1" data-share-target="tls-configuration" aria-haspopup="dialog" aria-label="Share link: TLS Configuration"> Share link </button> </h4>Generate Certificates: <div class="highlight"><pre tabindex="0" class="chroma"><code class="language-bash" data-lang="bash"># Certificate Authority openssl genrsa -out ca.key 4096 openssl req -new -x509 -days 3650 -key ca.key -out ca.crt \ -subj "/C=US/ST=State/L=City/O=Organization/CN=Geode CA" # Server Certificate openssl genrsa -out server.key 4096 openssl req -new -key server.key -out server.csr \ -subj "/C=US/ST=State/L=City/O=Organization/CN=geode.example.com" # Sign with CA openssl x509 -req -in server.csr -CA ca.crt -CAkey ca.key \ -CAcreateserial -out server.crt -days 365 \ -sha256 -extfile <(printf "subjectAltName=DNS:geode.example.com,DNS:*.geode.internal") </code></pre></div>Server Configuration: <div class="highlight"><pre tabindex="0" class="chroma"><code class="language-yaml" data-lang="yaml">server: tls: cert_file: /etc/geode/tls/server.crt key_file: /etc/geode/tls/server.key client_ca: /etc/geode/tls/ca.crt min_version: "1.3" require_client_cert: true # mTLS </code></pre></div>Client Configuration: <div class="highlight"><pre tabindex="0" class="chroma"><code class="language-python" data-lang="python">client = Client( "geode.example.com:3141", tls_verify=True, tls_cert="/path/to/client.crt", tls_key="/path/to/client.key", tls_ca="/path/to/ca.crt" ) </code></pre></div> <h4 id="authentication--authorization" class="position-relative d-flex align-items-center group"> Authentication &amp; Authorization <button type="button" class="h-share btn btn-link p-0 text-decoration-none link-secondary opacity-50 hover-opacity-100 transition-all ms-1" data-share-target="authentication--authorization" aria-haspopup="dialog" aria-label="Share link: Authentication &amp; Authorization"> Share link </button> </h4>User Management: <div class="highlight"><pre tabindex="0" class="chroma"><code class="language-gql" data-lang="gql">-- Create admin user CREATE USER admin WITH PASSWORD 'strong_password_here' ROLE administrator; -- Create read-only user CREATE USER analyst WITH PASSWORD 'another_password' ROLE reader; -- Create application user with specific permissions CREATE USER app_user WITH PASSWORD 'app_password' ROLE writer; GRANT SELECT, INSERT, UPDATE ON GRAPH social_network TO app_user; </code></pre></div>Row-Level Security: <div class="highlight"><pre tabindex="0" class="chroma"><code class="language-gql" data-lang="gql">-- Multi-tenant isolation policy CREATE POLICY tenant_isolation ON User FOR ALL USING (user.organization_id = current_user_organization_id()); -- Data classification policy CREATE POLICY sensitive_data ON Document FOR SELECT USING ( document.classification = 'public' OR document.classification = 'internal' AND current_user_role() IN ('employee', 'admin') OR document.owner_id = current_user_id() ); </code></pre></div> <h4 id="network-security" class="position-relative d-flex align-items-center group"> Network Security <button type="button" class="h-share btn btn-link p-0 text-decoration-none link-secondary opacity-50 hover-opacity-100 transition-all ms-1" data-share-target="network-security" aria-haspopup="dialog" aria-label="Share link: Network Security"> Share link </button> </h4>Firewall Rules: <div class="highlight"><pre tabindex="0" class="chroma"><code class="language-bash" data-lang="bash"># Allow only application servers to connect iptables -A INPUT -p tcp --dport 3141 -s 10.0.1.0/24 -j ACCEPT iptables -A INPUT -p tcp --dport 3141 -j DROP # Cluster communication iptables -A INPUT -p tcp --dport 3142 -s 10.0.2.0/24 -j ACCEPT iptables -A INPUT -p tcp --dport 3142 -j DROP </code></pre></div>VPC Configuration (AWS): <div class="highlight"><pre tabindex="0" class="chroma"><code class="language-terraform" data-lang="terraform">resource "aws_security_group" "geode" { name = "geode-production" description = "Geode database security group" vpc_id = aws_vpc.main.id # Client connections from application tier ingress { from_port = 3141 to_port = 3141 protocol = "tcp" security_groups = [aws_security_group.app_tier.id] } # Cluster communication ingress { from_port = 3142 to_port = 3142 protocol = "tcp" self = true } egress { from_port = 0 to_port = 0 protocol = "-1" cidr_blocks = ["0.0.0.0/0"] } } </code></pre></div> <h3 id="monitoring--alerting" class="position-relative d-flex align-items-center group"> Monitoring &amp; Alerting <button type="button" class="h-share btn btn-link p-0 text-decoration-none link-secondary opacity-50 hover-opacity-100 transition-all ms-1" data-share-target="monitoring--alerting" aria-haspopup="dialog" aria-label="Share link: Monitoring &amp; Alerting"> Share link </button> </h3> <h4 id="metrics-collection" class="position-relative d-flex align-items-center group"> Metrics Collection <button type="button" class="h-share btn btn-link p-0 text-decoration-none link-secondary opacity-50 hover-opacity-100 transition-all ms-1" data-share-target="metrics-collection" aria-haspopup="dialog" aria-label="Share link: Metrics Collection"> Share link </button> </h4>Prometheus Configuration: <div class="highlight"><pre tabindex="0" class="chroma"><code class="language-yaml" data-lang="yaml"># prometheus.yml scrape_configs: - job_name: 'geode' static_configs: - targets: ['geode-1:9090', 'geode-2:9090', 'geode-3:9090'] metrics_path: /metrics scrape_interval: 15s </code></pre></div>Key Metrics: <div class="highlight"><pre tabindex="0" class="chroma"><code class="language-python" data-lang="python"># Expose metrics endpoint (Python client application) from prometheus_client import Counter, Histogram, Gauge, start_http_server query_duration = Histogram('geode_query_duration_seconds', 'Query execution time') query_counter = Counter('geode_queries_total', 'Total queries executed', ['status']) connection_pool = Gauge('geode_connection_pool_active', 'Active connections') transaction_duration = Histogram('geode_transaction_duration_seconds', 'Transaction time') @query_duration.time() async def execute_query(client, query): try: result, _ = await client.query(query) query_counter.labels(status='success').inc() return result except Exception as e: query_counter.labels(status='error').inc() raise </code></pre></div>Grafana Dashboard: <div class="highlight"><pre tabindex="0" class="chroma"><code class="language-json" data-lang="json">{ "dashboard": { "title": "Geode Production Monitoring", "panels": [ { "title": "Query Latency (p95)", "targets": [{ "expr": "histogram_quantile(0.95, rate(geode_query_duration_seconds_bucket[5m]))" }] }, { "title": "Queries Per Second", "targets": [{ "expr": "rate(geode_queries_total[1m])" }] }, { "title": "Error Rate", "targets": [{ "expr": "rate(geode_queries_total{status='error'}[5m]) / rate(geode_queries_total[5m])" }] }, { "title": "Connection Pool Utilization", "targets": [{ "expr": "geode_connection_pool_active / geode_connection_pool_max * 100" }] } ] } } </code></pre></div> <h4 id="alerting-rules" class="position-relative d-flex align-items-center group"> Alerting Rules <button type="button" class="h-share btn btn-link p-0 text-decoration-none link-secondary opacity-50 hover-opacity-100 transition-all ms-1" data-share-target="alerting-rules" aria-haspopup="dialog" aria-label="Share link: Alerting Rules"> Share link </button> </h4><div class="highlight"><pre tabindex="0" class="chroma"><code class="language-yaml" data-lang="yaml"># alerting_rules.yml groups: - name: geode_alerts interval: 30s rules: - alert: HighQueryLatency expr: histogram_quantile(0.95, rate(geode_query_duration_seconds_bucket[5m])) > 1 for: 5m labels: severity: warning annotations: summary: "High query latency detected" description: "p95 query latency is {{ $value }}s (threshold: 1s)" - alert: HighErrorRate expr: rate(geode_queries_total{status="error"}[5m]) / rate(geode_queries_total[5m]) > 0.05 for: 5m labels: severity: critical annotations: summary: "High error rate detected" description: "Error rate is {{ $value | humanizePercentage }} (threshold: 5%)" - alert: ConnectionPoolExhaustion expr: geode_connection_pool_active / geode_connection_pool_max > 0.9 for: 5m labels: severity: warning annotations: summary: "Connection pool nearly exhausted" description: "Pool utilization is {{ $value | humanizePercentage }}" - alert: ReplicationLag expr: geode_replication_lag_seconds > 10 for: 2m labels: severity: warning annotations: summary: "Replication lag detected" description: "Follower is {{ $value }}s behind leader" - alert: NodeDown expr: up{job="geode"} == 0 for: 1m labels: severity: critical annotations: summary: "Geode node is down" description: "Node {{ $labels.instance }} has been down for > 1m" </code></pre></div> <h3 id="backup--disaster-recovery" class="position-relative d-flex align-items-center group"> Backup &amp; Disaster Recovery <button type="button" class="h-share btn btn-link p-0 text-decoration-none link-secondary opacity-50 hover-opacity-100 transition-all ms-1" data-share-target="backup--disaster-recovery" aria-haspopup="dialog" aria-label="Share link: Backup &amp; Disaster Recovery"> Share link </button> </h3> <h4 id="backup-strategy" class="position-relative d-flex align-items-center group"> Backup Strategy <button type="button" class="h-share btn btn-link p-0 text-decoration-none link-secondary opacity-50 hover-opacity-100 transition-all ms-1" data-share-target="backup-strategy" aria-haspopup="dialog" aria-label="Share link: Backup Strategy"> Share link </button> </h4>Full Backups: <div class="highlight"><pre tabindex="0" class="chroma"><code class="language-bash" data-lang="bash">#!/bin/bash # daily_backup.sh BACKUP_DIR="/backups/geode" DATE=$(date +%Y%m%d_%H%M%S) BACKUP_FILE="$BACKUP_DIR/geode_full_$DATE.tar.gz" # Trigger consistent snapshot ./geode backup create --output "$BACKUP_FILE" --compress # Upload to S3 aws s3 cp "$BACKUP_FILE" "s3://backups/geode/full/" --storage-class STANDARD_IA # Retain only last 7 days locally find "$BACKUP_DIR" -name "geode_full_*.tar.gz" -mtime +7 -delete # Verify backup ./geode backup verify --file "$BACKUP_FILE" </code></pre></div>Incremental Backups: <div class="highlight"><pre tabindex="0" class="chroma"><code class="language-bash" data-lang="bash">#!/bin/bash # hourly_incremental.sh BACKUP_DIR="/backups/geode/incremental" DATE=$(date +%Y%m%d_%H%M%S) # Archive WAL segments since last backup ./geode wal-archive \ --since-checkpoint \ --output "$BACKUP_DIR/wal_$DATE.tar.gz" aws s3 cp "$BACKUP_DIR/wal_$DATE.tar.gz" "s3://backups/geode/wal/" </code></pre></div> <h4 id="recovery-procedures" class="position-relative d-flex align-items-center group"> Recovery Procedures <button type="button" class="h-share btn btn-link p-0 text-decoration-none link-secondary opacity-50 hover-opacity-100 transition-all ms-1" data-share-target="recovery-procedures" aria-haspopup="dialog" aria-label="Share link: Recovery Procedures"> Share link </button> </h4>Full Restore: <div class="highlight"><pre tabindex="0" class="chroma"><code class="language-bash" data-lang="bash"># Stop Geode systemctl stop geode # Clear existing data rm -rf /var/lib/geode/data/* rm -rf /var/lib/geode/wal/* # Restore from backup ./geode restore \ --input "/backups/geode/geode_full_20250124.tar.gz" \ --data-dir /var/lib/geode/data # Start Geode systemctl start geode # Verify data integrity ./geode verify --data-dir /var/lib/geode/data </code></pre></div>Point-in-Time Recovery (PITR): <div class="highlight"><pre tabindex="0" class="chroma"><code class="language-bash" data-lang="bash"># Restore base backup ./geode restore --input /backups/geode/geode_full_20250124.tar.gz # Replay WAL to specific timestamp ./geode wal-replay \ --wal-archive /backups/geode/wal/ \ --target-time "2025-01-24T15:30:00Z" \ --data-dir /var/lib/geode/data </code></pre></div> <h3 id="capacity-planning" class="position-relative d-flex align-items-center group"> Capacity Planning <button type="button" class="h-share btn btn-link p-0 text-decoration-none link-secondary opacity-50 hover-opacity-100 transition-all ms-1" data-share-target="capacity-planning" aria-haspopup="dialog" aria-label="Share link: Capacity Planning"> Share link </button> </h3> <h4 id="sizing-guidelines" class="position-relative d-flex align-items-center group"> Sizing Guidelines <button type="button" class="h-share btn btn-link p-0 text-decoration-none link-secondary opacity-50 hover-opacity-100 transition-all ms-1" data-share-target="sizing-guidelines" aria-haspopup="dialog" aria-label="Share link: Sizing Guidelines"> Share link </button> </h4>Memory Requirements: <ul> <li>Base: 2GB for Geode process</li> <li>Working set: 50-70% of total graph size for hot data</li> <li>Query cache: 10-20% of memory</li> <li>Connection overhead: 10MB per 1000 connections</li> </ul> Example: <ul> <li>10GB graph, 1000 connections, 10% query cache</li> <li>Memory needed: 2 + (10 0.6) + (10 0.1) + (1 * 10/1000) = ~9GB</li> <li>Recommended: 16GB for headroom</li> </ul> Storage Requirements: <ul> <li>Data: 1.2-1.5x raw data size (compression + overhead)</li> <li>WAL: 20-30% of data size for write-heavy workloads</li> <li>Snapshots: Full data size per backup</li> <li>Indexes: 10-30% of data size depending on indexed properties</li> </ul> CPU Requirements: <ul> <li>Query parsing and planning scale with query complexity and concurrency</li> <li>Transaction processing scales with write mix and index maintenance</li> <li>Replication overhead depends on follower count and network latency</li> <li>Benchmark your workload; Geode scales linearly up to 64 cores</li> </ul> <h4 id="load-testing" class="position-relative d-flex align-items-center group"> Load Testing <button type="button" class="h-share btn btn-link p-0 text-decoration-none link-secondary opacity-50 hover-opacity-100 transition-all ms-1" data-share-target="load-testing" aria-haspopup="dialog" aria-label="Share link: Load Testing"> Share link </button> </h4><div class="highlight"><pre tabindex="0" class="chroma"><code class="language-python" data-lang="python"># load_test.py using locust from locust import User, task, between from geode_client import Client import asyncio class GeodeUser(User): wait_time = between(0.1, 0.5) def on_start(self): self.client = Client("geode.example.com:3141") @task(10) # 10x weight def read_query(self): asyncio.run(self.client.execute( "MATCH (p:Person {id: $id}) RETURN p", id=random.randint(1, 100000) )) @task(1) # 1x weight (writes less frequent) def write_query(self): asyncio.run(self.client.execute( "CREATE (p:Person {id: $id, name: $name})", id=random.randint(100001, 200000), name=f"User_{random.randint(1, 10000)}" )) </code></pre></div>Run load test: <div class="highlight"><pre tabindex="0" class="chroma"><code class="language-bash" data-lang="bash">locust -f load_test.py --host geode.example.com --users 1000 --spawn-rate 10 --run-time 1h </code></pre></div> <h3 id="operational-runbooks" class="position-relative d-flex align-items-center group"> Operational Runbooks <button type="button" class="h-share btn btn-link p-0 text-decoration-none link-secondary opacity-50 hover-opacity-100 transition-all ms-1" data-share-target="operational-runbooks" aria-haspopup="dialog" aria-label="Share link: Operational Runbooks"> Share link </button> </h3> <h4 id="runbook-high-cpu-usage" class="position-relative d-flex align-items-center group"> Runbook: High CPU Usage <button type="button" class="h-share btn btn-link p-0 text-decoration-none link-secondary opacity-50 hover-opacity-100 transition-all ms-1" data-share-target="runbook-high-cpu-usage" aria-haspopup="dialog" aria-label="Share link: Runbook: High CPU Usage"> Share link </button> </h4>Symptoms: CPU > 80% for 5+ minutes Investigation: <ol> <li>Check active queries: <code>SELECT * FROM system.queries WHERE duration > 10s</code></li> <li>Profile slow queries: Use PROFILE command</li> <li>Check connection count: Monitor active_connections metric</li> <li>Review recent schema changes</li> </ol> Resolution: <ul> <li>Kill long-running queries: <code>KILL QUERY 'query-id'</code></li> <li>Add missing indexes</li> <li>Scale horizontally (add replicas)</li> <li>Increase CPU allocation</li> </ul> <h4 id="runbook-replication-lag" class="position-relative d-flex align-items-center group"> Runbook: Replication Lag <button type="button" class="h-share btn btn-link p-0 text-decoration-none link-secondary opacity-50 hover-opacity-100 transition-all ms-1" data-share-target="runbook-replication-lag" aria-haspopup="dialog" aria-label="Share link: Runbook: Replication Lag"> Share link </button> </h4>Symptoms: Follower > 10s behind leader Investigation: <ol> <li>Check network latency between nodes</li> <li>Review follower logs for errors</li> <li>Check disk I/O on follower</li> <li>Verify follower isn’t overloaded with read queries</li> </ol> Resolution: <ul> <li>Increase replication batch size</li> <li>Switch to asynchronous replication</li> <li>Add dedicated read replicas</li> <li>Upgrade network bandwidth</li> </ul> <h4 id="runbook-connection-pool-exhaustion" class="position-relative d-flex align-items-center group"> Runbook: Connection Pool Exhaustion <button type="button" class="h-share btn btn-link p-0 text-decoration-none link-secondary opacity-50 hover-opacity-100 transition-all ms-1" data-share-target="runbook-connection-pool-exhaustion" aria-haspopup="dialog" aria-label="Share link: Runbook: Connection Pool Exhaustion"> Share link </button> </h4>Symptoms: Connection timeouts, pool at 100% Investigation: <ol> <li>Check for connection leaks in application</li> <li>Review connection lifecycle management</li> <li>Analyze query patterns (long-running queries?)</li> </ol> Resolution: <ul> <li>Increase pool size</li> <li>Reduce query timeout</li> <li>Fix application connection leaks</li> <li>Implement connection retry logic</li> </ul> <h3 id="production-checklist" class="position-relative d-flex align-items-center group"> Production Checklist <button type="button" class="h-share btn btn-link p-0 text-decoration-none link-secondary opacity-50 hover-opacity-100 transition-all ms-1" data-share-target="production-checklist" aria-haspopup="dialog" aria-label="Share link: Production Checklist"> Share link </button> </h3> <h4 id="pre-launch" class="position-relative d-flex align-items-center group"> Pre-Launch <button type="button" class="h-share btn btn-link p-0 text-decoration-none link-secondary opacity-50 hover-opacity-100 transition-all ms-1" data-share-target="pre-launch" aria-haspopup="dialog" aria-label="Share link: Pre-Launch"> Share link </button> </h4><ul> <li><input disabled="" type="checkbox"> TLS certificates configured and tested</li> <li><input disabled="" type="checkbox"> Authentication and authorization policies defined</li> <li><input disabled="" type="checkbox"> Firewall rules implemented</li> <li><input disabled="" type="checkbox"> Backup strategy configured and tested</li> <li><input disabled="" type="checkbox"> Monitoring and alerting operational</li> <li><input disabled="" type="checkbox"> Load testing completed</li> <li><input disabled="" type="checkbox"> Disaster recovery procedures documented</li> <li><input disabled="" type="checkbox"> On-call rotation established</li> <li><input disabled="" type="checkbox"> Capacity planning reviewed</li> <li><input disabled="" type="checkbox"> Security audit completed</li> </ul> <h4 id="post-launch" class="position-relative d-flex align-items-center group"> Post-Launch <button type="button" class="h-share btn btn-link p-0 text-decoration-none link-secondary opacity-50 hover-opacity-100 transition-all ms-1" data-share-target="post-launch" aria-haspopup="dialog" aria-label="Share link: Post-Launch"> Share link </button> </h4><ul> <li><input disabled="" type="checkbox"> Monitor key metrics daily for first week</li> <li><input disabled="" type="checkbox"> Review logs for errors and warnings</li> <li><input disabled="" type="checkbox"> Validate backup integrity weekly</li> <li><input disabled="" type="checkbox"> Test disaster recovery procedures monthly</li> <li><input disabled="" type="checkbox"> Review and update capacity projections</li> <li><input disabled="" type="checkbox"> Conduct performance tuning based on real workload</li> <li><input disabled="" type="checkbox"> Update documentation with operational learnings</li> <li><input disabled="" type="checkbox"> Train operations team on runbooks</li> </ul> <h3 id="related-topics" class="position-relative d-flex align-items-center group"> Related Topics <button type="button" class="h-share btn btn-link p-0 text-decoration-none link-secondary opacity-50 hover-opacity-100 transition-all ms-1" data-share-target="related-topics" aria-haspopup="dialog" aria-label="Share link: Related Topics"> Share link </button> </h3><ul> <li>Operations: Day-to-day operational procedures</li> <li>Monitoring: Metrics, logging, and observability</li> <li>Performance Tuning: Optimization techniques</li> <li>Security: Authentication, authorization, encryption</li> <li>DevOps: Automation and infrastructure as code</li> </ul>

Popular

Related Articles

Deployment and Integration Guide

Production Deployment Guide

Deployment

Operations

Operations