Skip to content

Hardware Sizing Guide

Choose the right hardware for your GraphnAI deployment based on the number of identities in your environment.

Audience: IT administrators planning a self-hosted GraphnAI deployment.

Related: Quick Start: Self-Hosted, Identity Bridge, Bridge Configuration

Deployment Architecture

GraphnAI runs as two components:

ComponentWhat It IncludesWhere It Runs
Server containerGraphnAI Platform (API + UI) and ArangoDB (graph database) in a single Docker Compose stackA dedicated VM or server with Docker
Identity BridgeLightweight agent that collects Active Directory dataOn-premises, one per network segment or AD forest

The server container handles all graph storage, analysis, and UI serving. The Bridge connects outbound to the server — it does not require inbound firewall rules.

Server Container Requirements

The server and ArangoDB share resources within the same Docker host. ArangoDB is the primary consumer of RAM and storage during identity sync operations.

EnvironmentIdentitiesEstimated EdgesRAMCPUStorage (SSD)
SmallUp to 10,000~200K8 GB2 cores30 GB
Medium10,000 - 100,0001 - 2M16 GB4 cores75 GB
Large100,000 - 500,0006 - 10M32 GB8 cores150 GB
Enterprise500,000+15 - 20M+64 GB8+ cores300 GB+

Understanding the estimates

  • Identities include users, groups, computers, service accounts, service principals, OUs, containers, and domain objects synced from Active Directory and Entra ID.
  • Edges represent access relationships (MemberOf, ACL permissions, containment, etc.). Environments with complex ACLs (many delegated permissions, fine-grained security descriptors) will have higher edge counts. The ratio is typically 10-20 edges per identity.
  • RAM is the most critical dimension. ArangoDB's write-ahead log (WAL), RocksDB block cache, and indexes all compete for memory during sync. Insufficient RAM causes sync failures.
  • Storage must be SSD. ArangoDB uses RocksDB, which performs background compaction. Spinning disks cause severe performance degradation. Allow headroom of roughly 2x the live data size to accommodate compaction.

Resource usage pattern

GraphnAI has a bursty workload:

  • Idle / query time (majority): Lightweight topology queries, dashboard reads, and analytics cache hits. Minimal resource usage.
  • Sync bursts (periodic): Heavy write activity during scheduled identity syncs — edge building, criticality evaluation, analytics pipeline. This is when RAM and CPU spike.

For most deployments, sizing for the sync burst is what matters. Query-time performance is comfortable even on the Small tier.

Identity Bridge Requirements

Each Bridge instance is lightweight. CPU is the main factor — ACL parsing from Active Directory security descriptors is CPU-bound.

ResourceRequirement
RAM2 - 4 GB
CPU2 - 4 cores (more cores = faster sync)
StorageMinimal (stateless agent, logs only)
NetworkOutbound HTTPS to server on port 8443

How many Bridges?

Deploy one Bridge per network segment that contains Active Directory domain controllers:

ScenarioBridges Needed
Single AD forest, one site1
Single AD forest, multiple isolated network segments1 per segment
Multiple AD forests1 per forest (minimum)
Entra ID only (no on-premises AD)0 — Entra sync runs server-side

Bridges are managed entirely from the platform UI. See Bridge Configuration for deployment instructions.

Cloud / SaaS Considerations

For cloud-hosted deployments, the bursty workload pattern makes auto-scaling attractive:

  • Scale up before sync: If syncs are scheduled (e.g., nightly), increase the container's memory allocation before the sync window starts, then scale back down after completion.
  • ArangoDB memory: RocksDB's block cache grows as data is accessed. Scaling up memory before the sync starts is more effective than scaling reactively mid-sync.
  • Server is stateless: The server container holds no state — all data lives in ArangoDB. This makes horizontal scaling straightforward for query workloads, though sync operations run single-threaded per domain.

Recommendations by Deployment Type

Small / Proof of Concept

A single VM with 8 GB RAM and 2 cores handles up to 10K identities comfortably. Suitable for evaluation, single-domain environments, or small organizations.

Medium / Multi-Domain

16 GB RAM with 4 cores supports most mid-size organizations. If you have multiple AD domains, plan for one Bridge per network segment. Syncs complete in a few minutes.

Large / Enterprise

32-64 GB RAM is recommended. At this scale, identity syncs produce millions of edges and sync duration depends on the ArangoDB write throughput. The platform uses adaptive throttling to prevent database overload — sufficient RAM ensures syncs complete without intervention.

Multi-Forest / Global

For organizations with 500K+ identities across multiple forests, contact your GraphnAI representative for architecture planning. Factors like cross-forest trust relationships, sync scheduling, and network topology affect the optimal deployment layout.

Support

Contact your GraphnAI account representative for sizing guidance specific to your environment.