Nexus (The Brain)
Nexus - The Regional Brain
16GB+ RAM · $2000+ · ML training capability
Nexus is the heavy compute tier designed for organizations that need local ML model training, regional threat aggregation, and fleet management.
What Nexus Enables
| Capability | Description |
|---|---|
| ML Training | Train threat models locally |
| Federated Learning | Coordinate weight sharing |
| Fleet Management | Manage regional nodes |
| ClickHouse Analytics | High-performance queries |
| GPU Acceleration | NVIDIA/AMD support |
| High Availability | Redundant deployment |
Best For
- Multi-site organizations
- Security research teams
- ML-heavy workloads
- Regional coordination
- Data sovereignty requirements
Hardware Requirements
| Resource | Minimum | Recommended |
|---|---|---|
| CPU | 8 cores | 16+ cores (Xeon, EPYC) |
| RAM | 16GB | 64GB+ |
| Storage | 256GB SSD | 1TB+ NVMe |
| Network | 10Gbps | Intel X710 or Mellanox |
| GPU | Optional | NVIDIA RTX 3060+ |
Recommended Hardware
Server Configuration:
- Intel Xeon E-2300 series or AMD EPYC
- 64GB ECC RAM
- NVMe storage array
- Dual 10GbE NICs
- NVIDIA GPU (optional)
Approximate cost: $2000-5000+
Installation
sudo ./install.sh --tier nexus --enable-gpu --enable-haInstallation Options
# Standard Nexussudo ./install.sh --tier nexus
# With GPU accelerationsudo ./install.sh --tier nexus --enable-gpu
# High availability modesudo ./install.sh --tier nexus --enable-ha
# Full feature setsudo ./install.sh --tier nexus --enable-gpu --enable-ha --enable-clickhouseML Training
Local Model Training
Train threat detection models on your data:
from hookprobe.ml import ThreatClassifier
classifier = ThreatClassifier( model_type="transformer", local_data="/var/hookprobe/data/flows.parquet")
# Train on local data - never leaves your networkclassifier.train(epochs=50, batch_size=32)
# Export for deployment to edge nodesclassifier.export("/var/hookprobe/models/threat-v4.onnx")Federated Learning
Coordinate learning across the mesh:
from hookprobe.federated import FederatedCoordinator
coordinator = FederatedCoordinator( aggregation="fedavg", min_participants=10, rounds=100)
# Nodes train locally, share only weight updatescoordinator.start_round()
# Aggregate updates (no raw data shared)global_weights = coordinator.aggregate()
# Push improved model to all nodescoordinator.distribute(global_weights)Model Visibility
Every ML decision is transparent:
{ "model": "hookprobe-threat-classifier-v4", "version": "4.2.1", "trained_on": "local_data", "training_date": "2025-12-01", "accuracy": 0.97, "decision": { "classification": "malware_c2", "confidence": 0.94, "features": { "dns_entropy": 4.8, "connection_pattern": "beacon", "timing_jitter": 0.02 }, "explanation": "High entropy DNS with periodic beacon pattern" }}ClickHouse Analytics
High-performance analytics for security data:
Query Examples
-- Top blocked domains this weekSELECT domain, count() as blocksFROM dns_blocksWHERE timestamp > now() - INTERVAL 7 DAYGROUP BY domainORDER BY blocks DESCLIMIT 10;
-- QSecBit trend analysisSELECT toStartOfHour(timestamp) as hour, avg(score) as avg_score, max(score) as max_scoreFROM qsecbit_historyWHERE timestamp > now() - INTERVAL 24 HOURGROUP BY hourORDER BY hour;
-- Attack pattern correlationSELECT signature, count() as occurrences, uniq(node_id) as affected_nodesFROM mesh_alertsWHERE timestamp > now() - INTERVAL 1 DAYGROUP BY signatureORDER BY affected_nodes DESC;Data Retention
| Data Type | Retention | Compression |
|---|---|---|
| Flow data | 30 days | LZ4 |
| DNS logs | 90 days | ZSTD |
| QSecBit scores | 365 days | LZ4 |
| ML metrics | 365 days | ZSTD |
Fleet Management
Regional Coordination
Nexus manages edge nodes in its region:
# View managed nodeshookprobe-ctl fleet list
# Push configuration updatehookprobe-ctl fleet config --update baseline.yaml
# Deploy new modelhookprobe-ctl fleet deploy --model threat-v4.onnx
# Health check all nodeshookprobe-ctl fleet healthFleet Dashboard
| Metric | Description |
|---|---|
| Node count | Active nodes in region |
| Average QSecBit | Regional threat level |
| Model version | Current deployment |
| Last sync | Configuration freshness |
High Availability
Deployment Options
ha: enabled: true mode: "active-passive" nodes: - nexus-primary.internal:7946 - nexus-secondary.internal:7946 vip: 10.0.0.100 failover_timeout: 30Data Replication
- ClickHouse replication
- Configuration sync
- Model distribution
- Certificate rotation
GPU Acceleration
Supported GPUs
| Vendor | Models |
|---|---|
| NVIDIA | RTX 3060+, A2000+, T4, A100 |
| AMD | RX 6700+ (ROCm) |
GPU Workloads
| Task | Speedup |
|---|---|
| Model training | 10-50x |
| Inference | 5-20x |
| Embedding generation | 20-100x |
Configuration
# Verify GPU detectionhookprobe-ctl gpu status
# Enable GPU for traininghookprobe-ctl ml train --use-gpuIntegration Points
Dashboard
Full web dashboard with:
- Fleet overview
- ML training status
- ClickHouse query interface
- Model performance metrics
- Regional threat map
API
# Fleet statuscurl http://localhost:8888/api/fleet/status
# ML metricscurl http://localhost:8888/api/ml/metrics
# ClickHouse querycurl -X POST http://localhost:8888/api/query \ -d '{"sql": "SELECT * FROM qsecbit_history LIMIT 10"}'Resource Usage
| Resource | Typical Usage |
|---|---|
| RAM | 8-12GB (16GB+ with training) |
| CPU | 40-60% (100% during training) |
| GPU | Idle to 100% (training) |
| Disk I/O | High (ClickHouse) |
| Network | 100Mbps-1Gbps |
Security Considerations
Data Sovereignty
- All training data stays local
- Only weight updates shared
- ClickHouse on encrypted storage
- Network segmentation supported
Access Control
| Role | Permissions |
|---|---|
| Admin | Full access |
| Analyst | Query, view models |
| Operator | Fleet management |
| Viewer | Read-only dashboard |
The Nexus Difference
Other security tools: “Trust our cloud AI” Nexus: “Train your own models, see exactly how decisions are made”
- Full visibility into ML decisions
- Local training - data never leaves
- Auditable models - export and verify
- Collective benefit - share intelligence, not data
Next Steps
- Architecture - System design details
- Protocols - HTP transport layer
- Operations - Deployment guide
AI Intelligence (HIP)
Nexus runs the full HookProbe Intelligence Platform with deep ML capabilities:
- Hardware Detection: All 13 NPU types + GPU detection (NVIDIA CUDA/ROCm)
- Inference Bridge: GPU/NPU-accelerated classification + large local LLM (Phi-3 to Llama-70B)
- AEGIS-Deep: Advanced agents with local model training and adversarial testing
- Federated Learning: Aggregation server for collective model updates with differential privacy
- Red/Purple Teaming: Automated attack simulation and digital twin testing
See Brain & NPU Detection and AEGIS AI Defense for details.