Skip to content

Nexus (The Brain)

Nexus - The Regional Brain

16GB+ RAM · $2000+ · ML training capability

Nexus is the heavy compute tier designed for organizations that need local ML model training, regional threat aggregation, and fleet management.

What Nexus Enables

CapabilityDescription
ML TrainingTrain threat models locally
Federated LearningCoordinate weight sharing
Fleet ManagementManage regional nodes
ClickHouse AnalyticsHigh-performance queries
GPU AccelerationNVIDIA/AMD support
High AvailabilityRedundant deployment

Best For

  • Multi-site organizations
  • Security research teams
  • ML-heavy workloads
  • Regional coordination
  • Data sovereignty requirements

Hardware Requirements

ResourceMinimumRecommended
CPU8 cores16+ cores (Xeon, EPYC)
RAM16GB64GB+
Storage256GB SSD1TB+ NVMe
Network10GbpsIntel X710 or Mellanox
GPUOptionalNVIDIA RTX 3060+

Server Configuration:

  • Intel Xeon E-2300 series or AMD EPYC
  • 64GB ECC RAM
  • NVMe storage array
  • Dual 10GbE NICs
  • NVIDIA GPU (optional)

Approximate cost: $2000-5000+

Installation

Terminal window
sudo ./install.sh --tier nexus --enable-gpu --enable-ha

Installation Options

Terminal window
# Standard Nexus
sudo ./install.sh --tier nexus
# With GPU acceleration
sudo ./install.sh --tier nexus --enable-gpu
# High availability mode
sudo ./install.sh --tier nexus --enable-ha
# Full feature set
sudo ./install.sh --tier nexus --enable-gpu --enable-ha --enable-clickhouse

ML Training

Local Model Training

Train threat detection models on your data:

from hookprobe.ml import ThreatClassifier
classifier = ThreatClassifier(
model_type="transformer",
local_data="/var/hookprobe/data/flows.parquet"
)
# Train on local data - never leaves your network
classifier.train(epochs=50, batch_size=32)
# Export for deployment to edge nodes
classifier.export("/var/hookprobe/models/threat-v4.onnx")

Federated Learning

Coordinate learning across the mesh:

from hookprobe.federated import FederatedCoordinator
coordinator = FederatedCoordinator(
aggregation="fedavg",
min_participants=10,
rounds=100
)
# Nodes train locally, share only weight updates
coordinator.start_round()
# Aggregate updates (no raw data shared)
global_weights = coordinator.aggregate()
# Push improved model to all nodes
coordinator.distribute(global_weights)

Model Visibility

Every ML decision is transparent:

{
"model": "hookprobe-threat-classifier-v4",
"version": "4.2.1",
"trained_on": "local_data",
"training_date": "2025-12-01",
"accuracy": 0.97,
"decision": {
"classification": "malware_c2",
"confidence": 0.94,
"features": {
"dns_entropy": 4.8,
"connection_pattern": "beacon",
"timing_jitter": 0.02
},
"explanation": "High entropy DNS with periodic beacon pattern"
}
}

ClickHouse Analytics

High-performance analytics for security data:

Query Examples

-- Top blocked domains this week
SELECT domain, count() as blocks
FROM dns_blocks
WHERE timestamp > now() - INTERVAL 7 DAY
GROUP BY domain
ORDER BY blocks DESC
LIMIT 10;
-- QSecBit trend analysis
SELECT
toStartOfHour(timestamp) as hour,
avg(score) as avg_score,
max(score) as max_score
FROM qsecbit_history
WHERE timestamp > now() - INTERVAL 24 HOUR
GROUP BY hour
ORDER BY hour;
-- Attack pattern correlation
SELECT
signature,
count() as occurrences,
uniq(node_id) as affected_nodes
FROM mesh_alerts
WHERE timestamp > now() - INTERVAL 1 DAY
GROUP BY signature
ORDER BY affected_nodes DESC;

Data Retention

Data TypeRetentionCompression
Flow data30 daysLZ4
DNS logs90 daysZSTD
QSecBit scores365 daysLZ4
ML metrics365 daysZSTD

Fleet Management

Regional Coordination

Nexus manages edge nodes in its region:

Terminal window
# View managed nodes
hookprobe-ctl fleet list
# Push configuration update
hookprobe-ctl fleet config --update baseline.yaml
# Deploy new model
hookprobe-ctl fleet deploy --model threat-v4.onnx
# Health check all nodes
hookprobe-ctl fleet health

Fleet Dashboard

MetricDescription
Node countActive nodes in region
Average QSecBitRegional threat level
Model versionCurrent deployment
Last syncConfiguration freshness

High Availability

Deployment Options

/etc/hookprobe/ha.yaml
ha:
enabled: true
mode: "active-passive"
nodes:
- nexus-primary.internal:7946
- nexus-secondary.internal:7946
vip: 10.0.0.100
failover_timeout: 30

Data Replication

  • ClickHouse replication
  • Configuration sync
  • Model distribution
  • Certificate rotation

GPU Acceleration

Supported GPUs

VendorModels
NVIDIARTX 3060+, A2000+, T4, A100
AMDRX 6700+ (ROCm)

GPU Workloads

TaskSpeedup
Model training10-50x
Inference5-20x
Embedding generation20-100x

Configuration

Terminal window
# Verify GPU detection
hookprobe-ctl gpu status
# Enable GPU for training
hookprobe-ctl ml train --use-gpu

Integration Points

Dashboard

Full web dashboard with:

  • Fleet overview
  • ML training status
  • ClickHouse query interface
  • Model performance metrics
  • Regional threat map

API

Terminal window
# Fleet status
curl http://localhost:8888/api/fleet/status
# ML metrics
curl http://localhost:8888/api/ml/metrics
# ClickHouse query
curl -X POST http://localhost:8888/api/query \
-d '{"sql": "SELECT * FROM qsecbit_history LIMIT 10"}'

Resource Usage

ResourceTypical Usage
RAM8-12GB (16GB+ with training)
CPU40-60% (100% during training)
GPUIdle to 100% (training)
Disk I/OHigh (ClickHouse)
Network100Mbps-1Gbps

Security Considerations

Data Sovereignty

  • All training data stays local
  • Only weight updates shared
  • ClickHouse on encrypted storage
  • Network segmentation supported

Access Control

RolePermissions
AdminFull access
AnalystQuery, view models
OperatorFleet management
ViewerRead-only dashboard

The Nexus Difference

Other security tools: “Trust our cloud AI” Nexus: “Train your own models, see exactly how decisions are made”

  1. Full visibility into ML decisions
  2. Local training - data never leaves
  3. Auditable models - export and verify
  4. Collective benefit - share intelligence, not data

Next Steps

AI Intelligence (HIP)

Nexus runs the full HookProbe Intelligence Platform with deep ML capabilities:

  • Hardware Detection: All 13 NPU types + GPU detection (NVIDIA CUDA/ROCm)
  • Inference Bridge: GPU/NPU-accelerated classification + large local LLM (Phi-3 to Llama-70B)
  • AEGIS-Deep: Advanced agents with local model training and adversarial testing
  • Federated Learning: Aggregation server for collective model updates with differential privacy
  • Red/Purple Teaming: Automated attack simulation and digital twin testing

See Brain & NPU Detection and AEGIS AI Defense for details.