SYRAG™ White Paper
System RAG for Procedures, Regulations, and Compliance
A Paradigm Shift in Institutional Knowledge Management
📄 This technical white paper is available in English. The document contains detailed technical specifications, architecture details, and business model information.
0. Origins & Evolution
From Super Yacht to SYstem
SYRAG™ was conceived in the complexity of superyacht operations. Managing a 60-meter vessel requires navigating:
- 300+ international maritime regulations (SOLAS, MARPOL, MLC, ISM Code)
- 50+ crew certifications (STCW, medical, specialized training)
- 20+ jurisdictional compliance frameworks (flag state, port state, tax, labor)
- 100+ operational procedures (maintenance schedules, safety drills, environmental protocols)
- Decades of institutional knowledge held by retiring crew and engineers
The Problem: When an experienced chief engineer retires, 30 years of troubleshooting expertise disappears. When regulations change, nobody updates the 500-page operations manual. When an inspector arrives, finding the right certification takes hours.
The Insight: This isn't a maritime problem. It's a universal problem.
- Healthcare: Hospital procedures, patient protocols, medical regulations, staff certifications
- Energy: Safety standards, environmental compliance, equipment maintenance, incident reports
- Finance: Regulatory frameworks, audit trails, policy documentation, training records
- Construction: Building codes, safety procedures, material certifications, inspector requirements
- Manufacturing: ISO procedures, quality controls, equipment manuals, supply chain compliance
The architecture remained identical. Only the document corpus changed.
What SYRAG™ Means
Originally: Super Yacht RAG (Retrieval-Augmented Generation system for procedures, regulations, compliance)
Today: SYstem RAG - A universal platform for structured knowledge management across any regulated industry.
The maritime heritage is our strength: if we can handle the complexity of international vessel operations, we can handle anything.
1. Executive Summary
The Problem
Organizations face a persistent crisis in institutional knowledge management:
- Expertise Erosion: Senior experts retire, taking decades of knowledge with them
- Regulatory Lag: Regulations evolve faster than documentation can be updated
- Compliance Complexity: Multi-jurisdictional frameworks create impossible-to-navigate webs
- Fragmented Knowledge: Critical information scattered across emails, PDFs, databases, and individual memories
- Audit Anxiety: Finding the right document during inspection is emergency archaeology
Traditional solutions fail:
- Cloud SaaS: Sensitive compliance data leaves your control, vendor lock-in, privacy concerns
- Offline Systems: Limited intelligence, no updates, manual maintenance burden
- Consultants: Expensive, temporary, don't scale, knowledge walks out the door again
The SYRAG™ Paradigm Shift
SYRAG™ is an AI consulting service for procedures, regulations, and compliance that operates 24/7 on-premises at your facility. The system combines:
- Local Intelligence: LLM running entirely offline for zero data leakage
- Hybrid Flexibility: Optional connectivity for regulatory updates (with your approval)
- Trustless Architecture: Blockchain-anchored audit trails, reproducible builds, hardware attestation
- Future-Proof Design: Upgradable hardware, swappable LLMs, no vendor lock-in
- Trained on Your Data: System learns from your organization's specific documentation and procedures
"We don't ask you to trust us. We give you the tools to verify us."
Service Model: SYRAG™ is not sold as a product. It's a comprehensive AI consulting service where hardware, software, maintenance, updates, and support are bundled into a transparent subscription.
2. Technical Architecture
Hardware: 100% AMD Open Stack
Why AMD? Open-source ROCm (Radeon Open Compute) vs proprietary NVIDIA CUDA. Full transparency, no vendor lock-in, Linux-native.
Three Configurations Available:
BASE Configuration:
- AMD Ryzen 7 (8C/16T, 3.4-4.6 GHz)
- AMD Radeon RX 6600 (8GB VRAM, RDNA2)
- 32GB DDR5 RAM
- 1TB NVMe SSD + 2TB HDD
- Target: 10,000 documents, 30-50 tokens/sec, 8K context
ADVANCED Configuration:
- AMD Ryzen 9 (12C/24T, 3.7-4.8 GHz)
- AMD Radeon RX 6700 XT (12GB VRAM, RDNA2)
- 64GB DDR5 RAM
- 2TB NVMe SSD + 4TB HDD
- Target: 50,000 documents, 60-80 tokens/sec, 16K context
PROFESSIONAL Configuration:
- AMD Ryzen 9 (16C/32T, 4.5-5.7 GHz, Zen 4)
- AMD Radeon RX 7700 XT (12GB VRAM, RDNA3)
- 128GB DDR5 RAM
- 4TB NVMe Gen4 SSD + 8TB HDD
- Target: 100,000+ documents, 100+ tokens/sec, 32K context
For detailed pricing and service plans, please contact us directly.
Software Stack
- OS: Linux (hardened, minimal attack surface)
- LLM: Multiple choices (online and local)
- Embeddings: Multiple options
- Vector DB: ChromaDB (local SQLite persistence)
- Orchestration: Docker Compose (6 containers: Prometheus, Grafana, AlertManager, exporters)
- Updates: GPG-signed OTA updates, hash-chained blockchain log
3. Core Components
Interface Layer
- PyQt5 desktop application (cross-platform: Linux, Windows)
- Natural language queries: "What certifications expire this month?"
- Document ingestion: drag-and-drop PDFs, Word docs, emails, scanned images
- Multi-language output: English, Dutch, Italian, Spanish, French, German
RAG Engine
- Document Processing: PDF parsing, OCR, metadata extraction
- Embedding Generation: sentence-transformers (Multiple choices)
- Vector Storage: ChromaDB (local) + optional Pinecone (cloud hybrid)
- Metadata Indexing: SQLite for structured search (dates, categories, certifications)
- Semantic Search: Cosine similarity ranking, hybrid search (keyword + semantic)
LLM Inference
- Local Models: Multiple choices, ROCm acceleration
- Cloud Models (Hybrid/Online): Multiple choices
- Model Selection: Manual
- Context Window: 4k-128k tokens depending on model
Trust & Traceability Layer
- OpenTimestamps: Blockchain anchoring for document integrity proofs
- IPFS/Pinata: Distributed storage for hashes and metadata (optional)
- PGP Identity: Device-specific key pair generated at first boot
- Audit Registry: Immutable log of all operations (local SQLite + blockchain)
- Smart Contract: Device registration on OTS, Bitcoin, Ethereum/Polygon (public verification)
Security Hardening
- On-premises deployment (no data leaves unless authorized)
- Local Key Management System (secrets encrypted at rest)
- Full-disk encryption (LUKS with TPM-sealed keys)
- Immutable logging of mode transitions (offline→hybrid tracked)
- Network isolation in Offline mode (hardware-enforced via TPM attestation)
Update Pipeline
- Signed updates (GPG detached signatures verified automatically)
- Hash-chained log (every update recorded on blockchain)
- Safe rollback (previous versions preserved, one-command revert)
- Public changelog (GitHub releases + transparency dashboard)
- Multi-party verification (community can rebuild binaries)
Monitoring Stack (Production Ready)
Complete monitoring and compliance system based on Prometheus, AlertManager, Grafana and custom exporters:
- Prometheus 2.45.3: Time-series database with 15-day retention, 15s scrape interval
- AlertManager 0.26.0: Alert routing with email notifications (Yandex SMTP) and Slack, automatic escalation
- Grafana 12.3.1: Real-time dashboard with metric visualization, alert status, SLA compliance tracking
- Compliance Exporter: Custom exporter for SYRAG™-specific compliance metrics
- Node Exporter 1.7.0: System resource monitoring (CPU, RAM, disk, network)
- Blackbox Exporter: External endpoint monitoring (Google, GitHub, DNS providers) with 8 active probes
- Docker Orchestration: 6 containers managed with docker-compose, health checks, automatic restart, named volumes for data persistence
- SLA Target: 99.5% availability for critical services (≤21.6 min downtime/month)
- Alert Rules: 12 active rules (5 recording rules + 7 alert rules) for SLA violations, performance, external monitoring
- CI/CD Pipeline: GitHub Actions with automated Docker image builds, config validation (Hadolint, YAML parser), multi-registry push (Docker Hub + GitHub Packages)
- Production Validation: 6 verified end-to-end tests (alert delivery, metrics flow, container recovery, data persistence, blackbox probes, load testing)
4. Operating Modes
- Offline: No external connectivity; ingestion and updates via verified channels.
- Hybrid: Limited and authorized connectivity (4-eyes) for regulations/LLMs; sensitive data remains on-premises.
- Online: Only with explicit approval; complete logging; goal: maximum intelligence while preserving data governance.
5. Security and Compliance
- GDPR: Client as Controller, SYRAG™ as Processor; dedicated DPA.
- Trustless Infrastructure: No traditional certifications. Instead: cryptographic proofs, blockchain anchoring, public audit logs, reproducible builds, hardware attestation. Don't trust us. Verify us.
- AI Transparency: Pre-certification disclaimers, human-in-the-loop.
6. Trust & Traceability
- Temporal anchoring on blockchain (OTS) for integrity proofs.
- Storage of hashes and metadata in immutable registries.
- Optional IPFS for referenced non-sensitive content.
- Device PGP identity automatically created at first boot.
- LLM update pipeline with GPG detached signature verification:
- Hash/signature registration of every update in hash-chained log
- Automatic GPG signature verification when present
- Import of maintainer public keys into device keyring
- Complete audit trail of every model modification
- Verified and traced rollback
- Integrated compliance audit with update signature integrity checks.
7. Business Model – Founder Partners
SYRAG™ is not a product. It's a service.
Think of SYRAG™ as an AI consultant specialized in procedures, regulations, and compliance that:
- Operates 24/7 on-premises at your facility
- Is trained on your organization's specific documentation
- Provides instant answers to compliance questions
- Maintains complete data sovereignty and confidentiality
- Evolves with your needs through continuous updates
The device is not for sale. We provide an integrated AI consulting service where hardware, software, maintenance, updates, and support are bundled into a single monthly subscription.
Founder Partners Program:
- 9 pioneer organizations selected to validate system flexibility across sectors
- Co-development opportunity: Your feedback shapes the product roadmap
- Deadline: Applications open until March 1, 2026
Become a Founder Partner
9 Founder Partner Slots Available
Deadline: March 1, 2026
Join the pioneers shaping the future of institutional knowledge management.
Contact: founders@syrag.net
Website: www.syrag.net
8. Open Source Strategy (Under Evaluation)
SYRAG™ is currently proprietary software while we evaluate legal and commercial implications of open sourcing.
Why Not Immediately Open Source
- Legal review required: Intellectual property, trademark, liability considerations
- Commercial viability assessment: Balancing transparency with sustainable business model
- Security implications: Device management and operational security need careful isolation
- Community readiness: Infrastructure for contributions (CLA, governance, security disclosure) must be robust
Future Open Source Vision (pending legal clearance)
- Open-core model: Core RAG components and SDK potentially public; device management remains proprietary
- Possible licenses: AGPL-3.0 (core protection), Apache-2.0 (SDK adoption), commercial dual-licensing
- Trademark policy: "SYRAG™" mark usage to be governed by dedicated policy
- Contributor framework: CLA (Contributor License Agreement) for orderly contribution reception
Timeline
Open source strategy decision expected Q3-Q4 2026 after legal review and Founder Partners feedback.
Current Transparency Commitments
- Public White Paper (this document)
- Blockchain device registry (verifiable operations)
- Reproducible builds (binary verification)
- Public transparency dashboard
9. Governance & Accountability
Internal Governance (current)
- Founders review and approve all architectural decisions
- Security disclosures: Reserved channel founders@syrag.net, 48h response time
- Quarterly White Paper updates with Founder Partners input
- Transparent roadmap published in this document
Future Community Governance (if open sourced)
- Maintainer review process, code of conduct
- Public issue tracker and roadmap
- CVE policy for security vulnerabilities
- Community advisory board from Founder Partners
10. Call to Action
Applications open until March 1, 2026. Contact: founders@syrag.net. Website: www.syrag.net.