betelgeusebytes/STACK.md

6.1 KiB
Raw Blame History

🧠 BetelgeuseBytes Full Stack Catalog

This document lists every major component deployed in the cluster, what it is used for today, and what it can be reused for.


Core Platform

Component Namespace Purpose Reuse
Kubernetes all Orchestration Any platform
Cilium kube-system Networking Secure clusters
NGINX Ingress ingress-nginx Traffic routing API gateway
cert-manager cert-manager TLS automation PKI

Databases & Messaging

Component URL / Access Purpose Reuse
PostgreSQL TCP via Ingress Relational DB App backends
Redis internal Cache Queues
Kafka kafka-ui UI Event streaming Streaming ETL
Elasticsearch Kibana UI Search + logs Fulltext search

Knowledge & Vector

Component URL Purpose Reuse
Neo4j neo4j.betelgeusebytes.io Knowledge graph Graph analytics
Qdrant vector.betelgeusebytes.io Vector search RAG

ML & AI

Component URL Purpose Reuse
Jupyter notebook UI Experiments Research
Label Studio label.betelgeusebytes.io Annotation Dataset creation
MLflow mlflow.betelgeusebytes.io Model tracking MLOps
Ollama / LLM llm.betelgeusebytes.io LLM inference Agents
Embeddings embeddings.betelgeusebytes.io Text embeddings Semantic search

Automation & DevOps

Component URL Purpose Reuse
Argo Workflows argo.betelgeusebytes.io Pipelines ETL
Argo CD argocd UI GitOps CI/CD
Gitea gitea UI Git hosting SCM
n8n automation UI Workflow automation Integration

Observability (LGTM)

Component Purpose Reuse
Grafana Dashboards Ops center
Prometheus Metrics Monitoring
Loki Logs Debugging
Tempo Traces Distributed tracing
Alloy Telemetry agent Standardized telemetry

Authentication

Component Purpose Reuse
Keycloak OIDC / SSO IAM

Why This Stack Matters

  • Covers data → ML → serving → observability endtoend
  • Suitable for research and production
  • Modular and futureproof

📚 Stack Catalog — Services, URLs, Access & Usage

This document lists every deployed component, how to access it, what it is used for now, and what it enables in the future.


🌐 Public Services (Ingress / HTTPS)

Component URL Auth What It Is Current Usage Future Usage
LLM Inference https://llm.betelgeusebytes.io none / internal CPU LLM server (Ollama / llama.cpp) Extract sanad & matn as JSON Agents, doc AI, RAG
Embeddings https://embeddings.betelgeusebytes.io none / internal Text Embeddings Inference (HF) Hadith & bio embeddings Semantic search
Vector DB https://vector.betelgeusebytes.io none Qdrant + UI Similarity search Recommendations
Graph DB https://neo4j.betelgeusebytes.io Basic Auth Neo4j Browser Isnād graph Knowledge graphs
Orchestrator https://hadith-api.betelgeusebytes.io OIDC FastAPI router Core AI API Any AI backend
Admin UI https://hadith-admin.betelgeusebytes.io OIDC Next.js UI Scholar review Any internal tool
Labeling https://label.betelgeusebytes.io Local / OIDC Label Studio NER/RE annotation Dataset curation
ML Tracking https://mlflow.betelgeusebytes.io OIDC MLflow UI Experiments & models Governance
Object Storage https://minio.betelgeusebytes.io Access key MinIO Console Datasets & artifacts Data lake
Pipelines https://argo.betelgeusebytes.io SA / OIDC Argo Workflows UI ML pipelines ETL
Auth https://auth.betelgeusebytes.io Admin login Keycloak SSO & tokens IAM
Observability https://grafana.betelgeusebytes.io Login Grafana Metrics/logs/traces Ops center

🔐 Authentication & Access Summary

System Auth Method Who Uses It
Keycloak Username / Password Admins
Admin UI OIDC (Keycloak) Scholars
Orchestrator API OIDC Bearer Token Apps
MLflow OIDC ML engineers
Label Studio Local / OIDC Annotators
Neo4j Basic Auth Engineers
MinIO Access / Secret key Pipelines
Grafana Login Operators

🧠 Internal Cluster Services (ClusterIP)

Component Namespace Purpose
PostgreSQL db Relational storage
Redis db Cache / temp state
Kafka broker Event backbone
Prometheus observability Metrics
Loki observability Logs
Tempo observability Traces
Alloy observability Telemetry agent

🗂 Storage Responsibilities

Storage Used By Contains
MinIO Pipelines, MLflow Datasets, models
Neo4j PVC Graph DB Isnād graph
Qdrant PVC Vector DB Embeddings
PostgreSQL PVC DB Metadata
Observability PVCs LGTM Logs, metrics, traces