> ## Documentation Index
> Fetch the complete documentation index at: https://docs.thoras.ai/llms.txt
> Use this file to discover all available pages before exploring further.

# What is Thoras?

> The autonomous control plane for compute efficiency.

Thoras is a self-hosted, AI/ML-powered platform that replaces reactive
Kubernetes autoscaling with predictive, autonomous resource management. It
eliminates waste, prevents capacity outages, and safely optimizes critical
infrastructure. Thoras is designed to earn your trust before running
autonomously.

<CardGroup cols={3}>
  <Card title="↓ 40%+ waste" icon="chart-line-down">
    Reduction in cloud compute waste from CPU and memory over-provisioning.
  </Card>

  <Card title="↑ Performance" icon="bolt">
    OOM kill loops resolved automatically. Latency spikes prevented through
    rightsized compute.
  </Card>

  <Card title="0 toil" icon="circle-check">
    Continuous right-sizing without manual tuning or on-call escalations.
  </Card>
</CardGroup>

## Philosophy

**Autonomy is earned, not assumed.** Thoras doesn't arrive in your cluster and
start making decisions. It observes, learns, surfaces recommendations, and only
acts autonomously once you've validated its reasoning. Your workload and
infrastructure context are always respected.

**Performance first, efficiency second.** Thoras reclaims waste from
under-utilized CPU and memory without compromising availability. It knows when
not to scale down as clearly as it knows when to scale up.

## How it works

<Steps>
  <Step title="Observe">
    Thoras ingests real-time telemetry from your existing metric sources
    (Prometheus, the Kubernetes metrics server, and any custom or external
    metrics you already use) and persists historical usage data inside your
    cluster. It builds per-workload demand profiles that account for
    seasonality, traffic patterns, and resource trends.
  </Step>

  <Step title="Forecast">
    AI/ML models continuously predict horizontal and vertical scaling needs
    ahead of time. Forecasts update on configurable intervals (typically minutes
    to hours), with model accuracy improving as patterns repeat.
  </Step>

  <Step title="Recommend">
    Before any autonomous action, Thoras surfaces predictions and right-sizing
    recommendations in a dashboard. Teams validate model accuracy first.
    Autonomous mode requires a minimum of 3 hours of historical data, and
    benefits from at least 48 hours for workloads with daily seasonality.
  </Step>

  <Step title="Act">
    Once enabled, Thoras scales pods and replica counts in advance of demand,
    remediates OOM kill loops in real time with compounding memory adjustments,
    and maintains configurable upper and lower bounds at all times. You can
    pause autonomous scaling cluster-wide at any moment.
  </Step>
</Steps>

## Capabilities

<CardGroup cols={2}>
  <Card title="Predictive vertical rightsizing" icon="arrows-up-down" href="/guides/vertical-pod-rightsizing">
    Pre-emptive CPU and memory request optimization. Day-one support for K8s
    1.33+ in-place pod resize, with rolling restart fallback.
  </Card>

  <Card title="Predictive horizontal scaling" icon="arrows-left-right" href="/guides/hpa">
    Replica counts adjusted ahead of demand spikes, not after. Integrates with
    your existing HPA.
  </Card>

  <Card title="OOM remediation" icon="shield-halved" href="/guides/oom-remediation">
    Detects kill loops and stabilizes workloads with compounding memory
    adjustments (1.2× per cycle) until the forecaster catches up.
  </Card>

  <Card title="JVM-aware scaling" icon="mug-saucer" href="/guides/jvm-scaling">
    Purpose-built handling for Java workloads, with heap and GC pressure
    awareness.
  </Card>

  <Card title="Cost visibility and ROI" icon="dollar-sign" href="/guides/cost-estimates">
    Real-time waste quantification against node pricing data. Savings estimated
    per workload.
  </Card>

  <Card title="Fleet policies" icon="layer-group" href="/reference/cluster-aiscale-template">
    `ClusterAIScaleTemplate` applies scaling policies across namespaces without
    per-workload config.
  </Card>
</CardGroup>

## Get started

<CardGroup cols={2}>
  <Card title="Quickstart" icon="rocket" href="/getting-started/quickstart">
    Install Thoras and see resource recommendations in minutes.
  </Card>

  <Card title="Integrations" icon="plug" href="/faq/integrations">
    How Thoras works alongside KEDA, Cluster Autoscaler, and Karpenter.
  </Card>
</CardGroup>
