Ensure Thoras installation is production-ready - run through the
production readiness checklist to
ensure optimal platform and model performance.
Check predictions in the dashboard - confirm metric suggestions for your
workload are accurate. The Thoras reasoning engine typically needs to see a
pattern twice to learn about it; so model performance can take between a few
minutes to a few days to reach maximum accuracy.
Understand application behavior - for vertical scaling, ensure the
workload and its dependencies can handle periodic restarts. See the
vertical pod rightsizing and
aiscaletarget pages for an overview of scaling
and restart configuration options.
Review pod resource limits - see
vertical scaling config breakdown
for details on how to factor in pod resource limits to your vertical scaling
behavior.
Start small - test on a non-critical workload first.
Ensure observability and alerts are in place - have monitoring dashboards
and alerts ready so you can spot unexpected performance impacts quickly.