
Site portfolio.

...
Go for performance. Python for automation. Everything else — so infrastructure runs itself.
Everything that can be automated — should be automated. Manual operations = risk.
Logs, metrics, traces. If you can't see the problem — it doesn't exist, until everything goes down.
Graceful degradation, circuit breakers, retries. The system must survive, even when something dies.