docs(pulse-docs): add Docker Swarm runbook + recovery commands + session & docker checklists + state snapshot

This commit is contained in:
Pulse Agent
2026-05-20 11:00:36 -03:00
parent 25d6b3556d
commit 1ccf3ce727
4 changed files with 75 additions and 0 deletions
+37
View File
@@ -0,0 +1,37 @@
# Docker Swarm Runbook — Pulse Agent
_Atualizado: 2026-05-20 | Responsável: Pulse Agent_
## \u0001Servicos por Stack
| Stack | Servicos | Status |
|-------|----------|--------|
| bot | beebot | 🟢 |
| code | file (8dcode) | 🟢 |
| database | mongos-master, dbadmin | 🟡 degraded |
| design | penpot (7 containers) | 🟢 |
| dock | portainer, agent | 🟡 |
| git | gitea | 🟢 |
| pro | leantime, leantime-db | 🟡 |
| proxy | caddy (80/443) | 🟢 |
## Recuperacao rapida
\`\`\`bash
# Status detalhado
docker stack ps --no-trunc --no-resolve <stack>
# Forcar restart
docker service update --force <stack>_<service>
# Escalar (forcar nova réplica)
docker service scale <stack>_<service>=2
docker service scale <stack>_<service>=1
# Limpar orfaos
docker ps -a -f 'status=exited' --format '{{.Names}}' | xargs -r docker rm -f
\`\`\`
## Health check coverage
- 3/19 containers com health check definido
- TODO: adicionar para bot_office, gitea, pro_leantime-db, todos do design stack
+20
View File
@@ -0,0 +1,20 @@
# Recovery Commands — Docker Swarm
## Emergency
\`\`\`bash
docker node ls # verificar saude do no
docker stack rm <stack> && sleep 3 # remover stack problematica
docker stack deploy -c <stack>.yml <stack> # re-deploy
\`\`\`
## Servico especifico
\`\`\`bash
docker service ps <stack>_<service> # ver tasks
docker service update --force <stack>_<service> # forc@r nova task
\`\`\`
## Health check manual
\`\`\`bash
docker inspect --format '{{.State.Health.Status}}' <container_id>
# → healthy | unhealthy | starting | <sem-health>
\`\`\`