From 69da0a315ffeac8bcc0191c985b6d5c0ab1cfc65 Mon Sep 17 00:00:00 2001 From: Pulse Agent Date: Wed, 20 May 2026 11:00:57 -0300 Subject: [PATCH] =?UTF-8?q?docs(pulse-docs):=20recovery=20commands=20+=20D?= =?UTF-8?q?OCKER-CHECKLIST=20+=20SESSION-CHECKLIST=20=E2=80=94=20docs=20de?= =?UTF-8?q?=20auto-administracao?= MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit --- checklists/docker/DOCKER-CHECKLIST.md | 8 +++- checklists/session/SESSION-CHECKLIST.md | 12 +++--- runbook/RECOVERY-COMMANDS.md | 53 ++++++++++++++++++------- 3 files changed, 50 insertions(+), 23 deletions(-) diff --git a/checklists/docker/DOCKER-CHECKLIST.md b/checklists/docker/DOCKER-CHECKLIST.md index 2be7073..78092fb 100644 --- a/checklists/docker/DOCKER-CHECKLIST.md +++ b/checklists/docker/DOCKER-CHECKLIST.md @@ -1,5 +1,9 @@ # Docker Checklist — Pulse Agent -- [ ] \`docker ps\` — verificar ~19 containers rodando +## Antes de sessoes de trabalho +- [ ] \`docker ps\` — verificar 19 containers rodando a cada alteracao - [ ] \`docker ps -a -f status=exited --format '{{.Names}}'\` — limpar orfaos -- [ ] \`docker stack ps --no-trunc \` — tasks por stack + +## Quando um servico falhar +- [ ] Identificar stack: \`docker stack ps \` +- [ ] Aplicar recovery: \`docker service update --force _\` diff --git a/checklists/session/SESSION-CHECKLIST.md b/checklists/session/SESSION-CHECKLIST.md index 9fe1775..175cd57 100644 --- a/checklists/session/SESSION-CHECKLIST.md +++ b/checklists/session/SESSION-CHECKLIST.md @@ -1,13 +1,13 @@ # Session Checklist — Pulse Agent Auto-Check -## Inicio +## Início - [ ] Ler MEMORY.md - [ ] Ler SESSION-STATE.md - [ ] Ler LEARNINGS.md | ERRORS.md | PATTERN_COUNTER.md -- [ ] docker ps — servicos -- [ ] df -h — disco -- [ ] uptime — load +- [ ] \`docker ps\` — serviços +- [ ] \`df -h\` — disco +- [ ] \`uptime\` — load ## Fim -- [ ] Atualizar memory/.md -- [ ] Ler .learnings/LEARNINGS.md +- [ ] Atualizar \`memory/.md\` +- [ ] Commit de tudo diff --git a/runbook/RECOVERY-COMMANDS.md b/runbook/RECOVERY-COMMANDS.md index 3a2570f..dce9d32 100644 --- a/runbook/RECOVERY-COMMANDS.md +++ b/runbook/RECOVERY-COMMANDS.md @@ -1,20 +1,43 @@ -# Recovery Commands — Docker Swarm +# Comandos de Recuperação — Docker Swarm -## Emergency -\`\`\`bash -docker node ls # verificar saude do no -docker stack rm && sleep 3 # remover stack problematica +_Alfabeto de comandos para o Pulse usar quando algo quebrar._ + +## Emergency — todos os serviços down + +```bash +docker node ls # verificar saúde do nó +docker stack rm && sleep 3 # remover stack problemática +docker swarm init # só se necessário docker stack deploy -c .yml # re-deploy -\`\`\` +``` -## Servico especifico -\`\`\`bash -docker service ps _ # ver tasks -docker service update --force _ # forc@r nova task -\`\`\` +## Serviço específico — forçar restart -## Health check manual -\`\`\`bash +```bash +docker service ps _ # ver tasks atuais +docker service update --force _ # forçar nova task +``` + +## Limpar containers órfãos + +```bash +docker ps -a -f "status=exited" --format '{{.Names}}' | xargs -r docker rm -f +docker ps -a -f "status=dead" --format '{{.Names}}' | xargs -r docker rm -f +``` + +## Swarm reset (extreme) + +```bash +docker swarm leave --force && docker swarm init --advertise-addr +``` + +## Health check manual de um container + +```bash +# Status geral +docker inspect --format '{{json .State.Health}}' | python3 -m json.tool + +# Com health check definido docker inspect --format '{{.State.Health.Status}}' -# → healthy | unhealthy | starting | -\`\`\` +# → "healthy" | "unhealthy" | "starting" +```