๐Ÿ“ [Wiki] Dr.Kube โ†’ AI-Powered Self-Diagnostic Agent for Kubernetes

:bookmark_tabs: [Dr.Kube] Official Wiki

KR: ์ด ํŽ˜์ด์ง€๋Š” Dr.Kube์˜ ๋น„์ „, ๊ธฐ์ˆ ์  ๋ฐฉํ–ฅ์„ฑ, ๊ทธ๋ฆฌ๊ณ  ํ˜‘์—… ๋ฐฉ์‹์„ ์ •์˜ํ•˜๋Š” ํ†ตํ•ฉ ๋ฌธ์„œ์ž…๋‹ˆ๋‹ค. ํŒ€์›๊ณผ ์™ธ๋ถ€ ๊ธฐ์—ฌ์ž๋“ค์ด ์กฐํ™”๋กญ๊ฒŒ ํ˜‘์—…ํ•  ์ˆ˜ ์žˆ๋„๋ก ๋•๋Š” ๊ณต์‹ ๊ฐ€์ด๋“œ๋ผ์ธ์ž…๋‹ˆ๋‹ค.

EN: This page serves as the comprehensive documentation defining the vision, technical direction, and collaboration methods for Dr.Kube. It is an official guideline to ensure seamless collaboration between the team and external contributors.

1. ํ”„๋กœ์ ํŠธ ๊ฐœ์š” (Project Overview)

  • Purpose: ํด๋Ÿฌ์Šคํ„ฐ์˜ '๋‹ค์ž‰ ๋ฉ”์‹œ์ง€โ€™์„ ๋ถ„์„ํ•ด ์Šค์Šค๋กœ ์žฅ์• ๋ฅผ ์ง„๋‹จํ•˜๊ณ  ํ•ด๊ฒฐ ๊ฐ€์ด๋“œ๊นŒ์ง€ ์ œ๊ณตํ•˜๋Š” ์ง€๋Šฅํ˜• AI ์—์ด์ „ํŠธ ; An autonomous AI agent that analyzes โ€˜dying messagesโ€™ from failing clusters to diagnose root causes and deliver actionable remediation guidelines.

  • Background / Introduction (KR):

    • Kubernetes์˜ ๊ณ ์งˆ์ ์ธ ๋ฌธ์ œ์ธ ์žฅ์•  ๋ฐœ์ƒ ์‹œ MTTR(ํ‰๊ท  ๋ณต๊ตฌ ์‹œ๊ฐ„)์„ ๋‹จ์ถ•ํ•˜๋Š” ๊ฒƒ์„ ๋ชฉํ‘œ๋กœ ํ•ฉ๋‹ˆ๋‹ค.
    • LangGraph ๊ธฐ๋ฐ˜ ์ถ”๋ก  ๋ฃจํ”„๋ฅผ ํ†ตํ•ด ๊ทผ๋ณธ ์›์ธ์„ ํƒ์ƒ‰ํ•˜๊ณ , Slack ๋“ฑ ๋ฉ”์‹ ์ €๋กœ ์šด์˜์ž์—๊ฒŒ ์ฆ‰๊ฐ์ ์ธ ์กฐ์น˜ ๊ฐ€์ด๋“œ๋ผ์ธ์„ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
    • ์šด์˜์ž์˜ ์Šน์ธ ํ•˜์— ๋ช…๋ น์„ ์ˆ˜ํ–‰ํ•˜์—ฌ ์•ˆ์ „์„ฑ์„ ํ™•๋ณดํ•˜๋Š” Safe AI Ops ์ƒํƒœ๊ณ„๋ฅผ ๋งŒ๋“ญ๋‹ˆ๋‹ค.
  • Background / Introduction (EN):

    • Our objective is to minimize Mean Time to Recovery (MTTR), a persistent challenge in Kubernetes environments during system failures. By leveraging LangGraph-based reasoning loops, the system identifies root causes and delivers immediate remediation guidelines to operators via messaging platforms like Slack. We are building a โ€˜Safe AIOpsโ€™ ecosystem that ensures operational integrity by executing commands only upon explicit human-in-the-loop approval.
  • Core Values:

    • e.g., Knowledge Sovereignty (์ง€์‹ ์ฃผ๊ถŒ), Open Source (์˜คํ”ˆ ์†Œ์Šค ์ •์‹ ), Innovation (๊ธฐ์ˆ  ํ˜์‹ )

2. ํŒ€ ๊ตฌ์„ฑ (The Team)

Roles and responsibilities for the member team.

์ด๋ฆ„ (Name) ID ์—ญํ•  (Role) SNS ์ฃผ์š” ์ฑ…์ž„ (Responsibilities - KR/EN)
ํŒ€์› A @id_a Team Leader Link ๋กœ๋“œ๋งต ๋ฐ ์ตœ์ข… ์˜์‚ฌ๊ฒฐ์ • / Roadmap & Final decision-making
ํŒ€์› B @id_b Tech Lead Link ์•„ํ‚คํ…์ฒ˜ ๋ฐ ์ฝ”๋“œ ๋ฆฌ๋ทฐ / Architecture & Code reviews
ํŒ€์› C @id_c Core Dev Link ํ•ต์‹ฌ ๊ธฐ๋Šฅ ๊ตฌํ˜„ / Core logic & API implementation
ํŒ€์› D @id_d DevOps Link ์ธํ”„๋ผ ๋ฐ CI/CD ๊ด€๋ฆฌ / Infrastructure & CI/CD management
ํŒ€์› E @id_e Writer Link ๋ฌธ์„œํ™” ๋ฐ ์ปค๋ฎค๋‹ˆํ‹ฐ ๊ด€๋ฆฌ / Documentation & Community

3. ๊ธฐ์ˆ  ์Šคํƒ (Tech Stack)

  • Language: (e.g., TypeScript, Go, Python)
  • Infra: (e.g., Kubernetes, Docker, Terraform)
  • Communication: Discord, GitHub Issues

4. ๋กœ๋“œ๋งต (Roadmap)

  • Phase 1: MVP ์š”๊ตฌ์‚ฌํ•ญ ์ •์˜ (MVP Requirement Definition)
  • Phase 2: ํ•ต์‹ฌ ๋ชจ๋“ˆ ๊ฐœ๋ฐœ ๋ฐ ์•ŒํŒŒ ํ…Œ์ŠคํŠธ (Core Module Dev & Alpha Test)
  • Phase 3: ๊ธ€๋กœ๋ฒŒ ์ปค๋ฎค๋‹ˆํ‹ฐ ๊ณต๊ฐœ (Global Community Launch)

5. ์ฐธ์—ฌ ๋ฐฉ๋ฒ• (How to Contribute)

  • Issues: ๋ฒ„๊ทธ๋‚˜ ๊ธฐ๋Šฅ ์ œ์•ˆ์€ GitHub Issues๋ฅผ ํ™œ์šฉํ•˜์„ธ์š”. (Please use GitHub Issues for bug reports or feature requests.)
  • PRs: ๋ชจ๋“  Pull Request๋Š” Tech Lead์˜ ๊ฒ€ํ†  ํ›„ ๋ณ‘ํ•ฉ๋ฉ๋‹ˆ๋‹ค. (All PRs will be merged after review by the Tech Lead.)
  • Guide: [CONTRIBUTING.md] ํŒŒ์ผ์„ ์ฐธ๊ณ ํ•˜์„ธ์š”. (Please refer to the [CONTRIBUTING.md] file.)
  • Discord (Official): [Dr.Kube Invite Link]
    • KR: ์‹ค์‹œ๊ฐ„ ์†Œํ†ต ๋ฐ ๊ธฐ์ˆ  ์ง€์›์„ ์œ„ํ•œ ์ฑ„๋„์ž…๋‹ˆ๋‹ค.
    • EN: Official channel for real-time communication and technical support.

6. ๋ฆฌ์†Œ์Šค ๋ฐ ๋งํฌ (Resources & Links)

  • GitHub Repository: [Link]
  • Docs: [Architecture / API Specs]

| This is a space where knowledge is not merely consumed, but respected, sovereign, and connectedโ€”shared together with cloud industry professionals (Bros).|
| ์ง€์‹์ด ์†Œ๋น„๋˜์ง€ ์•Š๊ณ  ์กด์ค‘ยท์ฃผ๊ถŒ๋ณด์žฅยท์—ฐ๊ฒฐ๋˜๋Š” ๊ณต๊ฐ„์œผ๋กœ ํด๋ผ์šฐ๋“œ ํ˜„์—… ์ „๋ฌธ๊ฐ€(Bro)์™€ ํ•จ๊ป˜ ๊ณต์œ ํ•˜๊ณ  ์žˆ์Šต๋‹ˆ๋‹ค. |