📝 [Wiki] HoneyBeePF → A Lightweight eBPF-Based Selective Observability Platform

dorok · January 10, 2026, 4:38am

[HoneyBeePF] Official Wiki

KR: 이 페이지는 HoneyBeePF의 비전, 기술적 방향성, 그리고 협업 방식을 정의하는 통합 문서입니다. 팀원과 외부 기여자들이 조화롭게 협업할 수 있도록 돕는 공식 가이드라인입니다.

EN: This page serves as the comprehensive documentation defining the vision, technical direction, and collaboration methods for HoneyBeePF. It is an official guideline to ensure seamless collaboration between the team and external contributors.

Project Overview

A lightweight, eBPF-based observability platform for AI workloads.
Install in seconds via Helm Chart (Kubernetes) or standalone binary (bare-metal / VM), with zero code changes required.

What We Do

HoneyBeePF attaches directly to the kernel layer using eBPF to provide two critical capabilities for organizations running LLM workloads:

FinOps — LLM Cost Visibility
Track token consumption across any model (OpenAI, Anthropic, self-hosted) in real time. Know exactly how many tokens each team, service, or request is burning — before the invoice arrives.

Security — File Access Auditing
Monitor which files LLM-powered applications access during inference and fine-tuning. Detect when sensitive or restricted files are touched, enforcing corporate data security policies without modifying application code.

Core Values

Selective Observability: Collect only decision-driving data, not everything.
Zero Instrumentation: No SDK, no sidecar, no code changes. eBPF does the work at the kernel level.
Universal Deployment: Helm or binary. Works in Kubernetes clusters and traditional data centers alike.

The Team

Roles and responsibilities for the member team.

이름 (Name)	역할 (Role)	SNS	주요 책임 (Responsibilities)
서준우	Team Leader	TBU	Roadmap & Feature Devolopment
박민진	Core Dev	TBU	CI/CD & Observability
안형준	Core Dev	TBU	Feature Development
이명일	Core Dev	TBU	CI/CD & Observability

Who Needs This

Any organization running LLM workloads that can’t answer these two questions:

“How much is each team/service spending on LLM tokens right now?”
“What restricted files did our LLM applications access in the last 24 hours?”

Who	Why They Need It	Trigger Moment
ML Platform / Infra Teams	LLM API costs growing with no per-team attribution. CFO asks “why did our bill double?” and no one has the breakdown.	Monthly API invoice surprises
CISO / Security Teams	Regulators or internal audit require proof of what data LLMs touch. Current logging is application-level and incomplete.	Compliance audit or AI regulation enforcement (AI Basic Act, EU AI Act)
SRE / DevOps	Running multi-tenant LLM services — need per-tenant token attribution for billing and capacity planning without adding middleware overhead.	Custom observability middleware breaks on framework updates
Executives (CTO/CFO)	Need cost visibility and security posture for LLM adoption decisions. Can’t justify scaling AI spend without governance.	Board-level AI budget review
Individuals using LLM on work devices	Company policy restricts which files LLM tools can access. No way to automatically detect violations on personal/work laptops.	Employee uses ChatGPT or Copilot with access to restricted company files

Problem Statement

Problems We Solve

Problem	Impact	HoneyBeePF Solution
LLM token costs are invisible until billing per team or per project	Teams frequently overspend on unused or inefficient model calls with no attribution	Real-time per-service token tracking at the network layer
No visibility into which files or directory LLMs access	Sensitive data leaks go undetected. compliance violations	Kernel-level file access auditing with policy alerting
Existing observability tools require heavy instrumentation	Weeks of integration, code changes, sidecar overhead	Zero-instrumentation eBPF agent, deployed in minutes
Agent-based monitoring adds resource overhead	5–15% CPU/memory tax from traditional agents	eBPF in-kernel processing with < 1% overhead

Why Now

Enterprise LLM adoption is accelerating, but cost governance lags behind.
AI security regulations (Korea AI Basic Act, EU AI Act) are creating compliance requirements around data access transparency.
Traditional APM tools (Datadog, New Relic) were built for HTTP microservices, not LLM-specific cost and security patterns.

Competitive Analysis

HoneyBeePF vs Other Tools in the Market

Competitors include Helicone, LangSmith, Datadog LLM Observability, and similar platforms.

	HoneyBeePF	Other Tools
Install method	`helm install` or single binary drop onto node	SDK wrapper, proxy setup, or agent install with config
Time to first insight	< 3 minutes	~2 hours (code changes + config required)
Code changes required	None	Yes — ranging from 1-line URL swap to full SDK instrumentation
Collection layer	Kernel (eBPF)	Application layer (SDK, proxy, or agent)
Resource overhead	< 1% CPU/memory	1–15% depending on tool (agent overhead, proxy latency, SDK weight)
File access auditing	Yes (kernel-level)	No or partial
Token cost tracking	Yes	Yes
Self-hosted option	Yes (fully)	Varies — some open-source, some cloud-only
Pricing model	TBU	$15–39/seat or /host/month + usage-based overages
Node-level injection	Direct binary drop, no restart needed	Not supported or requires agent install + restart

Our Differentiators

Fastest install in the market: Single binary copy to node or one helm install command. No application restarts, no code changes at all → not even a base URL swap. Closest competitor (Helicone) still requires a code-level change per service.
Lightest footprint: eBPF runs inside the kernel → no sidecar container, no proxy hop, no SDK overhead. Under 1% resource impact vs 5–15% for traditional agents.
Direct node injection: Drop the binary onto any Linux node and it starts collecting immediately. No orchestration dependency, no configuration files, no service mesh required. This is uniquely powerful for bare-metal AI data centers where Kubernetes isn’t present.
Security as a first-class feature: File access auditing at the kernel level is something no competitor offers. This isn’t an add-on — it’s architecturally impossible to replicate at the application layer with the same coverage.

Use cases

Note: The scenarios below are illustrative examples based on common patterns in the target market. Specific savings figures are projected estimates, not measured case studies.

Use case 1: API Usage

Company: Spending $40K/month on OpenAI + Anthropic APIs
Pain: CEO asked “why did our API bill double last quarter?” — has no per-team or per-feature breakdown
Current workaround: Manually tagging API calls with team labels
Needs: Drop-in solution that shows token usage by service/team without affecting the code or system

Day 0: Install HoneyBeePF on staging cluster
       → No application changes, no dev team involvement needed
       → Grafana dashboard shows token flow within 3 minutes
       → Discovers API usage per team or project 
       → Presents cost breakdown to CEO

Use case 2: Security/Compliance

Company: Financial services firm, employee using personal LLM for enterprise assets
Pain: Regulators require audit trail of what data the LLM accesses.
Current workaround: Manual detection or no automatic detection
Needs: Continuous file access monitoring that satisfies

Day 0: Install HoneyBeePF on personal labtop
       → No application changes, no dev team involvement needed
       → Grafana dashboard shows token flow within 3 minutes
       → file access logs start flowing to security database within minutes

Day 1+: Alert fires — LLM service accessed a restricted directory or files 
       → incident response team investigates which team or project or pod accessed 
       → policy violation caught before data left the network

| This is a space where knowledge is not merely consumed, but respected, sovereign, and connected—shared together with cloud industry professionals (Bros).|
| 지식이 소비되지 않고 존중·주권보장·연결되는 공간으로 클라우드 현업 전문가(Bro)와 함께 공유하고 있습니다. |