Won's Blog

공부 및 실험 공유

A.X K1 Technical Report

A.X K1 논문 리뷰 — 519B MoE 모델의 아키텍처, 데이터 파이프라인, Think-Fusion 학습 전략

9 min read · 2026

TelAgentBench: A Multi-faceted Benchmark for Evaluating LLM-based Agents in Telecommunications

TelAgentBench 논문 리뷰 - 통신 도메인 LLM 에이전트의 5가지 핵심 역량 평가 벤치마크

24 min read · 2026

TelBench: A Benchmark for Evaluating Telco-Specific Large Language Models

TelBench 논문 리뷰 — 통신 도메인 특화 LLM 벤치마크의 설계, 구축, 평가

22 min read · 2026

FlashAttention-4: Algorithm and Kernel Pipelining Co-Design for Asymmetric Hardware Scaling

FlashAttention-4 논문 리뷰 — Blackwell GPU의 비대칭 스케일링에 맞춘 파이프라인 재설계와 소프트웨어 지수함수

11 min read · 2026

FlashAttention-3: Fast and Accurate Attention with Asynchrony and Low-precision

FlashAttention-3 논문 리뷰 — Hopper GPU의 비동기 실행과 FP8을 활용한 Attention 최적화

19 min read · 2026

Triton 07: Flash Attention 3 — Triton으로 어디까지 가능한가

Hopper 전용인 Flash Attention 3를 Triton으로 어디까지 따라잡을 수 있는가 — 확장 autotune·persistent kernel·실패한 실험까지

10 min read · 2026

Triton 06: Flash Attention 2 — FA1 대비 5가지 최적화

Flash Attention 2를 Triton으로 구현한다 — un-scaled 누적, exp2 트릭, Causal 2-stage, tl.dot accumulator, autotune

12 min read · 2026

Triton 05: Flash Attention — 종합 프로젝트

Flash Attention을 Triton으로 구현한다 — Forward/Backward 전체 구현과 RTX 4080·A100·H100·B200 아키텍처별 최적화 포인트

19 min read · 2026

FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning

FlashAttention-2 논문 리뷰 — non-matmul FLOPs 감소, 병렬화, warp partitioning 개선

12 min read · 2023

Kubernetes 확장과 생태계 — Operator와 CNCF Projects

CRD와 Custom Controller로 나만의 리소스를 만드는 Operator 패턴, cert-manager 실습, 그리고 CNCF 성숙도 3단계로 읽는 생태계 지도 — K8s 입문 시리즈 마지막 편

17 min read · July 07, 2026

2026 · kubernetes infra operator crd helm kustomize cncf · infra
Kubernetes 권한 관리 — ServiceAccount와 RBAC

인증과 인가의 차이부터 ServiceAccount, Role/RoleBinding 4요소, kubectl auth can-i 실습까지 — Kubernetes RBAC 입문

15 min read · July 06, 2026

2026 · kubernetes infra rbac serviceaccount security · infra
Kubernetes 스토리지와 설정 — PV/PVC, ConfigMap, Secret

컨테이너의 휘발성 문제를 푸는 PV/PVC/StorageClass 3계층 추상화, 그리고 설정과 비밀값을 이미지에서 분리하는 ConfigMap·Secret을 kind 실습으로 익힌다

15 min read · July 06, 2026

2026 · kubernetes infra storage pv pvc configmap secret · infra
Kubernetes 네트워킹 — Service와 Ingress

Pod IP는 재시작마다 바뀐다 — ClusterIP·NodePort·LoadBalancer 세 가지 Service 타입과 Ingress로 Kubernetes 트래픽 라우팅을 이해한다

25 min read · July 06, 2026

2026 · kubernetes infra service ingress networking hpa networkpolicy gateway-api · infra
Kubernetes 워크로드 — ReplicaSet, Deployment, StatefulSet, DaemonSet

Pod를 감싸는 상위 워크로드 리소스 4종 — Label/Selector부터 롤링 업데이트와 롤백, StatefulSet의 순차 기동, DaemonSet까지 kind 클러스터로 실습한다

24 min read · July 06, 2026

2026 · kubernetes infra deployment replicaset statefulset daemonset job cronjob namespace · infra

Won's Blog

공부 및 실험 공유

A.X K1 Technical Report

TelAgentBench: A Multi-faceted Benchmark for Evaluating LLM-based Agents in Telecommunications

TelBench: A Benchmark for Evaluating Telco-Specific Large Language Models

FlashAttention-4: Algorithm and Kernel Pipelining Co-Design for Asymmetric Hardware Scaling

FlashAttention-3: Fast and Accurate Attention with Asynchrony and Low-precision

Triton 07: Flash Attention 3 — Triton으로 어디까지 가능한가

Triton 06: Flash Attention 2 — FA1 대비 5가지 최적화

Triton 05: Flash Attention — 종합 프로젝트

FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning

Kubernetes 확장과 생태계 — Operator와 CNCF Projects

Kubernetes 권한 관리 — ServiceAccount와 RBAC

Kubernetes 스토리지와 설정 — PV/PVC, ConfigMap, Secret

Kubernetes 네트워킹 — Service와 Ingress

Kubernetes 워크로드 — ReplicaSet, Deployment, StatefulSet, DaemonSet