2026

an archive of posts from this year

May 26, 2026 ALMA: 9,000개 주석만으로 LLM을 정렬하기
May 26, 2026 PIKA: 난이도에 집중한 expert-level 합성 정렬 데이터셋
May 26, 2026 WildJailbreak: in-the-wild 탈옥을 대규모로 합성한 안전 학습 데이터셋
May 26, 2026 BeaverTails: helpfulness와 harmlessness를 분리한 안전 정렬 데이터셋
May 26, 2026 HarmfulQA & RED-INSTRUCT: Chain of Utterances로 유해 질문을 만들고 안전 정렬까지
May 26, 2026 HH-RLHF Red-Team Attempts: Anthropic의 38,961건 레드팀 대화 데이터셋
May 26, 2026 AdvBench: LLM 공격 평가의 사실상 표준이 된 유해 행동 데이터셋
May 25, 2026 에이전트란 무엇인가: 지능형 에이전트의 고전 정의부터 LLM 에이전트까지
May 25, 2026 AgentBench: Evaluating LLMs as Agents
May 25, 2026 GAIA: a benchmark for General AI Assistants
May 25, 2026 SWE-bench: Can Language Models Resolve Real-World GitHub Issues?
May 25, 2026 TravelPlanner: A Benchmark for Real-World Planning with Language Agents
May 25, 2026 MedAgentBench: A Realistic Virtual EHR Environment to Benchmark Medical LLM Agents
May 25, 2026 OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
May 18, 2026 Llama Guard: LLM-based Input-Output Safeguard for Human-AI Conversations
May 18, 2026 Constitutional AI: Harmlessness from AI Feedback
May 18, 2026 JailbreakBench: An Open Robustness Benchmark for Jailbreaking Large Language Models
May 18, 2026 HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal
May 16, 2026 AgentVigil: Generic Black-Box Red-teaming for Indirect Prompt Injection against LLM Agents
May 16, 2026 InjecAgent: Benchmarking Indirect Prompt Injections in Tool-Integrated Large Language Model Agents
May 16, 2026 AgenticRed: Evolving Agentic Systems for Red-Teaming
May 16, 2026 Auto-RT: Automatic Jailbreak Strategy Exploration for Red-Teaming Large Language Models
May 16, 2026 Curiosity-driven Red-teaming for Large Language Models
May 16, 2026 Many-shot Jailbreaking
May 16, 2026 Great, Now Write an Article About That: The Crescendo Multi-Turn LLM Jailbreak Attack
May 16, 2026 GPTFuzzer: Red Teaming Large Language Models with Auto-Generated Jailbreak Prompts
May 16, 2026 Tree of Attacks: Jailbreaking Black-Box LLMs Automatically
May 16, 2026 AutoDAN: Generating Stealthy Jailbreak Prompts on Aligned Large Language Models
May 16, 2026 Red Teaming Language Models to Reduce Harms: Methods, Scaling Behaviors, and Lessons Learned
May 16, 2026 Red Teaming Language Models with Language Models
May 11, 2026 TRL sequence packing → DeepSeek MLA: 누락된 cu_seqlens 복원
May 10, 2026 MLA 학습 시 modeling-side projection fusion: q_a/kv_a 배치 + K-side absorption
May 10, 2026 DeepSeek 계열 MoE 학습 가속: Python expert loop → grouped GEMM
Apr 29, 2026 CodeAttack: Code-based Adversarial Attacks for Pre-trained Programming Language Models
Apr 29, 2026 Jailbreaking Black Box Large Language Models in Twenty Queries
Apr 29, 2026 Universal and Transferable Adversarial Attacks on Aligned Language Models
Apr 14, 2026 K8s 시리즈 06: EKS 네트워킹·보안·비용·운영
Apr 14, 2026 K8s 시리즈 05: Amazon EKS — 아키텍처와 Worker Node
Apr 14, 2026 K8s 시리즈 04: ConfigMap, Secret, Storage — 설정과 데이터 관리
Apr 14, 2026 K8s 시리즈 03: Service, Ingress — 트래픽 라우팅과 외부 접근
Apr 14, 2026 K8s 시리즈 02: Pod, Deployment, Job, CronJob — K8s 워크로드 총정리
Apr 14, 2026 K8s 시리즈 01: Kubernetes란? 컨테이너부터 클러스터까지
Apr 12, 2026 A.X K1 Technical Report
Apr 12, 2026 TelAgentBench: A Multi-faceted Benchmark for Evaluating LLM-based Agents in Telecommunications
Apr 11, 2026 TelBench: A Benchmark for Evaluating Telco-Specific Large Language Models
Apr 11, 2026 LLM 엔지니어가 알아야 할 GPU 아키텍처: Ampere → Hopper → Blackwell
Apr 11, 2026 FlashAttention-4: Algorithm and Kernel Pipelining Co-Design for Asymmetric Hardware Scaling
Apr 09, 2026 FlashAttention-3: Fast and Accurate Attention with Asynchrony and Low-precision
Apr 01, 2026 Triton 07: Flash Attention 3 — Triton으로 어디까지 가능한가
Apr 01, 2026 Triton 06: Flash Attention 2 — FA1 대비 5가지 최적화
Apr 01, 2026 Triton 05: Flash Attention — 종합 프로젝트
Apr 01, 2026 Triton 04: Matrix Multiplication — 2D 타일링과 Autotune
Apr 01, 2026 Triton 03: RMSNorm — LLM에서 쓰이는 실전 커널
Apr 01, 2026 Triton 02: Fused Softmax — 커널 퓨전과 Reduction
Apr 01, 2026 Triton 01: Vector Addition — Triton 커널 기초
Apr 01, 2026 Triton 00: GPU 기초 — Triton을 시작하기 전에 알아야 할 것들