| May 26, 2026 | ALMA: 9,000개 주석만으로 LLM을 정렬하기 |
| May 26, 2026 | PIKA: 난이도에 집중한 expert-level 합성 정렬 데이터셋 |
| May 26, 2026 | WildJailbreak: in-the-wild 탈옥을 대규모로 합성한 안전 학습 데이터셋 |
| May 26, 2026 | BeaverTails: helpfulness와 harmlessness를 분리한 안전 정렬 데이터셋 |
| May 26, 2026 | HarmfulQA & RED-INSTRUCT: Chain of Utterances로 유해 질문을 만들고 안전 정렬까지 |
| May 26, 2026 | HH-RLHF Red-Team Attempts: Anthropic의 38,961건 레드팀 대화 데이터셋 |
| May 26, 2026 | AdvBench: LLM 공격 평가의 사실상 표준이 된 유해 행동 데이터셋 |
| May 25, 2026 | 에이전트란 무엇인가: 지능형 에이전트의 고전 정의부터 LLM 에이전트까지 |
| May 25, 2026 | AgentBench: Evaluating LLMs as Agents |
| May 25, 2026 | GAIA: a benchmark for General AI Assistants |
| May 25, 2026 | SWE-bench: Can Language Models Resolve Real-World GitHub Issues? |
| May 25, 2026 | TravelPlanner: A Benchmark for Real-World Planning with Language Agents |
| May 25, 2026 | MedAgentBench: A Realistic Virtual EHR Environment to Benchmark Medical LLM Agents |
| May 25, 2026 | OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments |
| May 18, 2026 | Llama Guard: LLM-based Input-Output Safeguard for Human-AI Conversations |
| May 18, 2026 | Constitutional AI: Harmlessness from AI Feedback |
| May 18, 2026 | JailbreakBench: An Open Robustness Benchmark for Jailbreaking Large Language Models |
| May 18, 2026 | HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal |
| May 16, 2026 | AgentVigil: Generic Black-Box Red-teaming for Indirect Prompt Injection against LLM Agents |
| May 16, 2026 | InjecAgent: Benchmarking Indirect Prompt Injections in Tool-Integrated Large Language Model Agents |
| May 16, 2026 | AgenticRed: Evolving Agentic Systems for Red-Teaming |
| May 16, 2026 | Auto-RT: Automatic Jailbreak Strategy Exploration for Red-Teaming Large Language Models |
| May 16, 2026 | Curiosity-driven Red-teaming for Large Language Models |
| May 16, 2026 | Many-shot Jailbreaking |
| May 16, 2026 | Great, Now Write an Article About That: The Crescendo Multi-Turn LLM Jailbreak Attack |
| May 16, 2026 | GPTFuzzer: Red Teaming Large Language Models with Auto-Generated Jailbreak Prompts |
| May 16, 2026 | Tree of Attacks: Jailbreaking Black-Box LLMs Automatically |
| May 16, 2026 | AutoDAN: Generating Stealthy Jailbreak Prompts on Aligned Large Language Models |
| May 16, 2026 | Red Teaming Language Models to Reduce Harms: Methods, Scaling Behaviors, and Lessons Learned |
| May 16, 2026 | Red Teaming Language Models with Language Models |
| May 11, 2026 | TRL sequence packing → DeepSeek MLA: 누락된 cu_seqlens 복원 |
| May 10, 2026 | MLA 학습 시 modeling-side projection fusion: q_a/kv_a 배치 + K-side absorption |
| May 10, 2026 | DeepSeek 계열 MoE 학습 가속: Python expert loop → grouped GEMM |
| Apr 29, 2026 | CodeAttack: Code-based Adversarial Attacks for Pre-trained Programming Language Models |
| Apr 29, 2026 | Jailbreaking Black Box Large Language Models in Twenty Queries |
| Apr 29, 2026 | Universal and Transferable Adversarial Attacks on Aligned Language Models |
| Apr 14, 2026 | K8s 시리즈 06: EKS 네트워킹·보안·비용·운영 |
| Apr 14, 2026 | K8s 시리즈 05: Amazon EKS — 아키텍처와 Worker Node |
| Apr 14, 2026 | K8s 시리즈 04: ConfigMap, Secret, Storage — 설정과 데이터 관리 |
| Apr 14, 2026 | K8s 시리즈 03: Service, Ingress — 트래픽 라우팅과 외부 접근 |
| Apr 14, 2026 | K8s 시리즈 02: Pod, Deployment, Job, CronJob — K8s 워크로드 총정리 |
| Apr 14, 2026 | K8s 시리즈 01: Kubernetes란? 컨테이너부터 클러스터까지 |
| Apr 12, 2026 | A.X K1 Technical Report |
| Apr 12, 2026 | TelAgentBench: A Multi-faceted Benchmark for Evaluating LLM-based Agents in Telecommunications |
| Apr 11, 2026 | TelBench: A Benchmark for Evaluating Telco-Specific Large Language Models |
| Apr 11, 2026 | LLM 엔지니어가 알아야 할 GPU 아키텍처: Ampere → Hopper → Blackwell |
| Apr 11, 2026 | FlashAttention-4: Algorithm and Kernel Pipelining Co-Design for Asymmetric Hardware Scaling |
| Apr 09, 2026 | FlashAttention-3: Fast and Accurate Attention with Asynchrony and Low-precision |
| Apr 01, 2026 | Triton 07: Flash Attention 3 — Triton으로 어디까지 가능한가 |
| Apr 01, 2026 | Triton 06: Flash Attention 2 — FA1 대비 5가지 최적화 |
| Apr 01, 2026 | Triton 05: Flash Attention — 종합 프로젝트 |
| Apr 01, 2026 | Triton 04: Matrix Multiplication — 2D 타일링과 Autotune |
| Apr 01, 2026 | Triton 03: RMSNorm — LLM에서 쓰이는 실전 커널 |
| Apr 01, 2026 | Triton 02: Fused Softmax — 커널 퓨전과 Reduction |
| Apr 01, 2026 | Triton 01: Vector Addition — Triton 커널 기초 |
| Apr 01, 2026 | Triton 00: GPU 기초 — Triton을 시작하기 전에 알아야 할 것들 |