2026-02-17 22:37 KST Β· by Angtiger Β· 맀일 10:00 KST μ—…λ°μ΄νŠΈ
5 sources
1 new posts

πŸ† AI λͺ¨λΈ 벀치마크

β–Ό

πŸ–₯️ Terminal-Bench 2.0 (Top 5)

πŸ† Chatbot Arena ELO (Top 5)

🧠 ARC-AGI-2 달성λ₯ 

84.6%
πŸ€– 84.6% β€” Gemini 3 Deep Think (Google) πŸ§‘ Human Panel = 100% κΈ°μ€€
← 전체 보기 πŸ“‚ Research & Papers× πŸ“… 2026-02-10×
총 1건
HF Daily Papers Benchmarking Knowledge-Extraction Attack and Defense on Retrieval-Augmented Generation
RAG(Retrieval-Augmented Generation)에 λŒ€ν•œ 지식 μΆ”μΆœ 곡격과 λ°©μ–΄λ₯Ό λ²€μΉ˜λ§ˆν‚Ή. μ—”ν„°ν”„λΌμ΄μ¦ˆ 챗봇, 의료 μ–΄μ‹œμŠ€ν„΄νŠΈ λ“±μ—μ„œ 지적 μž¬μ‚° λ„μš©κ³Ό ν”„λΌμ΄λ²„μ‹œ 유좜 μœ„ν—˜μ„ μ²΄κ³„μ μœΌλ‘œ ν‰κ°€ν•œλ‹€.