2026-02-17 22:38 KST Β· by Angtiger Β· 맀일 10:00 KST μ—…λ°μ΄νŠΈ
5 sources
2 new posts

πŸ† AI λͺ¨λΈ 벀치마크

β–Ό

πŸ–₯️ Terminal-Bench 2.0 (Top 5)

πŸ† Chatbot Arena ELO (Top 5)

🧠 ARC-AGI-2 달성λ₯ 

84.6%
πŸ€– 84.6% β€” Gemini 3 Deep Think (Google) πŸ§‘ Human Panel = 100% κΈ°μ€€
← 전체 보기 πŸ“‚ Research & Papers× πŸ“… 2026-02-13×
총 2건
HF Daily Papers SPILLage: Agentic Oversharing on the Web
SPILLage: LLM 기반 μ›Ή μ—μ΄μ „νŠΈκ°€ μ‚¬μš©μž λ¦¬μ†ŒμŠ€(이메일, μΊ˜λ¦°λ” λ“±)λ₯Ό 제3μžμ—κ²Œ κ³Όλ„ν•˜κ²Œ κ³΅μœ ν•˜λŠ” μ—μ΄μ „νŠΈ μ˜€λ²„μ…°μ–΄λ§ 문제λ₯Ό κ³΅μ‹ν™”ν•˜κ³  λΆ„μ„ν•œ 연ꡬ.
HF Daily Papers Nanbeige4.1-3B: A Small General Model that Reasons, Aligns, and Acts
3B νŒŒλΌλ―Έν„°λ§ŒμœΌλ‘œ μ—μ΄μ „νŠΈ 행동, μ½”λ“œ 생성, 일반 좔둠을 λ™μ‹œμ— λ‹¬μ„±ν•˜λŠ” 톡합 λ²”μš© μ–Έμ–΄ λͺ¨λΈ Nanbeige4.1-3B λ°œν‘œ. 졜초의 μ˜€ν”ˆμ†ŒμŠ€ μ†Œν˜• μ–Έμ–΄ λͺ¨λΈ(SLM)λ‘œμ„œ μ΄λŸ¬ν•œ λ‹€μž¬λ‹€λŠ₯함을 μ‹€ν˜„ν–ˆλ‹€.