2026-02-17 22:41 KST Β· by Angtiger Β· 맀일 10:00 KST μ—…λ°μ΄νŠΈ
5 sources
1 new posts

πŸ† AI λͺ¨λΈ 벀치마크

β–Ό

πŸ–₯️ Terminal-Bench 2.0 (Top 5)

πŸ† Chatbot Arena ELO (Top 5)

🧠 ARC-AGI-2 달성λ₯ 

84.6%
πŸ€– 84.6% β€” Gemini 3 Deep Think (Google) πŸ§‘ Human Panel = 100% κΈ°μ€€
← 전체 보기 πŸ“‚ Company News× πŸ“… 2025-12-16×
총 1건
Google DeepMind Gemma Scope 2: helping the AI safety community deepen understanding of complex language model behavior
Gemma 3 μ œν’ˆκ΅°μ„ μœ„ν•œ 포괄적 μ˜€ν”ˆ 해석가λŠ₯μ„± 도ꡬ λͺ¨μŒ Gemma Scope 2 λ°œν‘œ. AI μ•ˆμ „ 연ꡬλ₯Ό κ°€μ†ν™”ν•˜κΈ° μœ„ν•΄ λ³΅μž‘ν•œ μ–Έμ–΄ λͺ¨λΈ λ™μž‘μ˜ 이해λ₯Ό λ•λŠ”λ‹€.