Google DeepMindμ μκ΅ AI Security Institute(AISI)κ° μλ‘μ΄ μ°κ΅¬ ννΈλμμ ν΅ν΄ AI μΆλ‘ λͺ¨λν°λ§, μ¬νκ²½μ μ μν₯ νκ° λ± ν΅μ¬ μμ μ°κ΅¬ λΆμΌμμ νλ ₯μ κ°ννλ€.
5 sources
1 new posts
π AI λͺ¨λΈ λ²€μΉλ§ν¬
π₯οΈ Terminal-Bench 2.0 (Top 5)
π Chatbot Arena ELO (Top 5)
Source: Chatbot Arena
π§ ARC-AGI-2 λ¬μ±λ₯
π€ 84.6% β Gemini 3 Deep Think (Google)
π§ Human Panel = 100% κΈ°μ€
Source: ARC Prize Leaderboard
μ΄ 1건