2026-02-17 22:44 KST Β· by Angtiger Β· 맀일 10:00 KST μ—…λ°μ΄νŠΈ
5 sources
9 new posts

πŸ† AI λͺ¨λΈ 벀치마크

β–Ό

πŸ–₯️ Terminal-Bench 2.0 (Top 5)

πŸ† Chatbot Arena ELO (Top 5)

🧠 ARC-AGI-2 달성λ₯ 

84.6%
πŸ€– 84.6% β€” Gemini 3 Deep Think (Google) πŸ§‘ Human Panel = 100% κΈ°μ€€
← 전체 보기 πŸ“‚ Company News× πŸ“… 2026-01×
총 9건
Hugging Face Blog Introducing Daggr: Chain apps programmatically, inspect visually
Daggr μ†Œκ°œ: 앱을 ν”„λ‘œκ·Έλž˜λ° λ°©μ‹μœΌλ‘œ μ²΄μ΄λ‹ν•˜κ³  μ‹œκ°μ μœΌλ‘œ κ²€μ‚¬ν•˜λŠ” 도ꡬ.
Google DeepMind Project Genie: Experimenting with infinite, interactive worlds
Google AI Ultra κ΅¬λ…μž(λ―Έκ΅­)κ°€ λ¬΄ν•œν•˜κ³  μΈν„°λž™ν‹°λΈŒν•œ 세계λ₯Ό μƒμ„±ν•˜κ³  νƒν—˜ν•  수 μžˆλŠ” μ‹€ν—˜μ  연ꡬ ν”„λ‘œν† νƒ€μž… Project Genieλ₯Ό μ²΄ν—˜ν•  수 μžˆλ‹€.
Hugging Face Blog We Got Claude to Build CUDA Kernels and teach open models!
Claudeλ₯Ό ν™œμš©ν•˜μ—¬ CUDA 컀널을 λΉŒλ“œν•˜κ³  μ˜€ν”ˆ λͺ¨λΈμ— κ°€λ₯΄μΉ˜λŠ” 방법.
Hugging Face Blog Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective
LinkedIn의 GPT-OSSλ₯Ό μœ„ν•œ μ—μ΄μ „νŠΈ RL ν•™μŠ΅ μ‹€μ „ 회고둝.
Hugging Face Blog Alyah ⭐️: Toward Robust Evaluation of Emirati Dialect Capabilities in Arabic LLMs
TII의 Alyah: μ•„λžμ–΄ LLMμ—μ„œ 에미라티 λ°©μ–Έ λŠ₯λ ₯을 κ²¬κ³ ν•˜κ²Œ ν‰κ°€ν•˜κΈ° μœ„ν•œ 연ꡬ.
Hugging Face Blog Architectural Choices in China's Open-Source AI Ecosystem: Building Beyond DeepSeek
DeepSeek을 λ„˜μ–΄μ„  쀑ꡭ μ˜€ν”ˆμ†ŒμŠ€ AI μƒνƒœκ³„μ˜ μ•„ν‚€ν…μ²˜ 선택에 λŒ€ν•œ Hugging Face 뢄석.
Hugging Face Blog AssetOpsBench: Bridging the Gap Between AI Agent Benchmarks and Industrial Reality
IBM Research의 AssetOpsBench: AI μ—μ΄μ „νŠΈ λ²€μΉ˜λ§ˆν¬μ™€ μ‚°μ—… ν˜„μ‹€ κ°„μ˜ 격차λ₯Ό ν•΄μ†Œ.
Google DeepMind D4RT: Teaching AI to see the world in four dimensions
4D μž₯λ©΄ 볡원과 좔적을 μœ„ν•œ 톡합 AI λͺ¨λΈ D4RT μ†Œκ°œ. AIμ—κ²Œ 세계λ₯Ό 4μ°¨μ›μœΌλ‘œ λ³΄λŠ” 법을 κ°€λ₯΄μΉœλ‹€.
Google DeepMind Veo 3.1 Ingredients to Video: More consistency, creativity and control
Veo 3.1 μ—…λ°μ΄νŠΈ: μžμ—°μŠ€λŸ½κ³  역동적인 클립 생성 및 μ„Έλ‘œ μ˜μƒ 지원. 더 λ§Žμ€ 일관성, μ°½μ˜μ„±, μ œμ–΄λ ₯을 μ œκ³΅ν•œλ‹€.