2026-02-17 22:38 KST Β· by Angtiger Β· 맀일 10:00 KST μ—…λ°μ΄νŠΈ
5 sources
2 new posts

πŸ† AI λͺ¨λΈ 벀치마크

β–Ό

πŸ–₯️ Terminal-Bench 2.0 (Top 5)

πŸ† Chatbot Arena ELO (Top 5)

🧠 ARC-AGI-2 달성λ₯ 

84.6%
πŸ€– 84.6% β€” Gemini 3 Deep Think (Google) πŸ§‘ Human Panel = 100% κΈ°μ€€
← 전체 보기 πŸ“‚ Research & Papers× πŸ“… 2026-02-15×
총 2건
HF Daily Papers LM-Lexicon: Improving Definition Modeling via Harmonizing Semantic Experts
LM-Lexicon: 데이터 ν΄λŸ¬μŠ€ν„°λ§, μ‹œλ§¨ν‹± μ „λ¬Έκ°€ ν•™μŠ΅, 슀파슀 MoE μ•„ν‚€ν…μ²˜λ₯Ό κ²°ν•©ν•œ μ •μ˜ λͺ¨λΈλ§ 접근법. κΈ°μ‘΄ SOTA λŒ€λΉ„ BLEU 점수 7% ν–₯상 달성.
HF Daily Papers STATe-of-Thoughts: Structured Action Templates for Tree-of-Thoughts
STATe-of-Thoughts: κ³ μˆ˜μ€€ μΆ”λ‘  νŒ¨ν„΄μ„ νƒμƒ‰ν•˜λŠ” 해석 κ°€λŠ₯ν•œ Inference-Time-Compute 방법. κΈ°μ‘΄ Tree-of-Thoughts의 λ‹€μ–‘μ„± λΆ€μ‘± 문제λ₯Ό κ΅¬μ‘°ν™”λœ μ•‘μ…˜ ν…œν”Œλ¦ΏμœΌλ‘œ ν•΄κ²°ν•œλ‹€.