2026-02-17 22:38 KST Β· by Angtiger Β· 맀일 10:00 KST μ—…λ°μ΄νŠΈ
5 sources
1 new posts

πŸ† AI λͺ¨λΈ 벀치마크

β–Ό

πŸ–₯️ Terminal-Bench 2.0 (Top 5)

πŸ† Chatbot Arena ELO (Top 5)

🧠 ARC-AGI-2 달성λ₯ 

84.6%
πŸ€– 84.6% β€” Gemini 3 Deep Think (Google) πŸ§‘ Human Panel = 100% κΈ°μ€€
← 전체 보기 πŸ“‚ Research & Papers× πŸ“… 2026-02-11×
총 1건
HF Daily Papers DeepImageSearch: Benchmarking Multimodal Agents for Context-Aware Image Retrieval in Visual Histories
DeepImageSearch: μ‹œκ°μ  νžˆμŠ€ν† λ¦¬μ—μ„œ μ»¨ν…μŠ€νŠΈ 인식 이미지 검색을 μœ„ν•œ λ©€ν‹°λͺ¨λ‹¬ μ—μ΄μ „νŠΈ 벀치마크. 이미지 검색을 자율 탐색 과제둜 μž¬μ •μ˜ν•˜λŠ” μƒˆλ‘œμš΄ μ—μ΄μ „νŠΈ νŒ¨λŸ¬λ‹€μž„μ„ μ œμ‹œν•œλ‹€.