[2026-01-27] Youtu-VL: '시각을 목표로(Vision-as-Target)' 정의하는 통합 시각-언어 자동 회귀 모델의 기술적 혁명
Figure 1:Youtu-VL achieves competitive performance on both general multimodal tasks and vision-centric tasks.The concentric rings illustrate the capability scope of different models across various ...
