Index10タイトルLearning hierarchical video-text relationship via large language model for cross-modal video retrieval出典2025 International Workshop on Advanced Image Technology (IWAIT2025)