第十一周学习笔记 1、课堂要点: 论文:Structure-CLIP: Towards Scene Graph Knowledge to Enhance Multi-modal Structured Representations. via: https://arxiv.org/pdf/2305.06152 什么是CLIP模型? 数据集MSCOCO