Heads up that we’ve hit the late summer slump in arXiv submissions, so there’s less content than usual this week. P.S.: thanks to AI Supremacy for the Twitter shoutout! Enhancing CLIP with GPT-4: Harnessing Visual Descriptions as Prompts Normally when constructing the text description of a given class for a CLIP model, you use a simple template like “A photo of {classname}”. What if you instead use GPT-4 to generate text that conveys something about what the class looks like?
Looking forward to the new post