img2prompt
Overview
Methexis-Inc/img2prompt is a tool designed to generate approximate text prompts that match an image. This tool is particularly optimized for stable-diffusion (clip ViT-L/14).
The tool is based on the open-source CLIP Interrogator notebook created by @pharmapsychotic and utilizes the OpenAI CLIP models to match an image to a variety of artists, mediums, and styles.
The results of the comparison are then combined with BLIP captions to generate a text prompt that can be used to create additional images similar to the original.
The tool can be run via an API, or the GitHub repository and license can be accessed for more information. Predictions typically complete within 24 seconds and run on Nvidia T4 GPU hardware.
Releases
Top alternatives
-
Connect sighted volunteers with blind users for visual assistance.
-
Extract editable text from images instantlyMichael Watson🛠️ 1 tool 🙏 75 karmaJul 12, 2024@Picture To Text ConverterThis is a text extraction service. Not image to text.
-
Convert images to insightful conversations.First time I have tried it. I think it is terrific!
-
Transform handwritten notes into digital text instantly. -
Transform images into concise summaries with AI.its good, but uses a credit system and needs premuim
-
Translate image text instantly across 130+ languages.
MongoDB - Build AI That Scales

