AI: Integrate Florence 2 Vision AI for Auto Caption #2129

Open
opened 2026-02-20 01:06:47 -05:00 by deekerman · 2 comments
Owner

Originally created by @david-ng-hk on GitHub (Jun 26, 2024).

Microsoft release a new AI model recently, MIT license.
It will use Vision AI to auto generate a brief or detail caption for a photo or video.

I think this AI can generate the caption for the search in photo prism.
Please integrate, if not all GPU, then NVidia card go first.
Thank you.

model download
https://huggingface.co/microsoft/Florence-2-large-ft

demo
https://huggingface.co/spaces/SixOpen/Florence-2-large-ft

Originally created by @david-ng-hk on GitHub (Jun 26, 2024). Microsoft release a new AI model recently, MIT license. It will use Vision AI to auto generate a brief or detail caption for a photo or video. I think this AI can generate the caption for the search in photo prism. Please integrate, if not all GPU, then NVidia card go first. Thank you. model download https://huggingface.co/microsoft/Florence-2-large-ft demo https://huggingface.co/spaces/SixOpen/Florence-2-large-ft
Author
Owner

@GlassedSilver commented on GitHub (Jun 27, 2024):

Played around with the demo for a bit and what can I say, seems pretty darn good!

MIT license isn't too shabby either.

@GlassedSilver commented on GitHub (Jun 27, 2024): Played around with the demo for a bit and what can I say, seems pretty darn good! MIT license isn't too shabby either.
Author
Owner

@graciousgrey commented on GitHub (Dec 2, 2025):

Although there is no support for Florence 2 Vision, you can now use our Ollama integration to generate labels or captions:

@graciousgrey commented on GitHub (Dec 2, 2025): Although there is no support for Florence 2 Vision, you can now use our Ollama integration to generate labels or captions: - https://docs.photoprism.app/user-guide/ai/using-ollama/ - https://docs.photoprism.app/user-guide/ai/
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
starred/photoprism#2129
No description provided.