mirror of
https://github.com/photoprism/photoprism.git
synced 2026-03-02 22:57:18 -05:00
AI: Animal pet detection #1122
Labels
No labels
ai
android
api
auth
awesome
bug
bug
ci
cli
config
database
declined
deprecated
docker
docs 📚
documents
duplicate
easy
enhancement
enhancement
enhancement
epic
faces
feedback wanted
frontend
hacktoberfest
help wanted
idea
in-progress
incomplete
index
invalid
ios
labels
live
live
low-priority
macos
member-feature
metadata
mobile
nas
needs-analysis
no-coding-required
no-coding-required
observability
performance
places
please-test
plus-feature
priority
pro-feature
question
raspberry-pi
raw
released
released
released
research
resolved
security
sharing
tested
tests
third-party-issue
thumbnails
upgrade
upstream-issue
ux
vector
video
waiting
won't fix
won't fix
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
starred/photoprism#1122
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @ServerGeek82 on GitHub (Sep 28, 2021).
Just a thought. Future feature. Had a picture of my dog and it got recognized in the faces. A cat as well. Also had a face match on a green plastic cup. And the funniest was a person in a bunny outfit for an Easter picture. Running the reindexing for the library testing it's capabilities with the new face detection. First of all. Awesome.
32,000+ images / videos from decades of pictures and a few misses of my kiddos. But that is because it's missing some of the similarities, and differences.
@graciousgrey commented on GitHub (Sep 28, 2021):
Good to hear that you like the facial recognition :).
I personally would love to have pet detection as well (ideally one that can separate my two black cats ;)).
But it will take a while... we have many tasks with higher priority
@heisenberg2980 commented on GitHub (May 8, 2022):
Another vote for this one, it would be really cool to be able to filter pictures of our pets
@mateuszdrab commented on GitHub (Jun 16, 2022):
I've just started evaluating Photoprism today, face detection is impressive but object classification, not so well.
I think a better TF model is needed - which would make sense once Coral support is added https://github.com/photoprism/photoprism/issues/536
Example of an identified Dog:

@lastzero commented on GitHub (Jun 19, 2022):
It could almost be a dog. In order to find as many dogs as possible, we lowered the probability threshold for this label as we felt that was an acceptable compromise. I take note you see it differently.
@mateuszdrab commented on GitHub (Jun 19, 2022):
I get it, despite this, I have loads of photos of my actual dog which it didn't recognize.
Sometimes two similar photos where one is and one isn't recognized.
I'd like to try out the large version of the current model instead of the mobile one, just to see if it makes things better... It's a huge size difference of 20MB vs >300MB so I'd expect better results from it.
I mentioned this on the gitter chat actually.
@lastzero commented on GitHub (Jun 19, 2022):
Let us know about the results. Based on the stats and my experience, the difference is minimal. For TF acceleration and upgrade, watch the TensorFlow 2 issue / read its comments.
@mateuszdrab commented on GitHub (Jun 20, 2022):
I would, but I couldn't get the model to work by simply swapping files.
I was getting this error:
16T15:33:04Z" level=error msg="index: Could not find meta graph def matching supplied tags: { photoprism }. To inspect available tag-sets in the SavedModel, please use the SavedModel CLI: saved_model_cli"Can you provide some instructions/pointers on whether a straight replacement of the model files should work or perhaps it needs to be rebuilt?
@lastzero commented on GitHub (Jun 20, 2022):
Yep, sadly it's not as easy why developing this software took so long. The Go library needs a named save point which you have to use Python for to create. Also you have to change the code that passes the image thumbnails to TF as the resolution of this model is different.
@mateuszdrab commented on GitHub (Jun 21, 2022):
That makes sense, I may try to tinker around with it see if I can recompile for the larger model.
Just curious if it would make a difference, in theory it should since it's a higher resolution model.
@lastzero commented on GitHub (Jun 21, 2022):
Basically all models use square thumbnails, so it would be good to know if you find any connection between false negatives and the image format, in particular subject (dog) size or position. Because you can easily magnify parts of an image, a higher resolution is not as big an advantage as it seems. We already use three different crops, so the logical step would be to use more in different sizes and positions instead of switching to a much larger and slower model that significantly increases the overhead when doing this. Using multiple crops is a scientifically verified approach, I recommend reading a few papers as common sense doesn't really work for machine learning. Much is art.
@pgalbavy commented on GitHub (Dec 27, 2023):
Too. Many. Cats.
Actually, never enough, but too many cat photos for sure. Would be good to be able to apply a model to them :-)