mirror of
https://github.com/immich-app/immich.git
synced 2026-03-02 22:57:45 -05:00
[Bug] CUDA ML crash (CUDNN_STATUS_EXECUTION_FAILED) in v2.1.0 — fixed by downgrading to v2.0.0 #7072
Closed
opened 2026-02-20 04:19:33 -05:00 by deekerman
·
46 comments
No Branch/Tag specified
main
renovate/typescript-projects
release/next
renovate/github-actions
chore/translations
push-nwxlpmyzkyrl
push-wzwotsylzylq
push-zpwsovysllvn
push-zunuwtznrlpm
renovate/opentofu-1.x
renovate/node
push-rsywxvptwxuv
renovate/prom-prometheus
renovate/docker.io-valkey-valkey-9
push-suusrsrnmzrp
push-xyozownmuwqp
csp-policy
uhthomas/fix-mobile-video-state
uhthomas/fix-mobile-hero-height
feat/use-native-clients
uhthomas/chore-mobile-simplify-scroll
feat/mobile-edit-3-mobile-sync-handling
feat/panorama-tiles
refactor/zod-migration
fix/maintenance-reload
refactor/restores-file-interceptor
uhthomas/fix-mobile-inconsistent-asset-detials-background
postgres-socketio
claude/auto-screenshot-web-changes-Y7efI
visual-review/pr-26535
push-lvyturrtwkrq
feat/notification
feat/library-offline-stats
feat/checksum-algorithm-indicator
feat/library-offline-count
uhthomas/feat-mobile-search-results
fix/bring-back-globalkeys
fix/map-webgl-error
feat/mobile-ocr
feat/custom-date-range
fix/mobile-video-aspect-ratio
fix/ml-ocr-batch-size
fix/timeline-rtl
feat/integrity-checks-izzy
uhthomas/fix-mobile-search-results
renovate/flutter
uhthomas/feat-sort-smart-search
renovate/github-cqlabs-homebrew-dcm-1.x
feat/video-player
feat/mobile-editing
refactor/remove-replace-with-upload
uhthomas/chore-mobile-maplibre
uhthomas/mobile-fix-asset-details-album-pop
feat/crawl-wrapper
feat/open-in-browser
push-skvzqoozqkpl
feat/edit-filters
fix/locale-settings-desc
feat/pg-queue
refactor/asset-upload
renovate/connectivity_plus-7.x
better-project-structure
uhthomas/mobile-feat-asset-viewer-details
fix/ml-rocm-build
fix/25803
feat/asset-file-apis
midzelis/wip
feature/bottom-buttons-order
sqlite_thumbs
fix-keep-correct-ios-shared-album-asset
fix-memory-generation-and-display
push-vpxwmwwxwnvw
fix-migration-width-height
revert/prettier-translations
shared-deep-link-handler
feat/thumbnail-native-clients
feat/platform-clients
fix/foreground-cloud-sync
filter-by-person
feat/csp
refactor/sidebar
fix/disable-editing
fix/view-timeline-deeplink
image-zoom-on-slow-connection
fix/merged-edited-assets
open-api-fix
feat/create-job-with-dto
use-toast-primary
feat/vitest-4
feat/ios-fastlane-match
match-signing
fix-update-time-update-timeline
feat/modal-routes
feature/mobile-view-asset-owner
feat/system-settings
feature/show-activity-count
better-info-in-asset-viewer
fix/all-people-count
feat/location-favorites
feature/rearrange-buttons-2
fix/download-storage-template
feat/kb-shortcuts-mobile
fix/people-count
push-qolzzzzxrvvn
chore/originals-in-asset-files
feat/asset-size-columns
ben/tree-a11y
new-search-filter-ui
refactor/expectSelectedReadonly
refactor/mobile-grdb
push-qvuktpxmkknu
feat/mobile-native-local-sync
refactor/timeline_ops
fix/scrubber_end
feat/version.txt
feat/context-menus
feat/server-chunked-uploads
refactor/virtualsegment
refactor/rename_daymonth_groups
fix/restrict-android-bg-worker
feat/android-periodic-worker
fix-remote-sync-clean-up
refactor/timeline_move_ops
fix/timeline_split_selectable
feat/keyboard_actions_help_modal
feat/static_frontend
feat/notification-warnign-android
feat/plugins2
feat/plugins
test/create-workflow-token-action
fix/docs-force
debug/search-result-similarity
debug/cf-chunked-uploads
feat/eslint_rule
feat/search-filter-album/web
refactor/timeline_photostream
refactor/timelineasset_asset
feat/session-permissions
feat/timeline_photostream_assetnav
feat/timeline_minor_optimize
feat/timeline_perf_nocomp
feat/timeline_search_results_actions
feat/timeline_search_results_page
fix/timeline_padding
fix/timeline_search_reactivity_warnings
feat/timeline_scrollbar
feat/timeline_stream_withviewer
fix/timeline_back_forth_nav
refactor/timeline_photostream_component
fix/generated-files-checks
fix/locate-button-local
chore/base-image-mimalloc
refactor/timeline_assetlayout
refactor/timeline_selectable
refactor/timeline_aware_actions
refactor/timeline_monthsegment
feat/remove-old-pages
chore/deps-gradle
tmp_photostream
tmp/lcms
feat/mobile-dynamic-thumbnails
fix/mobile-finer-thumbnail-concurrency
refactor/timeline1
refactor/extract_photostream
refactor/rename_load_api
refactor/timeline2
refactor/timeline3
feat/multi-select-asset-viewer
feat-no-thumbhash-cache
refactor/asset_grid
feat/faster-access-checks
fix/18991
fix/19543
chore/temp-remove
fix/21419
feat/mobile-hdr-images
chore/update-mise-lockfile
feat/mise-server-checks
feat/mise-ci
feat/windows-2025
feat/dev_cli
refactor/mobile-migrate-clients
fix/map-theme
fix/require-checkbox
chore/use_swc
feat/efficient-thumbnail-decoding
refactor/mobile-thumbhash
refactor/mobile-thumbhash-new
feat/beta-background-upload
fix/beta-timeline-memories-setting
fix/failed-uploads-not-removed
feat/mobile-shared-album
feat/groups
drift-map-page
drift-auth-user-sync
fix/disable-memory
feat/add-to-album-action
edit-date-time-action
drift-people-page
sqlite-remove-isIn
chore/required-reviewers
refact/asset-manager
fix/folder-sort
pnpm
feat/widget-multiple-server-urls
chore/medium-tests-dbname
fix/web-no-iterator-find
fix/map-pan-interruption
track-livephotos
timeline_events
chore/oxlint-migration
feat/maintenance-worker
feat/dav
chore/demo-snapshot
refactor/server-side-dedupe
feat/integrity-checks
dev/recognition-eval
lighter_buckets_test
perf/postgres-queue
postgres-queue
focus_rings
refactor/web-stores-1
refactor/add-to-taken
feat/sort-places
vet
tmp/demo-snapshot-preview
fix/server-migration-file-extension
fix/asset-update-race-condition
rknn-toolkit-lite2
refactor/mobile-split-up-search-page
feature/Add-rocm-support-for-machine-learning
feat/rocm
chore/async-hash-file
feat/shared-link-view-count
feat/rotation
feat/graphql
feat/job-ids
feat/ignore-library-permission-error
feat/docker-compose-builder
feat/kysely-typeorm
mobile/onboarding
no-video-player
fix/server-qsv-output-format
chore/server-geodata-tweaks
mobile/native-video-player-no-hero
feat/xxhash
fix/docs-concurrency
feat/local-tileserver
refactor/exif-orientation
original-path-infix
refactor/mobile/login-form-1
feat/server-editor-endpoints
fix/server-qsv-vbr
fix-mobile-db-problems
feat/ml-armnn-conversion
feat/mobile/backup-with-album-info
feat/fast-initial-sync-1
chore/handle-output_dims
feat/unassign-faces
feat/shortcuts-on-asset-grid
feat/capacitor-mobile-app-poc
feat/server-nvenc-hw-decoding
fix/mobile-fetch-non-archive
web/automation-ui
feat/mobile-server-endpoint-save-dropdown
object-storage
feat/memories-animations
dev/metrics
ml/tflite
feat/ml-export-cli
v2.5.6
v2.5.5
v2.5.4
v2.5.3
v2.5.2
v2.5.1
v2.5.0
v2.4.1
v2.4.0
v2.3.1
v2.3.0
v2.2.3
v2.2.2
v2.2.1
v2.2.0
v2.1.0
v2.0.1
v2.0.0
v1.144.1
v1.144.0
v1.143.1
v1.143.0
v1.142.1
v1.142.0
v1.141.1
v1.141.0
v1.140.1
v1.140.0
v1.139.4
v1.139.3
v1.139.2
v1.139.1
v1.139.0
v1.138.1
v1.138.0
v1.137.3
v1.137.2
v1.137.1
v1.137.0
v1.136.0
v1.135.3
v1.135.2
v1.135.1
v1.135.0
v1.134.0
v1.133.1
v1.133.0
v1.132.3
v1.132.2
v1.132.1
v1.132.0
v1.131.3
v1.131.2
v1.131.1
v1.131.0
v1.130.3
v1.130.2
v1.130.1
v1.130.0
v1.129.0
v1.128.0
v1.127.0
v1.126.1
v1.126.0
v1.125.7
v1.125.6
v1.125.5
v1.125.4
v1.125.3
v1.125.2
v1.125.1
v1.125.0
v1.124.2
v1.124.1
v1.124.0
v1.123.0
v1.122.3
v1.122.2
v1.122.1
v1.122.0
v1.121.0
v1.120.2
v1.120.1
v1.120.0
v1.119.1
v1.119.0
v1.118.2
v1.118.1
v1.118.0
v1.117.0
v1.116.2
v1.116.1
v1.116.0
v1.115.0
v1.114.0
v1.113.1
v1.113.0
v1.112.1
v1.112.0
v1.111.0
v1.110.0
v1.109.2
v1.109.1
v1.109.0
v1.108.0
v1.107.2
v1.107.1
v1.107.0
v1.106.4
v1.106.3
v1.106.2
v1.106.1
v1.106.0
v1.105.1
v1.105.0
v1.104.0
v1.103.1
v1.103.0
v1.102.3
v1.102.2
v1.102.1
v1.102.0
v1.101.0
v1.100.0
v1.99.0
v1.98.2
v1.98.1
v1.98.0
v1.97.0
v1.96.0
v1.95.1
v1.95.0
v1.94.1
v1.94.0
v1.93.3
v1.93.2
v1.93.1
v1.93.0
v1.92.1
v1.92.0
v1.91.4
v1.91.3
v1.91.2
v1.91.1
v1.91.0
v1.90.2
v1.90.1
v1.90.0
v1.89.0
v1.88.2
v1.88.1
v1.88.0
v1.87.0
v1.86.0
v1.85.0
v1.84.0
v1.83.0
v1.82.1
v1.82.0
v1.81.1
v1.81.0
v1.80.0
v1.79.1
v1.79.0
v1.78.1
v1.78.0
v1.77.0
v1.76.1
v1.76.0
v1.75.2
v1.75.1
v1.75.0
v1.74.0
v1.73.0
v1.72.2
v1.72.1
v1.72.0
v1.71.0
v1.70.0
v1.69.0
v1.68.0
v1.67.2
v1.67.1
v1.67.0
v1.66.1
v1.66.0
v1.65.0
v1.64.0
v1.63.2
v1.63.1
v1.63.0
v1.62.1
v1.62.0
v1.61.0
v1.60.0
v1.59.1
v1.59.0
v1.58.0
v1.57.1
v1.57.0
v1.56.2
v1.56.1
v1.56.0
v1.55.1
v1.55.0
v1.54.1
v1.54.0
v1.53.0
v1.52.1
v1.52.0
v1.51.2
v1.51.1
v1.51.0
v1.50.1
v1.50.0
v1.49.0
v1.48.1
v1.48.0
v1.47.3
v1.47.2
v1.47.1
v1.47.0
v1.46.1
v1.46.0
v1.45.0
v1.44.0
v1.43.1
v1.43.0
v1.42.0_65-dev
v1.41.1_64-dev
v1.41.0_64-dev
v1.40.1_63-dev
v1.40.0_63-dev
v1.39.0_61-dev
v1.38.2_60-dev
v1.38.1_60-dev
v1.38.0_60-dev
v1.37.0_58-dev
v1.36.2_56-dev
v1.36.1_55-dev
v1.36.0_55-dev
v1.35.0_54-dev
v1.34.0_53-dev
v1.33.1_52-dev
v1.33.0_52-dev
v1.32.1_51-dev
v1.32.0_50-dev
v1.31.1_49-dev
v1.31.0_49-dev
v1.30.2_48-dev
v1.30.0_46-dev
v1.29.6_45-dev
v1.29.6_44-dev
v1.29.5_44-dev
v1.29.4_44-dev
v1.29.3_43-dev
v1.29.2_43-dev
v1.29.1_43-dev
v1.29.0_42-dev
v1.28.4_41-dev
v1.28.4_42-dev
v1.28.3_41-dev
v1.28.2_40-dev
v1.28.1_39-dev
v1.28.0_38-dev
v1.27.0_37-dev
v1.26.0_36-dev
v1.25.0_35-dev
v1.24.0_34-dev
v1.23.0_33-dev
v1.22.0_32-dev
v1.21.1_31-dev
v1.21.0_31-dev
v1.20.3_30-dev
v1.20.2_30-dev
v1.20.1_30-dev
v1.20.0_30-dev
v1.19.1_29-dev
v1.19.0_29-dev
v1.18.0_27-dev
v1.17.0_25-dev
v1.16.0_23-dev
v1.15.1_21-dev
v1.15.0_21-dev
v1.14.0_21-dev
v1.13.0_20-dev
v1.12.0_18-dev
v1.11.0_17-dev
v1.10.0_15-dev
v1.9.1_14-dev
v1.9.0_13-dev
v1.8.0_12-dev
v1.7.0_11-dev
v1.6.0_10-dev
v1.5.1+9-dev
v1.5.0+8-dev
v1.4.0+7-dev
v1.4.0+6-dev
v1.4.0-dev
v1.3.0-dev
v1.3.1-dev
v0.6-dev
v0.5-dev
v0.4-dev
v0.3-dev
v0.2-dev
first-android-release
Labels
Clear labels
accessibility
changelog:enhancement
changelog:security
changelog:skip
changelog:translation
cli
date-time
dependencies
documentation
external-library
format
good first issue
mobile-beta
mobile-beta
mobile-beta
needs-answer
nice to have
sharing
tech-debt
📱mobile
🖥️web
🗄️server
🧠machine-learning
No labels
accessibility
changelog:enhancement
changelog:security
changelog:skip
changelog:translation
cli
date-time
dependencies
documentation
external-library
format
good first issue
mobile-beta
mobile-beta
mobile-beta
needs-answer
nice to have
sharing
tech-debt
📱mobile
🖥️web
🗄️server
🧠machine-learning
Milestone
Clear milestone
No items
No milestone
Projects
Clear projects
No items
No project
Assignees
Clear assignees
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".
No due date set.
Dependencies
No dependencies set.
Reference
starred/immich#7072
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @TheOneWayTruth on GitHub (Oct 16, 2025).
I have searched the existing issues, both open and closed, to make sure this is not a duplicate report.
The bug
Summary
Immich 2.1.0 causes repeated CUDNN_STATUS_EXECUTION_FAILED errors in immich-machine-learning during ONNX inference on NVIDIA GPUs.
Downgrading the entire stack to v2.0.0 resolves the issue completely.
Workaround
Downgrading all containers to 2.0.0 eliminates the crash.
The OS that Immich Server is running on
Windows WSL
Version of Immich Server
2.1.0
Version of Immich Mobile App
Platform with the issue
Device make and model
No response
Your docker-compose.yml content
Your .env content
Reproduction steps
...
Relevant log output
Additional information
No response
@jdicioccio commented on GitHub (Oct 16, 2025):
same general issue here. searches are now failing
@cguertin14 commented on GitHub (Oct 16, 2025):
Same bug for me in v2.1.0, downgrading to v2.0.1 fixes it.
@cloudcogsio commented on GitHub (Oct 16, 2025):
+1 on this issue.
v2.1.0 Machine Learning (CUDA) does not work with Quadro P400 BUT it works with RTX A2000
RTX A2000 Details (v2.1.0 Works) (WSL 2):
NVIDIA-SMI 553.24 Driver Version: 553.24 CUDA Version: 12.4
Quadro P400 Details (v2.1.0 Fails) (Linux VM):
NVIDIA-SMI 550.163.01 Driver Version: 550.163.01 CUDA Version: 12.4
Last call of traceback:
@mertalev commented on GitHub (Oct 16, 2025):
There were no CUDA-related changes in this release, but I suspect that #17718 triggered a rebuild of the CUDA image that might be installing different dependency versions now. Will look into fixing this.
@V-e-n-i-m commented on GitHub (Oct 16, 2025):
I have several ML related issues on this version as well with my Quadro P2000.
@Sacryn commented on GitHub (Oct 16, 2025):
Same issue here, GTX-960
NVIDIA-SMI 570.172.08 Driver Version: 570.172.08 CUDA Version: 12.8
I'm also getting issues with
Conv_0@DrSpaldo commented on GitHub (Oct 17, 2025):
I just wanted to jump on and say that I have just started getting errors after updating, initially didn't find this thread and posted in this one
Interestingly, I am running two Immich servers, both on Unraid and both using the usual docker compose (not imagegenius app) and one seems to be working better than the other....
I have downgraded the 1070 machine to 2.0.1 and am re-indexing to see if that works, initially though, no errors.
So, without jumping the gun too much, I am thinking that the changes made by mertalev in https://github.com/immich-app/immich/pull/17718 are causing the issues. But not for all cards/drivers/CUDA versions, so I guess the changes in 2.1.0 need to be either reverted or narrowed down to the actual cause, then fixed...
2.1.0 works with
2.1.0 does not work with (but reverting to 2.0.1 makes it work again)
@Nullpo1nt commented on GitHub (Oct 17, 2025):
I'll add onto this, just upgraded to v2.1.0 today and encountered this issue. Reverting the ML service to v2.0.1 mitigates this for now.
@sigma-2 commented on GitHub (Oct 18, 2025):
I had the same problem. With v2.1.0 I encountered the same problem. Rolling back to v2.0.1 solved it.
@Magnus987 commented on GitHub (Oct 19, 2025):
I can confirm this issue with my setup as well. Downgrading to v2.0.1 resolved it completely.
Hardware:
Error logs from v2.1.0:
After downgrading to v2.0.1, everything works perfectly again.
Thanks to the Devs for having an eye on this and working on a fix!
@Nordtus commented on GitHub (Oct 19, 2025):
Same problem on V2.1.0
@ErichVonHampter commented on GitHub (Oct 19, 2025):
Installed TrueNAS Scale 25.04.2.4 on a DL20 Gen9 today, ML also doesnt work after following the guided. When i upload Images then i see that a python process is using the GPU (About 140mb vRAM used) and then gets 1% GPU-Util at max.
Downgrading from the current release to 2.0.1 seems nearly impossible without fully changing everything in TrueNAS Scale. Installing another container image looks nearly impossible. If anyone has a good guide on how to get back to v2.0.1 then i would be very thankfull.
GPU: Nvidia P1000
Driver: 550.142
CUDA: 12.4
@SecretAgentOne commented on GitHub (Oct 19, 2025):
I experienced the same issue on an NVIDIA Geforce GTX 1080 with v2.1.0. Reverting just the immich-machine-learning to v2.0.1-cuda made it work again.
EDIT: I'm running...
This is the error I saw in the logs:
@DrSpaldo commented on GitHub (Oct 19, 2025):
@alextran1502 do any of the dev team have NVidia cards that are say RTX 2xxx and 3xxx to see if that is what is causing the issue? Other than my 3060 that appears to be working, I don't see any other users that are having this issue (in this thread) that have newish cards.
I am wondering if the changes that @mertalev made in the 2.1.0 release have some type of unknown requirement that you are not aware of?
In any event, I think it would be reasonable that if you cannot find the fix that the changes made are rolled back, as it appears quite a few users are having issues after these changes were made. Remember, no more breaking changes ;)
@mertalev commented on GitHub (Oct 19, 2025):
I'll probably try to install the old CUDNN version for now to restore the old behavior, but I'm unsure as to how to handle these updates moving forward. It seems difficult to be confident an update won't cause issues like this given only some environments are affected.
@DrSpaldo commented on GitHub (Oct 19, 2025):
Pretty much what programming / testing is about?
@sigma-2 commented on GitHub (Oct 20, 2025):
I did this to downgrade it on truenas scale:
TrueNAS SCALE Apps don’t let you downgrade directly through the UI, but you can pin a specific image version manually:
latestorrelease) with:Hope this helps you!
@ErichVonHampter commented on GitHub (Oct 20, 2025):
Sadly no, it also seems like 1:1 the same thing that ChatGPT told me to do. I dont have "three dots → Edit." but "Edit and Three Dots next to it. Also there is no Chance to click on something that is somewhat near "App Configuration".
Thinking that "yeah i just install something Linux-Based and it will just work" wasnt the right way to think i guess. Maybe the best way for me would be to wait until a update comes along.
Edit 20.10.2025 #1: Deleting the "ghcr.io/immich-app/immich-machine-learning:v2.1.0-cuda" Image and using "ghcr.io/immich-app/immich-machine-learning:v2.0.1-cuda" without any Tag at least downloaded the right Image as it seems like. Going to check it further.
Edit 20.10.2025 #2: After deleting everything from the Image Container that had "2.1.0" in its name AND then pulling the images with 2.01 again the 2.1.0 do reapper after starting the app.
Edit 20.10.2025 #3: Changed Immich to "Custom App", changed the Image File in the then editable Textfile and now it works. But now Immich is a Custom App and i would guess that this will also give me other problems.
@sigma-2 commented on GitHub (Oct 20, 2025):
Right, I forgot about the custom app part, sorry. I changed it too to a custom app in the text file as per the instructions.
My guess is that in the future, updates will have to be done manually through this text file.
@ErichVonHampter commented on GitHub (Oct 20, 2025):
@sigma-2 Thank u for u help. It now works as intendet! :)
@tr1plus commented on GitHub (Oct 20, 2025):
Just want to jump in to increase visibility and mention I have the same issue.
@j8ith commented on GitHub (Oct 20, 2025):
I also have the same issue with 2.1.0, rolling back to 2.0.1 confirmed working.
@mertalev commented on GitHub (Oct 20, 2025):
Immich 2.0.1 installed cuDNN 9.8, while 2.1.0 installs 9.14. It seems Pascal and Maxwell cards are no longer supported as of cuDNN 9.11 (see 9.10 vs 9.11), which explains why everyone with the issue has a Pascal or Maxwell GPU. The solution will be to pin to 9.10, and probably add a new CUDA 13 image variant so newer GPUs can continue to receive updates.
@mertalev commented on GitHub (Oct 20, 2025):
Would anyone care to try
ghcr.io/immich-app/immich-machine-learning:pr-23110-cuda?@DrSpaldo commented on GitHub (Oct 20, 2025):
Great thinking @mertalev , glad you were able to look into it and come up with a plan. I’ll give that build a go in a few hours. Should I test on the 1070 and 3060 or just 1070
@mertalev commented on GitHub (Oct 20, 2025):
The 1070 is the important one to test since I can confirm it works on my 4090, but if you can test both then even better.
@dasunsrule32 commented on GitHub (Oct 20, 2025):
It worked perfectly on my 1080ti. Android App timeline was busted on that build though. haha
@thardie commented on GitHub (Oct 20, 2025):
Confirmed
ghcr.io/immich-app/immich-machine-learning:pr-23110-cudaworks on my Tesla P40. Thanks!@j8ith commented on GitHub (Oct 20, 2025):
Confirmed ghcr.io/immich-app/immich-machine-learning:pr-23110-cuda works on my 1060. Thanks!
@Sacryn commented on GitHub (Oct 21, 2025):
According to the release notes of 9.11, everything older than Turing (GTX 16xx & RTX 20xx cards) has been dropped.
The pr-image worked like a charm for my old GTX 960.
@KristianKarl commented on GitHub (Oct 21, 2025):
Confirmed ghcr.io/immich-app/immich-machine-learning:pr-23110-cuda works on my GTX 960 Thanks!
@tr1plus commented on GitHub (Oct 21, 2025):
What would be the "official" way to have this supported long term now?
Will a seperate machine-learning container/image be provided?
I am using the official docker compose and use ansible to modify certain locations for my use case (e.g. enable Machine learning, ...) - so I would like to know the expected approach to modify my ansible scripts.
@Jonathan-Ddn commented on GitHub (Oct 21, 2025):
Had the exact same issue, can confirm @mertalev solution works for Quadro P400.
@mertalev commented on GitHub (Oct 21, 2025):
The plan is to add a -cuda-12 tag for the current CUDA image and (later) -cuda-13 for newer cards. -cuda will start pointing to the latest CUDA version we support as of 3.0 (as a breaking change), which will likely be CUDA 13. That means using -cuda-12 will be the best way to avoid disruption as long as we support it.
@yeeahnick commented on GitHub (Oct 21, 2025):
For TrueNAS SCALE users who want the test tag fix, here’s what worked for me:
Stop the Immich app.
Edit the ix_values.yaml file located at /mnt/.ix-apps/app_configs/immich/versions/1.10.7/ and change the image tag from v2.1.0-cuda to pr-23110-cuda.
Go to Apps → Configuration → Manage Container Images and manually pull the new image.
Start the Immich app.
Edit the Immich app and simply click Save to trigger a redeploy.
@MWP commented on GitHub (Oct 22, 2025):
Same problem here with a GTX1060.
The "pr-23110-cuda" image works.
@wajer1 commented on GitHub (Oct 29, 2025):
Same problem here with a tesla m4
The "pr-23110-cuda" image works.
@DrSpaldo commented on GitHub (Oct 30, 2025):
Do we still need to include the pr-23110-cuda tag on the ML container? Ie. Is this now included in 2.2.0?
Edit; never mind, I was searching the changelog for CUDA not cudnn - found this
@zvarnes commented on GitHub (Oct 30, 2025):
Can someone tell me where I supposed to put this tag?
Also the fix was merged? I'm struggling to understand the core issue here, and how it's still a broken change in the latest release.
@benjoon90 commented on GitHub (Oct 30, 2025):
It's supposed to go onto your compose file under the immich_machine_learning container.
immich-machine-learning:
container_name: immich_machine_learning
# For hardware acceleration, add one of -[armnn, cuda, rocm, openvino, rknn] to the image tag.
# Example tag: ${IMMICH_VERSION:-release}-cuda
image: ghcr.io/immich-app/immich-machine-learning:${IMMICH_VERSION:-release}-cuda
It's fixed with 2.2.0 for NVIDIA GTX1070. Remember to pull the latest image after editing the compose file.
@zvarnes commented on GitHub (Oct 30, 2025):
Okay... I guess the issue is resolved in the new version. I was seeing errors again, but it seems to be done to something else. Thanks for the quick reply
@yeeahnick commented on GitHub (Nov 3, 2025):
Issue is resolved but does that mean newer cards won't work? I am upgrading my Nvidia GPU (Pascal to Turing) since Truenas stopped supporting older cards and am wondering if it will work with Immich. Was the change reverted or is there some sort of intelligence to detect which driver to use?
Thanks
@DrSpaldo commented on GitHub (Nov 3, 2025):
@yeeahnick , yep, they are keeping the older version of CUDA to support the older cards. There will be a new variable at some stage for those people with newer cards
@yeeahnick commented on GitHub (Nov 3, 2025):
Until then what happens to TrueNAS users? The latest version (Goldeneye) stopped supporting the old driver in favor of the newer one. The Immich community app in TrueNAS should be updated asap unless I'm missing something.
@DrSpaldo commented on GitHub (Nov 3, 2025):
@yeeahnick, I don't use TrueNAS, so can't really comment too much on it. But, most apps just pull the container from the official ones, so going to 2.2.0 or newer should revert the cudnn version to the more compatible version. Have you tried updating the version? While it wasn't working, did you revert the changes you previously made in ix_values.yaml ? I would think it just needs to be back to https://github.com/truenas/apps/blob/master/ix-dev/community/immich/ix_values.yaml
@yeeahnick commented on GitHub (Nov 5, 2025):
All good, I installed the new Turing GPU (Quadro RTX 4000) and it's working with Immich 2.2.2 on TrueNAS.