mirror of
https://github.com/Mintplex-Labs/anything-llm.git
synced 2026-03-02 22:57:05 -05:00
[FEAT]: Allow Agent Skills to Access Images Attached or Pasted in Chat #3210
Labels
No labels
Desktop
Docker
Integration Request
Integration Request
OS: Linux
OS: Mobile
OS: Windows
UI/UX
blocked
bug
bug
core-team-only
documentation
duplicate
embed-widget
enhancement
feature request
github_actions
good first issue
investigating
needs info / can't replicate
possible bug
question
stage: specifications
wontfix
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
starred/anything-llm#3210
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @LudoE11 on GitHub (Feb 25, 2026).
What would you like to see?
Context
I'm building a custom agent skill for AnythingLLM that edits images using an external API (Replicate) to call an AI Image editing model. The ideal user experience is for users to simply attach or paste an image into the chat and have the agent skill process it automatically based on their prompt (send the input image and the prompt to the API).
Current Situation
Currently, when a user sends a message to an agent skill (e.g., using
@agent), only the text content of the message is passed to the skill handler. Any images attached or pasted into the chat are not accessible to the skill—there is no URL, file path, or base64 data provided. As far as I know.Problem
This limitation means users must manually convert images to base64 or upload them to a specific folder, which adds friction and makes the workflow less user-friendly. The skill cannot process images directly from the chat, which is a common and expected use case for image editing or analysis skills.
Questions
Desired Feature
It would be very useful and helfpuf if you could provide a way for agent skills to access images that users attach or paste into the chat, ideally as a URL, file path, or base64 string, so that skills can process them directly and provide a seamless user experience for image-based workflows.
Thanks in advance for your feedback.