[DOCS]: Add wording to Text splitting & Chunking Preferences to explain current strategy #934

Open
opened 2026-02-28 05:04:52 -05:00 by deekerman · 1 comment
Owner

Originally created by @chrisn-au on GitHub (May 18, 2024).

Description

Team, great project thx. Just getting up to speed. I have been struggling with understanding the appropriate strategy for chunking for my data. While I have no doubt there is a plan to add alternative chunking strategies in the short term I wanted to check you are ok with me doing a pull request to add some wording to the ext splitting & Chunking Preferences tab to explain that it uses LangChain's RecursiveCharacterTextSplit and a link to an explanation of that strategy - I am happy to do the pull request. Again awesome project - well done

Originally created by @chrisn-au on GitHub (May 18, 2024). ### Description Team, great project thx. Just getting up to speed. I have been struggling with understanding the appropriate strategy for chunking for my data. While I have no doubt there is a plan to add alternative chunking strategies in the short term I wanted to check you are ok with me doing a pull request to add some wording to the ext splitting & Chunking Preferences tab to explain that it uses LangChain's RecursiveCharacterTextSplit and a link to an explanation of that strategy - I am happy to do the pull request. Again awesome project - well done
Author
Owner

@timothycarambat commented on GitHub (May 18, 2024):

I think a small link in the UI would be appropriate, but moreso a new page on a new page in the https://docs.useanything.com/anythingllm-setup/embedder-configuration/overview page to explain more would be the best solution.

Those docs are open source so lengthy or wordy descriptions should for sure be offloaded there! https://github.com/Mintplex-Labs/anythingllm-docs

@timothycarambat commented on GitHub (May 18, 2024): I think a small link in the UI would be appropriate, but moreso a new page on a new page in the https://docs.useanything.com/anythingllm-setup/embedder-configuration/overview page to explain more would be the best solution. Those docs are open source so lengthy or wordy descriptions should for sure be offloaded there! https://github.com/Mintplex-Labs/anythingllm-docs
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
starred/anything-llm#934
No description provided.