Multiple regex-strings in DocumentCleaner? #8899
-
Hi! I didn't know where i should post this, but will start as a general question (if you think this is more suitable somewhere else, please let me know!). I am using the DocumentCleaner (https://github.com/deepset-ai/haystack/blob/main/haystack/components/preprocessors/document_cleaner.py) and have a few different regex i would like to apply to clean up my documents.
Is there anything i am missing? To me this looks inconsistent, but guess that someone might have thought about this.. |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 1 reply
-
Hi @danielsaaf Thanks for your question! We have an issue scheduled for our current sprint to extend the DocumentCleaner so this is helpful feedback! For the |
Beta Was this translation helpful? Give feedback.
Hi @danielsaaf Thanks for your question! We have an issue scheduled for our current sprint to extend the DocumentCleaner so this is helpful feedback! For the
remove_regex
parameter, I would say that if you have a few different regular expressions that you would like to apply, you can always combine them into a single regex with OR condition|
.