-
Notifications
You must be signed in to change notification settings - Fork 2.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Allowing: Minimum Word Length & Frequencies of Numbers #465
Conversation
Added self.min_word_length=0. Added conditional filter in WordCloud.process_text() for word length. When parameter set to positive integer, remove words that do not exceed the minimum word length.
.vscode/settings.json
Outdated
@@ -0,0 +1,4 @@ | |||
{ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
can you please remove this file?
I would maybe not add two more examples as there already are quite a few. Can you maybe add one and also add tests for the two new options? Thanks! |
Will do a clean pull request again, with just one commit and remove the files. Sorry about that. |
You can also just change this one, with squash & merge it doesn't matter.
Sent from phone. Please excuse spelling and brevity.
…On Tue, Jan 29, 2019, 03:32 RCCG ***@***.*** wrote:
Will do a clean pull request again, with just one commit and remove the
files. Sorry about that.
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
<#465 (comment)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AAbcFuEpAyzpmNqwXomtdw6i72okUkD1ks5vIAcfgaJpZM4aUu9h>
.
|
… get removed in word_cloud. Fix to allow single letter inclusion in RegEx later.
looks good, thanks! |
Added parameters to WordCloud() to allow for filtering frequencies of numbers, as well as filtering by minimum length of a word.
Numbers are useful for visualizing patterns in numbers, dummy example using random numbers and the first 10,000 digits of pi.
Minimum word length is a simple way for filtering out functional words in foreign languages without having to write stopword lists or use pre-defined functional word collections. Examples filters A New Hope with length rather than stopwords.
Note: obviously not the most efficient implementations, but kept these in line with the style of the rest of the package.