-
-
Notifications
You must be signed in to change notification settings - Fork 1.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add Aggregation user input _index #2031
Conversation
9e3088d
to
13dd5c6
Compare
This necessitated a change in plots.js to prevent the supply defaults step running twice with the second run acting on fullData rather then userData.
13dd5c6
to
75e6e48
Compare
@@ -1039,9 +1039,21 @@ plots.supplyTransformDefaults = function(traceIn, traceOut, layout) { | |||
_module = transformsRegistry[type], | |||
transformOut; | |||
|
|||
/* | |||
* Supply defaults may run twice. First pass runs all supply defaults steps |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Questions:
- I assume this refers to the way
supplyTraceDefaults
runs on traces and then again on expanded traces? - Which
_module
is added? I think that sometimes refers to the trace module and sometimes to the transform module (and every now and then to the base plot module, though usually it's calledbaseModule
or something, I think)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Answer to second: the transform module.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yep that is what it is referring to
src/plots/plots.js
Outdated
* and adds the _module to any output transforms. | ||
* If transforms exist another pass is run so that any generated traces also | ||
* go through supply defaults. This has the effect of rerunning | ||
* supplyTransforms. If the transform does not have a `transform` function |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
*supplyTransformDefaults
?
src/plots/plots.js
Outdated
* it could not have generated any new traces and the second stage is | ||
* unnecessary. We detect this case with the following variables. | ||
*/ | ||
var isSecondStage = transformIn._module && transformIn._module === _module, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is a third stage possible if multiple transforms expand things multiple times? (e.g. groupby + groupby) I think this makes sense to me, but just making sure transform module identity equates to isSecondStage
. (Or is it isNotFirstStage
?) My understanding is just a little loose here.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
right, more than 2 stages... need to think about that. Suggestions?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Right I think we are ok. I should call this variable isNotFirstStage
though. Thanks
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Things get fundamentally a bit shaky after one level. Like groupby
+ groupby
. It does mostly what you'd expect, but it turns out styling is the big challenge. Think SQL. You massage the query until you have a nice query that spits out data. Then you insert it into a plot and you're happy. Plotly's transforms incrementally apply styles as a sequence of transforms is applied so that you'd have to track styles as well as the pathway a data point took through the query to really sort things out after the fact. Though I agree that I think we're okay.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Sorry, specifics: groupby
adds a _group
property to traces so that you can figure out which group split generated a trace. But it should really be tracking that as a transform index and a group value. This comes into play with transform vs. calcTransform since you start having to worry about when a trace was split/grouped/styled/etc.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Apologies if that's still a bit loose on the specifics. /cc @alexcjohnson since we've talked this through before in a fair bit of detail. I think our conclusion at the time was that the basic use cases are handled just fine and account for 97+% of realistic use-cases.
}); | ||
|
||
expect(enabledAggregations[0].target).toEqual('x'); | ||
expect(enabledAggregations[0]._index).toEqual(0); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is there any way to test this that doesn't rely upon an underscored property? Like could supplyTransformDefaults
delete the first aggregation if it's overridden by another? Would that be adequate to sort things out in the workspace or does the workspace need to know indices?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The only reason for indicies is for Workspace. supplyTransformDefaults doesn't care about them.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I need a way in Workspace to take a fullData transform Aggregration and write back to the correct userData index. I need some way to correlate userData indicies with the fullData equivalents.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
maybe it shouldn't be underscored? Something like .inputIndex
. the only problem with that is there are precedents for things being added to fullData for consumption by users that have an underscore. _input
for example.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
So the workspace accesses indices directly? (That's okay, just making sure. plotly.js's 'private' properties are fairly publicly accessed, so it's nothing new. Just means they need to be modified with great care.)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
yep, and the workspace is the only thing that needs them. Plotly.js doesn't care as it rebuilds new arrays each time.
Above corner-case comments notwithstanding, this all looks reasonable to me. |
Ok. Looks like @rreusser approves this. 💃 for me @bpostlethwaite This will be part of |
fixes #2024
and adds _index to fullData
aggregation.aggregations
to make back referencing into user data possible.cc @alexcjohnson