Update TrackClusterMergeSplitter to output track-cluster associations (PFA0) #1699

ruse-traveler · 2025-01-09T22:57:46Z

Briefly, what does this PR introduce?

This PR updates the TrackClusterMergeSplitter algorithm to output both edm4eic::TrackClusterMatch and MC paritcle-cluster associations. In this process, it reaps what was sown by originally writing the algorithm to operate on protoclusters rather than clusters: the algorithm will now ingest fully formed cluster and update relevant quantities.

What kind of change does this PR introduce?

Bug fix (issue #__)
New feature (issue Update Track-Cluster Merge/Splitter to output Track-Cluster Associations #1645 )
Documentation update
Other: __

Please check if this PR fulfills the following:

Tests for the changes have been added
Documentation has been added / updated
Changes have been communicated to collaborators

Does this PR introduce breaking changes? What changes might users need to make to their code?

No.

Does this PR change default behavior?

Yes. Track-cluster and MC particle-cluster associations will now be produced by the algorithm.

for more information, see https://pre-commit.ci

…/EICrecon into output-splitmerge-track-associations

for more information, see https://pre-commit.ci

…/EICrecon into output-splitmerge-track-associations

github-actions · 2025-02-04T22:13:50Z

Capybara summary for PR 1699

rec_dis_10x100_minQ2=0_craterlake
rec_dis_10x100_minQ2=1000_craterlake_tracking_only
rec_dis_18x275_minQ2=0_craterlake_18x275
rec_dis_18x275_minQ2=1000_craterlake_18x275
rec_dis_5x41_minQ2=0_craterlake_5x41
rec_e_1GeV_20GeV_craterlake
rec_pi_1GeV_20GeV_craterlake
^{_{Last updated 2025-03-12T22:00-04:00 f79708b}}

for more information, see https://pre-commit.ci

veprbl · 2025-02-06T21:07:38Z

src/algorithms/calorimetry/TrackClusterMergeSplitter.cc

+  // --------------------------------------------------------------------------
+  //! Calculate cluster shape parameters
+  // --------------------------------------------------------------------------
+  /*! Calculation originally written by Chao Peng, Dhevan Gangadharan,
+   *  and Sebouh Paul.  Code is copied from the `CalorimeterClusterRecoCoG`
+   *  algorithm.
+   */
+  void TrackClusterMergeSplitter::calculate_shape_parameters(edm4eic::MutableCluster& clust) const {


If this factorizes so well, we might as well move it to a separate factory.

(This is already a big PR. If you decide to go along with my suggestion, you could refactor existing code separately, and then rebase this to re-use that new facility)

I'd be very much in favor of that! Keeps it nice and modular 😉

But how would this work in the data model? Would we have one collection without shape parameters filled in (likely not saved by default) and then one with them filled in (saved by default)?

Started work on this in #1734 !

Rebase done! One new wrinkle I can immediately spot is how to handle the track-cluster matches: those will be pointing back to the split/merged clusters without cluster shapes...

A proposal: what if we had a new meta algorithm for copying associations? This could be useful for other algorithms that are one-to-one transformations. I have some ideas about how this might be implemented...

Regarding general studies using the matches, aren't these a bit special? In the sense that they were used to construct resulting clusters, but they might be not what a track-cluster matcher would find by looking at those clusters.

I think, we can be optimistic about our ability to follow up with a relation re-writer.

Regarding general studies using the matches, aren't these a bit special? In the sense that they were used to construct resulting clusters, but they might be not what a track-cluster matcher would find by looking at those clusters.

True! The matching procedure here does differ in that regard! But it's not clear to me how we would propagate the track-merged cluster relation downstream without outputting updated track-cluster matches...

I'm only talking about what is being written to the file. IIRC "dangling" relations are not allowed (there is a bug in PODIO which allows for OneToOne relations to not get checked, but that may be fixed at some point). To fix those, we would need to also save clusters without the shapes.

Ahhhh! I understand now! 😅
I'm absolutely good with disabling them in the output! 👍

Just pushed changes to disable the track-cluster associations!

src/detectors/EHCAL/EHCAL.cc

veprbl · 2025-02-06T21:21:35Z

src/algorithms/calorimetry/TrackClusterMergeSplitter.h

+  using MatrixF = std::vector<std::vector<float>>;
+  using VecMatrix = std::vector<MatrixF>;


Looks like, in some places you try to use maps, and here you give up and just rely on array indices? Is it true that every hit has a weight related to every cluster and every projection?

So I do think we need a weight that's related to every cluster and every projection pointing to the merged cluster:

the former is because the merged cluster will (naturally) be composed of all of the hits of the clusters being merged,

and the latter is because we'll need to know the sum of all projections' momenta for the weight, so that summing over the split clusters should give the merged cluster energy before splitting.

(Unless I'm misunderstanding how the splitting should work!)

All that being siad, I probably could rework it to use maps instead of indices. I was thinking of this map of weights (hit_weight = hit_weight(cluster, projection)) as a "matrix", and this was an easy way of implementing a dynamic 2D matrix of floats 🤷

So I do think we need a weight that's related to every cluster and every projection pointing to the merged cluster:

I think you're right. Looks like this only involves hits from the cluster group.

All that being siad, I probably could rework it to use maps instead of indices. I was thinking of this map of weights (hit_weight = hit_weight(cluster, projection)) as a "matrix", and this was an easy way of implementing a dynamic 2D matrix of floats 🤷

Ideally we write the code to maximize clarity. Having a consistent use of maps could help with that, if there is a neat way to use those. Using maps also provides basic sanity checks to the code.

If you have a preference to use tensors here, then just remember std::vector<std::vector<std::vector< has triple dereferences/allocations and spoils data locality, so it won't win that much in terms of performance. We already use eigen library, that could be better for your case, perhaps.

I see! I'll give it some thought and let you know which route (map or eigen) I go with!

After mulling it over, I think it a natural way might be to actually use a mix of maps and eigen: for each projection, we know how many clusters and hits we're going to need so we could easily define a map of projections onto MatrixXd! Let me give it a shot...

I stand corrected: it was much easier to just use std::map<edm4eic::CalorimeterHit, double>. Just pushed some changes that rewrite the splitting calculation to use that (it's a little cleaner now).

src/algorithms/calorimetry/TrackClusterMergeSplitter.h

…/EICrecon into output-splitmerge-track-associations

for more information, see https://pre-commit.ci

…/EICrecon into output-splitmerge-track-associations

ruse-traveler added 3 commits January 9, 2025 15:45

Add hooks for track-cluster match outputs

68672a9

Update algorithm to operate on clusters

90a9ccc

Begin filling in cluster reconstruction calculation

335d224

github-actions bot added topic: calorimetry relates to calorimetry topic: barrel topic: forward topic: backward labels Jan 9, 2025

pre-commit-ci bot and others added 14 commits January 9, 2025 22:58

[pre-commit.ci] auto fixes from pre-commit.com hooks

68fa03d

for more information, see https://pre-commit.ci

Add position calculation

f8605e8

Fix typos in input collection names

de78ffb

Merge branch 'output-splitmerge-track-associations' of github.com:eic…

1d04aa8

…/EICrecon into output-splitmerge-track-associations

Add missing edm4eic version header

30b92d3

Be safer with creating new clusters

021e11b

Rework weight calculation to accomodate cluster re-reconstruction

0307731

Fill track-cluster match output

f8d2050

[pre-commit.ci] auto fixes from pre-commit.com hooks

3dab6d8

for more information, see https://pre-commit.ci

Add missed edm4eic version header

bf31964

Merge branch 'output-splitmerge-track-associations' of github.com:eic…

178ecb8

…/EICrecon into output-splitmerge-track-associations

Add shape calculation

b4b0aca

Wire in associations

303f3f2

Copy associations of unused clusters into output

a2ff2f1

ruse-traveler temporarily deployed to github-pages February 4, 2025 22:12 — with GitHub Actions Inactive

Fill in mergerd cluster associations

f7476b4

ruse-traveler marked this pull request as ready for review February 4, 2025 22:53

[pre-commit.ci] auto fixes from pre-commit.com hooks

14deb24

for more information, see https://pre-commit.ci

pre-commit-ci bot temporarily deployed to github-pages February 4, 2025 23:33 Inactive

ruse-traveler requested review from Chao1009, veprbl and steinber February 5, 2025 15:13

veprbl reviewed Feb 6, 2025

View reviewed changes

src/detectors/EHCAL/EHCAL.cc Outdated Show resolved Hide resolved

veprbl reviewed Feb 6, 2025

View reviewed changes

src/algorithms/calorimetry/TrackClusterMergeSplitter.h Outdated Show resolved Hide resolved

ruse-traveler added 2 commits February 11, 2025 13:16

Merge branch 'main' into output-splitmerge-track-associations

b5a364c

Fix HcalEndcapNClusterAssociations typo

8141330

ruse-traveler mentioned this pull request Feb 11, 2025

Cluster Shape Parameter Calculation Could Be Factorized #1733

Closed

ruse-traveler added 2 commits February 11, 2025 15:19

Merge branch 'output-splitmerge-track-associations' of github.com:eic…

1c17703

…/EICrecon into output-splitmerge-track-associations

Template ObjectID comparator

0c75eb9

ruse-traveler temporarily deployed to github-pages February 11, 2025 21:23 — with GitHub Actions Inactive

ruse-traveler and others added 3 commits February 22, 2025 07:02

Rewrite splitting weight calculation to use maps

bd55165

Merge branch 'main' into output-splitmerge-track-associations

32af87f

[pre-commit.ci] auto fixes from pre-commit.com hooks

cc282d2

for more information, see https://pre-commit.ci

pre-commit-ci bot temporarily deployed to github-pages February 22, 2025 12:38 Inactive

ruse-traveler added 4 commits March 10, 2025 09:58

Merge main

143df2a

Remove shape calculation from merge/splitter

8288600

Merge branch 'output-splitmerge-track-associations' of github.com:eic…

30c9ab2

…/EICrecon into output-splitmerge-track-associations

Make split/merge shape parameters consistent with RecoCoG

9d33912

ruse-traveler temporarily deployed to github-pages March 10, 2025 17:37 — with GitHub Actions Inactive

IWYU

a8537b1

veprbl temporarily deployed to github-pages March 11, 2025 21:59 — with GitHub Actions Inactive

ruse-traveler added 3 commits March 12, 2025 21:15

Merge branch 'main' into output-splitmerge-track-associations

004b053

Disable saving split/merge track-cluster matches to output

87c0458

Merge branch 'output-splitmerge-track-associations' of github.com:eic…

f79708b

…/EICrecon into output-splitmerge-track-associations

ruse-traveler temporarily deployed to github-pages March 13, 2025 02:00 — with GitHub Actions Inactive

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update TrackClusterMergeSplitter to output track-cluster associations (PFA0) #1699

Update TrackClusterMergeSplitter to output track-cluster associations (PFA0) #1699

ruse-traveler commented Jan 9, 2025 •

edited

Loading

github-actions bot commented Feb 4, 2025 •

edited

Loading

veprbl Feb 6, 2025

veprbl Feb 6, 2025

ruse-traveler Feb 9, 2025

ruse-traveler Feb 11, 2025

ruse-traveler Mar 10, 2025

veprbl Mar 12, 2025

ruse-traveler Mar 12, 2025

veprbl Mar 12, 2025

ruse-traveler Mar 12, 2025

ruse-traveler Mar 13, 2025

veprbl Feb 6, 2025

ruse-traveler Feb 9, 2025 •

edited

Loading

veprbl Feb 11, 2025

ruse-traveler Feb 11, 2025

ruse-traveler Feb 22, 2025

ruse-traveler Feb 22, 2025

		using MatrixF = std::vector<std::vector<float>>;
		using VecMatrix = std::vector<MatrixF>;

Update TrackClusterMergeSplitter to output track-cluster associations (PFA0) #1699

Are you sure you want to change the base?

Update TrackClusterMergeSplitter to output track-cluster associations (PFA0) #1699

Conversation

ruse-traveler commented Jan 9, 2025 • edited Loading

Briefly, what does this PR introduce?

What kind of change does this PR introduce?

Please check if this PR fulfills the following:

Does this PR introduce breaking changes? What changes might users need to make to their code?

Does this PR change default behavior?

github-actions bot commented Feb 4, 2025 • edited Loading

Capybara summary for PR 1699

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ruse-traveler Feb 9, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ruse-traveler commented Jan 9, 2025 •

edited

Loading

github-actions bot commented Feb 4, 2025 •

edited

Loading

ruse-traveler Feb 9, 2025 •

edited

Loading