Fix attention fusion in conformer encoder #23711

kunal-vaishnavi · 2025-02-15T01:27:24Z

Description

This PR updates the attention fusion for conformer-encoder models. It is a follow-up to this PR.

Motivation and Context

Subsequent modeling code updates have changed (and will continue to change) the graph fusions. However, the three ending attention mask nodes (Cast --> Unsqueeze --> Equal) will remain. Thus, the attention fusion should work regardless of any future modeling code changes when handling the attention mask.

This reverts commit 47a0077.

### Description This PR updates the attention fusion for conformer-encoder models. It is a follow-up to [this PR](#23528). ### Motivation and Context Subsequent modeling code updates have changed (and will continue to change) the graph fusions. However, the three ending attention mask nodes (`Cast --> Unsqueeze --> Equal`) will remain. Thus, the attention fusion should work regardless of any future modeling code changes when handling the attention mask.

Fix attention fusion in conformer-encoder

7f9087e

kunal-vaishnavi requested a review from hanbitmyths February 15, 2025 01:27

hanbitmyths approved these changes Feb 15, 2025

View reviewed changes

kunal-vaishnavi merged commit 47a0077 into microsoft:main Feb 16, 2025
92 of 94 checks passed

jingyanwangms added a commit that referenced this pull request Feb 18, 2025

Revert "Fix attention fusion in conformer encoder (#23711)"

f09f319

This reverts commit 47a0077.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix attention fusion in conformer encoder #23711

Fix attention fusion in conformer encoder #23711

kunal-vaishnavi commented Feb 15, 2025

Fix attention fusion in conformer encoder #23711

Fix attention fusion in conformer encoder #23711

Conversation

kunal-vaishnavi commented Feb 15, 2025

Description

Motivation and Context