`DotLayer` reduce over dynamic axis should respect the seq mask #629

albertz · 2021-09-03T23:33:56Z

DotLayer should respect the dyn size (or seq mask) when you reduce dynamic axes, just like ReduceLayer etc would do over dynamic axes. The DotLayer currently completely ignores this. This is fine when the user explicitly performs masking beforehand. E.g. SoftmaxOverSpatialLayer will do the right thing. However, otherwise it will be wrong in general. This is esp also a problem for the concept of dynamic axes with extended dynamic sizes.

Implementing this for DotLayer is not so nice, though, because this additional masking is not needed for the common case where there was a SoftmaxOverSpatialLayer before. The additional masking is not wrong, but it would make it a bit slower. So we definitely want to avoid this. But how can we know when this can be skipped? It could explicitly check whether one of the inputs comes from SoftmaxOverSpatialLayer but this would be very ugly, hacky, and also fail in some cases (e.g. by any automatic internal wrapping layers such as SelectSearchSourcesLayer). But is there a better more generic way? Somehow some way that the input can say "I'm already masked, padded values are 0, no need to mask again". Other layers like ReduceLayer could also use this information. The padded values are also relevant. For DotLayer we want 0. For ReduceLayer with sum or avg we want 0. For ReduceLayer with max or also SoftmaxOverSpatialLayer we want -inf. But then, I'm not sure if we are maybe making it too complicated in end.

Originally posted by @albertz in #391 (comment)

The text was updated successfully, but these errors were encountered:

Fix #629

This was referenced Sep 3, 2021

CumConcatLayer #589

Merged

DotLayer, mask for dynamic axis #631

Merged

This comment has been minimized.

Sign in to view

albertz mentioned this issue Sep 7, 2021

DotLayer, use single reduce argument #636

Closed

albertz added a commit that referenced this issue Sep 7, 2021

DotLayer, mask dyn axes when needed

51b25e2

Fix #629

albertz added a commit that referenced this issue Sep 7, 2021

DotLayer, mask dyn axes when needed

61a7f70

Fix #629

albertz closed this as completed in e0367d2 Sep 7, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`DotLayer` reduce over dynamic axis should respect the seq mask #629

`DotLayer` reduce over dynamic axis should respect the seq mask #629

albertz commented Sep 3, 2021

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

DotLayer reduce over dynamic axis should respect the seq mask #629

DotLayer reduce over dynamic axis should respect the seq mask #629

Comments

albertz commented Sep 3, 2021

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

`DotLayer` reduce over dynamic axis should respect the seq mask #629

`DotLayer` reduce over dynamic axis should respect the seq mask #629