How to stack embedding and pass the gradients? #108

arita37 · 2022-02-23T03:06:18Z

Have a 2 neural nets N1, N2,
want to stack their output embedding layer.

How to do this ?

xuyxu · 2022-02-23T07:19:03Z

Suppose the output of N1 and N2 on a sample x is N1(x) and N2(x) separately, what you want is to concatenate their output (i.e., [N1(x), N2(x)]), and pass it to downstream layers, right?

arita37 · 2022-02-23T10:08:45Z

Exactly: Either concat, or Mean We need to pass the gradient and end to end training

…

On Feb 23, 2022, at 16:19, Yi-Xuan Xu ***@***.***> wrote: Suppose the output of N1 and N2 on a sample x is N1(x) and N2(x) separately, what you want is to concatenate their output (i.e., [N1(x), N2(x)]), and pass it to downstream layers, right? — Reply to this email directly, view it on GitHub, or unsubscribe. Triage notifications on the go with GitHub Mobile for iOS or Android. You are receiving this because you authored the thread.

xuyxu · 2022-02-23T14:54:28Z

Hi, if you are going to take the mean of outputs from all base estimators, the fusion ensemble is exactly what you want.

As to the concatenation, it is somehow weird since all base estimators in the ensemble are doing the same thing, making concatenating their outputs kind of useless. Is there any paper or technical report demonstrating the effectivenss of concatenating outputs of base estimators?

arita37 · 2022-02-24T00:41:19Z

N1, N2,Nx are different NN models. We aggregate through concat their embedding output. BigX = [ X1,….Xn] and feed into another NN (ie merging). This extensively used (ie Siamese Network…)

…

On Feb 23, 2022, at 23:54, Yi-Xuan Xu ***@***.***> wrote: Hi, if you are going to take the mean of outputs from all base estimators, the fusion ensemble is exactly what you want. As to the concatenation, it is somehow weird since all base estimators in the ensemble are doing the same thing, making concatenating their outputs kind of useless. Is there any paper or technical report demonstrating the effectivenss of concatenating outputs of base estimators? — Reply to this email directly, view it on GitHub, or unsubscribe. Triage notifications on the go with GitHub Mobile for iOS or Android. You are receiving this because you authored the thread.

arita37 · 2022-02-24T00:42:28Z

We are NOT dealing with output !!! Output is kind of useless for End to End training… We are dealing witht the last embedding.

…

On Feb 24, 2022, at 9:41, No Ke ***@***.***> wrote: N1, N2,Nx are different NN models. We aggregate through concat their embedding output. BigX = [ X1,….Xn] and feed into another NN (ie merging). This extensively used (ie Siamese Network…) >> On Feb 23, 2022, at 23:54, Yi-Xuan Xu ***@***.***> wrote: >> > > Hi, if you are going to take the mean of outputs from all base estimators, the fusion ensemble is exactly what you want. > > As to the concatenation, it is somehow weird since all base estimators in the ensemble are doing the same thing, making concatenating their outputs kind of useless. Is there any paper or technical report demonstrating the effectivenss of concatenating outputs of base estimators? > > — > Reply to this email directly, view it on GitHub, or unsubscribe. > Triage notifications on the go with GitHub Mobile for iOS or Android. > You are receiving this because you authored the thread.

xuyxu · 2022-02-24T01:04:30Z

Thanks for your kind explanation. Heterogeneous ensemble is not supported yet, since we have not come up with a succinct way on setting different optimizers for different base estimators 😢.

arita37 · 2022-02-24T02:13:04Z

Sure. At 1st version, Maybe, we can use same optimizer, scheduler for the ensemble model Goal is to have a one liner for easy Ensemble End to End.

…

On Feb 24, 2022, at 10:04, Yi-Xuan Xu ***@***.***> wrote: Thanks for your kind explanation. Heterogeneous ensemble is not supported yet, since we have not come up with a succinct way on setting different optimizers for different base estimators 😢. — Reply to this email directly, view it on GitHub, or unsubscribe. Triage notifications on the go with GitHub Mobile for iOS or Android. You are receiving this because you authored the thread.

xuyxu · 2022-02-27T12:05:18Z

Kind of busy these days, will appreciate a PR very much ;-)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to stack embedding and pass the gradients? #108

How to stack embedding and pass the gradients? #108

arita37 commented Feb 23, 2022

xuyxu commented Feb 23, 2022

arita37 commented Feb 23, 2022 via email

xuyxu commented Feb 23, 2022

arita37 commented Feb 24, 2022 via email

arita37 commented Feb 24, 2022 via email

xuyxu commented Feb 24, 2022

arita37 commented Feb 24, 2022 via email

xuyxu commented Feb 27, 2022

How to stack embedding and pass the gradients? #108

How to stack embedding and pass the gradients? #108

Comments

arita37 commented Feb 23, 2022

xuyxu commented Feb 23, 2022

arita37 commented Feb 23, 2022 via email

xuyxu commented Feb 23, 2022

arita37 commented Feb 24, 2022 via email

arita37 commented Feb 24, 2022 via email

xuyxu commented Feb 24, 2022

arita37 commented Feb 24, 2022 via email

xuyxu commented Feb 27, 2022