All you have to care is b=c
and you are done:
m1: [a x b], m2: [c x d]
m1
is [a x b]
which is [batch size x in features]
m2
is [c x d]
which is [in features x out features]
All you have to care is b=c
and you are done:
m1: [a x b], m2: [c x d]
m1
is [a x b]
which is [batch size x in features]
m2
is [c x d]
which is [in features x out features]