Added weight sharing options and pos enc
Added the none
, headwise
, kv
, and layerwise
parameter sharing options. Also, added positional encodings
Added the none
, headwise
, kv
, and layerwise
parameter sharing options. Also, added positional encodings