Skip to content

Added weight sharing options and pos enc

Compare
Choose a tag to compare
@tatp22 tatp22 released this 17 Jun 21:45
· 77 commits to master since this release

Added the none, headwise, kv, and layerwise parameter sharing options. Also, added positional encodings