Skip to content

Ongoing research training transformer models at scale

License

Notifications You must be signed in to change notification settings

kungfu-team/Megatron-LM

About

Ongoing research training transformer models at scale

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 92.7%
  • C++ 4.9%
  • Shell 1.6%
  • Cuda 0.5%
  • C 0.2%
  • HTML 0.1%