MSCCL++ v0.3.0
- Updated interfaces
- Add Python bindings and interfaces
- Add Python unit tests
- Add more configurable parameters
- Add a new single-node AllReduce kernel
- Fix bugs
See details from #89.
Full Changelog: v0.2.0...v0.3.0
See details from #89.
Full Changelog: v0.2.0...v0.3.0