ARK v0.3.0
- Enable heuristic model graph optimization
- Revise Python interfaces
- Add more operators & support mixed-precision models & support
bfloat16
- Add a Llama2-7B example
- Fix connection setup bugs for large & distributed models
- Fix correctness bugs from a few operators
- Minor scheduler improvements
See details from #113.