Add batch splitting in attention layer to hide NIC latency(#14) #4959
build_pr_documentation.yml
on: pull_request
build_documentation
3m 14s
Artifacts
Produced during runtime
Name | Size | |
---|---|---|
doc-build-artifact
|
219 KB |
|