Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

VCLAMP ASAN fix for F32 and F16 Neon #5918

Merged
merged 1 commit into from
Jan 13, 2024
Merged

Conversation

copybara-service[bot]
Copy link
Contributor

@copybara-service copybara-service bot commented Jan 13, 2024

VCLAMP ASAN fix for F32 and F16 Neon

  • Selective loads for remainder handler
  • Add f32-vclamp-bench benchmark
  • Add xnn_f32_vclamp_ukernel__neon_u16 for 4 neon vector units
  • Enable xnn_f32_vclamp_ukernel__neon_u16
  • vld2_dup to load params

@copybara-service copybara-service bot changed the title Add f32-vclamp-bench benchmark VCLAMP ASAN fix for F32 and F16 Neon Jan 13, 2024
@copybara-service copybara-service bot force-pushed the test_598075809 branch 2 times, most recently from eb526dc to f2ca05a Compare January 13, 2024 21:29
- Selective loads for remainder handler
- Add f32-vclamp-bench benchmark
- Add xnn_f32_vclamp_ukernel__neon_u16 for 4 neon vector units
- Enable xnn_f32_vclamp_ukernel__neon_u16
- vld2_dup to load params

PiperOrigin-RevId: 598236479
@copybara-service copybara-service bot merged commit edd71bc into master Jan 13, 2024
@copybara-service copybara-service bot deleted the test_598075809 branch January 13, 2024 21:30
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant