-
Notifications
You must be signed in to change notification settings - Fork 1.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Update gensymbol #4969
Update gensymbol #4969
Conversation
Fix a symbol in `lapackobjs2d`.
sorry, but no. dlag2d simply does not exist. (not in Reference-LAPACK either) |
Should we just remove the symbols from |
I think it should stay exactly as it is now. What are you trying to fix here ? |
I was trying to add some LAPACK symbols for quadruple precision in LBT, and our script to generate the symbols in our library depends on this script. |
I see, thanks - certainly an oddity in the Reference LAPACK but not a typo. (Note also that neither the Reference implementation nor OpenBLAS currently supports quadruple precision, though OpenBLAS has some stubs from the early days of GotoBLAS that could mislead one into thinking they are implemented. But you probably know that) |
Reference-LAPACK can be easily compiled in quadruple precision with The idea was to keep the A Makefile can be found in Reference-LAPACK: With LBT, we could still use the OpenBLAS library for the usual symbols and our local quadruple library for the other ones. |
oh right - however I'm not even sure if that was ever seriously tested. (On the OpenBLAS side I guess it would be possible to hook up some half-baked support using the "generic" C kernels - this would probably provide some performance improvement relative to the Reference BLAS already. Not sure I'd want to open another can of worms while I'm already struggling to keep the number of open issues at or near 100 though) |
Had another look at this - there appear to be two mutually exclusive (non-)implementations inherited from GotoBLAS, QUAD_PRECISION defines a struct of two "unsigned long" types to hold the quadruple-precision arguments but is otherwise completely unimplemented in the actual code, EXPRECISION adds |
Will it require a lot of work to simply compile the "generic" C kernels in quadruple precision and use the It could be great for prototyping and testing. |
I'm trying to find out - at least the barely implemented GEMM3M kernels are acting up, and apparently this made xianyi disable what little support there was sometime in 2015 |
Tentative fix now available as #4976 - adding EXPRECISION=1 should provide "long double" Q and X functions alongside their usual float/double/complex counterparts when compiling with GCC for the "GENERIC" or any optimized x86_64 target. |
Fix a symbol in
lapackobjs2d
.