You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Feng et al. proposed a log-space implementation of parallel scan for improved numerical stability. It should be fairly easy to implement, but I'm a bit out of practice with my CUDA skills and wanted to ask whether you don't already have it on your mind by any chance before I attempt an implementation by myself.
The text was updated successfully, but these errors were encountered:
Hi @kklemon! I did not need to make cuda implementations of log space scans because I usually was able to stabilize the recurrence and use addition instead of logsumexp/logadd.
Feng et al. proposed a log-space implementation of parallel scan for improved numerical stability. It should be fairly easy to implement, but I'm a bit out of practice with my CUDA skills and wanted to ask whether you don't already have it on your mind by any chance before I attempt an implementation by myself.
The text was updated successfully, but these errors were encountered: