Notable changes to the project will be documented in this file.
The format is based on Keep a Changelog and the project adheres to the Haskell Package Versioning Policy (PVP)
-
Instances of
Elt
are now derivable viaGeneric
-
The
stencil
functions now support fusion. Note however that the source (delayed) array will be evaluated at every access to the stencil pattern; if the delayed function is expensive, you may wish to explicitlycompute
the source array first, matching the old behaviour. -
Removed
Slice
constraint from some indexing operations -
(internal) Visible type applications are used instead of
Proxy
types -
(internal)
EltRepr
is now a class-associated type ofElt
-
(internal)
GArrayData
has been simplified -
(internal) SIMD representation has been improved and generalised
- Pattern synonyms for manipulating custom product types can now be created;
see
Pattern
- Drop support for GHC-7.10
Special thanks to those who contributed patches as part of this release:
- Trevor L. McDonell (@tmcdonell)
- Joshua Meredith (@JoshMeredith)
1.2.0.1 - 2018-10-06
- Build fix for ghc-8.6
1.2.0.0 - 2018-04-03
- Internal debugging/RTS options handling has been changed. Compiling this package now implies that backends are also compiled in debug mode (no need to set the
-fdebug
cabal flag for those packages as well). - Complex numbers are stored in the C-style array-of-struct representation.
- Improve numeric handling of complex numbers.
- Coercions (
bitcast
) now occur between the underlying representation types - Front-end performance improvements
- Support for half-precision floating-point numbers.
- Support for struct-of-array-of-struct representations. Currently this is limited to fields of 2,3,4,8, or 16-elements wide.
- Add equivalents for
Data.Functor
,Data.Semigroup
(ghc-8+) - Add instances and helper functions for
Maybe
andEither
types - Add rank generalised versions of
take
,drop
,head
,tail
,init
,slit
,reverse
andtranspose
. - Implement counters and reporting for
-ddump-gc-stats
Special thanks to those who contributed patches as part of this release:
- Trevor L. McDonell (@tmcdonell)
- Ryan Scott (@ryanglscott)
- Rinat Striungis (@Haskell-mouse)
1.1.1.0 - 2017-09-26
- Improve and colourise the pretty-printer
1.1.0.0 - 2017-09-21
-
Additional EKG monitoring hooks (#340)
-
Operations from
RealFloat
- Changed type of
scanl'
,scanr'
to return anAcc
tuple, rather than a tuple ofAcc
arrays. - Specialised folds
sum
,product
,minimum
,maximum
,and
,or
,any
,all
now reduce along the innermost dimension only, rather than reducing all elements. You can recover the old behaviour by firstflatten
-ing the input array. - Add new stencil boundary condition
function
, to apply the given function to out-of-bounds indices.
- #390: Wrong number of arguments in printf
1.0.0.0 - 2017-03-31
- Many API and internal changes
- Bug fixes and other enhancements
- Fix type of
allocateArray
- Bug fixes and performance improvements.
- New iteration constructs.
- Additional Prelude-like functions.
- Improved code generation and fusion optimisation.
- Concurrent kernel execution in the CUDA backend.
- Bug fixes.
- New array fusion optimisation.
- New foreign function interface for array and scalar expressions.
- Additional Prelude-like functions.
- New example programs.
- Bug fixes and performance improvements.
- Full sharing recovery in scalar expressions and array computations.
- Two new example applications in package
accelerate-examples
(both including a graphical frontend):- A real-time Canny edge detection
- An interactive fluid flow simulator
- Bug fixes.
- New Prelude-like functions
zip*
,unzip*
,fill
,enumFrom*
,tail
,init
,drop
,take
,slit
,gather*
,scatter*
, andshapeSize
. - New simplified AST (in package
accelerate-backend-kit
) for backend writers who want to avoid the complexities of the type-safe AST.
- Complete sharing recovery for scalar expressions (but currently disabled by default).
- Also bug fixes in array sharing recovery and a few new convenience functions.
- Streaming computations
- Precompilation
- Repa-style array indices
- Additional collective operations supported by the CUDA backend:
stencil
s, morescan
s, rank-polymorphicfold
,generate
. - Conversions to other array formats
- Bug fixes
- Bug fixes and some performance tweaks.
- More collective operations supported by the CUDA backend:
replicate
,slice
andfoldSeg
. Frontend and interpreter support forstencil
. - Bug fixes.
- Initial release of the CUDA backend