Releases: JuliaGPU/AMDGPU.jl
Releases · JuliaGPU/AMDGPU.jl
v2.1.4
v2.1.3
v2.1.2
AMDGPU v2.1.2
Merged pull requests:
- Even more BLAS tests and fixes (#832) (@kshyatt)
- Update buildkite for 1.12 (#833) (@kshyatt)
- Support for norm of Diagonal (#834) (@kshyatt)
- Support GPUToolbox 1.0 (#835) (@amontoison)
- Update discovery.jl with minimal approach to fix Windows discovery process (#839) (@jbiffl)
Closed issues:
v2.1.1
AMDGPU v2.1.1
Merged pull requests:
- Add Julia 1.11 Enzyme job to pipeline (#828) (@wsmoses)
- Updates for PrettyTables 3 (#829) (@kshyatt)
- fix ambiguity error in Julia 1.12 (#830) (@simeonschaub)
- A few more blas tests and fix (#831) (@kshyatt)
Closed issues:
- matrix matrix multiplication gives error (#827)
v2.1.0
AMDGPU v2.1.0
Merged pull requests:
- [rocSOLVER] Interface geblttrf_npvt and geblttrs_npvt (#746) (@amontoison)
- update hip wrappers (#813) (@simeonschaub)
- [rocSPARSE] Support more conversions (#816) (@amontoison)
- support cooperative groups (#817) (@simeonschaub)
- Support GPUToolbox 0.3 (#818) (@kshyatt)
- Bump actions/checkout from 4 to 5 (#820) (@dependabot[bot])
- Support lmul/rmul for ROCMatrix and Diagonal (#821) (@kshyatt)
- use correct wavefrontsize for device-side rng (#822) (@simeonschaub)
- fix CI: add JLD2 dependency (#823) (@simeonschaub)
- enable cooperative group tests only on 1.12 (#825) (@simeonschaub)
Closed issues:
v2.0.1
v2.0.0
AMDGPU v2.0.0
Breaking changes:
Update unsafe_wrap to match Base definition: remove lock kwargs and add own kwarg.
Changes:
- Better kernel exception reporting mechanism:
julia> using AMDGPU
julia> function ker!(x)
x[0] = 1
return
end
ker! (generic function with 1 method)
julia> x = ROCArray(zeros(Int, 1));
julia> @roc ker!(x);
julia> AMDGPU.synchronize()
ERROR: GPU Kernel Exception:
BoundsError: Out-of-bounds array access
workitemIdx: (x = 0x00000001, y = 0x00000001, z = 0x00000001)
workgroupIdx: (x = 0x00000001, y = 0x00000001, z = 0x00000001)
Stacktrace:
[1] error(s::String)
@ Base ./error.jl:35
[2] throw_if_exception(dev::HIPDevice)
@ AMDGPU ~/.julia/dev/AMDGPU/src/exception_handler.jl:149
[3] synchronize(stm::HIPStream; blocking::Bool, stop_hostcalls::Bool)
@ AMDGPU ~/.julia/dev/AMDGPU/src/highlevel.jl:40
[4] synchronize
@ ~/.julia/dev/AMDGPU/src/highlevel.jl:36 [inlined]
[5] synchronize()
@ AMDGPU ~/.julia/dev/AMDGPU/src/highlevel.jl:36
[6] top-level scope
@ REPL[5]:1- Disable eager GC by default (instead users should use
GPUArrays.@cached). - Implement
KA.pagelock!. - Remove old nonblocking sync (pre ROCm 6.0).
- Docs: Add FAQ entry for AMDGPU on Archlinux.
- Docs: Add SIMD docs & re-include installation tips.
Merged pull requests:
- Re-work exception reporting (take 2) (#800) (@pxl-th)
- Add FAQ entry for AMDGPU on Archlinux (#802) (@vchuravy)
- Update unsafe_wrap, implement
KA.pagelock!& remove old stuff (#803) (@pxl-th) - Fix gpu-less CI (#805) (@pxl-th)
- Disable eager GC by default (#806) (@pxl-th)
- Add SIMD docs & re-include installation tips (#807) (@pxl-th)
Closed issues:
- Improve exception reporting (#270)
- Support for atomic
maxonFloat(#339) - Implement revised KA functions
copyto!andpagelock!(#724) - 2D cumsum throwing GPU Kernel Exception (#742)
- CI Enzyme Julia 1.10 test failure (#765)
- CI failure on Julia 1.12 with GPUArrays linalg (#766)
- Noisy failure if ROCM not installed (#794)
- [Feature Request] Support KernelAbstractions pagelock! (#799)
v1.3.6
AMDGPU v1.3.6
Merged pull requests:
Closed issues:
- ROCM_PATH detected is actually ROCM_PATH/lib (#797)
v1.3.5
AMDGPU v1.3.5
Merged pull requests:
- Remove overload for 1.12-rc1 and higher (#784) (@pxl-th)
- Fix global hostcall detection (#787) (@pxl-th)
- Refactor initialization logic to allow for automatic detection in Fedora (#788) (@gbaraldi)
- Fix printf on 1.11+ (#789) (@pxl-th)
- follow up #788, improve docs (#791) (@simeonschaub)
- Remove unsupported assertions stuff (#792) (@pxl-th)
- actually run batchnorm tests (#793) (@simeonschaub)
Closed issues:
- Force matching artifact versions and LLVM version (#309)
- Device printing is broken on Julia 1.11 (#551)
- Trying to import AMDGPU fails with (an lvm?) error (#579)
reduce(f, A)fails to compile if Julia is started with--check-bounds=no(#581)- Tests not passing on MI300A (#779)
- incorrect first value for simple kernel on 1.12 (#780)
@rocprintfproduces memory access fault in 1.12 (#781)to_power_typedoes not exist any more inBase(#782)
v1.3.4
AMDGPU v1.3.4
Bug fixes
sync_workgroupwas lacking memory barrier semantics. They have been updated to match the expectation coming from HIP.
Merged pull requests:
- Update indexing.jl (#771) (@amontoison)
- Add memory barrier semantics to
sync_workgroup(#783) (@vchuravy)
Closed issues:
collectinvokes!error on julia 1.12.0-beta3 (#768)