Skip to content

Releases: JuliaGPU/AMDGPU.jl

v2.1.4

01 Dec 11:32
c2d2701

Choose a tag to compare

AMDGPU v2.1.4

Diff since v2.1.3

Merged pull requests:

v2.1.3

25 Nov 14:20
d794272

Choose a tag to compare

AMDGPU v2.1.3

Diff since v2.1.2

Merged pull requests:

Closed issues:

  • using AMDGPU fails after updating to ROCm 7.0.2 (#841)
  • call to unknown function jl_f_issubtype but CUDA works (#846)
  • Here is my errors in test(AMDGPU) with 780M. (#849)

v2.1.2

23 Oct 06:52
15824e6

Choose a tag to compare

AMDGPU v2.1.2

Diff since v2.1.1

Merged pull requests:

Closed issues:

  • Memory leak on AMD MI250X GPU when tranferring data (#836)
  • AMDGPU cannot find HIP/rocm install on Windows (#838)

v2.1.1

03 Oct 17:02
eb928bc

Choose a tag to compare

AMDGPU v2.1.1

Diff since v2.1.0

Merged pull requests:

Closed issues:

  • matrix matrix multiplication gives error (#827)

v2.1.0

08 Sep 11:56
02711ff

Choose a tag to compare

AMDGPU v2.1.0

Diff since v2.0.1

Merged pull requests:

Closed issues:

  • AMDGPU.jl v2 forces downgrade of MPI.jl (#812)
  • Support GPUToolbox 0.3.0 (#815)
  • device-side rng assumes warp size of 32 (#819)
  • Cooperative group triggers out of bound memory error in CI (#824)

v2.0.1

24 Jul 17:16
d2eeba2

Choose a tag to compare

AMDGPU v2.0.1

Diff since v2.0.0

Merged pull requests:

  • Revert InexactError exception (#809) (@pxl-th)
  • Update docs & fix own kwarg handling for unsafe_wrap (#811) (@pxl-th)

Closed issues:

  • reinterpret now leads to invalid IR (#808)
  • Update doc to reflect changes in memory (#810)

v2.0.0

18 Jul 21:41
daff36d

Choose a tag to compare

AMDGPU v2.0.0

Diff since v1.3.6

Breaking changes:

Update unsafe_wrap to match Base definition: remove lock kwargs and add own kwarg.

Changes:

  • Better kernel exception reporting mechanism:
julia> using AMDGPU

julia> function ker!(x)
           x[0] = 1
           return
       end
ker! (generic function with 1 method)

julia> x = ROCArray(zeros(Int, 1));

julia> @roc ker!(x);

julia> AMDGPU.synchronize()
ERROR: GPU Kernel Exception:
BoundsError: Out-of-bounds array access
workitemIdx: (x = 0x00000001, y = 0x00000001, z = 0x00000001)
workgroupIdx: (x = 0x00000001, y = 0x00000001, z = 0x00000001)
Stacktrace:
 [1] error(s::String)
   @ Base ./error.jl:35
 [2] throw_if_exception(dev::HIPDevice)
   @ AMDGPU ~/.julia/dev/AMDGPU/src/exception_handler.jl:149
 [3] synchronize(stm::HIPStream; blocking::Bool, stop_hostcalls::Bool)
   @ AMDGPU ~/.julia/dev/AMDGPU/src/highlevel.jl:40
 [4] synchronize
   @ ~/.julia/dev/AMDGPU/src/highlevel.jl:36 [inlined]
 [5] synchronize()
   @ AMDGPU ~/.julia/dev/AMDGPU/src/highlevel.jl:36
 [6] top-level scope
   @ REPL[5]:1
  • Disable eager GC by default (instead users should use GPUArrays.@cached).
  • Implement KA.pagelock!.
  • Remove old nonblocking sync (pre ROCm 6.0).
  • Docs: Add FAQ entry for AMDGPU on Archlinux.
  • Docs: Add SIMD docs & re-include installation tips.

Merged pull requests:

Closed issues:

  • Improve exception reporting (#270)
  • Support for atomic max on Float (#339)
  • Implement revised KA functions copyto! and pagelock! (#724)
  • 2D cumsum throwing GPU Kernel Exception (#742)
  • CI Enzyme Julia 1.10 test failure (#765)
  • CI failure on Julia 1.12 with GPUArrays linalg (#766)
  • Noisy failure if ROCM not installed (#794)
  • [Feature Request] Support KernelAbstractions pagelock! (#799)

v1.3.6

14 Jul 10:56
182920b

Choose a tag to compare

AMDGPU v1.3.6

Diff since v1.3.5

Merged pull requests:

  • Fix return value of find_roc_path when ROCM_PATH is set (#798) (@giordano)

Closed issues:

  • ROCM_PATH detected is actually ROCM_PATH/lib (#797)

v1.3.5

02 Jul 21:43
a697f90

Choose a tag to compare

AMDGPU v1.3.5

Diff since v1.3.4

Merged pull requests:

Closed issues:

  • Force matching artifact versions and LLVM version (#309)
  • Device printing is broken on Julia 1.11 (#551)
  • Trying to import AMDGPU fails with (an lvm?) error (#579)
  • reduce(f, A) fails to compile if Julia is started with --check-bounds=no (#581)
  • Tests not passing on MI300A (#779)
  • incorrect first value for simple kernel on 1.12 (#780)
  • @rocprintf produces memory access fault in 1.12 (#781)
  • to_power_type does not exist any more in Base (#782)

v1.3.4

25 Jun 06:25

Choose a tag to compare

AMDGPU v1.3.4

Diff since v1.3.3

Bug fixes

  • sync_workgroup was lacking memory barrier semantics. They have been updated to match the expectation coming from HIP.

Merged pull requests:

Closed issues:

  • collectinvokes! error on julia 1.12.0-beta3 (#768)