add Array template index and use long long for ProjDataInMemory by KrisThielemans · Pull Request #1676 · UCL/STIR

KrisThielemans · 2026-02-02T01:26:05Z

Drastic revision that adds a indexT to VectorWithOffset, Array and IndexRange to be able to use something else than int.
Then uses that to allow for more bins in ProjDataInMemory.

fixes #1505

KrisThielemans · 2026-02-03T16:08:29Z

C++ tests are fine. Weird things in the Python tests.

@z-k-li in principle you could check this with SIRF and "in memory" acquisition data. The STIR Python stuff will likely fail as per the tests.

Adding an indexT (defaulting to int) to various classes related to arrays, including BasicCoordinate, IndexRange, VectorWithOffset and Array. This will allow to use larger (or smaller) ranges for the indices. Most code using these classes has not been changed, therefore still using "int". This was a bit more work than anticipated, as I had to moved the forward declarations to separate files, such that - they are consistent - there is only one place where the default is defined (as required by C++)

This removes the limitation on the number of elements in the proj-data. Fixes UCL#1505

- Looks like SWIG doesn't understand default templates arguments in %template unfortunately. - cope with num_dimenssions swig bug

KrisThielemans · 2026-02-19T07:39:32Z

The tests currently fail for unsigned indices (although I thought I had fixed that). However, this is currently not uesed, except in thest. I also still expect problems with SWIG as above.

@ChristianHinge @z-k-li could you give this a go with parallelproj (either CPU or GPU). It doesn't have #1674 yet, but that fix doesn't affect parallelproj. (You'd have to disable building of STIR Python.)

For GPU, no doubt you will have to change

STIR/examples/samples/forward_projector_parallelproj.par

Lines 6 to 8 in 25b3656

    
           ; this keyword allows increasing the number of chunks that the projector uses 
        
           ; increase if you run out of GPU memory 
        
           num_gpu_chunks:=1

ChristianHinge · 2026-02-20T07:36:45Z

@KrisThielemans I will try this out next week since we are just finalizing the MICCAI submission :-)! So just make sure I understand correctly - I will recompile the PR without python bindings and benchmark RAM + clock against the master branch. Is that correct?

KrisThielemans · 2026-02-20T08:10:40Z

That's correct, but of course after installing CUDA drivers/toolkit, and switching the projector to parallelproj.

ChristianHinge · 2026-02-24T09:39:34Z

@KrisThielemans I ran the recon using parallelproj using 4i5s, zoom 0.5, segment 4 (same recon everything as the benchmark I ran for #1674, but using parallelproj)

Time: 118min (TOF CPU: 16 min)
RAM: 167GB (TOF CPU: 19GB)

I ran it on 4xA40 GPUs. When monitoring, nvidia-smi, I can see the work being offloaded to the GPUs, albeit the utilization is quite low since the GPU is iddle most of the time. The resulting images look virtually identical to the master branch, but the voxel values differ a tiny bit.

KrisThielemans · 2026-02-24T09:59:30Z

Thanks @ChristianHinge at least we know it works now (it used to crash), which is great. I do expect differences between parallelproj (Joseph projector) and the ray-tracing matrix (Siddon with a few lines), but images should look overall quite similar.

Indeed, most of the computation time sits in copying and reordering data. The GPU probably flies through it all. This will be work for our imminent hackathons.

ChristianHinge · 2026-02-25T08:42:13Z

It is awesome to see the recon work on GPU! And quite cool that it distributes the workload evenly between the four A40s.

KrisThielemans self-assigned this Feb 2, 2026

KrisThielemans added the enhancement label Feb 2, 2026

KrisThielemans force-pushed the ArrayTemplateIndex branch from 49ee4af to ce931d1 Compare February 3, 2026 15:09

KrisThielemans added 3 commits February 3, 2026 16:28

use Array<1,float,long long> for ProjDataInMemory buffer

bf6f69d

This removes the limitation on the number of elements in the proj-data. Fixes UCL#1505

[SWIG] fixes related to new indexT

8eda228

- Looks like SWIG doesn't understand default templates arguments in %template unfortunately. - cope with num_dimenssions swig bug

KrisThielemans force-pushed the ArrayTemplateIndex branch from ce931d1 to 8eda228 Compare February 3, 2026 16:30

KrisThielemans linked an issue Feb 24, 2026 that may be closed by this pull request

Calling parallelproj forward Segmentation fault (core dumped) for LAFOV #1567

Open

KrisThielemans mentioned this pull request Feb 24, 2026

Calling parallelproj forward Segmentation fault (core dumped) for LAFOV #1567

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add Array template index and use long long for ProjDataInMemory#1676

add Array template index and use long long for ProjDataInMemory#1676
KrisThielemans wants to merge 3 commits intoUCL:masterfrom
KrisThielemans:ArrayTemplateIndex

KrisThielemans commented Feb 2, 2026

Uh oh!

KrisThielemans commented Feb 3, 2026

Uh oh!

KrisThielemans commented Feb 19, 2026

Uh oh!

ChristianHinge commented Feb 20, 2026

Uh oh!

KrisThielemans commented Feb 20, 2026

Uh oh!

ChristianHinge commented Feb 24, 2026

Uh oh!

KrisThielemans commented Feb 24, 2026

Uh oh!

ChristianHinge commented Feb 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

KrisThielemans commented Feb 2, 2026

Uh oh!

KrisThielemans commented Feb 3, 2026

Uh oh!

KrisThielemans commented Feb 19, 2026

Uh oh!

ChristianHinge commented Feb 20, 2026

Uh oh!

KrisThielemans commented Feb 20, 2026

Uh oh!

ChristianHinge commented Feb 24, 2026

Uh oh!

KrisThielemans commented Feb 24, 2026

Uh oh!

ChristianHinge commented Feb 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants