cuDSS OOM in global_positioning on RTX 3090 (24GB): `CUDSS_STATUS_ALLOC_FAILED`

<h1>Reproducible cuDSS allocation failure during <code>global_positioning</code> on 4262-image fisheye dataset</h1>

<p>
I am seeing a reproducible <strong>cuDSS allocation failure</strong> during
<code>global_positioning</code> on a <strong>4262-image fisheye dataset</strong>.
</p>

<p>
This looks similar to <a href="https://github.com/colmap/glomap/issues/215">#215</a> /
<a href="https://github.com/colmap/glomap/issues/191">#191</a>, but I’m posting full reproducible
details from my setup.
</p>

<hr>

<h2>Environment</h2>

<ul>
  <li><strong>OS:</strong> Ubuntu 24.04.3 LTS</li>
  <li><strong>CPU:</strong> AMD Ryzen 9 5900XT (32 logical CPUs)</li>
  <li><strong>RAM:</strong> 64 GB</li>
  <li><strong>GPU:</strong> NVIDIA GeForce RTX 3090 (24 GB VRAM)</li>
  <li><strong>NVIDIA driver:</strong> 570.211.01</li>
</ul>

<h2>Build / Versions</h2>

<ul>
  <li><strong>GLOMAP binary:</strong> <code>/home/ubu/dev/src/glomap_cuda120/install/bin/glomap</code></li>
  <li><code>glomap --help</code> shows: <code>This version was compiled with CUDA!</code></li>
  <li>
    <strong>Local glomap repo revision used for this build:</strong><br>
    <code>14e9104c2e60bc3f68fbc8b645b0d79a62f97f7a</code>
  </li>
  <li>
    <strong>COLMAP in the same pipeline:</strong><br>
    <code>COLMAP 3.13.0 (Commit d69fddb2 on 2026-02-15 with CUDA)</code>
  </li>
</ul>

<h2>Dataset</h2>

<ul>
  <li><strong>4262 images</strong> (2-camera rig), total image size approximately <strong>2.4–2.6 GB</strong></li>
  <li><strong>Image dimensions:</strong> <code>3840x3840</code></li>
</ul>

<h3>Feature extraction settings</h3>

<ul>
  <li><code>SiftExtraction.max_image_size=3200</code></li>
  <li><code>SiftExtraction.max_num_features=16384</code></li>
  <li><code>SiftExtraction.peak_threshold=0.0033</code></li>
  <li>GPU feature extraction enabled</li>
</ul>

<h3>Extracted features (from logs)</h3>

<ul>
  <li><strong>min:</strong> 10426</li>
  <li><strong>max:</strong> 21180</li>
  <li><strong>mean:</strong> 15638.68 per image</li>
</ul>

<hr>

<h2>Command used</h2>

<pre><code>/home/ubu/dev/src/glomap_cuda120/run_glomap_120.sh mapper \
  --database_path /.../database/database.db \
  --image_path /.../distorted/images \
  --output_path /.../distorted/sparse \
  --GlobalPositioning.use_gpu 1 \
  --GlobalPositioning.gpu_index -1 \
  --BundleAdjustment.use_gpu 1 \
  --BundleAdjustment.gpu_index -1 \
  --BundleAdjustment.optimize_intrinsics 1 \
  --BundleAdjustment.optimize_principal_point 1 \
  --BundleAdjustment.optimize_rig_poses 1 \
  --BundleAdjustment.max_num_iterations 300 \
  --skip_pruning 0
</code></pre>

<p>
I also reproduced the same failure with <code>--skip_pruning 1</code>.
</p>

<hr>

<h2>Error output</h2>

<p>During <strong>Running global positioning ...</strong>:</p>

<pre><code>Linear solver fatal error: cudssExecute with CUDSS_PHASE_FACTORIZATION failed Got error: CUDSS_STATUS_ALLOC_FAILED
Terminating: Linear solver failed due to unrecoverable non-numeric causes.
Ceres Solver Report ... Termination: FAILURE
</code></pre>

<hr>

<h2>Important behavior</h2>

<p>
Even after this solver failure, <code>mapper</code> continues and exports a COLMAP model.
</p>

<p>
In that case, the model quality can be invalid or corrupted
(for example, extremely large reprojection error in analyzer output).
</p>

<p>
If I force CPU solvers
(<code>GlobalPositioning.use_gpu=0</code>, <code>BundleAdjustment.use_gpu=0</code>),
the reconstruction completes with normal stats.
</p>

<hr>

<h2>Questions</h2>

<ol>
  <li>
    Is this expected behavior when cuDSS runs out of memory
    (i.e. continue execution and still export a model),
    or should <code>mapper</code> fail hard in this case?
  </li>
  <li>
    Are there any recommended flags or settings to reduce cuDSS memory pressure
    in <code>global_positioning</code> for datasets of this scale?
  </li>
</ol>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cuDSS OOM in global_positioning on RTX 3090 (24GB): `CUDSS_STATUS_ALLOC_FAILED` #239

Reproducible cuDSS allocation failure during `global_positioning` on 4262-image fisheye dataset

Environment

Build / Versions

Dataset

Feature extraction settings

Extracted features (from logs)

Command used

Error output

Important behavior

Questions

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

cuDSS OOM in global_positioning on RTX 3090 (24GB): CUDSS_STATUS_ALLOC_FAILED #239

Description

Reproducible cuDSS allocation failure during global_positioning on 4262-image fisheye dataset

Environment

Build / Versions

Dataset

Feature extraction settings

Extracted features (from logs)

Command used

Error output

Important behavior

Questions

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

cuDSS OOM in global_positioning on RTX 3090 (24GB): `CUDSS_STATUS_ALLOC_FAILED` #239

Reproducible cuDSS allocation failure during `global_positioning` on 4262-image fisheye dataset