fix: gprofiler2 output files missing gene names in intersection columns #497 #9304

mohe1linux · 2025-10-29T15:27:29Z

The gprofiler2 module outputs were missing the actual gene names/IDs in the expected columns, making it impossible to identify which specific genes contribute to pathway enrichment.

Expected behavior:

*.gprofiler2.all_enriched_pathways.tsv should contain an intersection column with gene names/IDs
*.gprofiler2.[source].sub_enriched_pathways.tsv should contain actual gene names in the DE_genes_names column

Actual behavior:

all_enriched_pathways.tsv file lacks the intersection column entirely
sub_enriched_pathways.tsv files have DE_genes_names column containing numeric values (same as DE_genes) instead of gene names

Now with the fix

Enable g:Profiler evidence codes so the intersection column is emitted.
Populate sub-tables with both Ensembl IDs and symbols:
DE_genes_ids = originalintersection IDs DE_genes_names = gene symbols (from DE table where available, else gprofiler2::gconvert), fallback to IDs if unmapped

nextflow run . -profile test,docker
--gprofiler2_run true
--gprofiler2_organism mmusculus
--gprofiler2_evcodes true
--outdir test_gprofile_symbols

*all_enriched_pathways.tsv now contains intersection.
*sub_enriched_pathways.tsv now has DE_genes_ids and DE_genes_names (symbols present; IDs used as fallback).

Notes

No changes to output file names besides adding DE_genes_ids in sub tables.

…ntersection fixes nf-core#497

fix(gprofiler2) keep intersection:DE_genes_names, DE_genes_ids from i…

1e65153

…ntersection fixes nf-core#497

mohe1linux requested a review from SusiJo October 29, 2025 15:27

mohe1linux self-assigned this Oct 29, 2025

mohe1linux added this to Hackathon Barcelona October 2025 Oct 29, 2025

mohe1linux added the good first issue Good for newcomers label Oct 29, 2025

github-project-automation bot moved this to To do in Hackathon Barcelona October 2025 Oct 29, 2025

mohe1linux requested a review from suzannejin October 29, 2025 15:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: gprofiler2 output files missing gene names in intersection columns #497 #9304

fix: gprofiler2 output files missing gene names in intersection columns #497 #9304

Uh oh!

mohe1linux commented Oct 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

fix: gprofiler2 output files missing gene names in intersection columns #497 #9304

Are you sure you want to change the base?

fix: gprofiler2 output files missing gene names in intersection columns #497 #9304

Uh oh!

Conversation

mohe1linux commented Oct 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant