Skip to content

Dictionary can't find capitalized words in Norwegian #190

@crosby88

Description

@crosby88

Describe the bug
I have a working local wiktionary dump for Norwegian. It has no issue looking up words that are fully lowercase. However, if a word has any capital letters in it then it can't find a definition.

To Reproduce
Steps to reproduce the behavior:

  1. Set target language to Norwegian, get a working wiktionary dump (this is a process on its own but whatever)
  2. Give it a Norwegian sentence where some words have capital letters
  3. Look up capitalized word -> no definitions found
  4. Edit word to be all lowercase -> definition found

Expected behavior
Give me a definition without having to make the word lowercase myself

Screenshots
Sentence as is, can't find definition for "Hvorfor"

Image

Now I made it "hvorfor" all is well

Image

Logs
VocabSieve version: 0.12.4
Python version: 3.11.9 (tags/v3.11.9:de54cf5, Apr 2 2024, 10:12:12) [MSC v.1938 64 bit (AMD64)] on Windows 10 AMD64
PyQt5 (Qt bindings) version: 5.15.11, Qt 5.15.2

2025-05-18 11:34:41.990 | DEBUG | vocabsieve.main:initSources:148 - Initializing sources
2025-05-18 11:34:41.991 | DEBUG | vocabsieve.main:initSources:154 - Source Group 1: ['kaikki.org-dictionary-NorwegianBokmål-words-maybefixed'] has been created.
2025-05-18 11:34:41.991 | DEBUG | vocabsieve.main:initSources:166 - Source Group 2 is disabled, emptying source widget.
2025-05-18 11:34:41.991 | DEBUG | vocabsieve.main:initSources:175 - Audio source group is empty, emptying audio source widget.
2025-05-18 11:34:41.992 | DEBUG | vocabsieve.main:initTimers:1066 - Initializing timers
2025-05-18 11:34:42.027 | DEBUG | vocabsieve.main:getKnownDataOnThread:426 - Some data sources aren't available, not getting known data now
2025-05-18 11:35:08.910 | DEBUG | vocabsieve.ui.searchable_boldable_text_edit:bold:11 - bolding Hvorfor
2025-05-18 11:35:08.915 | DEBUG | vocabsieve.ui.multi_definition_widget:lookup:138 - Looking up Hvorfor in [<vocabsieve.sources.local_dictionary_source.LocalDictionarySource object at 0x000001EEE11EBD50>]
2025-05-18 11:35:08.915 | DEBUG | vocabsieve.ui.multi_definition_widget:appendDefinition:167 - All sources have been looked up
2025-05-18 11:35:10.893 | DEBUG | vocabsieve.ui.multi_definition_widget:lookup:138 - Looking up Hvorfor in [<vocabsieve.sources.local_dictionary_source.LocalDictionarySource object at 0x000001EEE11EBD50>]
2025-05-18 11:35:10.894 | DEBUG | vocabsieve.ui.multi_definition_widget:appendDefinition:167 - All sources have been looked up
2025-05-18 11:35:14.122 | DEBUG | vocabsieve.ui.searchable_boldable_text_edit:bold:11 - bolding hvorfor
2025-05-18 11:35:14.129 | DEBUG | vocabsieve.ui.multi_definition_widget:lookup:138 - Looking up hvorfor in [<vocabsieve.sources.local_dictionary_source.LocalDictionarySource object at 0x000001EEE11EBD50>]
2025-05-18 11:35:14.132 | DEBUG | vocabsieve.ui.multi_definition_widget:appendDefinition:167 - All sources have been looked up

Desktop (please complete the following information):

  • OS: Windows 10
  • Vocabsieve version (if nightly, must be latest): 0.12.4

Additional context
My dictionary processing is set to lemma first, then original

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions