-
Notifications
You must be signed in to change notification settings - Fork 10.3k
Description
Current Behavior
Scanned receipt in landscape mode contains sums
Käibemaksuta kokku : 2.66
Kokku 22% käive : 3.25
Käibemaks 22% : 0.59
Kokku 0% käive: 0.00
Käibemaks 0%: 0.00
Ümardus : 0.00
Kokku KM : 0.59
Kokku : 3.25
Tesseract is used to extract text using command
tesseract salomepdfseestvoetud.jpg - -l est --psm 1
In result only first digit of sums or other characters are shown:
käibemaksuta kokku : 2
Kokku 22% käive : 3
Käibemaks 224 : (
Kokku = 0% käive: Oa
käibemaks = 0%: (0
Ümardus : 0
Kokku KM : 0
Kokku : 3
Scanned file:
Expected Behavior
Whole sums should extracted:
Käibemaksuta kokku : 2.66
Kokku 22% käive : 3.25
Käibemaks 22% : 0.59
Kokku 0% käive: 0.00
Käibemaks 0%: 0.00
Ümardus : 0.00
Kokku KM : 0.59
Kokku : 3.25
Suggested Fix
No response
tesseract -v
tesseract 5.3.0
leptonica-1.82.0
libgif 5.2.1 : libjpeg 6b (libjpeg-turbo 2.1.2) : libpng 1.6.39 : libtiff 4.5.0 : zlib 1.2.13 : libwebp 1.2.4 : libopenjp2 2.5.0
Found AVX512BW
Found AVX512F
Found AVX512VNNI
Found AVX2
Found AVX
Found FMA
Found SSE4.1
Found OpenMP 201511
Found libarchive 3.6.2 zlib/1.2.13 liblzma/5.4.1 bz2lib/1.0.8 liblz4/1.9.4 libzstd/1.5.4
Found libcurl/7.88.1 OpenSSL/3.0.16 zlib/1.2.13 brotli/1.0.9 zstd/1.5.4 libidn2/2.3.3 libpsl/0.21.2 (+libidn2/2.3.3) libssh2/1.10.0 nghttp2/1.52.0 librtmp/2.3 OpenLDAP/2.5.13
Operating System
Debian 12 Bookworm
Other Operating System
No response
uname -a
Linux uvn-76-202.tll01.zonevs.eu 6.1.0-38-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.147-1 (2025-08-02) x86_64 GNU/Linux
Compiler
Using debian tesseract package
CPU
Architecture: x86_64
CPU op-mode(s): 32-bit, 64-bit
Address sizes: 52 bits physical, 57 bits virtual
Byte Order: Little Endian
CPU(s): 4
On-line CPU(s) list: 0-3
Vendor ID: AuthenticAMD
BIOS Vendor ID: QEMU
Model name: AMD EPYC 9274F 24-Core Processor
BIOS Model name: pc-q35-8.0 CPU @ 2.0GHz
BIOS CPU family: 1
CPU family: 25
Model: 17
Thread(s) per core: 1
Core(s) per socket: 1
Socket(s): 4
Stepping: 1
BogoMIPS: 8087,84
Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat ps
e36 clflush mmx fxsr sse sse2 syscall nx mmxext fxsr_opt pdpe1gb rdt
scp lm rep_good nopl cpuid extd_apicid tsc_known_freq pni pclmulqdq
ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_t
imer aes xsave avx f16c rdrand hypervisor lahf_lm cmp_legacy svm cr8
legacy abm sse4a misalignsse 3dnowprefetch osvw perfctr_core invpci
d_single ssbd ibrs ibpb stibp ibrs_enhanced vmmcall fsgsbase tsc_adj
ust bmi1 avx2 smep bmi2 erms invpcid avx512f avx512dq rdseed adx sma
p avx512ifma clflushopt clwb avx512cd sha_ni avx512bw avx512vl xsave
opt xsavec xgetbv1 xsaves avx512_bf16 clzero xsaveerptr wbnoinvd ara
t npt lbrv nrip_save tsc_scale vmcb_clean pausefilter pfthreshold v
vmsave_vmload vgif avx512vbmi umip pku ospke avx512_vbmi2 gfni vaes
vpclmulqdq avx512_vnni avx512_bitalg avx512_vpopcntdq la57 rdpid fsr
m flush_l1d arch_capabilities
Virtualization features:
Virtualization: AMD-V
Hypervisor vendor: KVM
Virtualization type: full
Caches (sum of all):
L1d: 256 KiB (4 instances)
L1i: 256 KiB (4 instances)
L2: 2 MiB (4 instances)
L3: 64 MiB (4 instances)
NUMA:
NUMA node(s): 1
NUMA node0 CPU(s): 0-3
Vulnerabilities:
Gather data sampling: Not affected
Indirect target selection: Not affected
Itlb multihit: Not affected
L1tf: Not affected
Mds: Not affected
Meltdown: Not affected
Mmio stale data: Not affected
Reg file data sampling: Not affected
Retbleed: Not affected
Spec rstack overflow: Mitigation; safe RET
Spec store bypass: Mitigation; Speculative Store Bypass disabled via prctl
Spectre v1: Mitigation; usercopy/swapgs barriers and __user pointer sanitization
Spectre v2: Mitigation; Enhanced / Automatic IBRS; IBPB conditional; STIBP disab
led; PBRSB-eIBRS Not affected; BHI Not affected
Srbds: Not affected
Tsa: Vulnerable: Clear CPU buffers attempted, no microcode
Tsx async abort: Not affected
Virtualization / Containers
No response
Other Information
No response