clip related crash, segmentation fault

it seems like 48956ffb874f9a2c1306b721a6896cac0a2a3256 introduced a bug and it causes crash on my laptop.
i tried to compile latest release but it was crashing. i was able the trace the issue back to that commit.

let me know if anything else is needed. i'm going to stay at ddc4a18b92dc58b2490d6528f7118a9a166c5d69 for now.

<details> 
<summary> output of execution with -v arg </summary>

> Option: 
    n_threads:                         8
    mode:                              img_gen
    model_path:                        /data/kuesji/archive/models/image_generation/models/sdxl_illustrious:hassakuxl:v2.2.gguf
    wtype:                             unspecified
    clip_l_path:                       
    clip_g_path:                       
    clip_vision_path:                  
    t5xxl_path:                        
    diffusion_model_path:              
    high_noise_diffusion_model_path:   
    vae_path:                          
    taesd_path:                        
    esrgan_path:                       
    control_net_path:                  
    embedding_dir:                     
    stacked_id_embed_dir:              
    input_id_images_path:              
    style ratio:                       20.00
    normalize input image:             false
    output_path:                       output.png
    init_image_path:                   
    end_image_path:                    
    mask_image_path:                   
    control_image_path:                
    ref_images_paths:
    increase_ref_index:                false
    offload_params_to_cpu:             false
    clip_on_cpu:                       false
    control_net_cpu:                   false
    vae_on_cpu:                        true
    diffusion flash attention:         false
    diffusion Conv2d direct:           false
    vae_conv_direct:                   false
    control_strength:                  0.90
    prompt:                            purple cat
    negative_prompt:                   
    clip_skip:                         -1
    width:                             576
    height:                            1024
    sample_params:                     (txt_cfg: 3.50, img_cfg: 3.50, distilled_guidance: 3.50, slg.layer_count: 3, slg.layer_start: 0.01, slg.layer_end: 0.20, slg.scale: 0.00, scheduler: karras, sample_method: euler_a, sample_steps: 8, eta: 0.00)
    high_noise_sample_params:          (txt_cfg: 7.00, img_cfg: 7.00, distilled_guidance: 3.50, slg.layer_count: 3, slg.layer_start: 0.01, slg.layer_end: 0.20, slg.scale: 0.00, scheduler: default, sample_method: euler_a, sample_steps: -1, eta: 0.00)
    moe_boundary:                      0.875
    flow_shift:                        inf
    strength(img2img):                 0.75
    rng:                               cuda
    seed:                              1329308695
    batch_count:                       1
    vae_tiling:                        false
    upscale_repeats:                   1
    chroma_use_dit_mask:               true
    chroma_use_t5_mask:                false
    chroma_t5_mask_pad:                1
    video_frames:                      1
    fps:                               16
System Info: 
    SSE3 = 1
    AVX = 1
    AVX2 = 1
    AVX512 = 0
    AVX512_VBMI = 0
    AVX512_VNNI = 0
    FMA = 1
    NEON = 0
    ARM_FMA = 0
    F16C = 1
    FP16_VA = 0
    WASM_SIMD = 0
    VSX = 0
[DEBUG] stable-diffusion.cpp:151  - Using Vulkan backend
[DEBUG] ggml_extend.hpp:62   - ggml_vulkan: Found 1 Vulkan devices:
[DEBUG] ggml_extend.hpp:62   - ggml_vulkan: 0 = AMD Radeon Graphics (RADV RENOIR) (radv) | uma: 1 | fp16: 1 | bf16: 0 | warp size: 64 | shared memory: 65536 | int dot: 0 | matrix cores: none
[INFO ] stable-diffusion.cpp:201  - loading model from '/data/kuesji/archive/models/image_generation/models/sdxl_illustrious:hassakuxl:v2.2.gguf'
[INFO ] model.cpp:1035 - load /data/kuesji/archive/models/image_generation/models/sdxl_illustrious:hassakuxl:v2.2.gguf using gguf format
[DEBUG] model.cpp:1052 - init from '/data/kuesji/archive/models/image_generation/models/sdxl_illustrious:hassakuxl:v2.2.gguf'
[INFO ] stable-diffusion.cpp:267  - Version: SDXL 
[INFO ] stable-diffusion.cpp:301  - Weight type:                 q4_K
[INFO ] stable-diffusion.cpp:302  - Conditioner weight type:     q4_K
[INFO ] stable-diffusion.cpp:303  - Diffusion model weight type: q4_K
[INFO ] stable-diffusion.cpp:304  - VAE weight type:             f32
[DEBUG] stable-diffusion.cpp:306  - ggml tensor size = 400 bytes
[WARN ] stable-diffusion.cpp:311  - !!!It looks like you are using SDXL model. If you find that the generated images are completely black, try specifying SDXL VAE FP16 Fix with the --vae parameter. You can find it here: https://huggingface.co/madebyollin/sdxl-vae-fp16-fix/blob/main/sdxl_vae.safetensors
[DEBUG] clip.hpp:171  - vocab size: 49408
[DEBUG] clip.hpp:182  - trigger word img already in vocab
[DEBUG] ggml_extend.hpp:1609 - clip params backend buffer size =  66.61 MB(VRAM) (196 tensors)
[DEBUG] ggml_extend.hpp:1609 - clip params backend buffer size =  380.09 MB(VRAM) (517 tensors)
[DEBUG] ggml_extend.hpp:1609 - unet params backend buffer size =  1960.49 MB(VRAM) (1680 tensors)
[INFO ] stable-diffusion.cpp:455  - VAE Autoencoder: Using CPU backend
[DEBUG] ggml_extend.hpp:1609 - vae params backend buffer size =  94.47 MB(RAM) (140 tensors)
[DEBUG] stable-diffusion.cpp:564  - loading weights
[DEBUG] model.cpp:1998 - loading tensors from /data/kuesji/archive/models/image_generation/models/sdxl_illustrious:hassakuxl:v2.2.gguf
  |==================================================| 2641/2641 - 1586.19it/s
[INFO ] model.cpp:2216 - loading tensors completed, taking 1.67s (process: 0.00s, read: 0.74s, memcpy: 0.00s, convert: 0.01s, copy_to_backend: 0.77s)
[INFO ] stable-diffusion.cpp:645  - total params memory size = 2501.66MB (VRAM 2407.19MB, RAM 94.47MB): text_encoders 446.70MB(VRAM), diffusion_model 1960.49MB(VRAM), vae 94.47MB(RAM), controlnet 0.00MB(VRAM), pmid 0.00MB(VRAM)
[INFO ] stable-diffusion.cpp:713  - running in eps-prediction mode
[DEBUG] stable-diffusion.cpp:724  - finished loaded file
[DEBUG] stable-diffusion.cpp:2180 - generate_image 576x1024
[INFO ] stable-diffusion.cpp:736  - running with Karras scheduler
[INFO ] stable-diffusion.cpp:2313 - TXT2IMG
[INFO ] stable-diffusion.cpp:858  - attempting to apply 0 LoRAs
[INFO ] stable-diffusion.cpp:878  - apply_loras completed, taking 0.00s
[DEBUG] stable-diffusion.cpp:879  - prompt after extract and remove lora: "purple cat"
[DEBUG] conditioner.hpp:345  - parse 'purple cat' to [['purple cat', 1], ]
[DEBUG] clip.hpp:311  - token length: 77
[DEBUG] ggml_extend.hpp:1437 - clip compute buffer size: 1.40 MB(VRAM)
Segmentation fault         (core dumped) ./build/bin/sd -m "$models/image_generation/models/sdxl_illustrious:hassakuxl:v2.2.gguf" --seed -1 --steps 8 --sampling-method euler_a --scheduler karras --cfg-scale 3.5 -W 576 -H 1024 -p "purple cat" --vae-on-cpu -v

</details>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

clip related crash, segmentation fault #837

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

clip related crash, segmentation fault #837

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions