fix: categorization uses configured LLM; MCP returns str; Unicode JSON preserved #3429

ErinnerMO · 2025-09-08T02:56:53Z

Description

Fix: Categorization now respects the configured LLM provider/model (DeepSeek or OpenAI). Previously it was hardcoded to an OpenAI model, so even when both the LLM and embeddings used non‑OpenAI providers, categorization still forced OpenAI. It now follows the configured provider/model for these two backends.
Fix: Normalize MCP server tool-handler return type to str per the MCP interface specification (some handlers previously returned a Python dict).
Enhancement: Preserve Unicode in JSON responses (no ASCII escaping, e.g., ensure_ascii=False), improving non‑ASCII handling.

Type of change

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Refactor (does not change functionality, e.g. code style improvements, linting)

How Has This Been Tested?

Test Script (manual verification using live provider APIs; no new unit tests were added)

Manual test setup:

Environment variables:
- OPENAI_BASE_URL = Aliyun DashScope’s OpenAI‑compatible endpoint
- OPENAI_API_KEY = Aliyun DashScope’s OpenAI‑compatible endpoint API key
- DEEPSEEK_BASE_URL = official DeepSeek API base URL
- DEEPSEEK_API_KEY = official DeepSeek API key
Models:
- LLM: DeepSeek (model: deepseek-chat)
- Embeddings: Qwen3 (model: text-embedding-v4, via DashScope)
Note: At this time, categorization supports DeepSeek and OpenAI only; this run validates the DeepSeek path with Qwen3 embeddings.

Steps and expected results:

Categorization respects configured provider/model (DeepSeek)
- Configure LLM provider = DeepSeek (deepseek-chat); embeddings = Qwen3 text-embedding-v4 via DashScope (through OPENAI_BASE_URL).
- Trigger a memory entry categorization.
- Expectation: categorization calls DeepSeek (no OpenAI fallback). Verify via request/trace logs that calls hit DEEPSEEK_BASE_URL; no requests are sent to api.openai.com.
Unicode JSON responses
- Submit text containing non‑ASCII characters (e.g., “Unicode character test — mañana 🚀”).
- Expectation: Unicode remains unescaped (no \uXXXX sequences); response encoding is UTF‑8.
MCP tool call return type is normalized to string
- Invoke a tool handler that previously could return a Python dict.
- Expectation: server now returns a string payload per the MCP spec; json.loads(response) succeeds and isinstance(response, str) is true.
Lint sanity (F811 fix)
- Run pre‑commit hooks (e.g., pre-commit run -a).
- Expectation: no F811 (“redefinition of unused name”) warning; checks pass.

Checklist:

My code follows the style guidelines of this project
I have performed a self-review of my own code
My changes generate no new warnings
I have checked my code and corrected any misspellings

Maintainer Checklist

N/A — no associated GitHub issue for this PR
Made sure Checks passed

…nicode JSON

…-configured-llm

CLAassistant · 2025-09-08T02:56:58Z

All committers have signed the CLA.

ErinnerMO added 4 commits September 5, 2025 18:21

chore: resolve F811 by deduping handle_post_message handlers

7e946d6

fix: stringify tool responses, fix search filter precedence, enable U…

347e298

…nicode JSON

fix: use configured LLM for categorization and add DeepSeek support

96594e7

Merge branch 'mem0ai:main' into openmemory/feature/categorization-use…

c107d07

…-configured-llm

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: categorization uses configured LLM; MCP returns str; Unicode JSON preserved #3429

fix: categorization uses configured LLM; MCP returns str; Unicode JSON preserved #3429

Uh oh!

ErinnerMO commented Sep 8, 2025

Uh oh!

CLAassistant commented Sep 8, 2025 •

edited

Loading

Uh oh!

Uh oh!

fix: categorization uses configured LLM; MCP returns str; Unicode JSON preserved #3429

Are you sure you want to change the base?

fix: categorization uses configured LLM; MCP returns str; Unicode JSON preserved #3429

Uh oh!

Conversation

ErinnerMO commented Sep 8, 2025

Description

Type of change

How Has This Been Tested?

Checklist:

Maintainer Checklist

Uh oh!

CLAassistant commented Sep 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

CLAassistant commented Sep 8, 2025 •

edited

Loading