Files
MtoRagSystem/RETRIEX_PATCH_21_LANGUAGE_CLEANUP_PROFILES_README.md
team 1 ff407d9fa1 p21
2026-05-03 20:25:08 +02:00

1.2 KiB

RetrieX Patch 21 - Language Cleanup Profiles groundwork

Goal

Prepare RetrieX 1.5.3 for simpler, centralized language cleanup without changing runtime behavior yet.

Changes

  • Extends config/retriex/language.yaml additively.
  • Keeps legacy retriex.stopwords.config.words unchanged.
  • Adds central groups for protected terms, German core stopwords, conversation noise, user instruction phrases, presentation/meta terms, and cleanup profiles.
  • Introduces initial profiles: commerce_query, rag_evidence, shop_context_fallback.

Non-goals

  • No external stopword library.
  • No Commerce/Agent runtime wiring yet.
  • No removal of existing lists in commerce.yaml, agent.yaml, or retrieval.yaml.
  • No domain-specific special cases.

Install

Copy the files from this patch over the current RetrieX root.

unzip retriex-p21-language-cleanup-profiles-patch-only.zip -d /path/to/retriex
cd /path/to/retriex
bin/console mto:agent:config:validate
bin/console mto:agent:regression:test
bin/console mto:agent:config:audit-source --details
bin/console mto:agent:config:audit-patterns --details

Expected result

All checks should remain green. This patch should not change answers yet.