37 lines
1.2 KiB
Markdown
37 lines
1.2 KiB
Markdown
# RetrieX Patch 21 - Language Cleanup Profiles groundwork
|
|
|
|
## Goal
|
|
|
|
Prepare RetrieX 1.5.3 for simpler, centralized language cleanup without changing runtime behavior yet.
|
|
|
|
## Changes
|
|
|
|
- Extends `config/retriex/language.yaml` additively.
|
|
- Keeps legacy `retriex.stopwords.config.words` unchanged.
|
|
- Adds central groups for protected terms, German core stopwords, conversation noise, user instruction phrases, presentation/meta terms, and cleanup profiles.
|
|
- Introduces initial profiles: `commerce_query`, `rag_evidence`, `shop_context_fallback`.
|
|
|
|
## Non-goals
|
|
|
|
- No external stopword library.
|
|
- No Commerce/Agent runtime wiring yet.
|
|
- No removal of existing lists in `commerce.yaml`, `agent.yaml`, or `retrieval.yaml`.
|
|
- No domain-specific special cases.
|
|
|
|
## Install
|
|
|
|
Copy the files from this patch over the current RetrieX root.
|
|
|
|
```bash
|
|
unzip retriex-p21-language-cleanup-profiles-patch-only.zip -d /path/to/retriex
|
|
cd /path/to/retriex
|
|
bin/console mto:agent:config:validate
|
|
bin/console mto:agent:regression:test
|
|
bin/console mto:agent:config:audit-source --details
|
|
bin/console mto:agent:config:audit-patterns --details
|
|
```
|
|
|
|
## Expected result
|
|
|
|
All checks should remain green. This patch should not change answers yet.
|