This commit is contained in:
team 1
2026-05-04 08:38:53 +02:00
parent c5e212f8f2
commit c00cb3a9b9
16 changed files with 482 additions and 88 deletions

View File

@@ -123,3 +123,18 @@ Use this checklist for every relevant PR:
- [ ] `mto:agent:regression:test` is OK.
- [ ] The protected functional flows were manually checked if the touched area can affect them.
- [ ] README or patch README documents the reason for any intentionally accepted technical fallback.
## 9. Language cleanup ownership
Generic language cleanup must use `config/retriex/language.yaml` and its cleanup profiles.
Rules:
- add generic German stopwords to `stopword_groups`, not to domain YAML files
- add user wording such as `ich suche`, `zeige mir` or `habt ihr` to `phrase_groups`
- add table/list/overview wording to `meta_term_groups`
- keep commerce intent, product-role, measurement and routing terms in their owning domain YAML
- never remove protected terms such as `nicht`, `kein`, `testomat`, `indikator`, `ph`, `rx`, `th`, `tc` or `0,02` through generic cleanup
- prefer `cleanup_profile: ...` references over copied token lists
See `RETRIEX_LANGUAGE_CLEANUP_GUIDE.md` for the detailed ownership rules.