Maximilian Alexander Rupp
MAR — Maximilian Alexander Rupp
ValidatingStarted 2026Sustainable Fashion | Data | Compliance | EU Green Claims Directive

Clean Data Maker

Fashion catalogs, cleaned and compliant.

Fashion brands publish product data across Shopify, Zalando, AboutYou, Mirakl marketplaces and their own site. Each channel has different fields, different vocabularies, and different legal exposure. Under the EU Green Claims Directive and the updated UCPD, vague terms like eco friendly, natural or responsibly made without a verified scheme attached are now an active enforcement risk.

Clean Data Maker takes a product CSV in, runs it through a rule library mapped to the Directive plus live integrations with issuer databases like GOTS, OEKO TEX, GRS and Master of Linen, and returns a cleaned catalog with a compliance score per SKU and a drafted rewrite for every risky line.

The summary I wrote when I first noted this idea down:

Clean Data Maker is a service that cleans, validates and harmonizes fashion product data, with sustainability claims compliance at the core. Brands drop in their Shopify, Zalando or AboutYou CSV and get back a cleaned catalog, a risk report per SKU mapped to specific EU articles, and drafted compliant rewrites for every flagged claim.

How it would work

A brand uploads a product CSV from Shopify, Zalando, AboutYou or any feed with the standard columns. Within minutes the system returns three things: a cleaned CSV with normalized material composition and harmonized terminology, a compliance report per SKU with a red, amber or green risk score, and drafted compliant rewrites ready to paste back into the store.

Every flag links to the specific article it maps to, so a brand can defend the change to their lawyer or trust officer. Certifications get matched to issuer database numbers, vague green claims get rewritten into provable facts, and material composition gets sorted into the descending weight order the EU Textile Labelling Regulation requires.

Pricing is per SKU cleaned. No subscription, no minimum, no annual contract. The compliance report itself is free with cleaning. The upsell is the rewrites and the audit trail.

Who it is for

Indie and mid market fashion brands with 20 to 500 SKUs, selling across two or more channels, using one or more sustainability claims, with no in house legal or data team. The brands that feel the Directive coming but have no budget for a compliance consultancy.

Marketplace operators who want their seller catalogs cleaned before listing or after a feed migration. Premium resale sellers who want listings normalized and risk checked before they go live.

Why now

The EU Green Claims Directive entered into force in 2026 with full enforcement landing in 2027. The grace period is real but short, and most indie brands have no plan for what to do with their catalog when it ends. A service priced per SKU rather than per seat is the fastest way for them to get into compliance.

Large language models make the cleaning itself cheap. The defensible asset is the rule library plus the live issuer integrations, not the model. That is a moat that gets deeper every month the Directive moves through case law.

FAQ

Questions people usually ask.

How is this different from running my catalog through ChatGPT?
A chat tool can rewrite a line in isolation. It cannot verify a certification number against an issuer database, it does not know what your Shopify export looks like vs your Zalando feed, and it does not give you an audit trail of which Directive article each change responds to. Clean Data Maker is built around those three things.
Do you certify that my claims are legal?
No. We provide a risk score, the article reference, and a drafted alternative. Your lawyer remains the source of truth. The service is built to make their job faster and cheaper, not to replace them.
What if my CSV has different column names?
The MVP expects a standard set of columns. A mapping layer for Shopify, Zalando, AboutYou and Mirakl feeds is on the near term roadmap. For anything else, point us at one row and we can usually take it from there.
Is there a working demo?
Yes. The demo at /ideas/clean-data-maker/demo loads a sample fashion catalog and shows the cleaning, the per SKU risk report and the drafted rewrites side by side. Sign up below to get a slot for a paid pilot on your own catalog.
How is this connected to HACOY?
The rule engine grew out of work I did to keep HACOY itself defensible under the Directive. HACOY remains the test bed and the first case study. Clean Data Maker is the productized version for other brands.

Join the pilot list

First ten brands get a free audit of fifty SKUs and a paid pilot on the next four hundred and fifty if the audit lands well.

Your info goes only to me. No newsletter, no third party. I delete the list if the idea is shelved for good.