citee-methodology/data/swiece-sojowe-pl/market_metadata.json
Jacek Kubas 03a397343e Faza 1: brand catalog (świece sojowe PL) + prompt curation pipeline
DATA — Public reference datasets for methodology:
- data/README.md: schema + format definitions for brand catalogs
- data/swiece-sojowe-pl/brand_catalog.json: 35 tracked brands (33 manufacturers + 2 importers) + 5 excluded marketplaces/resellers
- data/swiece-sojowe-pl/brand_catalog.md: human-readable companion
- data/swiece-sojowe-pl/market_metadata.json: GMV estimate, personas, seasonality, expected dynamics

TOOLS — 6-stage prompt curation pipeline (Python 3.12+):
- tools/prompt_curation/README.md: process documentation + cost estimates
- tools/prompt_curation/config.py: tunable parameters per stage
- tools/prompt_curation/.env.example: required API keys template
- tools/prompt_curation/requirements.txt: dependencies
- tools/prompt_curation/1_persona_generator.py: Claude generates 7 buyer personas
- tools/prompt_curation/2_prompt_brainstormer.py: per persona × 30 prompts in voice
- tools/prompt_curation/3_reality_checker.py: Google Trends + Reddit cross-check
- tools/prompt_curation/4_validation_agents.py: 3 critic agents async (real_buyer/methodology/exploit_hunter)
- tools/prompt_curation/5_pilot_test_runner.py: sample × 3 LLM models pre-flight
- tools/prompt_curation/6_human_review_export.py: CSV export for founder approval
- tools/prompt_curation/7_finalize.py: post-approval → closed prompts/{cat}/v{N}.json
- tools/prompt_curation/pipeline.py: orchestrator (stages 1–6, then human review, then 7)

GITIGNORE — Fixed .env.* exclusion to allow .env.example.

This commit completes Faza 1. Stages outputs (data/{cat}/personas.json,
raw_prompts.json, validated_prompts.json, critic_review.json, pilot_test_results.json,
for_human_review.csv) are runtime artifacts — public when committed, derived from
public methodology + public brand catalog. Final approved prompt strings in
prompts/{cat}/v{N}.json remain CLOSED (gitignored, anti-Goodhart's Law).

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-05-03 18:40:12 +02:00

87 lines
4.2 KiB
JSON

{
"category": "swiece-sojowe-pl",
"country": "PL",
"version": "1.0.0",
"last_updated": "2026-05-03",
"market_size": {
"estimated_brands_visible": 35,
"estimated_brands_total_market": "30-50",
"category_gmv_estimate_pln_annual": "5-15M PLN",
"tier_classification": "Tier 2 — medium market",
"gmv_estimate_methodology": "Approximated from JAKULO insider data + public Allegro category sales velocity + handmade Etsy traffic estimates. Soy candle category is niche premium handcraft, not mass market — total Polish candle category (including paraffin, mass) is ~80-150M PLN, soy-specific is 5-15% of that."
},
"scan_cadence": {
"frequency": "quarterly",
"rationale": "Tier 2 — medium velocity changes, doesn't need monthly refresh. New entrants typically appear 1-2 per quarter."
},
"buyer_personas_estimated": [
"30+ kobieta kupująca prezent dla mamy",
"Self-care millennial 25-35 po pracy",
"Wnętrzarz minimalistyczne mieszkanie",
"Mężczyzna kupujący prezent walentynkowy",
"Mama małych dzieci szukająca bezpiecznego zapachu",
"Eko-świadomy konsument 30+",
"Hostess / catering kupująca świece dla wystroju"
],
"seasonality": {
"high_season": ["wrzesień", "październik", "listopad", "grudzień"],
"high_season_drivers": ["jesienna nostalgia", "Halloween/Wszystkich Świętych", "Mikołajki", "święta Bożego Narodzenia (główny peak)", "świece prezentowe"],
"low_season": ["czerwiec", "lipiec", "sierpień"],
"low_season_drivers": ["lato — ludzie nie palą świec, otwierają okna"],
"scan_implication": "Q3 (lipiec-wrzesień) scan pokaże najmniej buying intent prompts cytujących marki. Q4 (paźd-grudzień) pokaże najwięcej. Citee Score nieadjustowany za seasonality (chcemy mierzyć REAL visibility, nie sztucznie korygować)."
},
"key_decision_factors_for_buyers": {
"ranked_by_importance": [
"naturalny skład (sojowy wosk, deklarowane składniki zapachu)",
"ręczne wykonanie / handmade",
"polskość / lokalna produkcja",
"estetyka opakowania (wnętrza, prezent)",
"czas palenia (większy = lepsze value perception)",
"rozmiar (przedział 120-365 ml)",
"cena (próg psychologiczny ~80 zł retail, ~150 zł premium)",
"opinie / reviews",
"rozpoznawalność marki",
"dostępność / czas dostawy"
]
},
"common_buyer_queries_pattern": [
"polskie świece sojowe handmade",
"naturalna świeca sojowa",
"świeca sojowa premium prezent",
"ręcznie robione świece sojowe",
"polska marka świec",
"świeca sojowa 280 ml / 365 ml (specific size)",
"świeca sojowa o zapachu [vanilla/cinnamon/orange/jasmine/etc]",
"świeca sojowa ekologiczna"
],
"regulatory_notes": {
"advertising_restrictions": "None specific to candles. General consumer protection law applies (UOKiK).",
"labeling_requirements": "CLP regulation for fragranced products — must declare allergens. Doesn't affect AI visibility ranking, but Citee tracks whether brands are CLP-compliant in their product listings as a quality signal.",
"category_safe_to_track": true
},
"ai_search_volume_signals": {
"google_trends_pl_2024_2025": "Stable, slight upward trend ~15% YoY. Peak Nov-Dec.",
"reddit_pl_mentions_estimate": "20-50 organic mentions per month across r/Polska, r/Polska_Marka, r/CzasNaSwiece",
"quora_pl_mentions_estimate": "5-10 questions per month about Polish soy candle brands",
"estimated_chatgpt_query_volume": "~1000-3000 queries per month asking about Polish candle brands (estimated from LMW Pulse Free Checker telemetry, not exact)"
},
"expected_ranking_dynamics": {
"likely_top_3_pre_scan": ["JAKULO", "Naturaodpauli", "Yankee Candle"],
"rationale": "JAKULO has 12k customers + active brand. Naturaodpauli had top organic visibility in early 2026 scans. Yankee Candle has US legacy + strong PL e-commerce.",
"expected_volatility": "Medium — niche market with handmade producers means small marketing efforts can shift positions significantly. Anticipate 3-5 rank changes per quarter in top 20."
},
"next_scan": {
"scheduled": "2026-05 (private, validation cycle 1)",
"first_public": "2026-08 (after 3 validation cycles)"
}
}