citee-methodology

jacek/citee-methodology

Fork 0

Commit graph

Author	SHA1	Message	Date
Jacek Kubas	03a397343e	Faza 1: brand catalog (świece sojowe PL) + prompt curation pipeline DATA — Public reference datasets for methodology: - data/README.md: schema + format definitions for brand catalogs - data/swiece-sojowe-pl/brand_catalog.json: 35 tracked brands (33 manufacturers + 2 importers) + 5 excluded marketplaces/resellers - data/swiece-sojowe-pl/brand_catalog.md: human-readable companion - data/swiece-sojowe-pl/market_metadata.json: GMV estimate, personas, seasonality, expected dynamics TOOLS — 6-stage prompt curation pipeline (Python 3.12+): - tools/prompt_curation/README.md: process documentation + cost estimates - tools/prompt_curation/config.py: tunable parameters per stage - tools/prompt_curation/.env.example: required API keys template - tools/prompt_curation/requirements.txt: dependencies - tools/prompt_curation/1_persona_generator.py: Claude generates 7 buyer personas - tools/prompt_curation/2_prompt_brainstormer.py: per persona × 30 prompts in voice - tools/prompt_curation/3_reality_checker.py: Google Trends + Reddit cross-check - tools/prompt_curation/4_validation_agents.py: 3 critic agents async (real_buyer/methodology/exploit_hunter) - tools/prompt_curation/5_pilot_test_runner.py: sample × 3 LLM models pre-flight - tools/prompt_curation/6_human_review_export.py: CSV export for founder approval - tools/prompt_curation/7_finalize.py: post-approval → closed prompts/{cat}/v{N}.json - tools/prompt_curation/pipeline.py: orchestrator (stages 1–6, then human review, then 7) GITIGNORE — Fixed .env.* exclusion to allow .env.example. This commit completes Faza 1. Stages outputs (data/{cat}/personas.json, raw_prompts.json, validated_prompts.json, critic_review.json, pilot_test_results.json, for_human_review.csv) are runtime artifacts — public when committed, derived from public methodology + public brand catalog. Final approved prompt strings in prompts/{cat}/v{N}.json remain CLOSED (gitignored, anti-Goodhart's Law). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-03 18:40:12 +02:00
Jacek Kubas	f76cf2858b	v1.0.0 — initial Citee Index Methodology release Foundational public methodology for the first open public ranking of brand visibility in AI search results (ChatGPT, Perplexity, Gemini, Claude). This release establishes the framework — no rankings have been computed or published yet. First scan cycle: late May 2026 (private validation). First public ranking publication target: August 2026, after 3 validation cycles. Includes: - methodology.json: machine-readable formulas, weights, policies - README.md: human-readable overview + open/closed boundary - CHANGELOG.md: versioning policy + v1.0.0 release notes - taxonomy.md: tier system + 11 PL pilot categories - LICENSE: MIT - .gitignore: closed operational data (exact prompts, anti-gaming thresholds) - prompts/README.md: 6-stage prompt curation process - prompts/example-swiece-sojowe-pl.md: illustrative framework for first category Strategic principles: - Algorithm-first, no advisory board - Open methodology + closed exact prompts (Goodhart's Law defense) - No retroactive changes (FIDE 2024 lesson) - No pay-to-play, hard rule (Moody's / Forbes 30 Under 30 lessons) - Subjective opinion disclaimer (Gartner v. NetScout 2020 First Amendment shield) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-03 17:25:56 +02:00

Author

SHA1

Message

Date

Jacek Kubas

03a397343e

Faza 1: brand catalog (świece sojowe PL) + prompt curation pipeline

DATA — Public reference datasets for methodology:
- data/README.md: schema + format definitions for brand catalogs
- data/swiece-sojowe-pl/brand_catalog.json: 35 tracked brands (33 manufacturers + 2 importers) + 5 excluded marketplaces/resellers
- data/swiece-sojowe-pl/brand_catalog.md: human-readable companion
- data/swiece-sojowe-pl/market_metadata.json: GMV estimate, personas, seasonality, expected dynamics

TOOLS — 6-stage prompt curation pipeline (Python 3.12+):
- tools/prompt_curation/README.md: process documentation + cost estimates
- tools/prompt_curation/config.py: tunable parameters per stage
- tools/prompt_curation/.env.example: required API keys template
- tools/prompt_curation/requirements.txt: dependencies
- tools/prompt_curation/1_persona_generator.py: Claude generates 7 buyer personas
- tools/prompt_curation/2_prompt_brainstormer.py: per persona × 30 prompts in voice
- tools/prompt_curation/3_reality_checker.py: Google Trends + Reddit cross-check
- tools/prompt_curation/4_validation_agents.py: 3 critic agents async (real_buyer/methodology/exploit_hunter)
- tools/prompt_curation/5_pilot_test_runner.py: sample × 3 LLM models pre-flight
- tools/prompt_curation/6_human_review_export.py: CSV export for founder approval
- tools/prompt_curation/7_finalize.py: post-approval → closed prompts/{cat}/v{N}.json
- tools/prompt_curation/pipeline.py: orchestrator (stages 1–6, then human review, then 7)

GITIGNORE — Fixed .env.* exclusion to allow .env.example.

This commit completes Faza 1. Stages outputs (data/{cat}/personas.json,
raw_prompts.json, validated_prompts.json, critic_review.json, pilot_test_results.json,
for_human_review.csv) are runtime artifacts — public when committed, derived from
public methodology + public brand catalog. Final approved prompt strings in
prompts/{cat}/v{N}.json remain CLOSED (gitignored, anti-Goodhart's Law).

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

2026-05-03 18:40:12 +02:00

Jacek Kubas

f76cf2858b

v1.0.0 — initial Citee Index Methodology release

Foundational public methodology for the first open public ranking of brand
visibility in AI search results (ChatGPT, Perplexity, Gemini, Claude).

This release establishes the framework — no rankings have been computed
or published yet. First scan cycle: late May 2026 (private validation).
First public ranking publication target: August 2026, after 3 validation
cycles.

Includes:
- methodology.json: machine-readable formulas, weights, policies
- README.md: human-readable overview + open/closed boundary
- CHANGELOG.md: versioning policy + v1.0.0 release notes
- taxonomy.md: tier system + 11 PL pilot categories
- LICENSE: MIT
- .gitignore: closed operational data (exact prompts, anti-gaming thresholds)
- prompts/README.md: 6-stage prompt curation process
- prompts/example-swiece-sojowe-pl.md: illustrative framework for first category

Strategic principles:
- Algorithm-first, no advisory board
- Open methodology + closed exact prompts (Goodhart's Law defense)
- No retroactive changes (FIDE 2024 lesson)
- No pay-to-play, hard rule (Moody's / Forbes 30 Under 30 lessons)
- Subjective opinion disclaimer (Gartner v. NetScout 2020 First Amendment shield)

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

2026-05-03 17:25:56 +02:00

2 commits