输出凭证
选择一个提示,查看工具的并排回应,包括 token 成本、耗时和评审分数。每个输出都可审计、标注方法并缓存以便公平比较。
analyze
blog
- 1000-word blog intro on AI coding assistants for small teams预期: ~1000 words
Balanced comparison, includes a real table, identifies trade-offs, avoids vendor-speak.
- 800-word blog intro on sustainable fashion预期: ~800 words
Strong hook, scannable structure, specific stats, clear article preview, no filler.
code
- Python CSV dedupe with fuzzy matching预期: 80-150 lines
Streams input, correct rapidfuzz usage, outputs merge report, reasonable complexity.
- React TypeScript todo list component预期: 120-250 lines
Compiles, types are tight, handles edge cases, accessible, idiomatic.
- SQL retention cohort query (Postgres)预期: 30-80 lines
Correct cohort logic, efficient (uses generate_series), readable CTE layout.
marketing
social
summarize
- Summarize a 10K earnings call transcript预期: 200-350 words
Faithful, no hallucinated numbers, correct structure, scannable.
- Summarize a SaaS MSA contract into plain-English bullets预期: 300-500 words
Accurate, readable by non-lawyer, flags real negotiation levers.
- Summarize an NLP research paper abstract预期: 150-250 words
Plausible thesis, method/results/limitations are distinct, not vague marketing speak.
translate
- Translate a 500-word EN technical doc to Simplified Chinese预期: ~500 Chinese characters equivalent
Fluent zh-CN, technical accuracy, consistent terminology, code preserved.
- Translate a Chinese marketing tagline set to English预期: 200-350 words
Natural, idiomatic, preserves marketing energy, variants differ meaningfully.