Skip to main content
These are real prompt/response pairs from Copilot’s evaluation suite, selected from runs that demonstrated strong performance across all 12 evaluation dimensions: workflow compliance, source grounding, incumbent honesty, business plausibility, data quality, report structure, specificity, foundational concept integration, attribution quality, source balance, actionability, and freshness. Click any example to see the full unedited Copilot output.

Consumer Apps for Mobile Developers

Generated with Sonnet 4.6 · Translate your background
I’m a mobile developer — shipped a few apps on the App Store with 50k+ downloads. Never touched crypto before but I keep hearing about Solana being fast and cheap. What kinds of consumer apps are people building? I want something normies would actually use, not another dashboard for degens.
Copilot mapped the user’s mobile dev background directly to underserved verticals, surfacing Trepa (C3, React Native prediction app) and Cron (native iOS P2P payments) as proof points. It identified that native iOS is structurally underrepresented (React dominates at 41% of submissions) and recommended a live game show app with USDC prizes — solving HQ Trivia’s fatal unit economics problem ($500 in PayPal fees per 10K prizes vs $0.10 on Solana).

B2B Stablecoin Payments

Generated with Sonnet 4.6 · Validate a startup idea
I work in logistics and supply chain at a Fortune 500. We spend millions on cross-border payment fees and settlement takes 3-5 days. I’ve been researching stablecoins and think there’s a massive opportunity to cut out SWIFT entirely for B2B trade finance. Is anyone doing this on Solana? What would a realistic MVP look like?
Copilot immediately surfaced CargoBill (C3 accelerator, 1st Place Stablecoins) as the most direct predecessor, then classified the gap as “Partial — Segment”: CargoBill targets freight forwarders and logistics operators, not the Fortune 500 buyers who generate the payment flows. It identified OpenFX ($23M Accel, $10B annualized volume) and Fin ($17M) as the funded competitive threats, and recommended an ERP-integrated stablecoin treasury module targeting the corporate AP side of the same transaction.

Gaming Track: Radar vs. Renaissance

Generated with Sonnet 4.6 · Compare hackathons
Compare gaming track submissions between Radar and Renaissance.
Copilot pulled submission counts, prize winners, and project details for both hackathons’ gaming tracks, then surfaced the key thematic shift: Radar’s top prizes went to DeFi/trading game hybrids (The Arena, Windfall, TrenchTown), while Renaissance rewarded DePIN-gaming crossovers (MeshMap + City Champ) and infrastructure-layer thinking (Honeycomb). It identified the sophistication shift from wallet integration and NFT ownership toward ZK/VDF fairness, ephemeral rollups, and cross-track DePIN plays.

MEV Batch Auctions

Generated with Sonnet 4.6 · Challenge a thesis
My thesis is that Solana doesn’t need another DEX — it needs better execution infrastructure. I want to build a batch auction system that eliminates MEV for retail traders. Think CowSwap but native to Solana’s architecture. Has anyone tried this?
Copilot surfaced three accelerator companies already building MEV-protective DEXs on Solana: Archer Exchange (C4, DFBA batch auctions), URANI (C1, intent-based aggregator), and DARKLAKE (C2, ZK-proof matching). Rather than validating the DEX angle, it identified the real gap: no one has built the infrastructure layer — a batch-settlement routing layer beneath Jupiter that existing DEXs route through. Cited $370-500M extracted from Solana users in 16 months as the TAM proxy.

AI Agent Payments

Generated with Sonnet 4.6 · Find who’s building this
Everyone’s talking about AI agents but nobody’s figured out payments. I want to build a protocol where AI agents can pay for services programmatically — like a Stripe for autonomous agents. Has anyone in the Solana ecosystem tried this? What’s the architecture look like?
Copilot identified a dense cluster (crowdedness 325) and surfaced MCPay (C4 accelerator, 1st Place Stablecoins) as the most validated project — MCP tool monetization via x402. It cited Nick Szabo’s micropayments essay (machines have no mental accounting barrier) and a16z’s “Tourists in the Bazaar” framing to argue the real gap isn’t payment plumbing (x402 is winning) but agent spending policy engines — the “Brex for AI Agents” layer that manages budgets, credit lines, and compliance.

Privacy-Preserving Stablecoin

Generated with Sonnet 4.6 · Research concepts
I want to build a privacy-preserving stablecoin — like a Zcash-style shielded pool but for USDC on Solana. Users deposit USDC, get a private balance, and can transfer without anyone seeing amounts or recipients. I know Tornado Cash got sanctioned but I think there’s a compliant way to do this with selective disclosure and ZK proofs. What does the landscape look like and is this even possible on Solana technically?
Copilot identified a critical Token-2022 limitation: Confidential Balances hide transfer amounts but NOT sender/recipient addresses — a full Zcash-style shielded pool requires a separate program. It surfaced Umbra ($155M ICO commitments, Feb 2026 launch) as the closest competitor using MPC, then traced the compliance-first approach through a16z’s 2022 paper on privacy-protecting regulatory solutions. The recommended wedge: regulated B2B private payments (payroll, supplier payments) where institutions can’t use public chains.

Evaluation dimensions

Each example was scored across these 12 dimensions:
DimensionWhat it measures
Workflow complianceDid it follow the research workflow correctly?
Source groundingIs every claim traced to a specific source?
Incumbent validation honestyDoes it honestly flag competitors and saturation?
Business plausibilityIs the revenue model and GTM realistic?
Data quality feedbackAre the cited sources relevant and high-quality?
Report structure complianceDoes the output follow the required format?
Specificity scoreAre claims backed by concrete numbers?
Foundational concept integrationDo archive sources inform the thesis?
Attribution qualityDoes every claim map to a specific source?
Source balanceIs there a healthy mix of projects, archives, and web data?
ActionabilityCan a founder act on this report?
FreshnessIs the data current (within 6 months)?