New York
Startups
By : Syed Owais Date:October 13, 2025
Ready to Share Your Story or Recommend a Leader?
Join our mission to connect and empower entrepreneurial ecosystem.
Datacurve, a data infrastructure startup, has raised $15 million in a Series A round led by Mark Goldberg’s Chemistry, with participation from teams at DeepMind, OpenAI, Anthropic, and Vercel. TechCrunch Previously, the company raised $2.7 million in a seed round.
The startup is approaching data collection with a “bounty hunter” model: software engineers can claim paid tasks (bounties) to source the hardest-to-obtain datasets. So far, Datacurve has distributed over $1 million in bounties.
Co-founder Serena Ge emphasizes user experience as a critical differentiator. Unlike many data labeling operations, Datacurve treats its platform like a consumer product, optimizing for engagement and retention.
Though its current focus is on software engineering datasets, the model may extend into sectors like marketing, finance, or healthcare as data demands grow more complex.
Datacurve is entering at a pivotal moment. As AI companies mature, access to high-quality training data is becoming increasingly competitive. TechCrunch With Scale AI being one of the most notable incumbents, Datacurve is positioning itself as a nimble, expert-oriented alternative.
The bounty model allows Datacurve to tap into specialized talent globally, applying financial incentives to attract the best contributors. That compensates for the fact that data work often pays far less than other forms of engineering.
Their decision to prioritize UX (user experience) over pure scale is strategic: for tasks where quality is essential, a better experience helps retain contributors and maintain data integrity.
Launching in software engineering is smart, it’s a well understood domain with strong demand for structured datasets. But with success here, Datacurve may expand into verticals where tailored data is harder to source.
Successful AI models increasingly require not just large volume but domain-specific, high-quality datasets. The frontier is no longer generic text or images, it’s rare, contextual datasets that reflect real environments.
In that environment, platforms that can reliably coordinate, verify, and incentivize specialized data work will have an edge. Datacurve’s bounty system is one answer: break down difficult data tasks into discrete, payable chunks and distribute them to domain experts.
Meanwhile, Scale AI remains a formidable incumbent in data curation and pipeline infrastructure. Datacurve’s challenge will be differentiating on quality, cost, contributor experience, and domain flexibility.
If Datacurve succeeds, it could emerge as a specialist hub for rare, high-value data tasks that generalist data platforms struggle to serve. As AI demands more refined data, the ability to match task complexity, contributor skill, and incentive dynamically becomes a differentiator.
Because Datacurve treats its platform like a product, not just a labeling pipeline, it could build contributor loyalty and reputation systems that sustain long-term engagement.
As more verticals adopt AI, finance, health, simulation, robotics, demand for domain-rich data will only rise. Datacurve’s model, if scalable, could become a backbone for post-training data ecosystems.
Founder & Fractional CBO - Who loves to deliver value over hype. Aiming to build a no-BS community for founders (by founder), investors, venture capitalists, accelerators and journalists.
- Gain full access to our premium content
- Never miss a story with active notifications
- Browse free from up to 5 devices at once