Decentralized OORT AI data hits top ranks on Google Kaggle
By: bitcoin ethereum news|2025/05/15 13:30:07
0
Share
An artificial intelligence training image data set developed by decentralized AI solution provider OORT has seen considerable success on Google’s platform Kaggle. OORT’s Diverse Tools Kaggle data set listing was released in early April; since then, it has climbed to the first page in multiple categories. Kaggle is a Google-owned online platform for data science and machine learning competitions, learning and collaboration. Ramkumar Subramaniam, core contributor at crypto AI project OpenLedger, told Cointelegraph that “a front-page Kaggle ranking is a strong social signal, indicating that the data set is engaging the right communities of data scientists, machine learning engineers and practitioners.“ Max Li, founder and CEO of OORT, told Cointelegraph that the firm “observed promising engagement metrics that validate the early demand and relevance” of its training data gathered through a decentralized model. He added: “The organic interest from the community, including active usage and contributions — demonstrates how decentralized, community-driven data pipelines like OORT’s can achieve rapid distribution and engagement without relying on centralized intermediaries.“ Li also said that OORT plans to release multiple data sets in the coming months. Among those is an in-car voice commands data set, one for smart home voice commands and another for deepfake videos meant to improve AI-powered media verification. Related: AI agents are coming for DeFi — Wallets are the weakest link First page in multiple categories The data set in question was independently verified by Cointelegraph to have reached the first page in Kaggle’s General AI, Retail & Shopping, Manufacturing, and Engineering categories earlier this month. At the time of publication, it lost those positions following a possibly unrelated data set update on May 6 and another on May 14. While recognizing the achievement, Subramaniam told Cointelegraph that “it’s not a definitive indicator of real-world adoption or enterprise-grade quality.” He said that what sets OORT’s data set apart “is not just the ranking, but the provenance and incentive layer behind the data set.” He explained: “Unlike centralized vendors that may rely on opaque pipelines, a transparent, token-incentivized system offers traceability, community curation, and the potential for continuous improvement assuming the right governance is in place.“ Lex Sokolin, partner at AI venture capital firm Generative Ventures, said that while he does not think these results are hard to replicate, “it does show that crypto projects can use decentralized incentives to organize economically valuable activity.” Related: Sweat wallet adds AI assistant, expands to multichain DeFi High-quality AI training data: a scarce commodity Data published by AI research firm Epoch AI estimates that human-generated text AI training data will be exhausted in 2028. The pressure is high enough that investors are now mediating deals granting rights to copyrighted materials to AI companies. Reports concerning increasingly scarce AI training data and how it may limit growth in the space have been circulating for years. While synthetic (AI-generated) data is increasingly used with at least some degree of success, human data is still largely viewed as the better alternative, higher-quality data that leads to better AI models. When it comes to images for AI training specifically, things are becoming increasingly complicated with artists sabotaging training efforts on purpose. Meant to protect their images from being used for AI training without permission, Nightshade allows users to “poison” their images and severely degrade model performance. Subramaniam said, “We’re entering an era where high-quality image data will become increasingly scarce.” He also recognized that this scarcity is made more dire by the increasing popularity of image poisoning: “With the rise of techniques like image cloaking and adversarial watermarking to poison AI training, open-source datasets face a dual challenge: quantity and trust.” In this situation, Subramaniam said that verifiable and community-sourced incentivized data sets are “more valuable than ever.” According to him, such projects “can become not just alternatives, but pillars of AI alignment and provenance in the data economy.“ Magazine: AI Eye: AI’s trained on AI content go MAD, is Threads a loss leader for AI data? Source: https://cointelegraph.com/news/oort-decentralized-ai-dataset-climbs-kaggle-rankings?utm_source=rss_feed&utm_medium=feed&utm_campaign=rss_partner_inbound
You may also like

Who will own the most Bitcoin in 2026
In this article, we will examine some individuals, companies, and wallets that have become crypto whales based on on-chain data and their own public statements, and investigate the amount of Bitcoin they hold.

A private feud lasting 10 years, if not for OpenAI's "hypocrisy," would not have led to the world's strongest AI company, Anthropic
What shapes the global AI landscape is not only the competition of technological routes but also a personal trauma that has never healed.

"Crypto Tsar" steps down: 130 days of political performance come to an end, how much of Trump's crypto promise remains?
The encryption czar has left, and Trump has muted.

Untitled
I’m unable to access the original article content you referenced. Please provide specific details or another article so…

From Utopian Narratives to Financial Infrastructure: The "Disenchantment" and Shift of Crypto VC
Financial infrastructure is the real reason that attracts venture capital investment in the cryptocurrency field.

A decade-long personal feud, if not for OpenAI's "hypocrisy," there would be no globally leading AI company Anthropic
Shaping the global AI landscape is not just a battle of technical paths, but also a wound of private trauma that has never healed

a16z: The True Meaning of Strong Chain Quality, Block Space Should Not Be Monopolized
Essentially, this attribute allows stakeholders to have a "virtual lane" within a high-throughput blockchain to ensure their transactions can be included.

a16z: The True Meaning of Strong Chain Quality, Block Space Should Not Be Monopolized
Essentially, this attribute allows stakeholders to have "virtual lanes" within a high-throughput blockchain, ensuring that their transactions can be included.

2% user contribution, 90% trading volume: The real picture of Polymarket
Is Polymarket a battleground for retail investors or an arena for institutions?

Trump Can't Take It Anymore, 5 Signals of the US-Iran Ceasefire
From Oil Prices and Elections to Secret Negotiations, Are the US and Iran Really Heading for a Ceasefire?

Judge Halts Pentagon's Retaliation Against Anthropic | Rewire News Evening Brief
The "Orwellian" Term Stymies Pentagon's Supply Chain Risk Label for Anthropic

Midfield Battle of Perp DEX: The Decliners, The Self-Savers, and The Latecomers
Hyperliquid has captured this wave of geopolitical market trends with commodity contracts. Decentralized exchanges are moving from internal competition within the crypto industry to a genuine alternative to traditional financial infrastructure, and this direction has only just begun.

Iran War Stalemate: What Signal Should the Market Follow?
Watch the Bond Market

Rejecting AI Monopoly Power, Vitalik and Beff Jezos Debate: Accelerator or Brake?
Can technological advancement be guided, or has it already gone beyond our control?

Insider Trading Alert! Will Trump Call a Truce by End of April?
Multiple Accounts Accurately Predict War, Earn $1.8 Million

After establishing itself as the top tokenized stock, does Ondo have any new highlights?
The total market capitalization of the global stock market is about $150 trillion, while the tokenized stocks market is currently only $10 billion in size, making it akin to a nascent super market that has just cracked the door open.

BIT Brand Upgrade First Appearance, Hosts "Trust in Digital Finance" Industry Event in Singapore
Discussing topics such as governance standards, compliance frameworks, and operational infrastructure within the context of the institutionalization process

OpenClaw Founder Interview: Why the US Should Learn from China on AI Implementation
In the US, using OpenClaw may get you fired; in China, not using it may get you fired
Who will own the most Bitcoin in 2026
In this article, we will examine some individuals, companies, and wallets that have become crypto whales based on on-chain data and their own public statements, and investigate the amount of Bitcoin they hold.
A private feud lasting 10 years, if not for OpenAI's "hypocrisy," would not have led to the world's strongest AI company, Anthropic
What shapes the global AI landscape is not only the competition of technological routes but also a personal trauma that has never healed.
"Crypto Tsar" steps down: 130 days of political performance come to an end, how much of Trump's crypto promise remains?
The encryption czar has left, and Trump has muted.
Untitled
I’m unable to access the original article content you referenced. Please provide specific details or another article so…
From Utopian Narratives to Financial Infrastructure: The "Disenchantment" and Shift of Crypto VC
Financial infrastructure is the real reason that attracts venture capital investment in the cryptocurrency field.
A decade-long personal feud, if not for OpenAI's "hypocrisy," there would be no globally leading AI company Anthropic
Shaping the global AI landscape is not just a battle of technical paths, but also a wound of private trauma that has never healed
