🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools
-
Updated
May 12, 2026 - Python
🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
pix2code: Generating Code from a Graphical User Interface Screenshot
Cleanlab's open-source library is the standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Open source annotation tool for machine learning practitioners.
AI Observability & Evaluation
TFDS is a collection of datasets ready to use with TensorFlow, Jax, ...
搜索所有中文NLP数据集,附常用英文NLP数据集
TorchGeo: datasets, samplers, transforms, and pre-trained models for geospatial data
⚡FlashRAG: A Python Toolkit for Efficient RAG Research (WWW2025 Resource)
A framework for collecting and analyzing prediction market data, including the largest publicly available dataset of Polymarket and Kalshi market and trade data.
Papers and Datasets about Point Cloud.
Colour Science for Python
数据接口:百度、谷歌、头条、微博指数,宏观数据,利率数据,货币汇率,千里马、独角兽公司,新闻联播文字稿,影视票房数据,高校名单,疫情数据…
An extension of Open3D to address 3D Machine Learning tasks
Benchmark datasets, data loaders, and evaluators for graph machine learning
A repository that contains models, datasets, and fine-tuning techniques for DB-GPT, with the purpose of enhancing model performance in Text-to-SQL
The AI Datastore for Schemas, BLOBs, and Predictions. Use with your apps or integrate built-in Human Supervision, Data Workflow, and UI Catalog to get the most value out of your AI Data.
Language Understanding Evaluation benchmark for Chinese: datasets, baselines, pre-trained models,corpus and leaderboard
Add a description, image, and links to the datasets topic page so that developers can more easily learn about it.
To associate your repository with the datasets topic, visit your repo's landing page and select "manage topics."