精选数据 – 开放知识图谱

Selected Datasets

The OpenKG community has a collection of high-quality data in common sense, encyclopedia, finance, and medical care.

cnSchema

The Schema reference standard managed and maintained by OpenKG combines the characteristics of the Chinese language and specific application requirements in the Chinese field.

OneGraph

A knowledge graph project initiated and maintained by OpenKG that uses large language models to build and serve large language model applications.

Covid-19KG

Taking viruses and bacteria as the main body, it expands the content related to treatment and diseases, integrates encyclopedia knowledge, and forms the new coronavirus encyclopedia knowledge graph.

OpenRichpedia

A large multimodal knowledge graph, which contains multimodal knowledge that can be applied to many fields such as natural language processing and computer vision.

OpenBG

Alibaba Group released the first large-scale open business knowledge graph to promote deep understanding of retail data.

OpenConcepts

A large-scale Chinese concept knowledge graph based on knowledge extraction, containing a large number of fine-grained concepts.

CKGG

A Chinese knowledge graph for high school geography that can provide students with better computer-assisted education.

GAKG

A large-scale multimodal academic knowledge graph, a novel multimodal academic knowledge graph for earth sciences.

Zhishi.me

A first attempt to construct a Chinese general knowledge graph by extracting structured data from open encyclopedia data.

CN-DBpedia

A large-scale general domain structured encyclopedia developed and maintained by the Knowledge Factory Laboratory of Fudan University.

PKU-PIE知识库

Peking University Chinese Encyclopedia Knowledge Graph is a knowledge base formed by automatically collecting knowledge from multiple sources such as Wikipedia, DBpedia, Baidu Encyclopedia, etc.

七律-通用图谱

The encyclopedia knowledge graph carefully created by Goosegrass Technology includes things, facts, concepts, rules, etc.

面向家庭常见疾病的知识图谱

A knowledge graph of common diseases that includes common diseases, symptoms, treatments, commonly used medicines, recommended recipes, etc.

CN-Probase

The large-scale Chinese concept map developed and maintained by the Knowledge Factory Laboratory of Fudan University has an accuracy rate of over 95% for isa relationships.

DiaKG: 糖尿病知识图谱数据集

This dataset is derived from 41 publicly published diabetes guidelines and consensus articles, covering the most extensive research content and hot areas in recent years.

THUOCL

A set of high-quality Chinese vocabulary compiled and launched by the Natural Language Processing and Social Humanities Computing Laboratory of Tsinghua University.

XLORE双语百科知识图谱

It extracts structured information from heterogeneous cross-language online encyclopedias and is the first large-scale knowledge graph that balances Chinese and English knowledge.

《大词林》

Harbin Institute of Technology released a system that automatically crawls entities and entity concepts from the Internet to form a general knowledge graph based on hierarchical relationships.

汉语开放词网

The Chinese Open Word Network collects and organizes important open knowledge bases and knowledge graph projects at home and abroad, and organizes and compiles relevant Chinese materials.

The clinical terminology published by Yidu Cloud is manually edited by Yidu Cloud doctors based on the real medical record distribution, providing a basis for the standardization of clinical terminology.