What is ConceptNet?
ConceptNet is a freely-available semantic network, designed to help computers understand the meanings of words that people use.
ConceptNet originated from the crowdsourcing project Open Mind Common Sense, which was launched in 1999 at the MIT Media Lab. It has since grown to include knowledge from other crowdsourced resources, expert-created resources, and games with a purpose.
Examples
To explore what's in ConceptNet, try browsing what it knows about any of these terms:
- en graph
- en knowledge
- en learn
- en natural language
- en semantic network
- mul 💡
Word vectors and recent publications
ConceptNet is used to create word embeddings -- representations of word meanings as vectors, similar to word2vec, GloVe, or fastText, but better.
These word embeddings are free, multilingual, aligned across languages, and designed to avoid representing harmful stereotypes. Their performance at word similarity, within and across languages, was shown to be state of the art at SemEval 2017.
The process for learning these word vectors is described in our AAAI 2017 paper, which also shows state-of-the-art results on solving analogy problems.
Support and discussion
Detailed documentation about ConceptNet appears on its GitHub wiki.
You can chat with ConceptNet developers and users on Gitter, or join the conceptnet-users mailing list.
Updates to ConceptNet and its supporting technologies appear on the ConceptNet blog.
Linked Open Data API
{
"@id": "/a/[/r/UsedFor/,/c/en/example/,/c/en/explain/]",
"dataset": "/d/conceptnet/4/en",
"end": {
"@id": "/c/en/explain",
"label": "explain something",
"language": "en",
"term": "/c/en/explain"
},
"license": "cc:by/4.0",
"rel": {
"@id": "/r/UsedFor",
"label": "UsedFor"
},
"sources": [
{
"activity": "/s/activity/omcs/omcs1_possibly_free_text",
"contributor": "/s/contributor/omcs/pavlos"
}
],
"start": {
"@id": "/c/en/example",
"label": "an example",
"language": "en",
"term": "/c/en/example"
},
"surfaceText": "You can use [[an example]] to [[explain something]]",
"weight": 1.0,
"@context": [
"//api.conceptnet.io/ld/conceptnet5.7/context.ld.json",
"//api.conceptnet.io/ld/conceptnet5.7/pagination.ld.json"
]
}
ConceptNet is a proud part of the ecosystem of Linked Open Data.
As a modern Linked Open Data resource, the data in ConceptNet is available in a JSON-LD API, a format that aims to make linked data easy to understand and easy to work with. If you don't care what JSON-LD is, it's just a JSON REST API with some extra metadata.
You can use ExternalURL links in ConceptNet to find the same terms in other vocabularies, such as WordNet, DBPedia, and OpenCyc, which can provide you with other forms of information.
For information on how to use the ConceptNet API, see the API documentation. Or just start browsing it and you'll probably figure it out.
Sources of knowledge
Previous versions of ConceptNet were a home-grown crowd-sourced project, where we ran a Web site (Open Mind Common Sense) collecting facts from people who came to the site. The Web of Data is much bigger than that now. Our data comes from many different sources, some of which you can contribute to and improve not just the state of computational knowledge, but of human knowledge.
- ConceptNet 5, like previous versions, contains the relational knowledge contributed to Open Mind Common Sense and its sister projects in other languages.
- We connect to a subset of DBPedia, which extracts knowledge from the infoboxes on Wikipedia articles.
- Much of our knowledge comes from Wiktionary, the free multilingual dictionary. This gives us information about synonyms, antonyms, translations of concepts into hundreds of languages, and multiple labeled word senses for many words.
- More dictionary-style knowledge comes from Open Multilingual WordNet.
- We imported a high-level ontology from OpenCyc (by Cycorp, formerly hosted at cyc.com).
- Some knowledge about people's intuitive word associations comes from "games with a purpose". We have learned facts in English from Verbosity, a word game formerly run by the GWAP project, and in Japanese from the "nadya.jp" game by Nihon Unisys and Dentsu.
If you believe a term should be understood by ConceptNet, the most straightforward way to add it to a future build is to add information about that term to Wiktionary, following their guidelines.
Attributing ConceptNet
To give proper attribution to ConceptNet's data, we suggest this text:
This work includes data from ConceptNet 5, which was compiled by the Commonsense Computing Initiative. ConceptNet 5 is freely available under the Creative Commons Attribution-ShareAlike license (CC BY SA 4.0) from https://conceptnet.io. The included data was created by contributors to Commonsense Computing projects, contributors to Wikimedia projects, Games with a Purpose, Princeton University's WordNet, DBPedia, OpenCyc, and Umbel.
A paper you can cite about ConceptNet is:
Robyn Speer, Joshua Chin, and Catherine Havasi. 2017. "ConceptNet 5.5: An Open Multilingual Graph of General Knowledge." In proceedings of AAAI 31.
Development
Development of ConceptNet takes place as an open-source project of Luminoso Technologies, Inc. The code that builds and powers ConceptNet is available on GitHub.
ConceptNet originated at the MIT Media Lab, and became part of the Commonsense Computing Initiative, a collaboration between MIT and other labs and companies around the world. This global collaboration helps us collect relational knowledge in many languages. The Commonsense Computing Initiative was founded by Catherine Havasi, now the CEO of Luminoso.
The development of ConceptNet 5 is led by Robyn Speer, a Luminoso co-founder, with contributions from several other people.