What is Ascent++?

Ascent++ KB is a commonsense knowledge base (CSKB) constructed from the C4 crawl using the Ascent++ pipeline. It consists of 2 million CSK assertions about 10K popular concepts, presented in the established ConceptNet schema with 19 predicates (e.g., AtLocation, CapableOf, HasProperty, etc.). As of 2022, it presents the highest-quality automated CSKB, both in terms of precision, and in terms of ranked recall.

Ascent++, a successor of Ascent, is a pipeline for automatically collecting, extracting and consolidating commonsense knowledge (CSK) from any English text corpus. Ascent++ is capable of extracting facet-enriched assertions, overcoming the common limitations of the triple-based knowledge model in traditional knowledge bases (KBs). Ascent++ also captures composite concepts with subgroups and related aspects, supplying even more expressiveness to CSK assertions.

This web interface allows you to browse the CSK assertions in the Ascent++ KB.

Examples

To explore what our CSKB captures, try out the following concepts:

Download

You can download 2 million CSK assertions in our KB here: ascentpp.csv.tar.gz.

Code

Code is stored in this Github repository.

Citation

Main publication
Tuan-Phong Nguyen, Simon Razniewski, Julien Romero, and Gerhard Weikum. 2022. Refined Commonsense Knowledge from Large-Scale Web Contents. IEEE Transactions on Knowledge and Data Engineering (2022). [pdf] []
@ARTICLE{ascentpp,
    author={Nguyen, Tuan-Phong and Razniewski, Simon and Romero, Julien and Weikum, Gerhard},
    journal={IEEE Transactions on Knowledge and Data Engineering},
    title={Refined Commonsense Knowledge from Large-Scale Web Contents},
    year={2022},
    doi={10.1109/TKDE.2022.3206505}
}
Related publications
Tuan-Phong Nguyen, Simon Razniewski, and Gerhard Weikum. 2021. Advanced Semantics for Commonsense Knowledge Extraction. The Web Conference 2021 (WWW '21). [pdf] []
@inproceedings{ascent,
    author = {Nguyen, Tuan-Phong and Razniewski, Simon and Weikum, Gerhard},
    title = {Advanced Semantics for Commonsense Knowledge Extraction},
    year = {2021},
    isbn = {9781450383127},
    publisher = {Association for Computing Machinery},
    address = {New York, NY, USA},
    doi = {10.1145/3442381.3449827},
    booktitle = {Proceedings of the Web Conference 2021},
    pages = {2636–2647},
    numpages = {12},
    location = {Ljubljana, Slovenia},
    series = {WWW '21}
}
Hiba Arnaout, Simon Razniewski, Gerhard Weikum, and Jeff Z. Pan. 2022. UnCommonSense: Informative Negative Knowledge about Everyday Concepts. 31st ACM International Conference on Information and Knowledge Management (CIKM '22). [pdf] []
@inproceedings{uncommonsense,
    author = {Arnaout, Hiba and Razniewski, Simon and Weikum, Gerhard and Pan, Jeff Z.},
    title = {UnCommonSense: Informative Negative Knowledge about Everyday Concepts},
    year = {2022},
    isbn = {9781450392365},
    publisher = {Association for Computing Machinery},
    address = {New York, NY, USA},
    doi = {10.1145/3511808.3557484},
    booktitle = {Proceedings of the 31st ACM International Conference on Information & Knowledge Management},
    pages = {37–46},
    numpages = {10},
    keywords = {knowledge base, commonsense, negation},
    location = {Atlanta, GA, USA},
    series = {CIKM '22}
}
Julien Romero, Simon Razniewski, Koninika Pal, Jeff Z. Pan, Archit Sakhadeo, and Gerhard Weikum. 2019. Commonsense Properties from Query Logs and Question Answering Forums. 28th ACM International Conference on Information and Knowledge Management (CIKM '19). [pdf] []
@inproceedings{quasimodo,
    author = {Romero, Julien and Razniewski, Simon and Pal, Koninika and Z. Pan, Jeff and Sakhadeo, Archit and Weikum, Gerhard},
    title = {Commonsense Properties from Query Logs and Question Answering Forums},
    year = {2019},
    isbn = {9781450369763},
    publisher = {Association for Computing Machinery},
    address = {New York, NY, USA},
    doi = {10.1145/3357384.3357955},
    booktitle = {Proceedings of the 28th ACM International Conference on Information and Knowledge Management},
    pages = {1411–1420},
    numpages = {10},
    keywords = {common-sense knowledge acquisition, web mining, information extraction},
    location = {Beijing, China},
    series = {CIKM '19}
}