About this Talk

This tutorial provides an introduction to building knowledge graphs by using open source libraries in Python. We’ll introduce the key concepts and abstractions, discuss engineering trade-offs, and emphasize hands-on coding exercises.

The coding exercises are based on progressive examples based on managing the content for a website, which illustrate how to integrate the use of:

rdflib - RDF triples, SPARQL queries, serialization
arrow - Parquet serialization of RDF graphs
networkx - graph algorithms
pyvis - interactive visualization
gensim - embedding (clean up annotations)
pslpython - probabilistic soft logic, to apply rules for graph-based inference, link prediction, testing data quality of annotations, etc.

Plus related use of pandas, numpy, matplotlib, pylev, and other libraries that help with building and analyzing KGs in open source Python.

We will work in Jupyter notebooks, available from a public repository on GitHub, which can be run locally. Semantic technologies used within these examples include OWL, FOAF, XSD (for literals), and some SKOS, which are represented in Turtle and JSON-LD formats.

Participants are encouraged to ask questions throughout the lectures, exercises, and during breaks.

Prerequisites:

Some coding experience in Python (you can read a 20-line program)
Interest in use cases that require knowledge graph representation

Preparation before class:

git clone https://github.com/DerwenAI/kglab.git
fill out the online survey https://forms.gle/uB9p7XBjWutR2fHd7
join our Slack channel for the class https://knowledgeconnexions.slack.com/archives/C01F95PAL31

Audience:

Python developers who need to work with KGs
Data Scientists and Machine Learning Engineers
Technical Leaders who want hands-on KG implementation experience

Key Takeaways:

Hands-on experience with popular open source libraries in Python for building KGs
Coding examples that can be used as starting points for your own KG projects
Understanding trade-offs for different approaches to building KGs

Knowledge Graphs Masterclasses

About The Speakers

Paco Nathan

Managing Partner, Derwen, Inc.

Known as a "player/coach", with core expertise in data science, natural language, machine learning, cloud computing; 38+ years tech industry experience, ranging from Bell Labs to early-stage start-ups. Advisor for Amplify Partners, IBM Data Science Community, Recognai, KUNGFU.AI, Primer. Lead committer PyTextRank. Formerly: Director, Community Evangelism @ Databricks and Apache Spark. Cited in 2015 as one of the Top 30 People in Big Data and Analytics by Innovation Enterprise.

François Scharffe

Knowledge Connector / Founder, Knowledge Connexions / KGC / The Data Chefs

On a mission to show the value of knowledge engineering.

KG 101: Hands-on coding in Python

This Talk Has Finished

About this Talk

Prerequisites:

Preparation before class:

Audience:

Key Takeaways:

About The Speakers

KG 101: Hands-on coding in Python

This Talk Has Finished

About this Talk

Prerequisites:

Preparation before class:

Audience:

Key Takeaways:

About The Speakers

We use cookies

We use cookies