# Set Theory

• Lesson title: Set Theory
• Keywords: Set Theory
• Objective/Description: In this lesson we will introduce set theory (how sets are built and how they can be used) and how it connects to graphs. We will then apply this knowledge on sets and graphs to semantic similarity and corresponding measures.
• Learning outcomes:
• Understanding what a set is and how they are used in science and other general domains
• Explaining how to build a set and how to perform basic operations on it
• Explaining how data properties are used to form sets and then do calculations on them (e.g. similarity)
• Explaining what semantic similarity is with examples illustrating its usefulness
• Distinguishing, comparing and using different approaches to semantic similarity

## Resources

These Q&A aim to validate a basic understanding of concepts discussed during the lecture. The use of such concepts is beyond the scope of this Q&Astudy set as it is addressed via projects and hands-on for those enrolled in the corresponding university course.

What is a set?

A collection of distinct unordered elements where all satisfy some specific common conditions/characteristics.
For example, the set of vertebrate animals comprises all those distinct animals that belong to the chordate subphylum Vertebrata.

How is the set theory relevant to semantics

Semantic resources can be seen as sets and can be analyzed using methods from the set theory.
For instance, we can talk about subclasses is the way as we talk about subsets.

What are some common operations between sets?

A union B, containing all distinct elements from A and B.
A intersection B, containing only the elements that appear in both A and B at the same time.

What is semantic similarity?

Similarity across two entities based on their semantics/semantic charecteristics.

Name the different types of semantic similarity

Semantic identity: two terms representing exactly the same concept.
Semantic similarity: two terms that are similar two each other so term A can replace term B withing the same context, e.g., sterling and euro are both currencies.
Semantic relatedness: two terms can be used in the same context but correspond to different functionalities, e.g., drink, dessert, meal.
Association: two terms relate to each other by any sort of relation, e.g., student is related to an educational institution.

Name some similarity measures

Jaccard index, number of shared terms between A and B regarding the total number of exisisting terms in A and B.
Jaccard distance, 1 - Jaccard index, it measures the dissimilarity of two sets.
Tanimoto index, similar to the Jaccard index but both A and B must have the same size.
Cosine similarity, measure of the angle between two vectors in the same space.

'Set Theory' 'Set Theory' 'In this lesson we will introduce set theory (how sets are built and how they can be used) and how it connects to graphs. We will then apply this knowledge on sets and graphs to semantic similarity and corresponding measures.' - '"Understanding what a set is and how they are used in science and other general domains"' - '"Explaining how to build a set and how to perform basic operations on it"' - '"Explaining how data properties are used to form sets and then do calculations on them (e.g. similarity)"' - '"Explaining what semantic similarity is with examples illustrating its usefulness"' - '"Distinguishing, comparing and using different approaches to semantic similarity"'