Example: Similarity of "
Barock", "
Dresden", "
University"
```
Dresden ==>> [1,0,0]
Barock ==>> [0,1,0]
University ==>> [0,0,1]
```
Notes:
- assign to a 3 dimensional vector space
---
Cosine Similarity Issues:
- In a given document we shouldn’t count repeating words too much.
- There are words that are just very common, so they appear in lots of documents. (“the”, “and”, “or” etc..)
- Syntactic structure of sentences is lost.
---