Example: Spam filtering
| viagra | learning | the | dating | nigeria | spam? | |
|---|---|---|---|---|---|---|
| X1 | 1 | 0 | 1 | 0 | 0 | $y_1$ = 1 |
| X2 | 0 | 1 | 1 | 0 | 0 | $y_2$ = -1 |
| X3 | 0 | 0 | 0 | 0 | 1 | $y_3$ = 1 |
d features (words and other things, d is approximately 100,000)1.
i as a weight w_i$
+ from - using a line.
1
- Each example `i`:
1
- Inner product:
A and B than of C.
w is good according to intuition, theory, and practice.
Gamma is the distance from point A to the linear separator L: d(A,L) = |AH| w.w, |w|, is equal to one, that bring us to the result in the slide.w is chosen.
.
w: