Example: Spam filtering
| viagra | learning | the | dating | nigeria | spam? | |
|---|---|---|---|---|---|---|
| X1 | 1 | 0 | 1 | 0 | 0 | $y_1$ = 1 |
| X2 | 0 | 1 | 1 | 0 | 0 | $y_2$ = -1 |
| X3 | 0 | 0 | 0 | 0 | 1 | $y_3$ = 1 |
d features (words and other things, d is approximately 100,000)1.
i as a weight w_i$

+ from - using a line.
i:


A and B than of C.

w is good according to intuition, theory, and practice.


Gamma is the distance from point A to the linear separator L: d(A,L) = |AH| w.w, |w|, is equal to one, that bring us to the result in the slide.w is chosen.For the ith data point:
.

w:


