Discussion on data mining pitfalls
After a few comments on the post Garbage in, garbage out, I find interesting to discuss more precisely about existing pitfalls when applying data mining techniques. I warmly encourage you to give your ideas. Here are two possible pitfalls that I have now in mind:
- Underfitting/overfitting
- Data preparation (i.e. normalization, etc.)