I am trying to use rpart for a classification problem with 10,000 observations.
The dependent variable has 11 levels and I have 4 predictors: 2 continuous and
2
categorical (with 5 and 4 levels respectively). It runs fine if I take the
defaults, but if I use a loss matrix (with off-diagonal elements equal to the
absolute value of the misclassification error it just runs forever and never
finishes).
Am I being too ambitious here and the problem is just too big? (10,000 is 25%
of
the full dataset).
Thanks for any input you can provide.
(I'm using S-Plus 3.4 on Sun Solaris (2.0?) with 512M of RAM)
Carlos Alzola
calzola@apa.com
-----------------------------------------------------------------------
This message was distributed by s-news@wubios.wustl.edu. To unsubscribe
send e-mail to s-news-request@wubios.wustl.edu with the BODY of the
message: unsubscribe s-news
|