Processing with DRILLTHROUGH enabled will typically take just a few seconds
more than without the flag. The algorithm will make an extra pass over the
training data and save the associations between training cases and tree
nodes.
An alternate way of getting the cases (if Drillthrough is not enabled):
SELECT T.* FROM MyModel NATURAL PREDICTION JOIN
OPENQUERY(...., <Original Training Data>) AS T
WHERE PredictNodeId() = '<Node_Unique_Name_for_your_target_node>'
--
--
--
This posting is provided "AS IS" with no warranties, and confers no rights.
Please do not send email directly to this alias. It is for newsgroup
purposes only.
thanks,
bogdan
Post by Dejan SarkaPost by JesseOne other thing: can you give me an idea of the impact of specifying
WITH DRILLTHROUGH on model build time will be? for example, if I have
a model that took 6 hours to process with drillthrough turned off, how
long can I expect it to run with drillthrough enabled? Just an
estimate is all I'm looking for.
Jesse, I think enabling drillthrough is just a flag in the model meta
data; I do not think processing should take longer. However, just to be
sure, you might want to reprocess just a subset of data and compare times
with or without drillthrough.
--
Dejan Sarka
http://blogs.solidq.com/EN/dsarka/default.aspx