Version 2.14.0
general
- add option to use fully predefined indices in resampling (
makeResampleDesc(fixed = TRUE)
) (@pat-s, #2412). Task
help pages are now split into separate ones, e.g.RegrTask
,ClassifTask
(@pat-s, #2564)
functions - new
deleteCacheDir()
: Clear the default mlr cache directory (@pat-s, #2463)getCacheDir()
: Return the default mlr cache directory (@pat-s, #2463)
functions - general
getResamplingIndices(inner = TRUE)
now correctly returns the inner indices (before inner indices referred to the subset of the respective outer level train set) (@pat-s, #2413).
filter - general
- Caching is now used when generating filter values.
This means that filter values are only computed once for a specific setting and the stored cache is used in subsequent iterations.
This change inherits a significant speed-up when tuningfw.perc
,fw.abs
orfw.threshold
.
It can be triggered with the newcache
argument inmakeFilterWrapper()
orfilterFeatures()
(@pat-s, #2463).
filter - new
- praznik_JMI
- praznik_DISR
- praznik_JMIM
- praznik_MIM
- praznik_NJMIM
- praznik_MRMR
- praznik_CMIM
- FSelectorRcpp_gain.ratio
- FSelectorRcpp_information.gain
- FSelectorRcpp_symuncert
Additionally, filter names have been harmonized using the following scheme: _.
Exeptions are filters included in base R packages.
In this case, the package name is omitted.
filter - general
-
Added filters
FSelectorRcpp_gain.ratio
,FSelectorRcpp_information.gain
andFSelectorRcpp_symmetrical.uncertainty
from packageFSelectorRcpp
.
These filters are ~ 100 times faster than the implementation of theFSelector
pkg.
Please note that both implementations do things slightly different internally and theFSelectorRcpp
methods should not be seen as direct replacement for theFSelector
pkg. -
filter names have been harmonized using the following scheme: _. (@pat-s, #2533)
information.gain
->FSelector_information.gain
gain.ratio
->FSelector_gain.ratio
symmetrical.uncertainty
->FSelector_symmetrical.uncertainty
chi.squared
->FSelector_chi.squared
relief
->FSelector_relief
oneR
->FSelector_oneR
randomForestSRC.rfsrc
->randomForestSRC_importance
randomForestSRC.var.select
->randomForestSRC_var.select
randomForest.importance
->randomForest_importance
-
fixed a bug related to the loading of namespaces for required filter packages (@pat-s, #2483)
learners - new
- classif.liquidSVM (@PhilippPro, #2428)
- regr.liquidSVM (@PhilippPro, #2428)
learners - general
- regr.h2o.gbm: Various parameters added,
"h2o.use.data.table" = TRUE
is now the default (@j-hartshorn, #2508) - h2o learners now support getting feature importance (@markusdumke, #2434)
learners - fixes
- In some cases the optimized hyperparameters were not applied in the performance level of a nested CV (@berndbischl, #2479)
featSel - general
- The FeatSelResult object now contains an additional slot
x.bit.names
that stores the optimal bits - The slot
x
now always contains the real feature names and not the bit.names - This fixes a bug and makes
makeFeatSelWrapper
usable with custombit.names
. - Fixed a bug due to which
sffs
crashed in some cases (@bmihaljevic, #2486)