The joblib Memory
class is a utility class that facilitates caching of function or method results to disk. We create a Memory
object by specifying a caching directory. We can then decorate the function to cache or specify methods to cache in a class constructor. If you like, you can specify the arguments to ignore. The default behavior of the Memory
class is to remove the cache any time the function is modified or the input values change. Obviously, you can also remove the cache manually by moving or deleting cache directories and files.
In this recipe, I describe how to reuse a scikit-learn regressor or classifier. The naïve method would be to store the object in a standard Python pickle or use joblib. However, in most cases, it is better to store the hyperparameters of the estimator.
We will use the ExtraTreesRegressor
class as estimator. Extra trees (extremely randomized trees) are a variation of the random forest algorithm, which is covered in the Learning with...