autoCaret is a R
package for helping business analysts and other enthusiasts understand how to begin building predictive models in R
via automation. View our intro presentation!
It leverages and wraps underlying features and functionality provided by both the caret, (short for Classification And REgression Training) and caretEnsemble R
packages in an effort to provide a simple programmatic interface for analysts who would like to begin working on binary classification problems .
Also included in the package is also an intuitive graphical interface - in the form of an RStudio Add-In - that allows for an easy introduction into the package’s main functionality - producing an ensemble model via autocaret::automodel()
- in order to help speed the learning and development process.
autoCaret
?We fundamentally believe that the best ideas and concepts in machine learning are simple but that current literature and accessibility to “getting started” with machine learning sometimes puts walls up against these ideas.
We think:
Machine learning is a field that will only continue to pervade modern life. We think that additional tools need to be built to get analysts, who might not have much experience using R
or other programming languages, engaged and excited about using machine learning in their day to day.
Why is this so important?
While R
tools like caret
have brought us a long way in the effort to standardize many of the commonly repeated parts of the process required for building predictive models, there is no reason we can’t further streamline this process.
Python tools like TpoT have attempted to do this using genetic programming. Additionally there are a number of proprietary tools built by companies like BigMl and DataRobot that also seek to automate machine learning tasks.
The autoCaret
package intends to take an analagous but simpler approach:
summary()
and predict()
autoCaret
fit?Both sourcing raw data and cleaning it are the responsibility of the end user. Otherwise, many of the most tedious parts of the predictive model process are covered by autoCaret
! Be sure to see the getting started guide included with the package or explore other examples shown on this page.
There are too many potential possibilities for a R
package like autoCaret
to be able to provide functionality that would provide acceptable performance or even begin to be able to automate the data cleaning process. There are, however; a great wealth of tools that do help and should be explored – the github page RStartHere is a great place to get begin getting acquainted!