R and Python. Python + Pandas + scikit-learn: Pros: * scikit-learn offers better performance for extremely large data sets * Large-data manipulation tools * Fairly good set of ML algorithms Cons: * High barrier to entry, in terms of skill and knowledge * Fairly labor intensive to create large number of experiments R + caret: Pros: * Very good amount of ML algorithms (so many it may cause paralysis from too much choice, 200-plus algorithms) * Good performance, unless the data set is extremely large Cons: * High barrier to entry * Data manipulation is a pain, you probably want to use another tool to pre-treat the data before loading it into R dataframes View full review »
