A Basket Full of Snakes: Python Modules for Data Science
Anyone who knows my former blogs knows that I am a big fan of both R and Python in daily work.
As powerful as R is in terms of functionalities for data analysis and modeling, as quickly is the motivation subdued in case of "number crunching" when RAM runs at maximum.
In this context, a nice server installation with a lot of metal (e.g. 96Gig-RAM) works wonders.
As this option is not always available, I have made a virtue of necessity and turned towards the more performant alternative, namely the Python based R alternatives, especially since I have been using Python for ETLs and data preparation for a long time.