How's this for my topic on Thursday? Spark and the Resilient Distributed Dataset - Fast, robust, general-purpose cluster computing for data science Anything in particular that folks want to know about Spark? Cheers, Neal McBurnett http://neal.mcburnett.org/