A picture of Matthias Buehlmaier.

Matthias Buehlmaier

Matthias Buehlmaier is a principal lecturer in finance and the BBA(IBGM) program director at HKU Business School, University of Hong Kong (HKU). He is a winner of several teaching and research awards, e.g. the Outstanding Teaching Award and the Teaching Innovation Award granted by HKU. His research has appeared in the Review of Financial Studies (Oxford University Press) and has been featured in the Harvard Law School Forum on Corporate Governance and Financial Regulation.

http://www.buehlmaier.net

Clojask: Inviting Data Scientists to Distributed Computing

Clojask is a distributed dataframe with a focus on usability and scalability. On one hand, Clojask is simple to use so that data scientists without any distributed systems experience can use Clojask immediately. The API design is inspired by R's data.table and SQL, so the learning curve is flat. On the other hand, Clojask is optimized for larger-than-memory datasets. Memory overflow will not be a problem even for tasks with massive datasets. Both technical considerations are determined to attract and benefit users with prior data science experience to Clojure. In our session, we would like to cover topics such as a functionality walkthrough (with reference to R data.table and SQL), comparisons with Dask (in Python) and Spark as well as what Clojask can bring to the Clojure data science community.