iml-wg · rustil · Apr 12, 2018
diff --git a/benchmarks/README.md b/benchmarks/README.md
@@ -0,0 +1,25 @@
+# ROOT-C++-Python - Benchmarking, comparing, best practices
+
+## problem to solve
+More and more people see benefits in using ML techniques and in addition to that (or apart from that) they see the benefits from taking advantage of the large data science ecosystem around (scipy, numpy, pandas, matplotlib and many many more) in addition to their ROOT based analyses. But instead of then using these tools there seems to be a high level of caution mainly due to:
+
+- people are not necessarily aware of easy ways to connect ROOT based data with python data science tools
+- people fear that using python will be _significantly_ slower than the ROOT based approach
+  - strongly connected to that: people are not necessarily aware of how to parallelise in python
+
+## desired outcome
+The best case scenario would be to come out of this hackathon with a comprehensive but simple presentation (mini-tutorial), that shows best practices on how to integrate non-ROOT-tools in an overall ROOT based analysis, how to transfer data between the ecosystems and which includes some performance comparisons between the different approaches. 
+So basically a talk that can be used to mitigate the fears of stepping outside of a purely ROOT based analysis and that gives actual starting points on how to do that.
+
+I think that the workload of this project would be threefold:
+- actually compute some performance comparisons
+- search for performance comparisons, tutorials, talks about the topic that already exist.. and add them to the repository (for the latter see e.g. https://github.com/ChristosChristofidis/awesome-deep-learning )
+- compile a talk (maybe a notebook, maybe something else) with a high pedagogical value :)
+
+## skills / knowledge needed  (for the project, not per person)
+- didactic skills
+- literature research skills
+- some programming skills
+- ROOT
+- Other data storage solutions
+