Time series and event data is becoming huge for every business — in terms of volume, and importance of deriving real-time insights. In this talk I share the story of developing and productionizing FiloDB, an open source time series solution built with the Scala, Akka, Kafka, and Cassandra reactive stack. I will share how our solution fits into the real-time, time series world and what works and doesn't work with Akka and Akka Cluster at scale and in production. What are the benefits and challenges of off-heap, in-memory data processing at scale? How does one implement persistent, recoverable data at scale with Kafka and Cassandra?
SHARE THIS TALK
Senior Software Engineer
Evan loves to design, build, and improve bleeding edge distributed data and backend systems using the latest in open source technologies. He is the creator of the FiloDB open-source distributed time-series database, as well as the Spark Job Server. He has led the design and implementation of multiple big data platforms based on Storm, Spark, Kafka, Cassandra, and Scala/Akka, including a columnar real-time distributed query engine. He has been an active contributor to the Apache Spark project, and a two-time Datastax Cassandra MVP. He has built Spark applications since Spark 0.8, Cassandra since 0.6. He is a big believer in GitHub, open source, and meetups, and have given talks at various conferences including Strata, Spark Summit, Cassandra Summit, FOSS4G, and Scala Days. He has a Bachelors and Masters of Electrical Engineering, with distinction, from Stanford University. In his spare time he is a family man, photographer, foodie, avid Oakland Athletics fan, and committed follower of Jesus.