Loading…
Attending this event?
View analytic
Thursday, November 16 • 11:40am - 12:20pm
Building a High-Performance Database with Scala, Akka, and Spark

Sign up or log in to save this to your schedule and see who's attending!

Log in to leave feedback.
#distributedsystems #scala #akka #spark #FiloDB #cassandra

Scala and its large ecosystem of libraries are increasingly being used to build highly scalable and performant data systems. In this talk, I share years of experience building high performance data systems using Scala, Akka, and Spark, plus recent experience building FiloDB, a high performance analytics database built on these technologies. How do we balance Scala and functional programming with very high performance demands? What are some tips to watch out for when building very very fast Scala code?
  • Why build a new database for streaming applications?
  • Why Scala and Akka makes a great foundation for building a database
  • When to use Futures, Actors, Reactive Streams
  • Using Akka Cluster to coordinate and implement distributed ingestion
  • Monix and use of reactive streams
  • Reactive/async tracing and production metrics
  • Filo: summing integers at billions of ops per second, taking advantage of processor cache and SIMD with super fast vector operations
  • Serialization, GC, and off-heap: how to leverage binary data structures for the win - JVM method dispatch, inlining, and writing lots of small methods

Speakers
avatar for Evan Chan

Evan Chan

Senior Software Engineer
Evan loves to design, build, and improve bleeding edge distributed data and backend systems using the latest in open source technologies.  He is the creator of the FiloDB open-source distributed database, as well as the Spark Job Server.  He has led the design and implementatio... Read More →


Thursday November 16, 2017 11:40am - 12:20pm
Data

Attendees (65)