Code Mesh 2016

How did I get here? Building Confidence in a Distributed Stream Processor

Sean T Allen

VP of Engineering @ Sendence

How did I get here? Building Confidence in a Distributed Stream Processor

When we build a distributed application, how do we have confidence that our results are correct? We can test our business logic over and over but if the engine executing it isn't trustworthy, we can't trust our results.

How can we build trust in our execution engines? We need to test them. It's hard enough to test a stream processor that runs on a single machine, it gets even more complicated when you start talking about a distributed stream processor. As Kyle Kingsbury's Jepsen series has shown, we have a long way to go creating tests that can provide confidence that our systems are trustworthy.

At Sendence, we're building a distributed streaming data analytics engine that we want to prove is trustworthy. This talk will focus on the various means we have come up with to create repeatable tests that allow us to start trusting that our system will give correct results. You’ll learn how to combine repeatable programmatic fault injection, message tracing, and auditing to create a trustworthy system. Together, we’ll move through the design process repeatedly answering the questions “What do we have to do to trust this result?” and “If we get the wrong result, how can we determine what went wrong so we can fix it?”. Hopefully you’ll leave this talk inspired to apply a similar process to your own projects.

Talk objectives:

- Understand the need for verification of distributed systems.
- Learn approaches and techniques for verification with distributed systems.
- Understand some of the different challenges and solutions for verification with stream processing systems.

Target audience:

- Developers and Architects interested in practical approaches to verify correctness in a distributed system.

<a target="_blank" href="https://youtu.be/6MsPDtpe2tg"><span class="glyphicon glyphicon-film"> Video</span></a></div></div><div class="speaker-bio"><h4 class="header-about-speaker">About Sean</h4><p>Sean T. Allen is VP of Engineering at Sendence - a startup focused on high speed data analytics. His turn-ons include programming languages, distributed computing, Hiwatt amplifiers, and Fender Telecasters. His turn-offs include mayonnaise, stirring yogurt, and sloppy code. He is one of the authors of Storm Applied; he wanted to call it “I wanna go fast” but, you know, publishers- ¯\(ツ)/¯.</p> <p>Github: <a target="_blank" class="speaker-social-link" href="https://github.com/seantallen">seantallen</a></p><p>Twitter: <a target="_blank" class="speaker-social-link" href="https://twitter.com/seantallen">@seantallen</a></p></div><a href="/codemesh2016"><div class="button back-to-conference-button">Back to conference page</div></a></div></div></div><script src="//maps.google.com/maps/api/js" type="text/javascript"></script><script src="/assets/code_mesh-35b24222a9d1c1d4eb837cdf4bc2c6bae1db9680a00cb0dca117afffb621d14b.js"></script><script>(function(i,s,o,g,r,a,m){i['GoogleAnalyticsObject']=r;i[r]=i[r]||function(){ (i[r].q=i[r].q||[]).push(arguments)},i[r].l=1*new Date();a=s.createElement(o), m=s.getElementsByTagName(o)[0];a.async=1;a.src=g;m.parentNode.insertBefore(a,m) })(window,document,'script','//www.google-analytics.com/analytics.js','ga'); ga('create', 'UA-44276742-1', 'auto'); ga('send', 'pageview');</script><script>try { var pageTracker = _gat._getTracker("UA-235575-6"); pageTracker._trackPageview(); } catch(err) {}</script><script>piAId = '24452'; piCId = '1532'; (function() { function async_load(){ var s = document.createElement('script'); s.type = 'text/javascript'; s.src = ('https:' == document.location.protocol ? 'https://pi' : 'http://cdn') + '.pardot.com/pd.js'; var c = document.getElementsByTagName('script')[0]; c.parentNode.insertBefore(s, c); } if(window.attachEvent) { window.attachEvent('onload', async_load); } else { window.addEventListener('load', async_load, false); } })();</script></body></html>