<div dir="ltr">Hey Tom,<div><br></div><div>If I were looking to start something new, I'd look hard into docker. Spark and Storm are neat if you like stream processing. Kafka is a niche. It needs to mature or it will be replaced.</div><div><br></div><div>$0.02.</div><div><br></div><div>Good luck.</div></div><div class="gmail_extra"><br><div class="gmail_quote">On Fri, Nov 21, 2014 at 10:56 PM, Tom Hanlon <span dir="ltr"><<a href="mailto:tom@functionalmedia.com" target="_blank">tom@functionalmedia.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">Thanks Chris,<br>
<br>
As a hadoop instructor of 4 years, i was looking to get into something<br>
more interesting then the jumble of tools, and focus on one tool.<br>
<br>
Kafka, storm and spark are all candidates, and perhaps docker.<br>
<br>
My plan is to learn one of them well enough to teach it. Kafka and<br>
Docker and Storm seem to be in demand, but not well served with<br>
training. Spark is in demand, but some training exists.<br>
<br>
Your summary is useful, I think I will read what documentation is out<br>
there, and have a look at the code and see if I like it well enough to<br>
keep learning.<br>
<br>
--<br>
Tom<br>
<div class="HOEnZb"><div class="h5"><br>
<br>
<br>
On Fri, Nov 21, 2014 at 9:42 PM, Chris Embree <<a href="mailto:cembree@ez-as.net">cembree@ez-as.net</a>> wrote:<br>
> 8-o So many questions.<br>
><br>
> List is fine, nothing secret here, just my opinions which are now<br>
> worth slightly less than you paid for them.... sorry.<br>
><br>
> We run a limited size cluster due to physical limits. That said, it's<br>
> anywhere from 10 -14 kafka nodes. Each w/ 2 dedicated 10k disks. GC<br>
> hasn't shown up as an issue so far, but it might be the culprit behind<br>
> a couple of anomalous issues.<br>
><br>
> Generally, as a cluster is seems somewhat immature. It works well<br>
> when it works well, otherwise things get ugly.<br>
><br>
> It can use JMX for monitoring, but management tools are somewhat<br>
> limited. One of smart guys on my team built a tool, dubbed Kurator<br>
> (play on ES tools) that uses some Python Kafka API's to provide some<br>
> insight. However, it relies heavily on Zookeeper status and Kafka<br>
> telling the truth. We've seen a few issues that raise doubt about<br>
> Kafka's agreement w/ ZK on what's real.<br>
><br>
> HOWEVER: Our use case is extremely abusive. We're looking for 1.2M<br>
> 1K transactions per second. If you are anywhere south of 100K/s<br>
> chances are extremely good you can construct a highly reliable Kafka<br>
> Cluster.<br>
><br>
> On the fence: We've had little luck re-allocating partitions to<br>
> recover from a lost node. Listing Kafka topics will show you the #<br>
> and nodes hosting In Sync Replicas (ISR). The re-balance feature is<br>
> somewhat new and lightly documented, at least at last Google. I've<br>
> had little luck re-syncing after a node loss.<br>
><br>
> Kafka is a minor part of our solution in the grand scheme of things.<br>
> I feel ill equipped to give a talk on the subject in any reasonable<br>
> depth.<br>
><br>
> That said, I will be speaking at Cisco Live Milan (EMEA). I'd be<br>
> happy to re-present at a COLUG if there is any value. The talk is a 4<br>
> hour tech session (I'm only 1 of 3 speakers) on the entire openSOC<br>
> project. My focus will be on the platform side and it may not be the<br>
> best fit for a LUG. It's more of a HUG topic.<br>
><br>
> FWIW, I absolutely hate the name Hadoop. ;)<br>
><br>
> I hope that helps.<br>
><br>
> Chris<br>
><br>
> On 11/21/14, Tom Hanlon <<a href="mailto:tom@functionalmedia.com">tom@functionalmedia.com</a>> wrote:<br>
>> Chris,<br>
>><br>
>> Can you talk about it on the list ? If not maybe we can send some<br>
>> private emails.<br>
>><br>
>> How big is the kafka cluster ? How many events are handled?<br>
>><br>
>> What are the details of the hiccups ? Java Garbage collection?<br>
>> Configuration changes ? General strangeness ?<br>
>><br>
>> Does it provide any hooks for monitoring or managing? Nagios for<br>
>> monitoring ? Some api hooks for management ?<br>
>><br>
>> Thanks,<br>
>> Tom<br>
>><br>
>><br>
>> On Fri, Nov 21, 2014 at 1:17 PM, Chris Embree <<a href="mailto:cembree@ez-as.net">cembree@ez-as.net</a>> wrote:<br>
>>> Sadly, yes.<br>
>>><br>
>>> We're using Kafka as the buffering queue for OpenSOC (<a href="http://getopensoc.com" target="_blank">getopensoc.com</a>)<br>
>>> and while it works well when things are fine, it has significant<br>
>>> difficulty recovering from hiccups.<br>
>>><br>
>>> Also, there are few tools for managing it from an Admin point of view.<br>
>>> Deleting a topic is a non-trivial task, for example.<br>
>>><br>
>>> Chris<br>
>>><br>
>>> On 11/21/14, Tom Hanlon <<a href="mailto:tom@functionalmedia.com">tom@functionalmedia.com</a>> wrote:<br>
>>>> Colug,<br>
>>>><br>
>>>> Are there any kafka users on this list.<br>
>>>><br>
>>>> <a href="http://kafka.apache.org/" target="_blank">http://kafka.apache.org/</a><br>
>>>><br>
>>>> I am looking to dive into kafka and some use-case, war-story,<br>
>>>> discussion with a user would be helpful.<br>
>>>><br>
>>>> If there is broader interest perhaps we can make a meeting<br>
>>>> presentation out of it.<br>
>>>><br>
>>>> Thanks,<br>
>>>><br>
>>>> Tom<br>
>>>> _______________________________________________<br>
>>>> colug-432 mailing list<br>
>>>> <a href="mailto:colug-432@colug.net">colug-432@colug.net</a><br>
>>>> <a href="http://lists.colug.net/mailman/listinfo/colug-432" target="_blank">http://lists.colug.net/mailman/listinfo/colug-432</a><br>
>>>><br>
>>> _______________________________________________<br>
>>> colug-432 mailing list<br>
>>> <a href="mailto:colug-432@colug.net">colug-432@colug.net</a><br>
>>> <a href="http://lists.colug.net/mailman/listinfo/colug-432" target="_blank">http://lists.colug.net/mailman/listinfo/colug-432</a><br>
>> _______________________________________________<br>
>> colug-432 mailing list<br>
>> <a href="mailto:colug-432@colug.net">colug-432@colug.net</a><br>
>> <a href="http://lists.colug.net/mailman/listinfo/colug-432" target="_blank">http://lists.colug.net/mailman/listinfo/colug-432</a><br>
>><br>
> _______________________________________________<br>
> colug-432 mailing list<br>
> <a href="mailto:colug-432@colug.net">colug-432@colug.net</a><br>
> <a href="http://lists.colug.net/mailman/listinfo/colug-432" target="_blank">http://lists.colug.net/mailman/listinfo/colug-432</a><br>
_______________________________________________<br>
colug-432 mailing list<br>
<a href="mailto:colug-432@colug.net">colug-432@colug.net</a><br>
<a href="http://lists.colug.net/mailman/listinfo/colug-432" target="_blank">http://lists.colug.net/mailman/listinfo/colug-432</a><br>
</div></div></blockquote></div><br></div>