Wednesday, December 17, 2014

Packt’s $5 eBonanza returns

Just got these news from Packt (of course this includes our recently published book)

Following the success of last year’s festive offer, Packt Publishing will be celebrating the Holiday
season with an even bigger $5 offer. From Thursday 18th December, every eBook and video will be available on the publisher’s website for just $5. Customers are invited to purchase as many as they like before the offer ends on Tuesday January 6th, making it the perfect opportunity to try something new or to take your skills to the next  level as 2015 begins.

With all $5 products available in a range of formats and DRM-free, customers will find great value
content delivered exactly how they want it across Packt’s website this Xmas and New Year.

Find out more at


Media Contact:

About Packt Publishing

Founded in 2004 in Birmingham, UK, Packt’s mission is to help the world put software to work in new ways, through the delivery of effective learning and information services to IT professionals. Working towards that vision, we have published over 2000 books and videos so far, providing IT
professionals with the actionable knowledge they need to get the job done –whether that’s specific
learning on an emerging technology or optimizing key skills in more established tools.

As part of our mission, we have also awarded over $1,000,000 through our Open Source Project Royalty scheme, helping numerous projects become household names along the way.

Wednesday, November 26, 2014

Big Data Cartoon - What is text analytics?

Analytics may be the next big thing in Big Data, but it is very hard to define what it really is. Firstly, this word shows as misspelled in the browser and in Word or OpenOffice. Secondly, it's too vague and nebulous. As always, when in doubt, we turn to our illustrator, and our RK can illuminate us with a simple to understand cartoon that even data scientists can get.

Thursday, November 20, 2014

Big Data Cartoon - What's the latest and greatest in Hadoop?

What's the latest and greatest in Hadoop? Ask this question, and many people will say "Real-time" and point to Spark. Look at Berkeley's AMP labs two-day seminar going on right now, for example.

But what is Spark, really? What are those RDD's? They stand for Resilient Distributed Datasets, but is it any clearer? We asked our illustrator to clarify this, and hopefully we got it explained.

Wednesday, November 12, 2014

Announcing HBase Design Patterns Book

Happy to announce the "HBase Design Patterns" book, by Mark Kerzner and Sujee Maniyam. The book just went into production and can be pre-ordered using this link:

The book offers an HBase and NoSQL developer practical guidance in designing and implementing real-world applications. Subjects covered include

  • Various HBase install options
  • Single entity tables
  • Key generation
  • Storing large files
  • Dealing with time series data
  • Advanced modeling
  • Performance optimization
  • A number of labs and exercises

Based on the authors' own work, research and experience gained  in writing the open source book "Hadoop Illuminated." Oh, and did we forget to mention cartoons by RK? Each chapter has at least one.


Mark & Sujee

Tuesday, November 11, 2014

An excellent presentation by Rohit Jain about exciting new open source product Trafodion

Rohit Jain drove from Austin and presented Trafodion (Welsh for "Transaction"), pronounced "Travodion" - for those in the know. Rohit is an HP Database Distinguished and Chief Technologist. The breadth and depth of his knowledge is amazing.

In turn, the audience did not betray the expectations.  Houston is getting its Big Data people, by importing them, and people from Cloudera, Hortonworks and DataStax were all represented.

Pizza was sponsored by HP - thank you - and Rohit has already uploaded the slides to the Meetup. Here are the main slides, and the architecture

, with this note from Rohit: "There was interest in the Trafodion Distributed Transaction Management (DTM) architecture. However, it is a bit dated. Since this presentation, DTM has now been implemented as HBase co-processor code & THLOG has been integrated with the HBase HLOG."

My comment: I started Houston Hadoop Meetup in 2010, with the expectation of an imminent Big Data Boom in Houston. I am still expecting. This was the first meetup though where we had active Big Data professionals, but they were all imported, as I said, from Big Data companies. We are still yet to see native Houstonians and Houston companies doing Big Data. Again, it's coming, and our meetup is one of the focal points.

Monday, October 27, 2014

Big Data Cartoon: Big Data needs big muscle

Inspired possibly by this cartoon in New Yorker, our illustrator has set out to tell us that being in Big Data, you travel a lot, and of course avail yourself of the exercise facilities found in each and every hotel. My latest was a packed gym in downtown San Francisco.

Lately, I've been noticing that trainers at Elephant Scale have been gaining muscle weight.

Tuesday, September 30, 2014

Got an Ubuntu laptop!

Quite powerful and good-looking, from System76. (It is the one in the middle). Now I have a chance to be productive while traveling or working in friends' place.

I am planning to add Windows in a VM, stay tuned...