Monday, October 27, 2014

Big Data Cartoon: Big Data needs big muscle

Inspired possibly by this cartoon in New Yorker, our illustrator has set out to tell us that being in Big Data, you travel a lot, and of course avail yourself of the exercise facilities found in each and every hotel. My latest was a packed gym in downtown San Francisco.

Lately, I've been noticing that trainers at Elephant Scale have been gaining muscle weight.

Tuesday, September 30, 2014

Got an Ubuntu laptop!

Quite powerful and good-looking, from System76. (It is the one in the middle). Now I have a chance to be productive while traveling or working in friends' place.

I am planning to add Windows in a VM, stay tuned...

Sunday, September 7, 2014

Big Data Cartoon: NY is new Silicon Valley

Silicon Valley may be the leader in Big Data, but when you compare it to New York, it is underwhelming. Indeed.com gives 994 Hadoop jobs in NY, and 1719 in Silicon Valley.

What's more is that if you are a financial startup, then you simply must be in New York. You might have an office in TechSpaces in SF, but that's about it. This is fully supported by our illustrator and cartoon author, whose new residence is now appropriately in Manhattan.

Silicon Valley, pay attention!

Thursday, July 24, 2014

FreeEed does Concordance (R)


The latest release of FreeEed (V4.4) allows import into Concordance (R) eDiscovery management software. Here are the instructions.

It also contains a number of fixes. You can use FreeEed in so many ways:
  • Start a FreeEed server on Amazon, no hardware needed;
  • Download a virtual machine to your workstations;
  • Install in Windows, Linux, or Mac.
Download page: hereAnd all of the popcorn advantages still apply.


PS. Sneak preview: we are working on a document processing engine for today's 3V's - volume, velocity, variety. It is 10-100 times faster, and allows dynamic data sources. 

Wednesday, July 2, 2014

Run FreeEed in the cloud, no downloads or hardware needed

Hi, all,

now we have another option to run FreeEed: on Amazon AWS cloud. There are three steps: (1) start the server, (2) connect to it with X2GO, (3) download and unzip the latest software. The rest of the environment is already prepared for you. Check it out here.

We are planning regular webinars teaching this setup, please write indicating your interest.

Cheers,
FreeEed team

Sunday, June 22, 2014

I am a reviewer on Apache Solr High Performance book


As always, I acknowledge my colleagues, my friend and partner Sujee, and my multi-talented family.

Next time, more of my friends who always help.

Friday, June 13, 2014

Houston Hadoop Meetup - Marco Vasquez presents Apache Spark

Invited speaker Marco Vasques told the group about his work as Data Scientist at MapR, and his use of Spark for this purpose. Thanks to YARN in Hadoop 2, Spark has become a part of every major distribution, either as a release or as early preview.

The group was quite technical and asked a lot of detailed questions. Thanks to everyone, and to MapR for sponsoring the pizza and drinks.

And here are the slides: http://www.slideshare.net/MapRTechnologies/spark-v1