Banner Image

Skills

  • Amazon AWS
  • Analytics
  • Apache
  • Business Intelligence
  • Cassandra
  • Clojure
  • Data Management
  • Data Science
  • Distributed Computing
  • Erlang
  • Hadoop
  • Java
  • Machine Learning
  • Python
  • Scala

Sign up or Log in to see more.

Services

  • Principal Architect (Big Data & Cloud)

    $50/hr Starting at $500 Ongoing

    Dedicated Resource

    Building data pipelines and analytical systems at massive scale. My experience lies in distributed systems, focusing on data driven large-scale systems (+ nodes). For the highly concurrent world my choice...

    Amazon AWSAnalyticsApacheApache HiveApache Solr

About

TCO efficient CTO

Building data pipelines and analytical systems at massive scale. My experience lies in distributed systems, focusing on data driven large-scale systems (+ nodes).

For the highly concurrent world my choice of development environment is Erlang(BEAM) and Clojure (JVM). Using functional languages that supports thousands of lightweight threads communicating with message passing and having inverted concurrency control enables low latency and high throughput with thread safe software.

Storing data at scale has been an interesting subject to me, I am familiar with the RCFile whitepaper and the more recent publication about ORC and Parquet. I have been using columnar stores beside the classical row oriented stores (SQL servers) and key-value stores (Riak, Couchbase).

Analysis of large datasets is sometimes challenging. Using caching and sampling and few other techniques makes it possible to query these sets. I am familiar with few query engines (Hive, PrestoDB, Tez)

Work Terms

I mostly work at CET working hours, but I'm flexible. I don't work cheap but I don't play cheap in return.

Attachments (Click to Preview)