Innovation is a way of Life!

Blogs


Big Data and Your Privacy: How Concerned Should You Really Be?

Today, every IT-related service online or offline is driven by data. In the last few years alone, explosion of social media has given rise to a humongous amount of data, which is sort of impossible to manipulate without specific high-end computing systems. In general, normal...

Is there an Alternative for Hadoop ?

Hadoop: Using big data technologies for your business is a really an attractive thing and Hadoop makes it even more appealing nowadays. Hadoop is a massively scalable data storage platform that is used as a foundation for many big data projects. Hadoop is powerful, however it...

Product Catalogue Classication Using Ensemble

The Problem Statement The Otto Product Classification Challenge was a competition hosted on Kaggle, a website dedicated to solving complex data science problems. The purpose of this challenge was to classify products into correct category, based on their recorded features. Data The organizers had provided a training data set containing...

Hyperscaling Applications Using Mesos & Marathon

In a previous blog post we have seen what Apache Mesos is and how it helps to create dynamic partitioning of our available resources which results in increased utilization, efficiency, reduced latency, and better ROI. We also discussed how to install, configure and run Mesos...

Predicting Restaurant Revenue using Ensemble Methods in Machine Learning

The Problem Statement The TFI Restaurant Revenue Prediction Challenge was a competition hosted on Kaggle, a website dedicated to solving complex data science and machine learning problems. The purpose of this challenge was to predict the annual sales of restaurants based on given objective measurements. Data The organizers...

How to Implement Big Data for Your Organization in the Right Way

Let’s start with an introduction to what IT across the globe calls “big data.” From a use case perspective, few terms are so overused and hackneyed as big data. Some people say it’s the entire data in your company, while some others say it’s anything...

Monitoring Your Servers With Nagios Using NRPE and ELK Stack

In this blog, let’s look at the power of three tools—Elasticsearch, Logstash, and Kibana (together known as ELK) in collecting, analyzing, and visualizing all types of structured and unstructured data. You will see the advantages of these tools, and by the end of the article,...

Hadoop Cluster Automation Using Ironfan

Recently, we faced a unique challenge - setup DevOps and management for a relatively complex Hadoop cluster on the Amazon EC2 Cloud. The obvious choice was to use a configuration management tool. Having extensively used Opscode's Chef and given the flexibility and extensibility Chef provides;...