We are pleased to announce on boarding of Mr. Amar Sharma. He is a pass out from IIT Roorkee (Electrical ) and IIT Delhi ( Computers ). He has rich knowledge and the possibly among the best experience in Big Data and Analytics. He has served many big companies ( Yahoo/Microsoft/Motorola/Synopsys etc. ) and provided consultancy to successful startups( Tiffin Ala-Carte, Cloud Theta etc.).
Big Data analytics is the process of collecting, organizing and analyzing large sets of data (called Big Data) to discover patterns and other useful information. Big Data analytics can help organizations to better understand the information contained within the data and will also help identify the data that is most important to the business and future business decisions. Analysts working with Big Data typically want the knowledge that comes from analyzing the data.
Today's advances in analyzing big data allow researchers to decode human DNA in minutes, predict where terrorists plan to attack, determine which gene is mostly likely to be responsible for certain diseases and, of course, which ads you are most likely to respond to on Facebook.
Hadoop is an open source distributed processing framework that manages data processing and storage for big data applications running in clustered systems. It is at the center of a growing ecosystem of big data technologies that are primarily used to support advanced analytics initiatives, including predictive analytics, data mining and machine learning applications. Hadoop can handle various forms of structured and unstructured data, giving users more flexibility for collecting, processing and analyzing data than relational databases and data warehouses provide.To know more details about hadoop click here
ELK Stack is one way modern organizations choose to accomplish this. As the name (“stack”) implies, ELK is not actually a tool in itself, but rather a useful combination of three different tools – Elasticsearch ,Logstash, and Kibana – hence ELK. All three are open source projects maintained by Elastic Elastic says they are:
the tools respectively provide fast searching over a large data set, collect and distribute large amounts of log data, and visualize the collected and processed data.
Apache Cassandra is a free and open-source distributed wide column store NoSQL database management system ideal for high-speed, online transactional data.It was designed to handle large amounts of data across many commodity servers, providing high availability with no single point of failure.
Initially Cassandra was developed at Facebook to power the Facebook inbox search feature.To know more details about cassandra click here