Interview: M.C. Srivas, CTO, MapR on Data Agility – The Next Frontier of Big Data
Tags: Big Data, Business Strategy, Challenges, Competition, Education, Interview, M. C. Srivas, MapR, Trends
We discuss the competitive differentiation of MapR, challenges in consumerizing Big Data, trends, strategy recommendations, desired skills and more.

Srivas was Chief Architect at Spinnaker Networks (now NTAP) which built the industry's fastest single-box NAS filer, as well as the industry's most scalable clustered filer. Previously, he managed the Andrew File System (AFS) engineering team at Transarc (now IBM). AFS is now standard classroom material in operating systems courses. While not writing code, Srivas enjoys playing tennis, badminton and volleyball. M.C. has an MS in Computer Science from University of Delaware, and a B.Tech. in electrical engineering from IIT Delhi.
First part of interview
Here is second and last part of my interview with him:
Anmol Rajpurohit Q7. How do you distinguish MapR from the competition such as Cloudera and Hortonworks?
M.C. Srivas: MapR is the only Hadoop distribution that brings the kind of integrity, reliability, performance, and manageability that enterprises have long desired

MapR is the only distribution for Hadoop, and in fact the only commercial data processing system, that can work across data centers worldwide, automatically move petabytes, and switch over from one to another and back, whether due to “follow the sun” operations, or due to disaster recovery. The Aadhaar project in India moved to the MapR Distribution including Hadoop to take advantage of such advanced capabilities.
AR: Q8. Given the current state of technology and talent, what do you see as the biggest challenges in the way of consumerizing Big Data on a massive scale?
MCS: The “consumerization,” as you term it, has already taken place at Yahoo, Facebook, Twitter, LinkedIn, and at many, many other websites around the world. The challenge is always to balance privacy while delivering these services. People have started to get afraid of being tracked, and have started hiding themselves.

Real life is about “forgive and forget” … you are innocent until proven guilty, and if you have done something wrong and paid your dues, then you are entitled to a new start. The web, as it gets more and more entangled in real life, needs to mimic these human aspects and needs to get more forgiving and forgetting.
If they don’t do it themselves, it will get legislated and that can take a turn for the worse. For example, the European Union already has passed some bizarre laws about people who can ask web sites to not link to content that is already a matter of public record. Big Data is a powerful tool, and it must be used with care responsibly.
AR: Q9. Which trends do you expect to dominate across the big data industry over the next 2-3 years?
MCS: Legacy databases and data warehouses are expensive because DBA resources are required to flatten, summarize and fully structure the data. Upfront DBA costs delay access to new data sources, and

Additionally, data lakes and data hubs have represented a popular first deployment for Hadoop. A data lake or data hub is a scalable infrastructure that’s both economically attractive (reduced per-terabyte cost) and designed for flexibility, and it has the ability to store various forms of both structured and unstructured data. During the next few years, data lakes will evolve as organizations move from batch to real-time processing and integrate file-based Hadoop and database engines into their large-scale processing platforms.
AR: Q10. What is the best advice you have got in your career?
MCS: What you do today will follow you for the rest of your life. Therefore, don’t ever compromise on pushing the ceiling. Quality is lasting. You want to be known as a person that demands and delivers the highest quality products, in every sense of the word.
AR: Q11. What strategy would you recommend to companies that are trying to make big data an integral part of their business strategy?

MCS: Start simple. Pick a small project that can succeed easily. The idea is to focus on learning how to implement and deploy the new technology first. Subsequent projects can be more complex, but it’s important to first get success on a small scale.
AR: Q12. Recently, we have seen a lot of universities offering big data-related programs and certificates. Do you think the universities are doing a good job in preparing students for a technical career in big data? What changes would you suggest to the big data-related academic curriculum?
MCS: Universities are doing a good job with big data, in many ways. Some tend to focus on the technology itself, while others look at combining various technologies to build new systems. Both are important. We have a pretty strong internship program every summer, where students in their senior or final years work at MapR for 3-6 months on various projects. We have been quite impressed by the quality of the students coming in. Many of these students have joined MapR after they’ve graduated.
AR: Q13. What key qualities do you look for when interviewing for data science-related positions on your team?
MCS: Attitude. Attitude and the ability to work as part of a team are the most important qualities. It goes without saying that the person has to have the relevant knowledge, intelligence, and

AR: Q14. What was the last book that you read and liked? What do you like to do when you are not working?

Related:
Top Stories Past 30 Days
|
|