Google Launches Cloud Dataproc

Google launches new product Cloud Dataproc, adding on to its big data services. Cloud Dataproc is a managed Spark and Hadoop service that lets you take advantage of open source data tools.

The New Google Cloud Dataproc is now available in beta manages the Spark data processing engine or Hadoop framework directly on virtual machines. A fully managed service like Cloud Dataflow, which lets you orchestrate your data on Google’s platform.
 
Dataproc allows its users to set up ad-hoc clusters whenever required. This will be managed and administered by Google.
 
As stated via a blog post:
 
"Cloud Dataproc is a managed Spark and Hadoop service that lets you take advantage of open source data tools for batch processing, querying, streaming, and machine learning. Cloud Dataproc automation helps you create clusters quickly, manage them easily, and save money by turning clusters off when you don't need them. With less time and money spent on administration, you can focus on your jobs and your data. In the time it takes you to read this blog post, you can have a Spark or Hadoop cluster created, configured, and ready to work for you."
 
 
Image Source: googlecloudplatform.blogspot.in
 
Greg DeMichillie, director of product management, believes that Cloud Dataproc is an important addition to the range of Big Data services.
 
DeMichillie goes on to acknowledge that there are users who want complete control over their data, and hence these users might be willing to manage and operate their own virtual machines. The Dataproc users will not face any issues if they want to set up their own infrastructure.
 
For more information, visit here.


Next Recommended Reading Google Announces Cloud Dataproc Service