

Cloudera Announces the General Availability of Cloudera DataFlow for the Public...
source link: https://www.infoq.com/news/2021/08/cloudera-dataflow-public-cloud/
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.

Cloudera Announces the General Availability of Cloudera DataFlow for the Public Cloud
Aug 21, 2021 2 min read
The enterprise data cloud company Cloudera recently announced the general availability (GA) of Cloudera DataFlow for the Public Cloud, a cloud-native service for data flows to process hybrid streaming workloads on the Cloudera Data Platform (CDP). With Cloudera DataFlow for the Public Cloud, customers can automate complex data flow operations, improve the operational efficiency of streaming data flows with auto-scaling capabilities, and reduce cloud costs by removing infrastructure sizing guesswork.
With Cloudera DataFlow for the Public Cloud, the company brings a FlowOps service with several capabilities such as:
- Central Flow Catalog for manageability, discovery, and version control
- Central dashboard for monitoring, troubleshooting, and performance tuning of data flows across multiple cloud clusters
- Simple deployment wizard and robust APIs for auto-scaling flows on Kubernetes managed by CDP
- Pre-built flows called "ReadyFlows" for some of the common streaming use cases
Under the hood, Cloudera DataFlow for the Public Cloud leverages Kubernetes as the scalable runtime, and it provisions NiFi clusters on top of it as needed. The foundation is a brand new Kubernetes Operator developed from the ground up to manage the lifecycle of Apache NiFi clusters on Kubernetes. Through this operator requests for clusters lead to the provisioning of them. Furthermore, once the provisioning is complete, the operator will also take care of other life cycle aspects, like upgrading Apache NiFi to a new version or terminating a cluster.
Cloudera DataFlow for the Public Cloud users can access the service through the hosted CDP Control Plane, which hosts critical components of CDF-PC like the Catalog, the Dashboard, and the ReadyFlow Gallery.
Source: https://blog.cloudera.com/cloudera-dataflow-for-the-public-cloud-a-technical-deep-dive/
Today, many organizations leverage Apache NiFi to capture and process data across hybrid cloud architectures by visually designing no-code data flows. However, one of the challenges with Apache NiFi is deploying multiple data flows into a single cluster, and these flows compete for resources – leading to performance issues. Some mitigate that issue by sizing a more significant amount of infrastructure than necessary and thus ending up with underutilized infrastructure and higher costs. Furthermore, other challenges they can face are scaling or not having a central overview of the flows.
Dinesh Chandrasekhar, head of product marketing, Data-in-Motion at Cloudera, said in a Cloudera press release:
Cloudera DataFlow automates and manages cloud-native data flows on Kubernetes - and it is something only we offer. Now it is easy for our customers to boost the operational efficiency of their streaming workloads and save on infrastructure costs in the public cloud.
Initially, Cloudera DataFlow for the Public Cloud will be available on the Amazon Web Services (AWS) platform, and Microsoft Azure will be next. And lastly, the pricing details of Cloudera DataFlow for the Public Cloud are available on the pricing page.
Recommend
-
13
Google Announces the General Availability of A2 Virtual Machines Apr 07, 2021...
-
7
AWS Announces the General Availability of the Red Hat OpenShift Service on AWS Apr 12, 2021...
-
9
HashiCorp Announces the General Availability of HCP Vault on AWS Apr 14, 2021...
-
6
Catchpoint Announces General Availability of Enhanced WebPageTest Performance Testing API Apr 13, 2021...
-
8
AWS Announces General Availability of New Application Migration Service May 27, 2021...
-
12
AWS Announces the General Availability of Amazon HealthLake Jul 21, 2021...
-
5
Gitpod Announces General Availability for Public and Private Repositories Aug 30, 2021...
-
9
Amazon Announces QuickSight Q General Availability Oct 02, 2021...
-
10
AWS Announces the General Availability and Open Sourcing of the Amazon Genomics CLI Oct 06, 2021...
-
8
Microsoft Announces the General Availability of Azure Purview Oct 10, 2021...
About Joyk
Aggregate valuable and interesting links.
Joyk means Joy of geeK