

GitHub - confluentinc/ksql: KSQL - the Streaming SQL Engine for Apache Kafka
source link: https://github.com/confluentinc/ksql
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.

README.md
KSQL - Streaming SQL for Apache Kafka
Important: We recently announced the General Availability of KSQL. It will be available for download in early April at http://confluent.io/ksql. Until then you can download the latest KSQL Developer Preview release at https://github.com/confluentinc/ksql/releases.
KSQL is an open source streaming SQL engine for Apache Kafka. It provides a simple and completely interactive SQL interface for stream processing on Kafka; no need to write code in a programming language such as Java or Python. KSQL is open-source (Apache 2.0 licensed), distributed, scalable, reliable, and real-time. It supports a wide range of powerful stream processing operations including aggregations, joins, windowing, sessionization, and much more.
Click here to watch a screencast of the KSQL demo on YouTube.
Quick Start
If you are ready to see the power of KSQL, try out these:
- KSQL Quick Start: Demonstrates a simple workflow using KSQL to write streaming queries against data in Kafka.
- Clickstream Analysis Demo: Shows how to build an application that performs real-time user analytics.
Use Cases and Examples
Streaming ETL
Apache Kafka is a popular choice for powering data pipelines. KSQL makes it simple to transform data within the pipeline, readying messages to cleanly land in another system.
CREATE STREAM vip_actions AS SELECT userid, page, action FROM clickstream c LEFT JOIN users u ON c.userid = u.user_id WHERE u.level = 'Platinum';
Anomaly Detection
KSQL is a good fit for identifying patterns or anomalies on real-time data. By processing the stream as data arrives you can identify and properly surface out of the ordinary events with millisecond latency.
CREATE TABLE possible_fraud AS SELECT card_number, count(*) FROM authorization_attempts WINDOW TUMBLING (SIZE 5 SECONDS) GROUP BY card_number HAVING count(*) > 3;
Monitoring
Kafka's ability to provide scalable ordered messages with stream processing make it a common solution for log data monitoring and alerting. KSQL lends a familiar syntax for tracking, understanding, and managing alerts.
CREATE TABLE error_counts AS SELECT error_code, count(*) FROM monitoring_stream WINDOW TUMBLING (SIZE 1 MINUTE) WHERE type = 'ERROR' GROUP BY error_code;
Latest news
- Press Release: KSQL GA announced for early April 2018 -- until then you can download the latest KSQL Developer Preview release at https://github.com/confluentinc/ksql/releases.
- KSQL Feb 2018 release available -- bug fixes, performance and stability improvements
- Secure Stream Processing with Apache Kafka, Confluent Platform and KSQL -- stream processing examples using KSQL that show how companies are using Apache Kafka to grow their business and to analyze data in real time; how to secure KSQL and the entire Confluent Platform with encryption, authentication, and authorization
- KSQL in Action: Real-Time Streaming ETL from Oracle Transactional Data -- replacing batch extracts with event streams, and batch transformation with in-flight transformation; we take a stream of data from a transactional system built on Oracle, transform it, and stream the results into Elasticsearch
- KSQL Jan 2018 release available
-- improved data exploration with
PRINT TOPIC
,SHOW TOPICS
; improved analytics withTOPK
,TOPKDISTINCT
aggregations; operational improvements (command line tooling for metrics); distributed failure testing in place - KSQL Dec 2017 release available
-- support for Avro and Confluent Schema Registry; easy data
conversion between Avro, JSON, Delimited data; joining streams and tables across different data formats; operational
improvements (
DESCRIBE EXTENDED
,EXPLAIN
, and new metrics); optimizations (faster server startup and recovery times, better resource utilization) - KSQL Nov 2017 release available -- focus on community-raised issues and requests (369 pull requests, 50 closed issues)
Documentation
You can find the KSQL documentation here
Join the Community
Whether you need help, want to contribute, or are just looking for the latest news, you can find out how to connect with your fellow Confluent community members here.
- Ask a question in the #ksql channel in our public Confluent Community Slack. Account registration is free and self-service.
- Join the Confluent Google group.
Contributing
Contributions to the code, examples, documentation, etc, are very much appreciated. For more information, see the contribution guidelines.
- Report issues and bugs directly in this GitHub project.
- Read the Developer notes
License
The project is licensed under the Apache License, version 2.0.
Apache, Apache Kafka, Kafka, and associated open source project names are trademarks of the Apache Software Foundation.
Recommend
-
50
Confluent's Python Client for Apache KafkaTM confluent-kafka-python provides a high-level Producer, Consumer and AdminClient compatible with all Apache Kafka
-
93
README.md Docker Images for Confluent Plaform Docker images for deploying and running the Confluent Platform. The images are currently available on
-
71
README.md Schema Registry Schema Registry provides a serving layer for your metadata. It provides a RESTful interface for storing and retrieving Avro s...
-
38
README.md Kafka Tutorials The source code for the Kafka Tutorials microsite. Setup ...
-
51
README.md Kafka REST Proxy The Kafka REST Proxy provides a RESTful interface to a Kafka cluster. It makes it easy to produce and consume messages, view...
-
41
README.md Confluent's .NET Client for Apache KafkaTM
-
57
README.md Kafka Connect HDFS Connector kafka-connect-hdfs is a Kafka Connector
-
23
README.md Confluent's Golang Client for Apache KafkaTM confluent-kafka-go is Confluent's Golang client for
-
11
By Soumyadeb Mitra/September 07, 2020Overview In this post, we answer the all-important question – “Why we did not prefer Apache Kafka over PostgreSQL for building RudderStack”. We discuss some of the...
-
11
Intelligent Kafka message routing using Drools DMN Engine and Apache CamelIn this post I want to share an inte...
About Joyk
Aggregate valuable and interesting links.
Joyk means Joy of geeK