Apache Kafka is an open-source and distributed stream system used for stream processing, real-time data lines, and large-scale data integration. Kafka, originally created at LinkedIn in 2011 to manage real-time data streams, quickly evolved into a full-fledged event stream platform that can process more than a million messages per second or trillions of messages per day from a messaging queue.