Apache Flink is an open source stream processing framework with both batch processing and data streaming programs.
Apache Flink requires Java to run. Flink is implemented in Java and runs on the Java Virtual Machine (JVM). It leverages the Java programming language and its runtime environment to execute data processing tasks and manage distributed computations:
You can download the JDK from the
official Oracle website
.
Or use an open-source distribution like
OpenJDK
.
Set Java Environment Variable
You can find all the versions and components on the official Flink website
Before you configure Flink, you should learn about two important words: JobManager and TaskManager
Replace JobManager_IP
in the command above with the IP address of the JobManager in your setup. You can also use the default setting of “localhost:8081”.
Replace TaskManager_1_IP
, TaskManager_2_IP
and TaskManager_3_IP
in the command above with the IP addresses of the TaskManagers in your setup. You can also use localhost in place of the IP addresses.