spark-error-selector

This app will read messages from a kafka topic, select messages that match certain criteria (contain specific words), and write selected messages to another topic.

usage on openshift

As this project utilizes Spark, it will be easiest to consume on OpenShift by using the RADanalytics tooling. The source-to-image nature of this application will require that a Spark cluster is available. The shortest path to making that connection is to use the automatically spawned Spark clusters that are created by the Oshinko project source-to-image utilities. Please see that documentation for more information about this process.

see the radanalytics.io Get Started page for instructions on installing that tooling

launch the skeleton with the following command:

oc new-app --template=oshinko-pyspark-build-dc \
           -p APPLICATION_NAME=error-selector \
           -p GIT_URI=https://github.com/redhathackfest/spark-error-selector \
           -p APP_ARGS='--servers=apache-kafka:9092 --in=topic1 --out=topic2 --count=topic3'  \
           -p SPARK_OPTIONS='--packages org.apache.spark:spark-streaming-kafka-0-8_2.11:2.1.0'

In this example, our application will subscribe to messages on the Kafka topic topic1, publish filtered messages on the topic topic2, and publish information about message counts on topic3 using the broker at apache-kafka:9092.

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

spark-error-selector

usage on openshift

About

Releases

Packages

Languages

License

RedHatHackFest/spark-error-selector

Folders and files

Latest commit

History

Repository files navigation

spark-error-selector

usage on openshift

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages