Stream Processing

less than 1 minute read

Streaming exchange rates on binance

I created a stream processing application using the AWS Kinesis R package’s daemon + Redis to record the overall amount of coins exchanged on Binance (per symbol) in the most recent micro-batch. Then I created a Jenkins job that reads from this Redis cache and prints the overall value (in USD) of the transactions – based on the coin prices reported by the Binance API at the time of request. I report two charts to the “#bots-final-project” Slack channel.

I documented my steps to show my general understanding on how to build data pipelines using Amazon Web Services and R and how to implement a stream processing application (either running in almost real-time or batched/scheduled way) in practice.

-> find the PDF here

Apache Spark

In this project I use Spark in a practical manner: I look at cloud configurations, examine databricks (a software as a service running on Amazon)as well as managed Spark EMR, or elastic map reduce, on Amazon and many others.