IMDB Movie Data ETL Pipeline using S3, Glue, Redshift, EventBridge, SNS
-
Updated
Aug 6, 2024 - Python
IMDB Movie Data ETL Pipeline using S3, Glue, Redshift, EventBridge, SNS
This project creates a serverless data pipeline to extract data from the Colombo Stock Market ASI Index API using AWS Lambda, Kinesis Firehose, and S3. An AWS Glue workflow processes and transforms the data, storing it in an Apache Iceberg table via Athena and Glue ETL jobs.
Used AWS Glue to perform ETL operations and load resultant data to AWS Redshift. In the second phase used AWS CloudWatch rules and LAMBDA to automatically run GLUE Jobs
ETL using application streaming and creating a Data Lake
This project outlines the final project requirements for DAV6100 - Information Architectures, focusing on group assignments, scoring criteria, topic selection, core requirements, and project components such as design, development, visualization, and executive presentation.
Data Streaming and Batch processing using AWS Services
Terraform configuration that creates several AWS services, uploads data in S3 and starts the Glue Crawler and Glue Job.
O projeto foi elaborado com o objetivo de estabelecer uma arquitetura na AWS, originada a partir de uma migração de um banco de dados existente em um ambiente local (on-premise).
This Project aims to automate the process of infrastructure creation.
This project aims to analyze the popularity of YouTube content across different regions by leveraging datasets sourced from Kaggle. It employs a systematic approach to data preprocessing, cleaning, and analysis using various AWS (Amazon Web Services) services including S3, Lambda, Glue, and others, to build an automated ETL pipeline.
Pipeline ETL na AWS
Glue Data Quality Example - Deploy to your AWS Account w/ Terraform to test
Data Engineering project using data streaming produced by python applications, ETL process and availability for ad-hoc SQL queries in the AWS cloud
This is a data pipeline built with the purpose of serving a business team.
Terraform module to create and manage a AWS Glue job
Add a description, image, and links to the glue-job topic page so that developers can more easily learn about it.
To associate your repository with the glue-job topic, visit your repo's landing page and select "manage topics."