đź“ť Research :https://ojitha.blogspot.com.au
for my lengthy articles.
Notes on Introduction to Advanced Bash Usage
While I am going through the following, the youtube talk and it’s associated presentation, my hand-ons were recorded here. It is recommended to go through the basics first. You can also refer to the Bash Ref Manual for more information.
Pandas type conversion
Sometimes we need to remove unnecessary data and save the column in the right format in the Pandas data frames.
AWS Glue run locally
This blog explains how to create an AWS Glue container1 to develop PySpark scripts locally. I’ve already explained how to run the Glue locally using Glue Development using Jupyter.
Access AWS SSM via AWS Stepfunctions
Configuration will be availble throughout the pipeline, if that can be stored in the AWS Stepfunctions. Generally congiruation should be stored in the SSM parameter store. How to access the SSM parameter store from the AWS Stepfunction?
Glue Development using Jupyter
Developing and testing the Glue job in the viscose IDE is one of the best development opportunities because Jupyter doesn’t support IDE features. In this blog, I set up a Glue docker instance in the EC2 and use the vscode Jupyter notebook feature to develop Glue jobs. If you want to create more customized your own Docker image, please see AWS Glue run locally.