Amazon EMR Studio Is Now Generally Available

The GA release also adds new features including the ability to use the EMR console and AWS CloudFormation to create and configure a new EMR Studio for your team, and support for Microsoft Active Directory as an identity provider.

Recently, AWS announced the general availability of Amazon EMR Studio, which is an integrated development environment that makes it easy to develop, visualize, and debug applications written in R, Python, Scala, and PySpark. 

The GA release also adds new features including the ability to use the Amazon EMR console and AWS CloudFormation to create and configure a new EMR Studio for your team, and support for Microsoft Active Directory (AD) as an identity provider.

The release also features a new quick start notebook experience, and the ability to launch the live Apache Spark UI directly from an EMR Studio notebook. The GA release also supports private Git repositories.

Well, EMR Studio offers fully managed Jupyter notebooks, and tools like Spark UI and YARN Timeline Service to help debugging. The solution uses AWS Single Sign-On and lets you log in directly with your corporate credentials without signing in to the AWS Management Console. Users can install custom kernels and libraries, and collaborate with peers using code repositories like GitHub and Bitbucket. User can run parameterized notebooks as part of scheduled workflows using orchestration services like Apache Airflow and Amazon MWAA.

Source: AWS

EMR Studio allows you to run notebook code on Amazon EMR on Amazon EC2 or Amazon EMR on Amazon EKS, and leaverage the performance-optimized EMR runtime for Apache Spark. 

With this GA release, users can now use the EMR console, AWS CloudFormation, or the AWS CLI to create a new EMR Studio for their team. They can also automate Studio creation in AWS CloudFormation by specifying the configurations and dependencies in a CloudFormation template.  And added support for Microsoft AD as an identity source enables users to use it with EMR Studio via AWS SSO.

Amazon has extended the collaboration features of EMR Studio to include connecting from notebooks in EMR Studio to GitHub,GitLab, AWS CodeCommit, and Bitbucket, repositories on private networks like on-premises and customer VPCs. 

The release also makes application debugging easier by enabling users to launch the live Apache Spark UI directly from their notebooks within EMR Studio. 

To getting started with the Amazon EMR Studio Interface you can check out this tutorial.