28

GitHub - aws-samples/aws-glue-samples: AWS Glue code samples

 4 years ago
source link: https://github.com/aws-samples/aws-glue-samples
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.

README.md

AWS Glue ETL Code Samples

This repository has samples that demonstrate various aspects of the new AWS Glue service, as well as various AWS Glue utilities.

You can find the AWS Glue open-source Python libraries in a separate repository at: awslabs/aws-glue-libs.

Content

  • FAQ and How-to

    Helps you get started using the many ETL capabilities of AWS Glue, and answers some of the more common questions people have.

  • Join and Relationalize Data in S3

    This sample ETL script shows you how to use AWS Glue to load, transform, and rewrite data in AWS S3 so that it can easily and efficiently be queried and analyzed.

  • Clean and Process

    This sample ETL script shows you how to take advantage of both Spark and AWS Glue features to clean and transform data for efficient analysis.

  • The resolveChoice Method

    This sample explores all four of the ways you can resolve choice types in a dataset using DynamicFrame's resolveChoice method.

  • Hive metastore migration

    This utility can help you migrate your Hive metastore to the AWS Glue Data Catalog.

  • Crawler undo and redo

    These scripts can undo or redo the results of a crawl under some circumstances.

License Summary

This sample code is made available under the MIT-0 license. See the LICENSE file.


About Joyk


Aggregate valuable and interesting links.
Joyk means Joy of geeK