Skip to content

An example that shows how to periodically launch a Dataflow analytics pipeline from GAE Flex, that reads from Datastore.

License

Notifications You must be signed in to change notification settings

amygdala/gae-dataflow

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Running Dataflow jobs from Google App Engine

This directory contains two different examples that show how you can run Cloud Dataflow pipelines from App Engine apps, as a replacement for the older GAE Python MapReduce libraries, as well as do much more.

The examples show how to periodically launch a Python Dataflow pipeline from GAE, to analyze data stored in Cloud Datastore; in this case, tweets from Twitter.

The example in sdk_launch shows how to launch Dataflow jobs via the Dataflow SDK. This requires the use of an App Engine Flex service to launch the pipeline.

The example in job_template_launch shows how to launch Dataflow jobs via job Templates. This can be done using only App Engine Standard. Prior to deploying the app, you create a pipeline template (via your local command line, in this example) for the app to use.

Contributions

Contributions are not currently accepted. This is not an official Google product.

About

An example that shows how to periodically launch a Dataflow analytics pipeline from GAE Flex, that reads from Datastore.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages