Sentiment Analysis with Kubeflow Pipelines, Cloud Dataflow, and Cloud Natural Language API

Included code will build a Kubeflow Pipelines component and pipeline. The pipeline uses Cloud Dataflow to do sentiment analysis on New York Times headlines. Cloud DAtaflow uses Apache Beam (Java) to extract front page headlines from NYTimes Archive API JSON files, perform sentiment analysis on them by calling Cloud Natural Language API and then aggregates the sentiment scores and magnitudes.

Pre-requisites

Enable Natural Language API
Enable Dataflow
Deploy a Kubeflow cluster

Create Docker image

To create Sentiment Analysis docker image, go to ./sentiment-analysis folder and run the following command replacing PROJECT_ID with your actual Project ID.

bash build_image.sh PROJECT_ID

Build pipeline

To create Sentiment Analysis pipeline, go to ./pipeline folder and run the following command.

python build_sentiment_analysis.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Sentiment Analysis with Kubeflow Pipelines, Cloud Dataflow, and Cloud Natural Language API

Pre-requisites

Create Docker image

Build pipeline

Files

README.md

Latest commit

History

README.md

File metadata and controls

Sentiment Analysis with Kubeflow Pipelines, Cloud Dataflow, and Cloud Natural Language API

Pre-requisites

Create Docker image

Build pipeline