This blog is writed to simply introduce the assignment 2 project of COMP90024.

This project was implemented by members in group 55

EMO’s Blog

The github of this project is here

COMP90024-A2-Group55

Team Members:

PowerPoint

https://docs.google.com/presentation/d/141WuuAyn-erbEmaW-RVfLsY1yr2QnDdpmTDvFowC9Ms/edit?usp=sharing

Demonstration Video links

Ansible

Part 1: https://youtu.be/_D38VT2fbOc
Part 2: https://youtu.be/Xdya6DEyqS0

Tweet harvesting and CouchDB utilization

Part 1: https://www.youtube.com/watch?v=baOaeuTgMh0

System Structure

Frontend (http://172.26.132.195:8080/)

For visualizing analyzed data, we choose Vue which is a components-based development framework to build our web application. Four main packages were used here, the first one is Vue-ElementUI which offers a set of created components and UIs. We have used it to construct the main structure and style of web pages like Navigation Side Bar and Message Box. The second is Vue-Echart which can be used to draw charts like histogram, pie chart and line chart. The third one is Vue-GoogleMaps, we used it to show data distribution on specific areas. For example, we distribute areas as seven levels based on the num of negative job related tweets and label these areas by seven different colors. The last one is Axios which can be used to communicate with backend servers by HTTP Requests.

Backend

Harvester Server
Data Server
Semantic Analysis Server
Trigger Server

Harvester server

We have built APIs for stream tweets and search tweets. The user can simply call these two APIs to add new thread for harvest tweets.

Semantic Analysis Server

In this server, we use the SentimentIntensityAnalyzer model under the NLTK package of python. It is a pre-trained natural language processing model which can divide emotions expressed in words into negative or positive. This server can be automatically triggered per hour, and then it will pull tweets from created views on couchdb and label these tweets as negative or positive. Finally it will count the num of negative tweets and positive tweets and the statistical results will be stored or updated into couchdb. This server is listening on port 5001.

Data Server

This server provides main api services for the web application. The Flask which is a lightweight development framework of Python is used here. The couchdb package is used to communicate with the couchdb database. There are two types of services which are Aurin Data Services and Couchdb Data Services. The Aurin Data Services will be listening for requests, once accepting requests, they will read json files which download from Aurin and format them and send the formatted data back as response. The Couchdb Data Services are similar to The Aurin Data Service, but they will return data which come from the couchdb database. This server is listening on port 5000.

Trigger Server

If we want to scale up the container of Harvester Server and Semantic Analysis Server to raise system efficiency, we have to make sure each container operates different tasks. So, we have built a trigger that will automatically allocate tasks by sending requests to the Harvester Server cluster and Semantic Analysis Server cluster. After the requests arrive the docker swarm load balance of both servers, the load balancers can evenly distribute tasks into each container. The trigger will allocate new tasks for collecting latest tweets to Harvester Server per day. And it will assign tasks of analyzing tweets for scenarios to Semantic Analysis Server per hour. The trigger also provides APIs for webapp to manually edit the tasks lists. Generally, users can add new tasks into the task list and also delete tasks from the task list through the web app.

Twitter Analysis Project Design

Briefly Introduce for COMP90024-Assignment 2

COMP90024-A2-Group55

PowerPoint

Demonstration Video links

Ansible

Tweet harvesting and CouchDB utilization

System Structure

Frontend (http://172.26.132.195:8080/)

Backend

Harvester server

Semantic Analysis Server

Data Server

Trigger Server

Database

Specfic Description

Twitter harvester

Machine learning algorithm

Deployment Operation

Dockerfile

Images on DockerHub

Instances information

CATALOG

FEATURED TAGS