Skip to content

Latest commit

 

History

History
309 lines (245 loc) · 19 KB

README.md

File metadata and controls

309 lines (245 loc) · 19 KB

GE Healthcare Challenge

SHADE Engine to detect the emotions of a person based on his/her social media activity and recommend measures to improve upon the same.

Problem Statement

Help me with my Mood with Social-media Health Analysis and Display Engine (SHADE) is a software solution which tries to analyse your current emotions based on the content that you share on different social media websites. With advances in technology, it has now become easy to detect the emotions user is going though by using NLP on the text shared which when combined with visual recognition on the images gives a concrete solution to take calm down measures before hand he/she chooses any drastic actions.

Key Value Proposition

  • Data is fetched not only from Twitter but also from Medium.com and Instagram.
  • Emotion detection is applied on photos shared on Instagram and twitter apart from text.
  • Derive personality insights apart from blogs written on medium website.
  • Graphical representation of data using Google Charts (Bar Graph, Donut Pie-chart, 3D chart etc)
  • Recommendation engine for providing tips from doctors and psychiatrist based upon emotion detected.
  • Use of AWS-Translate module to translate all tweets to English so that other AWS services can use it effectively.
  • Based on emotion detected, various yoga positions have been displayed.
  • If user is sad or feeling disgusted, it is very dangerous to leave him alone. So we have shown nearby worship places, restaurants and natural places using Google Nearby APIs so that he will come out of room and explore places and will not be alone in public places.
  • To show the nearby places, application uses the current location of the user.
  • SMS Service : If Sadness in aggregated crosses 60% then SMS is sent to user's emergency number.
  • Medicine Aggregator : Given a medicine name, links of different e-commerce websites where the product is available for sale.
  • Nearby Psychiatrists : Using Location service from Browser, getting nearby psychiatrist clinics
  • Alexa Skill : Apart from web app, we have an Alexa Skill with same functionality which gives voice response as well as recommendations on alea app.
  • Chatbot for CBT(Cognitive Behavioral Test) : Chatbot powered by AI Services that improves over time as the patient interacts with it. It performs the role of expert in CBT Tests thus implanting positive thoughts in patient's mind.

Proposed Solution

  • The social media websites that we are targeting to understand user's current emotions are :
    • Twitter : We see twitter as a place where people express their instant emotions about Named Entities(Name,place,Product,Organisation etc).
    • Instagram : We see instagram as a rich source of data because of the #Tags usage and images a user shares which points to his current mental well being
    • Medium : The earlier two websites are used by people to express their momentary emotions where as when it comes to the most popular blog website like medium, analysing user's blogs gives you deep insights about his interests,personality and his state of mind.
  • The software solution has the following components :
    • Data Aggregator : It is a REST Server to aggregate data from different social media websites and put it in a NoSQL DB
    • Data Analyser : It uses the data aagregated previously and use AWS-AI Services like Translate,Comprehend,Rekognition services like detectText,detectLabel,recogniseCelebrities and Watson components like Tone Analyser and Personality Insights; Custom Models to understand user's state of mind.
    • Suggested Measures : Once the user's state of mind has been understood. We shall give him data visualizations to help him understand himself better and songs,videos,articles,yoga asanas and nearby places to eat,worship or of natural beauty depending the most prominent emotion detected.

Architectures and Flow Diagram

aws-rec-DB5


aws-rec-DB3




aws-rec-DB4


Technology Stack used

  • aws-aggregator :
  • aws-analyser :
    • Server-Type : NodeJS Project (Web-APP + REST Server)
    • Programming Language : NodeJS with Express Framework
    • App : NodeJS App
    • Hosted : AWS Lambda
    • Database : MongoDB hosted on MLAB
    • Major API Endpoint : https://aws-analyser.amazonapp.com/readProfile/valid_twitterID/valid_mediumID/valid_instagramID
    • 3rd Party APIs used : AWS APIs like Translate,Comprehend,Rekognition and Polly,Youtube APIs, Nearby Places APIs,Google Chart APIs,Algorithmia Models,Custom Model APIs,ibm-tone analyser APIs,ibm - personality insights APIs
  • aws-recommender :
  • For instagram, business accounts were having a 15days approval period to get access to APIs but since we dint have that much of time, we used a hack wherein the instagram data of any user can be obtained from this URL : https://www.instagram.com/iam_niks026/?__a=1 where iam_niks026 is the "Instagram Handle".So currently we have used that as the JSON Response and kept it on server and built a JSON parser on top of it. When the solution goes live the FILE Reading will be replaced by the API calling.

Implementation Details

Module 1 : aws-aggregator

  • Python Libraries Used : Flask,tweepy,json,csv,requests,xml.etree.ElementTree
  • For each user we have a single document in the collection "aggregate" of DB "awsai" in MongoDB. When the user first enters the system, aws-aggregator checks the DB if these set of social-media IDs exist in DB. If not implies he has come to the web app for the first time and a new document is created for him.If not then the previous document is deleted and a new one is created for him.
  • Twitter Sub-Module :
    • Motivation : To get user's tweets of past 7 days to undertsand what were the instantaneoud emotions he went though.
    • Data collected : For each tweet, we collect tweet text,time and language of tweet. Also we collect the image if any associated with the tweet.
  • Instagram Sub-Module :
    • Motivation : To get hash tags of each post which convey user's current state of mind and perform visual recognition on images.
    • Data collected : For each post, we collect post's hashtages and post's image URL and number of likes and store it in DB
  • Medium Sub-Module :
    • Motivation : To get insights into user's perosnality and interests.
    • Data Collected : For each blog, the blog content and date of publish.
  • Process Flow :

aws-agg-DB1
aws-agg-DB2
aws-agg-DB3



Module 2 : aws-analyser

  • NodeJS Modules Used : aws-sdk,watson-developer-cloud,image-downloader,request,async,body-parser,request,fs,algorithmia etc
  • When aws-aggregator has successfully agregated the data, the UI calls aws-analyser to enrich the data with emotion intelligence.
  • AWS Translate Sub-Module :
    • Motivation : Using the tweet language field , convert tweets in other language to english.
    • Outcome : Now all the tweets have been converted to english.
  • AWS Rekognition Sub-Module :
    • Motivation : If the user has tweeted an image in the tweet then,
      • We use detectText() to extract the text in the image in the form of slogans,messages or memes.
      • We use detectLabels() to identify the objects in an image or activities going on in an image
      • We use recogniseCelebrities to find if any celebrities in the image (indicative of a fan/follower)
    • Outcome : Enriching the textual tweet content and then later these insights are used for personalised recommendations.
  • Watson - Tone Analyser Sub-Module :
    • Motivation : To analyse the tone behind the set of words tweeted by the user.
    • Outcome : To get scores of different emotions like sad,happy etc and add them to each tweet object and the one with highest score is chosen as the prominent emotion .
  • AWS Comprehend for getting Named Entities and Keyphrases :
    • Motivation : It is applied to Tweets to get keywords and Entities the user is influenced by.
    • Outcome : Keywords array and Entities array added to user's document.
  • AWS Polly acting as a peronalised bot
    • Motivation : To narrate the user about what are the findings from his social media activity.
    • Outcome : A human voice that calms down the user and explains him the entire SHADE engine's dashboard
  • Watson - Personality insights Sub-Module :
    • Motivation : It is applied to Blogs and Instagram Hashtags to know the personality attributes of user.
    • Outcome :Personality insights array added to user's document with different parent quality and corresponding children quality.
  • Custom ML Model and Algorithmia Model Sub-Module :
    • Motivation : to create a customised 2-layer Network of Models to understand the emotions involved in an instagram image for each instagram post.
    • First Layer Model : Algorithmia model - To find if the image has any human faces involved in it. If YES then the face emotions are extracted out to be associated with image.
    • Second Layer Model- when first layer model confirms that there is no human being in the image then we make use of aesthetics of the image to determine emotion with a model trained with two sets of images like ones that are charcaterized by colors like black,grey,dark shades of blue and the other with much more vibrant colors. The first signifies that the user is feeling low while the second is an indicator of joy.
    • Outcome : Each instagram post object is updated with the associated emotion.
  • Process Flow :
    • When aws-aggregator confirms that the data has been aggregated in DB, aws-analyser comes into action and does perform all the analysis and updates the user document with enriched analytical results.
    • Typical API Call looks like : aws-analyser.herokuapp.com/users/readProfile/amitabhtiwari3/oldirony/pandey_amita
    • Now the DB Document created for this user looks like:

aws-ana-DB1
aws-ana-DB2
aws-ana-DB3
aws-ana-DB4
aws-ana-DB5



Module 3 : aws-recommender

  • Python packages Used : numpy,pandas,gensim,nltksklearn,pyLDAvis,datetime,kmodes,pickle etc
  • After aws-analyser has updated the user document,based on emotion the UI calls the recommender for pro-tips from doctors and psychologists to fight the adverse effects of emotions. This REST Server acting as recommendation engine then returns a set of 5 articles obtained from LDA and TF-IDF Model.
  • LDA - Latent Dirichlet Allocation Model :
    • Motivation : Using API input, where the emotion param is a piece of text describing the user's state, the LDA Model using topic modelling on the corpus of articles it has, tries to find related articles.
    • Outcome : 2 Articles from LDA are concatenated to JSON Response.
  • TF-IDF - TermFrequency - InverseDocumentFrequency Model :
    • Motivation : It tries to cluster articles based on term weights and uses cosine similarity between the input param and each vector representation of article from corpus and returns top 3 srticles with maximum matching scores.
    • Outcome : 3 Articles from LDA are concatenated to JSON Response.
  • Process Flow :
    • Both Models have been pickled and at run time the pickled representation of them are used to come up with a set of 5 articles from the corpus that are most similar to the input param text received at th ePAI endpoint.
    • Typical API Call looks like : http://aws-recommender.herokuapp.com/recommendations/fear
    • Snapshots : Recommendation Engine and REST-Response

aws-rec-DB1
aws-rec-DB2

Block Diagram of LDA Model

aws-rec-DB3

Block Diagram of TF-IDF Model

aws-rec-DB4



Module 4 : Web APP : User Interface

  • Motivation : To come up with a dashboard with data visualisations to help user understand himself and the real-word entities that are influencing his state of mind. Also to come up with video,songs and articles recommendations based on his prominent emotion. Plus to come up with Nearby places to explore and Yoga Asanas to practice to overcome the diffcult times.
  • After aws-analyser has updated the user document, the UI calls the Recommender for articles,Youtube APIs for songs and videos based on his current emotion. Google charts APIs are called for data visulaisations and Nearby APIs for places to eat,worship or explore natural beauty. Yoga Asanas Carousels are created at run time depending on his state of mind.
  • Youtube APIs for videos and songs:
    • Motivation : To get funny content and relaxing music.
    • Outcome : 3 responses for videos and songs. User can click on it and he will be redirected to youtube site to play the same.
  • aws-recommender APIs for articles:
    • Motivation : To get professional tips on how to deal with certain kind of emotions.
    • Outcome : 5 tips are presented on the dashboard.
  • Google Nearby APIs for Nearby places to eat,worship or of natural beauty:
    • Motivation : To cheer up people with depressions, psychologists says it's important to get out of your home, visit places that have good vibes to calm down your mind. The app asks the permission to know the current location of the user. Then the Nearby APIs are called to get list of restaurants,holy places and natural beauties around that location.
    • Outcome : Lists of places, clicking on any list item, redirects to google maps with the the source as your current location and destination as the Latitude,Longitude of list item you clicked on giving you a glimpse of how to reach the destination./li>
  • Carousels of images showing Yoga Asanas :
    • Motivation : There are different Yoga positions to practice depending on the emotion you are feeling.
    • Outcome : Same is presented to user depending on whether he is feeling happy,sad or disgusting.
  • Data Visualisations using Google Chart APIs
    • Motivation : To come up with a dashboard to help user undertand himself better.
    • Outcome : Different kinds of graphs and pie-charts depicted below.
  • Process Flow :
    • UI does API calls and gets the dashboard populated with data from DB and external 3rd party APIs.
    • Few Snapshots : UI elements are as follows.
SHADE Dashboard

aws-rec-DB1
aws-rec-DB1
aws-rec-DB1
aws-rec-DB1

SMS Service

aws-rec-DB1

Product Aggregator Service

aws-rec-DB1

Nearby Psychiatrist

aws-rec-DB1

Alexa Service

aws-rec-DB1

Chatbot Service

aws-rec-DB1