Project Specification
COMP7025 Social Media Intelligence
Aim
The Project requires us to analyse...

Question

Project Specification
COMP7025 Social Media Intelligence 
Aim
The Project requires us to analyse social media data using the knowledge obtained from this unit with assistance from a computer based statistical package. For this project, we will focus on Twitter.
Method
To complete this project:
1. Read through this specification
2. Complete the data analysis required by the specification
3. Write up your analysis using your favourite word processing/typesetting program, making sure that all of the working is shown and that is it presented well.
4. Include the student declaration text on the front page of your report. Please make sure that your name
and student number are clearly displayed on the front page.
5. Submit the report as a PDF by the due date.
Report Format
Once the required analysis is performed, write up the analysis as a report. Remember that the assessor will only see the report and will be marking the analysis based on your report. Therefore the report should contain a clear and concise description of the procedures carried out, the analysis of results, and any conclusions reached from the analysis.
The required analysis in this specification covers material presented in lectures and labs. Students should use the computer software R to carry out the required analysis and then present the results from the analysis in the report.
1
Marks
This project is worth 30 % of your final grade, and so the project will be marked out of 30. The project consists of six parts where each part contributes equally to your final mark.
There are five parts to the project, each will be marked using the following criteria:
Marks	Criteria Satisfied
0 The method does not lead to insightful analysis.
1 The method is flawed, but the analysis would have provided insight had the method been correct.
2 The correct method leads to partially correct results and analysis.
3 The correct method leads to correct results and analysis.
4 The correct method leads to correct results and analysis, with an insightful aim and conclusion.
5 The correct method leads to correct results and analysis, with an insightful aim and conclusion. Limitations of the analysis are identified and suggestions for further analysis are provided.
If a report is submitted late, the maximum mark it can achieve will be reduced by 10% (3 marks) per day. E.g., if a report is submitted five days late, it can receive at most 15 marks.
Declaration
The following declaration must be included in a clearly visible and readable place on the first page of the report.
By including this statement, I the author of this work, verify that:
· I hold a copy of this assignment that I can produce if the original is lost or damaged.
· I hereby certify that no part of this assignment/product has been copied from any other student’s work or from any other source except where due acknowledgement is made in the assignment.
· No part of this assignment/product has been written/produced for us by another person except where such collaboration has been authorised by the subject lecturer/tutor concerned.
· I am aware that this work may be reproduced and submitted to plagiarism detection software programs for the purpose of detecting possible plagiarism (which may retain a copy on its database for future plagiarism checking).
· I hereby certify that I have read and understand what the School of Computing and Mathematics defines as minor and substantial breaches of misconduct as outlined in the learning guide for this unit.
Note: An examiner or lecturer/tutor has the right not to mark this project report if the above declaration has not been added to the cover of the report.
Project Description
A social and behavioural research group at Western Sydney University is studying social activists. They have consulted you to investigate the flow of information regarding environmental activist Greta Thunberg on Twit- ter.  Researchers have provided a set of tasks below that need completion.  The results are to be presented at the International Social and Behaviour Change Communication (SBCC) Summit.
Perform this analysis using R with the rtweet and igraph libraries. Use the rtweet documentation to find functions that will assist your analysis:
· https://cran.r-project.org/web/packages/rtweet/vignettes/intro.html
· https://cran.r-project.org/web/packages/rtweet/rtweet.pdf
1 Followed by Greta
Find 12 people followed by Greta that have the most followers. Use only people, not any company’s twitter handles. Examine the twitter accounts and summarise the types of people.
2 Followers of Greta
Find the 12 people who follow Greta and have the most followers and examine if they have a positive or negative relationship with Greta based on their tweets. Examine their twitter accounts and summarise the types of people.
3 Bypassing Greta
Plot the graph containing people followed by Greta and 12 followers. Identify if any of the found following or followers are friends with each other and add these edges to the graph. Then determine if any of the following and followers should be friends, based on their background, and add those edges to the graph.
4 Graph Statistics
Compute the diameter and density of the graph, and neighbourhood overlap of each edge and determine which nodes have the greatest social capital. State if the results are obvious from the graph structure and why.
5 Graph Homophily
Compute if there is homophily in the graph. To do this, label each node as either a supporter or non-supporter of Greta using the information gathered in parts 1, 2 and 3. Write out the hypotheses, the test statistic and a conclusions of the test. Use a significance level of α = 0.05.
6 Structural Balance
Finally, determine if the signed network is weakly balanced (using hierarchical clustering) and identify if any within or between signed relationships are not as expected.   To perform this analysis,  first label all existing edges as either positive or negative, based on their association to Greta.
Write up a report containing your code and analysis of the data with each section clearly labelled. Clearly annotate your code and make sure to state any conclusions you make from each piece of analysis. The report is being marked using the marking criteria, so make sure that each piece of analysis covers all of the criteria. Remember that you are examining the relationship of twitter users to Greta, so make sure that the conclusion of each section refers back to this.
 
##ASSIGNMENT SOCIAL MEDIA INTELLIGENCE COMP7025
##STUDENT_NAME : SUHAS THOTA
##STUDENT_ID : XXXXXXXXXX
version
install.packages("rtweet")
install.packages("base64enc")
install.packages("httpuv")
install.packages("rtweet")
install.packages("dplyr")
install.packages("tidytext")
install.packages("tidyr")
install.packages("textdata")
library("rtweet")
library("base64enc")
library("httpuv")
library("magrittr")
library("dplyr")
library("textdata")
#app="1657696929873301504suhasthota1"
#api_key="1ag4NiBTizl4S5vRf40jsYFhH"
#api_secret_key="kNPoy4r1spzb7ZaZaB7RoDjrTWucPHxiDdjZDDEDjwGgYR3v9f"
#acc_token=" XXXXXXXXXXYcpXyvRhjzdELJwDxUWPBXwYkwgEME6u2afVMbc"
#acc_secret_token="4Yutcn8OaSvn6i7xPEZaVTqurWKmeRzVcWH7Vv6pH184t"
### Using the above keys resulted in an API error [403] from Twitter; to avoid this, 
##I used the keys supplied in the 6a solutions. Twitterkeys.txt
#Authenticating with Twitter API Credentials
app='SMIProject_2023'
api_key='AagjVq96hOMojkDdc0fz8OJPI'
api_secret_key='DWrqQZWe2QDabVKDT5nVped8jqDk6UrPGAmJM74xX1xMIVL6Cf'
acc_token=' XXXXXXXXXX1fvDtoNyoah7sq92QWFZ8GGsAkmmSl1xWBSgb3E3'
acc_secret_token='N29dRKpzRSgt7vCcVj8AFCuwfHUROGStK15X7HMeBWvg4'
#generate token
create_token(
  app=app,
  consumer_key=api_key,
  consumer_secret=api_secret_key,
  access_token=acc_token,
  access_secret=acc_secret_token
)
#Retrieving tweets
tweets=search_tweets("Greta Thunberg",n=5,include_rts=FALSE)
print(tweets)
#####################################################################################################################################################################################################
#######################Q1.)Followed by Greta Thunberg ###############################################################################################################################################
#####################################################################################################################################################################################################
# Get Greta Thunberg's friends (people followed by Greta)
friends_data %
  group_by(name, location, screen_name, description) %>%
  summarize(Count = n()) %>%
  ungroup()
print(summary_friends)
################################################################################################################################################################
################################ Q2.) Greta Thunberg Followers #################################################################################################
################################################################################################################################################################
library(tidytext)
#Loads the tidytext package, which provides functions for text mining and analysis.
library(dplyr)
# Loads the dplyr package, which provides tools for data manipulation and transformation.
library(tidyr)
#Loads the tidyr package, which provides functions for data tidying and reshaping.
#list of Greta Thunberg's followers
follower_ids %
  select(in_reply_to_screen_name,text)%>%
  unnest_tokens(word, text) %>%
  inner_join(get_sentiments("bing")) %>%
  count(in_reply_to_screen_name, sentiment) %>%
  spread(sentiment, n, fill = 0)
View(follower_sentiments)
#Performs sentiment analysis on the follower tweets. It selects the relevant columns 
#(in_reply_to_screen_name and text), tokenizes the text using unnest_tokens, joins the sentiment lexicon using inner_join and
#calculates the count of each sentiment for each follower.
#Finally, it spreads the sentiment counts into separate columns using spread.
summary_followers_1

Pratibha · Accepted Answer

Text scraping and Analysis
Text scraping and Analysis
2023-06-10
API Setup
library("rtweet")
## Warning: package 'rtweet' was built under R version 4.2.3
library("base64enc")
library("httpuv")
## Warning: package 'httpuv' was built under R version 4.2.3
library("magrittr")
library("dplyr")
## Warning: package 'dplyr' was built under R version 4.2.3
## 
## Attaching package: 'dplyr'
## The following objects are masked from 'package:stats':
## 
##     filter, lag
## The following objects are masked from 'package:base':
## 
##     intersect, setdiff, setequal, union
library("textdata")
## Warning: package 'textdata' was built under R version 4.2.3
#Authenticating with Twitter API Credentials
app='GretaProject_2023'
api_key='AagjVq96hOMojkDdc0fz8OJPI'
api_secret_key='DWrqQZWe2QDabVKDT5nVped8jqDk6UrPGAmJM74xX1xMIVL6Cf'
acc_token='124194957-1fvDtoNyoah7sq92QWFZ8GGsAkmmSl1xWBSgb3E3'
acc_secret_token='N29dRKpzRSgt7vCcVj8AFCuwfHUROGStK15X7HMeBWvg4'
#generate token
create_token(
  app=app,
  consumer_key=api_key,
  consumer_secret=api_secret_key,
  access_token=acc_token,
  access_secret=acc_secret_token
)
## Warning: `create_token()` was deprecated in rtweet 1.0.0.
## ℹ See vignette('auth') for details
## This warning is displayed once every 8 hours.
## Call `lifecycle::last_lifecycle_warnings()` to see where this warning was
## generated.
## Saving auth to
## 'C:\Users\Pratibha\AppData\Roaming/R/config/R/rtweet/create_token.rds'
1. Followed By Greta
# Get the friends (people followed) by Greta Thunberg
set.seed(123)
greta_friends                        
## 1 GretaThunberg 42643305           
## 2 GretaThunberg 1450363558483709954
## 3 GretaThunberg 1663643377215127553
## 4 GretaThunberg 1513242630217519104
## 5 GretaThunberg 1645750061438205952
## 6 GretaThunberg 1461716693437214722
# Extract the friend IDs
friend_ids                             
##  1 4.26e 7 42643… Hong… honghoangc… "Ho Chi…   http… "Environme… FALSE    
##  2 1.45e18 14503… RePl… letsreplan… "Europe"   http… "We’re a c… FALSE    
##  3 1.66e18 16636… Peop… PeopleFFut… ""           ""          FALSE    
##  4 1.51e18 15132… Scie… SR_Netherl… ""         http… "Scientist… FALSE    
##  5 1.65e18 16457… Frid… F4F_ROSA    "Nepal"    http… "FFF_South… FALSE    
##  6 1.46e18 14617… Nich… OmonukN     "Planet…   http… "A Climate… FALSE    
##  7 1.36e18 13632… Ende… ende_gelan… "Brunsb…   http… "Climate j… FALSE    
##  8 1.60e18 16023… Kari… k_nuttipil… ""           ""          FALSE    
##  9 1.66e18 16571… XR M… XRMothersUg "Uganda"     "We refuse… FALSE    
## 10 8.31e 8 83100… Dr A… PerrinAbi   "York, …     "Molecular… FALSE    
## # ℹ 2,855 more rows
## # ℹ 14 more variables: verified , followers_count ,
## #   friends_count , listed_count , favourites_count ,
## #   statuses_count , created_at , profile_banner_url ,
## #   profile_image_url_https , default_profile ,
## #   default_profile_image , withheld_in_countries , entities ,
## #   withheld_scope 
## ℹ Tweets data at tweets_data()
# Sort the friends based on their follower counts
top_friends                             
##  1  8.13e5 813286 Bara… BarackObama "Washin…   http… "Dad, husb… FALSE    
##  2  1.88e7 18839… Nare… narendramo… "India"    http… "Prime Min… FALSE    
##  3  1.58e7 15846… Elle… EllenDeGen… "Califo…   http… "Comedian,… FALSE    
##  4  7.59e5 759251 CNN   CNN         ""         http… "It’s our … FALSE    
##  5  8.07e5 807095 The … nytimes     "New Yo…   http… "News tips… FALSE    
##  6  4.72e8 47174… PMO … PMOIndia    "India"    http… "Office of… FALSE    
##  7  1.94e7 19397… Opra… Oprah       ""         http… ""          FALSE    
##  8  7.42e5 742143 BBC … BBCWorld    "London…   http… "News, fea… FALSE    
##  9  1.81e8 18050… Inst… instagram   ""         http… "Discover … FALSE    
## 10  1.34e9 13398… Hill… HillaryCli… "New Yo…   http… "2016 Demo… FALSE    
## 11  2.87e7 28706… P!nk  Pink        "los an…   http… "My new al… FALSE    
## 12  1.

Solution

Answer To This Question Is Available To Download

Related Questions & Answers

Submit New Assignment