Social Network Analysis
Statistician
Statistical Modeling of Networks
R enthusiast
and you?
Tuesdays 13:30-15:00
Lecture, discussions and live coding
Wednesdays 10:00-11:30
Lab, work sheets and self-study
Research Design
Data Collection
Methodology
(and some theory)
conventional research methods are often individual based and our models tend to model relations between variables
but nature and culture is structured as networks
Position within a network is important for predicting outcomes
Simmel, 1908/1971:
Society exists where a number of individuals enter into interaction
Durkheim, 1974:
Society has for its substratum the mass of associated individuals. The system which they form by uniting together […] their channels of communication [are] the basis from which social life is raised
Marx, 1973:
Society does not consist of individuals, but expresses the sum of interrelations, the relations within which these individual stand
…but conventional research methods are often individual based and our models tend to model relations between variables, not people
David eats predominantly vegetarian food
individual-based:
network-based:
…but conventional research methods are often individual based and our models tend to model relations between variables, not people
Someone close to you is unhappy…
…will you remain unaffected?
…but conventional research methods are often individual based and our models tend to model relations between variables, not people
equal opportunities based on our individual qualities…
…or on our personal networks?
atomic data
individuals or entities
dyadic data
dependent pairs of individuals (e.g. couples)
but treated as independent entities
networks
interdependent and overlapping dyads
usual (statistical) independence assumptions do not hold
Social network analysis is motivated by a structural intuition based on ties linking social actors
It is grounded in systematic empirical data
It draws heavily on graphic imagery
It relies on the use of mathematical and/or computational models
Georg Simmel (1858–1918)
If there is to be a science whose subject matter is society and nothing else, it must exclusively investigate these interactions, these kinds and forms of sociation.
Jacob Moreno (1889–1974)
Jacob Moreno (1889–1974)
Alex Bavelas (1913-1993)
Elizabeth Bott (1924–2016)
Bott hypothesis
the density of a husband’s and wife’s separate social networks is positively associated with marital role segregation
Barabási/Watts & Strogatz
I expressed the pious hope that […] our colleagues from physics would simply join in the collective enterprise. That hope, however, was not immediately realized. These physicists, new to social network analysis, did not read our literature; they acted as if our sixty years of effort amounted to nothing… (L. Freeman)
dyad level
Fundamental unit of network data collection
(“Does sharing offices lead to friendship?”)
node level
Aggregation of dyad level measurement
(“Do actors with more friends have a stronger immune system?”)
network level
Assessing overall structure of a network
(“Do well connected networks diffuse ideas faster?”)
more levels are possible (triads, groups, …)
Relational states
Relational events
undirected
symmetric relation
directed
asymmetric relation, but can be bi-directional
valued
strength of relation, frequency of contact, etc.
signed
positive and negative relations
or a mixture thereof
Network variables as independent/explanatory
Using network theory to explain the consequences of network properties
social capital, brokerage, adoption of innovation
Network variables as dependent/outcomes
Using ______ theory to explain the antecendents of a network
homophily, balance theory
type | independent | dependent | ex. hypotheses |
---|---|---|---|
network theory | node level network property | actor attribute | centrality ⟹ performance |
theory of networks | actor attribute | node level network property | good looks ⟹ centrality |
type | independent | dependent | ex. hypotheses |
---|---|---|---|
network theory | network tie | attribute similarity | friends ⟹ similar interest |
theory of networks | attribute similarity | network tie | smoking ⟹ friendship |
Strong ties have redundant information for individuals
Weak ties spread information between groups
A person’s chances of becoming obese increased by 57% if he or she had a friend who became obese in a given interval […] These effects were not seen among neighbors in the immediate geographic location.
The correlation between the connectivity and indispensability of a given protein confirms that, despite the importance of individual biochemical function and genetic redundancy, the robustness against mutations in yeast is also derived from the organization of interactions and the topological positions of individual proteins.
open source
cross-platform
CRAN
reproducibility
more than SNA
community
igraph
sna
CRAN packages that depend on igraph, network, and graph
use igraph
if
use sna
if
Social licking among cows
Link to paper
Most methods rely on concepts from graph theory