Data analysis and modeling pipelines for controlled networked social science experiments
Artificial intelligence
Information Systems and Management
Scientific Workflows
Science
Data analysis
Social Sciences
Environmental engineering
Experimental and Cognitive Psychology
02 engineering and technology
Decision Sciences
Data science
Data model (GIS)
Engineering
0202 electrical engineering, electronic engineering, information engineering
Humans
Network Analysis of Psychopathology and Mental Disorders
Psychology
Social Behavior
Data mining
Electronic Data Processing
Management and Reproducibility of Scientific Workflows
Software engineering
Q
R
FOS: Environmental engineering
Statistical and Nonlinear Physics
Models, Theoretical
Scripting language
Computer science
Programming language
FOS: Psychology
Physics and Astronomy
Physical Sciences
Medicine
Statistical Mechanics of Complex Networks
Pipeline (software)
Pipeline transport
Computational Research
Algorithms
Software
Research Article
Data modeling
DOI:
10.1371/journal.pone.0242453
Publication Date:
2020-11-24T21:28:08Z
AUTHORS (16)
ABSTRACT
There is large interest in networked social science experiments for understanding human behavior at-scale. Significant effort is required to perform data analytics on experimental outputs and for computational modeling of custom experiments. Moreover, experiments and modeling are often performed in a cycle, enabling iterative experimental refinement and data modeling to uncover interesting insights and to generate/refute hypotheses about social behaviors. The current practice for social analysts is to develop tailor-made computer programs and analytical scripts for experiments and modeling. This often leads to inefficiencies and duplication of effort. In this work, we propose a pipeline framework to take a significant step towards overcoming these challenges. Our contribution is to describe the design and implementation of a software system to automate many of the steps involved in analyzing social science experimental data, building models to capture the behavior of human subjects, and providing data to test hypotheses. The proposed pipeline framework consists of formal models, formal algorithms, and theoretical models as the basis for the design and implementation. We propose a formal data model, such that if an experiment can be described in terms of this model, then our pipeline software can be used to analyze data efficiently. The merits of the proposed pipeline framework is elaborated by several case studies of networked social science experiments.
SUPPLEMENTAL MATERIAL
Coming soon ....
REFERENCES (108)
CITATIONS (2)
EXTERNAL LINKS
PlumX Metrics
RECOMMENDATIONS
FAIR ASSESSMENT
Coming soon ....
JUPYTER LAB
Coming soon ....