Published on 01 January 2016
Supplement 1. Raw survey results and data cleaning/processing scripts for R.
View DatasetDescription
File List DataCleaning_20120105.R (md5:7c3b115561b5c687e34c7ffa63896a82)
DataTrimming_20120105.R (md5:77f0bbe669da955c81d9b5fb2d3e2d3e)
AdvancedSpreadsheet_20110526.csv (md5:db607c6c3238ffb45effa2c97ec40b9a)
VariableRenaming_20110711.csv (md5:ac7519d78d3eac3d2ccf82ef94f1af80)
ParamTable.csv (md5:ff3fa18d83a3a6b860462c2955a5c8c9) Description How the files should be used: Scripts (2) DataCleaning_20120105.R is an R script for cleaning up data prior to analysis. This script removes spaces, substitutes text for codes, removed duplicate schools, and converts questions and answers from the survey into more simple parameter names, without any numbers, spaces, or symbols. This script is heavily annotated to assist the user of the file in understanding what is being done to the data files. The script produces the file cleandata_[date].Rdata, which is called in the file DataTrimming_20120105.R DataTrimming_20120105.R is an R script for trimming extraneous variables not used in final analyses. Some variables are combined as needed and NAs (no answers) are removed. The file is heavily annotated. It produces trimdata_[date].Rdata, which was imported into Excel for summary statistics. Data files (3) AdvancedSpreadsheet_20110526.csv is the output file from the SurveyMonkey online survey tool used for this project. It is a .csv sheet with the complete set of survey data, although some data (e.g., open-ended responses, institution names) are removed to prevent schools and/or instructors from being identifiable. This file is read into DataCleaning_20120105.R for cleaning and editing. VariableRenaming_20110711.csv is called into the DataCleaning_20120105.R script to convert the questions and answers from the survey into simple parameter names, without any numbers, spaces, or symbols. ParamTable.csv is a list of the parameter names used for analysis and the value codes. It can be used to understand outputs from the scripts above (cleandata_[date].Rdata and trimdata_[date].Rdata).
Citations (0)
No citations found
Mentions (0)
No mentions found
Metrics Over Time
Publication Details
Subfield
Artificial Intelligence
Field
Computer Science
Domain
Physical Sciences
Confidence Score
96%
Source
Open Alex