You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Using existing data for re-analysis or new synthesis research is challenging because of the large amounts of time and effort needed to clean the data. Having best practices guidelines would streamline this process for data users and reduce the time spent cleaning data before analysis can proceed. These guidelines would ideally include a script that addresses the most common data cleaning tasks/problems/issues, as well as a document (or maybe another script?) that gives more details of specific problems (ex: scraping data from the web, using NETCDF files, extracting data from Excel workbook). This would all be done in R.
Common issues include:
Capitalization, misspelling, white space, dots, missing values not represented by NA, abbreviated text, different names for the same sites/species, metadata in the data table, etc.
The text was updated successfully, but these errors were encountered:
Using existing data for re-analysis or new synthesis research is challenging because of the large amounts of time and effort needed to clean the data. Having best practices guidelines would streamline this process for data users and reduce the time spent cleaning data before analysis can proceed. These guidelines would ideally include a script that addresses the most common data cleaning tasks/problems/issues, as well as a document (or maybe another script?) that gives more details of specific problems (ex: scraping data from the web, using NETCDF files, extracting data from Excel workbook). This would all be done in R.
Common issues include:
Capitalization, misspelling, white space, dots, missing values not represented by NA, abbreviated text, different names for the same sites/species, metadata in the data table, etc.
The text was updated successfully, but these errors were encountered: