[CODATA-international] Cost of Data Wrangling

Chris Hunter chris at gigasciencejournal.com
Fri Dec 11 05:01:11 EST 2020

Hi Ernie 
I dont know if this is the source of what you are referring to, but it does give some evidence for the 80% of work being Data preparation, although does appear to be based on a survey of just 80 data scientists so a pretty small 'n', and its over 4 years old. 



Chris Hunter 
Lead BioCurator, GigaDB 
GigaScience, BGI-HK 
Tel: (44)07429063514 
ORCID: 0000-0002-1335-0881 
Web: www.gigadb.org 

----- Original Message -----

From: "Ernie Boyko" <boykern at yahoo.com> 
To: "CODATA International" <codata-international at lists.codata.org> 
Sent: Friday, 11 December, 2020 4:57:54 AM 
Subject: [CODATA-international] Cost of Data Wrangling 

Hi all 
A study conducted for the EU? is often quoted as being the source of a statement along the lines of 

§ 80% of effort in data intensive research is used on data wrangling; conservative estimate of 10.2 Bn Euro. 
Can anyone on this list point me to this study? 

Many thanks in advance. I am trying to make the case for the benefits of developing a career stream for data wranglers/data stewards. 

Cheers, Ernie 

“Data is the new oil.” — Clive Humby 
“Data really powers everything that we do.” – Jeff Weiner 

CODATA-international mailing list 
CODATA-international at lists.codata.org 

The CODATA International list is for announcement of of activities, events and outputs by CODATA and by other organisations and initiatives. It is also for discussion of all issues related to data. It is an open subscription list with only lightweight moderation to remove spam. Messages posted on the list by third parties do not necessarily imply endorsement by CODATA.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.codata.org/pipermail/codata-international_lists.codata.org/attachments/20201211/ef75ea85/attachment.html>

More information about the CODATA-international mailing list