<div dir="ltr"><p style="box-sizing:border-box;margin:0px;padding:0px 0px 1em;border:0px;outline:0px;font-size:14px;vertical-align:baseline;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial;color:rgb(102,102,102);font-family:"Open Sans",Arial,sans-serif"><img class="gmail-alignright" src="https://codata.org/wp-content/uploads/2021/05/sparklyr.png" style="box-sizing: border-box; margin: 0px 0px 0px 15px; padding: 0px; border: 0px; outline: 0px; text-size-adjust: 100%; vertical-align: baseline; background: transparent; max-width: 100%; height: auto; float: right; display: inline;">An interactive workshop for learning the basics of Spark in R and how to easily include lazy evaluation in a data analysis workflow. This session intends to cover the basic concepts about Spark, Map Reduce, lazy evaluation and distributed processing and how this can be implemented in data science projects using local resources, such as a personal computer. This workshop will include live exercises with the participants. The focus of this workshop will be sharing with the participants a series of easily applicable tips to include distributed processing in their work with the resources available.</p><h3 style="box-sizing:border-box;margin:0px;padding:0px 0px 10px;border:0px;outline:0px;font-size:22px;vertical-align:baseline;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial;color:rgb(51,51,51);line-height:1em;font-weight:500;font-family:"Open Sans",Arial,sans-serif"><b style="box-sizing:border-box;margin:0px;padding:0px;border:0px;outline:0px;vertical-align:baseline;background:transparent">Abstract:</b></h3><p style="box-sizing:border-box;margin:0px;padding:0px 0px 1em;border:0px;outline:0px;font-size:14px;vertical-align:baseline;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial;color:rgb(102,102,102);font-family:"Open Sans",Arial,sans-serif">Big Data comes in handy when the dimension of what needs to be done in data preparation or analysis overcomes the capacity of a regular computer with a sequential workflow. Questions like How has Big Data and the use of Spark helped improve the general dynamics of data science in our institution will be explored through this workshop. Data Science projects are using larger data sources over time, and Big Data tools, such as Spark are developing efficient connections with the most popular programming languages used in the field, such as R or Python.</p><p style="box-sizing:border-box;margin:0px;padding:0px 0px 1em;border:0px;outline:0px;font-size:14px;vertical-align:baseline;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial;color:rgb(102,102,102);font-family:"Open Sans",Arial,sans-serif">The integration of these tools can be natural and easy for users. SparklyR makes the implementation of distributed processing and lazy evaluation very handy for users to optimize the available computational resources while still following a very natural and simple workflow for data analysis; from the storage, exploratory data analysis, modeling, etc. This workshop aims to share the basics of naturally including Spark in R to optimize the use of available resources.</p><p style="box-sizing:border-box;margin:0px;padding:0px 0px 1em;border:0px;outline:0px;font-size:14px;vertical-align:baseline;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial;color:rgb(102,102,102);font-family:"Open Sans",Arial,sans-serif"><b style="box-sizing:border-box;margin:0px;padding:0px;border:0px;outline:0px;vertical-align:baseline;background:transparent">Session objective:</b> Share practical examples of how to implement Spark with R to a data science project and present how it can actually make a large process more simple and efficient.</p><h3 style="box-sizing:border-box;margin:0px;padding:0px 0px 10px;border:0px;outline:0px;font-size:22px;vertical-align:baseline;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial;color:rgb(51,51,51);line-height:1em;font-weight:500;font-family:"Open Sans",Arial,sans-serif"><b style="box-sizing:border-box;margin:0px;padding:0px;border:0px;outline:0px;vertical-align:baseline;background:transparent">Date and time:</b></h3><ul style="box-sizing:border-box;margin:0px;padding:0px 0px 23px 1em;border:0px;outline:0px;font-size:14px;vertical-align:baseline;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial;list-style-position:initial;line-height:26px;color:rgb(102,102,102);font-family:"Open Sans",Arial,sans-serif"><li style="box-sizing:border-box;margin:0px;padding:0px;border:0px;outline:0px;vertical-align:baseline;background:transparent">Session 1: 1 hour, 18 June 2022; 6:00 pm IST 6:30 am Costa Rica (GMT-6)</li><li style="box-sizing:border-box;margin:0px;padding:0px;border:0px;outline:0px;vertical-align:baseline;background:transparent">Session 2: 1 hour, 25 June 2022; 6:00 pm IST 6:30 am Costa Rica (GMT-6)</li><li style="box-sizing:border-box;margin:0px;padding:0px;border:0px;outline:0px;vertical-align:baseline;background:transparent">Session 3: 1 hour, 02 July  2022; 6:00 pm IST 6:30 am Costa Rica (GMT-6)</li></ul><p style="box-sizing:border-box;margin:0px;padding:0px 0px 1em;border:0px;outline:0px;font-size:14px;vertical-align:baseline;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial;color:rgb(102,102,102);font-family:"Open Sans",Arial,sans-serif"><b style="box-sizing:border-box;margin:0px;padding:0px;border:0px;outline:0px;vertical-align:baseline;background:transparent">Intended Audience:</b> This session will focus on ECRs in the CODATA Connect and community pipeline. The audience will be drawn from different data and research ecological flows. Experience with R or basic programming is preferable.</p><p style="box-sizing:border-box;margin:0px;padding:0px 0px 1em;border:0px;outline:0px;font-size:14px;vertical-align:baseline;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial;color:rgb(102,102,102);font-family:"Open Sans",Arial,sans-serif"><b style="box-sizing:border-box;margin:0px;padding:0px;border:0px;outline:0px;vertical-align:baseline;background:transparent">Pre- requisite: </b>Computer system with R installed.</p><p style="box-sizing:border-box;margin:0px;padding:0px 0px 1em;border:0px;outline:0px;font-size:14px;vertical-align:baseline;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial;color:rgb(102,102,102);font-family:"Open Sans",Arial,sans-serif"><b style="box-sizing:border-box;margin:0px;padding:0px;border:0px;outline:0px;vertical-align:baseline;background:transparent">Topic Organizer: </b>CODATA Connect (Mariana, Shaily, Felix)</p><p style="box-sizing:border-box;margin:0px;padding:0px 0px 1em;border:0px;outline:0px;font-size:14px;vertical-align:baseline;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial;color:rgb(102,102,102);font-family:"Open Sans",Arial,sans-serif"><b style="box-sizing:border-box;margin:0px;padding:0px;border:0px;outline:0px;vertical-align:baseline;background:transparent">Number of Participants:</b> This would be an online workshop.</p><p style="box-sizing:border-box;margin:0px;padding:0px 0px 1em;border:0px;outline:0px;font-size:14px;vertical-align:baseline;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial;color:rgb(102,102,102);font-family:"Open Sans",Arial,sans-serif">We will have 10 to 20 participants.</p><p style="box-sizing:border-box;margin:0px;padding:0px 0px 1em;border:0px;outline:0px;font-size:14px;vertical-align:baseline;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial;color:rgb(102,102,102);font-family:"Open Sans",Arial,sans-serif">For registration please fill the google form <a href="https://forms.gle/PxXrbSsPGnFfi6K56" target="_blank" rel="noopener" style="box-sizing:border-box;margin:0px;padding:0px;border:0px;outline:0px;vertical-align:baseline;background:transparent;color:rgb(46,163,242);text-decoration-line:none">https://forms.gle/PxXrbSsPGnFfi6K56</a></p><p style="box-sizing:border-box;margin:0px;padding:0px 0px 1em;border:0px;outline:0px;font-size:14px;vertical-align:baseline;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial;color:rgb(102,102,102);font-family:"Open Sans",Arial,sans-serif">Or send your interest statement via email to <a href="mailto:codataconnect@codata.org" target="_blank" rel="noopener" style="box-sizing:border-box;margin:0px;padding:0px;border:0px;outline:0px;vertical-align:baseline;background:transparent;color:rgb(46,163,242);text-decoration-line:none">codataconnect@codata.org</a> with the Email subject: <b style="box-sizing:border-box;margin:0px;padding:0px;border:0px;outline:0px;vertical-align:baseline;background:transparent">Application for Workshop on </b><b style="box-sizing:border-box;margin:0px;padding:0px;border:0px;outline:0px;vertical-align:baseline;background:transparent">Introduction to Spark with R and Lazy evaluation </b></p><p style="box-sizing:border-box;margin:0px;padding:0px 0px 1em;border:0px;outline:0px;font-size:14px;vertical-align:baseline;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial;color:rgb(102,102,102);font-family:"Open Sans",Arial,sans-serif"><b style="box-sizing:border-box;margin:0px;padding:0px;border:0px;outline:0px;vertical-align:baseline;background:transparent">The application should contain Name, Date of birth, Country, city, highest education, area of interest, and why would you like to participate in the workshop (200 words) and would you be attending all the 3 sessions?</b></p><p style="box-sizing:border-box;margin:0px;padding:0px;border:0px;outline:0px;font-size:14px;vertical-align:baseline;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial;color:rgb(102,102,102);font-family:"Open Sans",Arial,sans-serif"><b style="box-sizing:border-box;margin:0px;padding:0px;border:0px;outline:0px;vertical-align:baseline;background:transparent">Send your applications by 5th June 2022</b></p><div><br></div>-- <br><div dir="ltr" class="gmail_signature" data-smartmail="gmail_signature"><div dir="ltr"><div><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr" style="font-size:small"><div style="font-size:12.8px"><div style="color:rgb(0,0,0)">___________________________<br></div></div></div><div dir="ltr" style="font-size:small"><br></div><div dir="ltr" style="font-size:small">Asha Law | Program Assistant, CODATA | <a href="http://www.codata.org/" style="color:rgb(17,85,204)" target="_blank">http://www.codata.org</a></div><div dir="ltr" style="font-size:small"><br></div><div dir="ltr" style="font-size:small">E-Mail: <a href="mailto:asha@codata.org" style="color:rgb(17,85,204)" target="_blank">asha@codata.org</a><br></div><div dir="ltr" style="font-size:small">Tel (Office): +33 1 45 25 04 96</div><div dir="ltr" style="font-size:small"><br></div><div dir="ltr" style="font-size:small">CODATA (Committee on Data of the International Council for Science), 5 rue Auguste Vacquerie, 75016 Paris, FRANCE</div></div></div></div></div></div></div></div></div></div></div></div>