{"id":2633,"date":"2018-03-28T09:50:43","date_gmt":"2018-03-28T09:50:43","guid":{"rendered":"https:\/\/devbloglavaprotocols.nityo.in\/5-skills-every-data-scientist-will-need-for-their-job-in-2018\/"},"modified":"2018-03-28T09:50:43","modified_gmt":"2018-03-28T09:50:43","slug":"5-skills-every-data-scientist-will-need-for-their-job-in-2018","status":"publish","type":"post","link":"https:\/\/www.lavaprotocols.com\/the-cloud-blog\/5-skills-every-data-scientist-will-need-for-their-job-in-2018\/","title":{"rendered":"5 Skills Every Data Scientist Will Need For Their Job in 2018"},"content":{"rendered":"<div style=\"margin-top: 0px; margin-bottom: 0px;\" class=\"sharethis-inline-share-buttons\" ><\/div><p><img decoding=\"async\" alt=\"\" aperture\":\"0\",\"credit\":\"\",\"camera\":\"\",\"caption\":\"\",\"created_timestamp\":\"0\",\"copyright\":\"\",\"focal_length\":\"0\",\"iso\":\"0\",\"shutter_speed\":\"0\",\"title\":\"\",\"orientation\":\"0\"}\"=\"\" class=\"size-full wp-image-9389 aligncenter\" data-attachment-id=\"9389\" data-comments-opened=\"0\" data-image-caption=\"\" data-image-description=\"\" data-image-meta=\"{\" data-image-title=\"5skillsdatascientist2018_1022x457\" data-large-file=\"https:\/\/i0.wp.com\/www.lavaprotocols.com\/wp-content\/uploads\/2018\/03\/5skillsdatascientist2018_1022x457.jpg?fit=1022%2C457&amp;ssl=1\" data-medium-file=\"https:\/\/i0.wp.com\/www.lavaprotocols.com\/wp-content\/uploads\/2018\/03\/5skillsdatascientist2018_1022x457.jpg?fit=300%2C134&amp;ssl=1\" data-orig-file=\"https:\/\/i0.wp.com\/www.lavaprotocols.com\/wp-content\/uploads\/2018\/03\/5skillsdatascientist2018_1022x457.jpg?fit=1022%2C457&amp;ssl=1\" data-orig-size=\"1022,457\" data-permalink=\"https:\/\/www.lavaprotocols.com\/2018\/03\/28\/5-skills-every-data-scientist-will-need-job-2018\/5skillsdatascientist2018_1022x457\/\" data-recalc-dims=\"1\" height=\"457\" loading=\"lazy\" sizes=\"auto, (max-width: 1022px) 100vw, 1022px\" src=\"https:\/\/www.lavaprotocols.com\/the-cloud-blog\/wp-content\/uploads\/imported-from-hubspot\/Imported_Blog_Media\/5skillsdatascientist2018_1022x457-1.jpg\" srcset=\" 1022w,  300w,  768w,  610w\" width=\"1022\"\/><\/p>\n<p>by <em>Devon Hopkins<\/em><\/p>\n<p><!--more--><\/p>\n<p>From <a href=\"https:\/\/www.reddit.com\/r\/dataisbeautiful\/\" rel=\"noopener\" target=\"_blank\">Reddit<\/a> to the <a href=\"https:\/\/www.nytimes.com\/interactive\/2017\/08\/30\/us\/houston-flood-rescue-cries-for-help.html?action=click&amp;pgtype=Homepage&amp;clickSource=g-artboard%20g-artboard-v3&amp;module=span-abc-region\u00aeion=span-abc-region&amp;WT.nav=span-abc-region&amp;_r=0\" rel=\"noopener\" target=\"_blank\">New York Times<\/a>, data scientists are in hot demand. Many want to break into the field, but the available career advice can be overwhelming: Which coding languages should you know? Do you have to be an expert in machine learning? Is it better to beef up on your technical skills or to nail down your design principles?<\/p>\n<p>To answer these questions (and more), CARTO\u2019s\u00a0newest addition to the Data &amp; Research team,\u00a0Wenfei Xu, is here to tell us about the journey that ultimately led her to <a href=\"http:\/\/carto.com\/location-intelligence\/\">CARTO<\/a>. Below, she shares her five secrets for success.<\/p>\n<h3 id=\"1-a-foundation-in-statistics-helps-you-understand-large-datasets\"><span style=\"color: #993366;\"><strong>1. A foundation in statistics helps you understand large\u00a0<\/strong><\/span><span style=\"color: #993366;\"><b>datasets<\/b><\/span><\/h3>\n<p>At the University of Chicago, Xu studied economics, including single and multi-linear regressions, time series econometrics, and forecast modeling. Though she\u2019s no longer trying to model the price of fixed income instruments, she still relies on her undergrad training to guide her data analysis<\/p>\n<p>For example, Xu is currently working on a project to better understand how people use public parks in New York City through mobile phone data. Working with terabytes of data (like the latitude and longitude where the app was opened, the timestamp, the length of use), she was able to choose the right distribution method (logarithmic) to evaluate the data.<\/p>\n<h3 id=\"2-study-design-principles-to-better-understand-how-to-visualize-data\"><strong><span style=\"color: #993366;\">2. Study design principles to better understand how to visualize data<\/span><\/strong><\/h3>\n<p>After three years working in finance, Xu decided to go back to school and study architecture and urban planning. While at MIT, she learned how to use design principles to prioritize certain information. Xu worked on a project last month about the different communities living in Williamsburg, Brooklyn. As a former resident of the neighborhood, she wanted to depict the co-habitation of those communities using visual design principles. <a href=\"https:\/\/carto.com\/blog\/using-location-data-identify-communities-williamsburg-ny\/\" rel=\"noopener\" target=\"_blank\">In her analysis, she used pick-up and drop-off data from New York City taxis to identify different sub-groups.<\/a><\/p>\n<div class=\"Wrap\">\n<a href=\"https:\/\/carto.com\/blog\/using-location-data-identify-communities-williamsburg-ny\/\" rel=\"noopener\" target=\"_blank\"><img decoding=\"async\" alt=\"Williamsburg Communities\" data-recalc-dims=\"1\" src=\"https:\/\/static.hsstatic.net\/BlogImporterAssetsUI\/ex\/missing-image.png\"\/><\/a>\n<\/div>\n<p>She found that a substantial number of people, nicknamed \u201cpartiers,\u201d take taxis from Lower Manhattan and other parts of Brooklyn to Williamsburg, generally pretty late at night and on the weekend. To make that message stick out, she used a simple black-and-white map with street boundaries and overlaid it with bright, primary colors for each of the pick-ups and drop-offs.<\/p>\n<h3 id=\"3-practice-telling-a-story-with-your-data\"><strong><span style=\"color: #993366;\">3. Practice telling a story with your data<\/span><\/strong><\/h3>\n<p>According to Xu, the best statistical models and sharpest design principles should ultimately come together to tell a narrative. When she created the <a href=\"https:\/\/carto.com\/blog\/using-location-data-identify-communities-williamsburg-ny\/\" rel=\"noopener\" target=\"_blank\">Williamsburg taxi map<\/a>, Xu discovered that there were 75 potential communities she could include, far too many to for one map. But, because she had a clear story in mind \u2014 <strong>that demographically disparate communities co-exist in Williamsburg, often in overlapping space<\/strong> \u2014 she was able to best support her argument by whittling down the options to the best five groups, even if they weren\u2019t necessarily the largest.<\/p>\n<h3 id=\"4-find-a-community-or-group-to-bounce-ideas-off-of\"><strong><span style=\"color: #993366;\">4. Find a community or group to bounce ideas off of<\/span><\/strong><\/h3>\n<p>Wenfei says she\u2019s lucky because CARTO sits \u201cat the intersection of industry and academia,\u201d meaning she has access to the best minds in both. She can also count on her coworkers for help. For example, many of the parallel processing and visualization tools she\u2019s currently using were introduced to her (or made) by her colleagues. If you don\u2019t yet have your own data science squad yet, you\u2019re in luck. <a href=\"http:\/\/carto.us16.list-manage.com\/subscribe?u=87735c84b9558c86ea525094e&amp;id=e500ada257\" rel=\"noopener\" target=\"_blank\">Wenfei writes a newsletter that you can join.<\/a><\/p>\n<h3 id=\"5-pay-attention-to-these-tools-concepts-and-programming-languages\"><strong><span style=\"color: #993366;\">5. Pay attention to these tools, concepts, and programming languages<\/span><\/strong><\/h3>\n<p>Hard skills are also important, especially when it comes to landing your data scientist dream job. Wenfei recommends the following:<\/p>\n<p><strong>Concepts to know<\/strong><\/p>\n<ul>\n<li>a solid foundation in statistics<\/li>\n<li>hypothesis testing<\/li>\n<li>linear regressions<\/li>\n<li>machine learning<\/li>\n<li>visualization principles<\/li>\n<\/ul>\n<p><strong>Skills to have<\/strong><\/p>\n<ul>\n<li>python or R<\/li>\n<li>spatial analysis<\/li>\n<li>cloud computing and distributing computing methods<\/li>\n<li>database skills such as PostgreSQL<\/li>\n<\/ul>\n<p><strong>Tools to be familiar with<\/strong><\/p>\n<ul>\n<li>the iPython\/Jupyter environment<\/li>\n<li>Matplotlib<\/li>\n<li>Pandas<\/li>\n<li>NumPy<\/li>\n<li>Dask<\/li>\n<li>Bokeh<\/li>\n<li>scikit-learn<\/li>\n<\/ul>\n<p><i><span style=\"font-weight: 400;\"><br \/> Lava is an authorised Salesforce Partner in Malaysia and has more than a decade of experience in cloud solutions which includes marketing automation, CRM implementation, change management, and consultation. We pride ourselves in not just being a CRM partner but in also understanding the needs of our customers and taking their business to the next level<\/span><\/i><\/p>\n<p><span class=\"et_bloom_bottom_trigger\"><\/span><\/p>\n","protected":false},"excerpt":{"rendered":"<p>by Devon Hopkins From Reddit to the New York Times, data scientists are in hot demand. Many want to break into the field, but the available career advice can be overwhelming: Which coding languages should you know? Do you have to be an expert in machine learning? Is it better to beef up on your [\u2026]<\/p>\n","protected":false},"author":1,"featured_media":2634,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[18,59],"class_list":["post-2633","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-uncategorized","tag-blog","tag-cloud-horizon"],"jetpack_featured_media_url":"https:\/\/www.lavaprotocols.com\/the-cloud-blog\/wp-content\/uploads\/2024\/10\/5skillsdatascientist2018_1022x457.jpg","_links":{"self":[{"href":"https:\/\/www.lavaprotocols.com\/the-cloud-blog\/wp-json\/wp\/v2\/posts\/2633","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.lavaprotocols.com\/the-cloud-blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.lavaprotocols.com\/the-cloud-blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.lavaprotocols.com\/the-cloud-blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.lavaprotocols.com\/the-cloud-blog\/wp-json\/wp\/v2\/comments?post=2633"}],"version-history":[{"count":0,"href":"https:\/\/www.lavaprotocols.com\/the-cloud-blog\/wp-json\/wp\/v2\/posts\/2633\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.lavaprotocols.com\/the-cloud-blog\/wp-json\/wp\/v2\/media\/2634"}],"wp:attachment":[{"href":"https:\/\/www.lavaprotocols.com\/the-cloud-blog\/wp-json\/wp\/v2\/media?parent=2633"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.lavaprotocols.com\/the-cloud-blog\/wp-json\/wp\/v2\/categories?post=2633"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.lavaprotocols.com\/the-cloud-blog\/wp-json\/wp\/v2\/tags?post=2633"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}