{"id":2890,"date":"2025-01-06T23:29:31","date_gmt":"2025-01-06T23:29:31","guid":{"rendered":"https:\/\/dataninjasinc.com\/?p=2890"},"modified":"2025-01-06T23:29:31","modified_gmt":"2025-01-06T23:29:31","slug":"data-science-for-everyone","status":"publish","type":"post","link":"https:\/\/dataninjasinc.com\/2025\/01\/data-science-for-everyone\/","title":{"rendered":"Data Science for Everyone"},"content":{"rendered":"

Data science seems like a brand new term but isn\u2019t so. We have always had data science \u2013 typically defined as principles, processes and techniques to understand the world around us through analysis of data.<\/p>\n

Sometimes, data analysis does not necessarily result into decision making. So what do we need to do to get become a data driven decision making organization? First step is to understand what is generally involved in data science and data driven decision making.<\/p>\n

I would have to say that there are two types of data based decisions groups generally identified \u2013<\/p>\n

    \n
  1. \u201cDiscover\u201d or understand data: This group is often ignored or is not identified as a key element by most organization. This probably comes from a place of hubris \u2013 \u201cwell, we know our data well!\u201d. However, the new norm (and the fact that more data are available) is to continuously discover data.<\/li>\n
  2. Decisions that repeat: This group is very popular candidate when it comes to data driven decisions. Customer churn is an age-old problem that has haunted even the best marketer.<\/li>\n<\/ol>\n

    During the past few years, we have seen tremendous improvements in technology and the natural rise of \u201cBig Data\u201d. So how can we make use of these advances, think analytically at a massive scale and process giant volumes of data on a daily basis?<\/p>\n

    The answer is mostly related to data processing. It is important to understand that data processing and data science are two separate yet related entities. Data processing is almost critical to maturation of data science.<\/p>\n

    We previously identified two separate classes of data based decisions.<\/p>\n

      \n
    1. \u201cDiscover\u201d or understand data: This group requires somewhat traditional approaches to data processing. Generally speaking, data have to be sourced from a wide variety of applications and\/or systems. These data tend to be in a wide array of formats (but tends to be mostly structured data). These formats make it difficult to process data. In the past, data warehouses were typically used for data discovery. Now with Big Data, a wider variety of toolsets are available for data processing.<\/li>\n
    2. Decisions that repeat: This type of decision requires slightly different approach to data processing. Generally reporting\/monitoring and alerting tools are required and should be used for repeating decisions based on well understood data. However, data warehouses\/data lakes or other architectural approaches can be used as well. These type of decisions are also based on data in motion (as opposed to data at rest).<\/li>\n<\/ol>\n

      With this basic difference in data processing and data science in mind, it will be interesting to figure out data science approaches and what can be done to fulfill the promise of pure data based decision making.<\/p>\n

      Now that we have reviewed the basics of data driven decision making categories and have discussed a few differences about how data science will require data processing, we are ready to jump into smaller subset of data mining techniques that are foundational to the data science process.<\/p>\n

      Following are brief descriptions of data mining techniques:<\/p>\n

        \n
      • Regression or Estimation: Generally you would use regression to predict value of a variable (such as readmission probability for a patient). This technique is quite useful when you are trying to predict one trustworthy value for a variable.<\/li>\n
      • Similarity matching: Often used to match an individual or group with another individual or group given a finite set of dimensional and measurable attributes. A lot of times organizations can use this to identify customer groups or peer groups<\/li>\n
      • Classification: This technique is useful when you are attempting to segment or categorize a population of candidates\/things. Generally used by marketers to identify positioning and targeting of segments.<\/li>\n
      • Clustering: There is a fundamental difference between similarity matching (which is for a specific purpose) and clustering (typically used for identifying \u201cnatural\u201d groups)<\/li>\n<\/ul>\n","protected":false},"excerpt":{"rendered":"

        Data science, while often seen as a new concept, has always been about analyzing data to better understand the world. However, merely analyzing data doesn\u2019t always lead to actionable decisions. To become a data-driven decision-making organization, it\u2019s essential to grasp the core of data science and its role in decision-making.<\/p>\n","protected":false},"author":1,"featured_media":2655,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"_coblocks_attr":"","_coblocks_dimensions":"","_coblocks_responsive_height":"","_coblocks_accordion_ie_support":"","footnotes":""},"categories":[23],"tags":[19,35,51,54],"class_list":["post-2890","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-blogs","tag-best-practices","tag-data-compliance","tag-data-management","tag-data-science"],"acf":[],"yoast_head":"\nData Science for Everyone - Data Ninjas inc.<\/title>\n<meta name=\"description\" content=\"Data science, while often seen as a new concept, has always been about analyzing data to better understand the world. However, merely analyzing data doesn\u2019t always lead to actionable decisions. To become a data-driven decision-making organization, it\u2019s essential to grasp the core of data science and its role in decision-making.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/dataninjasinc.com\/2025\/01\/data-science-for-everyone\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Data Science for Everyone - Data Ninjas inc.\" \/>\n<meta property=\"og:description\" content=\"Data science, while often seen as a new concept, has always been about analyzing data to better understand the world. However, merely analyzing data doesn\u2019t always lead to actionable decisions. To become a data-driven decision-making organization, it\u2019s essential to grasp the core of data science and its role in decision-making.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/dataninjasinc.com\/2025\/01\/data-science-for-everyone\/\" \/>\n<meta property=\"og:site_name\" content=\"Data Ninjas inc.\" \/>\n<meta property=\"article:published_time\" content=\"2025-01-06T23:29:31+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/dataninjasinc.com\/wp-content\/uploads\/2024\/07\/Customer-story-banner-top-background-scaled.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"2560\" \/>\n\t<meta property=\"og:image:height\" content=\"1321\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"ninjas4\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"ninjas4\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"3 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/dataninjasinc.com\/2025\/01\/data-science-for-everyone\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/dataninjasinc.com\/2025\/01\/data-science-for-everyone\/\"},\"author\":{\"name\":\"ninjas4\",\"@id\":\"https:\/\/dataninjasinc.com\/#\/schema\/person\/eff8e408f91472d4f752e1554b4e325f\"},\"headline\":\"Data Science for Everyone\",\"datePublished\":\"2025-01-06T23:29:31+00:00\",\"dateModified\":\"2025-01-06T23:29:31+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/dataninjasinc.com\/2025\/01\/data-science-for-everyone\/\"},\"wordCount\":634,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\/\/dataninjasinc.com\/#organization\"},\"keywords\":[\"Best Practices\",\"Data Compliance\",\"data management\",\"data science\"],\"articleSection\":[\"Blogs\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/dataninjasinc.com\/2025\/01\/data-science-for-everyone\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/dataninjasinc.com\/2025\/01\/data-science-for-everyone\/\",\"url\":\"https:\/\/dataninjasinc.com\/2025\/01\/data-science-for-everyone\/\",\"name\":\"Data Science for Everyone - Data Ninjas inc.\",\"isPartOf\":{\"@id\":\"https:\/\/dataninjasinc.com\/#website\"},\"datePublished\":\"2025-01-06T23:29:31+00:00\",\"dateModified\":\"2025-01-06T23:29:31+00:00\",\"description\":\"Data science, while often seen as a new concept, has always been about analyzing data to better understand the world. However, merely analyzing data doesn\u2019t always lead to actionable decisions. To become a data-driven decision-making organization, it\u2019s essential to grasp the core of data science and its role in decision-making.\",\"breadcrumb\":{\"@id\":\"https:\/\/dataninjasinc.com\/2025\/01\/data-science-for-everyone\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/dataninjasinc.com\/2025\/01\/data-science-for-everyone\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/dataninjasinc.com\/2025\/01\/data-science-for-everyone\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/dataninjasinc.com\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Data Science for Everyone\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/dataninjasinc.com\/#website\",\"url\":\"https:\/\/dataninjasinc.com\/\",\"name\":\"Data Ninjas inc.\",\"description\":\"For all your data projects, your trusted partner!\",\"publisher\":{\"@id\":\"https:\/\/dataninjasinc.com\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/dataninjasinc.com\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/dataninjasinc.com\/#organization\",\"name\":\"Data Ninjas inc.\",\"url\":\"https:\/\/dataninjasinc.com\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/dataninjasinc.com\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/dataninjasinc.com\/wp-content\/uploads\/2020\/03\/datan-ninjas-logo.png\",\"contentUrl\":\"https:\/\/dataninjasinc.com\/wp-content\/uploads\/2020\/03\/datan-ninjas-logo.png\",\"width\":1500,\"height\":400,\"caption\":\"Data Ninjas inc.\"},\"image\":{\"@id\":\"https:\/\/dataninjasinc.com\/#\/schema\/logo\/image\/\"}},{\"@type\":\"Person\",\"@id\":\"https:\/\/dataninjasinc.com\/#\/schema\/person\/eff8e408f91472d4f752e1554b4e325f\",\"name\":\"ninjas4\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/dataninjasinc.com\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/0bc515d79f6cbf148edfe842cf492034?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/0bc515d79f6cbf148edfe842cf492034?s=96&d=mm&r=g\",\"caption\":\"ninjas4\"},\"sameAs\":[\"https:\/\/dataninjasinc.com\"],\"url\":\"https:\/\/dataninjasinc.com\/author\/ninjas4\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Data Science for Everyone - Data Ninjas inc.","description":"Data science, while often seen as a new concept, has always been about analyzing data to better understand the world. However, merely analyzing data doesn\u2019t always lead to actionable decisions. To become a data-driven decision-making organization, it\u2019s essential to grasp the core of data science and its role in decision-making.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/dataninjasinc.com\/2025\/01\/data-science-for-everyone\/","og_locale":"en_US","og_type":"article","og_title":"Data Science for Everyone - Data Ninjas inc.","og_description":"Data science, while often seen as a new concept, has always been about analyzing data to better understand the world. However, merely analyzing data doesn\u2019t always lead to actionable decisions. To become a data-driven decision-making organization, it\u2019s essential to grasp the core of data science and its role in decision-making.","og_url":"https:\/\/dataninjasinc.com\/2025\/01\/data-science-for-everyone\/","og_site_name":"Data Ninjas inc.","article_published_time":"2025-01-06T23:29:31+00:00","og_image":[{"width":2560,"height":1321,"url":"https:\/\/dataninjasinc.com\/wp-content\/uploads\/2024\/07\/Customer-story-banner-top-background-scaled.jpg","type":"image\/jpeg"}],"author":"ninjas4","twitter_card":"summary_large_image","twitter_misc":{"Written by":"ninjas4","Est. reading time":"3 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/dataninjasinc.com\/2025\/01\/data-science-for-everyone\/#article","isPartOf":{"@id":"https:\/\/dataninjasinc.com\/2025\/01\/data-science-for-everyone\/"},"author":{"name":"ninjas4","@id":"https:\/\/dataninjasinc.com\/#\/schema\/person\/eff8e408f91472d4f752e1554b4e325f"},"headline":"Data Science for Everyone","datePublished":"2025-01-06T23:29:31+00:00","dateModified":"2025-01-06T23:29:31+00:00","mainEntityOfPage":{"@id":"https:\/\/dataninjasinc.com\/2025\/01\/data-science-for-everyone\/"},"wordCount":634,"commentCount":0,"publisher":{"@id":"https:\/\/dataninjasinc.com\/#organization"},"keywords":["Best Practices","Data Compliance","data management","data science"],"articleSection":["Blogs"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/dataninjasinc.com\/2025\/01\/data-science-for-everyone\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/dataninjasinc.com\/2025\/01\/data-science-for-everyone\/","url":"https:\/\/dataninjasinc.com\/2025\/01\/data-science-for-everyone\/","name":"Data Science for Everyone - Data Ninjas inc.","isPartOf":{"@id":"https:\/\/dataninjasinc.com\/#website"},"datePublished":"2025-01-06T23:29:31+00:00","dateModified":"2025-01-06T23:29:31+00:00","description":"Data science, while often seen as a new concept, has always been about analyzing data to better understand the world. However, merely analyzing data doesn\u2019t always lead to actionable decisions. To become a data-driven decision-making organization, it\u2019s essential to grasp the core of data science and its role in decision-making.","breadcrumb":{"@id":"https:\/\/dataninjasinc.com\/2025\/01\/data-science-for-everyone\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/dataninjasinc.com\/2025\/01\/data-science-for-everyone\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/dataninjasinc.com\/2025\/01\/data-science-for-everyone\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/dataninjasinc.com\/"},{"@type":"ListItem","position":2,"name":"Data Science for Everyone"}]},{"@type":"WebSite","@id":"https:\/\/dataninjasinc.com\/#website","url":"https:\/\/dataninjasinc.com\/","name":"Data Ninjas inc.","description":"For all your data projects, your trusted partner!","publisher":{"@id":"https:\/\/dataninjasinc.com\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/dataninjasinc.com\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/dataninjasinc.com\/#organization","name":"Data Ninjas inc.","url":"https:\/\/dataninjasinc.com\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/dataninjasinc.com\/#\/schema\/logo\/image\/","url":"https:\/\/dataninjasinc.com\/wp-content\/uploads\/2020\/03\/datan-ninjas-logo.png","contentUrl":"https:\/\/dataninjasinc.com\/wp-content\/uploads\/2020\/03\/datan-ninjas-logo.png","width":1500,"height":400,"caption":"Data Ninjas inc."},"image":{"@id":"https:\/\/dataninjasinc.com\/#\/schema\/logo\/image\/"}},{"@type":"Person","@id":"https:\/\/dataninjasinc.com\/#\/schema\/person\/eff8e408f91472d4f752e1554b4e325f","name":"ninjas4","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/dataninjasinc.com\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/0bc515d79f6cbf148edfe842cf492034?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/0bc515d79f6cbf148edfe842cf492034?s=96&d=mm&r=g","caption":"ninjas4"},"sameAs":["https:\/\/dataninjasinc.com"],"url":"https:\/\/dataninjasinc.com\/author\/ninjas4\/"}]}},"_links":{"self":[{"href":"https:\/\/dataninjasinc.com\/wp-json\/wp\/v2\/posts\/2890","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/dataninjasinc.com\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/dataninjasinc.com\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/dataninjasinc.com\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/dataninjasinc.com\/wp-json\/wp\/v2\/comments?post=2890"}],"version-history":[{"count":1,"href":"https:\/\/dataninjasinc.com\/wp-json\/wp\/v2\/posts\/2890\/revisions"}],"predecessor-version":[{"id":2891,"href":"https:\/\/dataninjasinc.com\/wp-json\/wp\/v2\/posts\/2890\/revisions\/2891"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/dataninjasinc.com\/wp-json\/wp\/v2\/media\/2655"}],"wp:attachment":[{"href":"https:\/\/dataninjasinc.com\/wp-json\/wp\/v2\/media?parent=2890"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/dataninjasinc.com\/wp-json\/wp\/v2\/categories?post=2890"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/dataninjasinc.com\/wp-json\/wp\/v2\/tags?post=2890"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}