Deprecated: Function create_function() is deprecated in /home/qualit96/public_html/wp-content/plugins/revslider/includes/framework/functions-wordpress.class.php on line 258

Warning: Cannot modify header information - headers already sent by (output started at /home/qualit96/public_html/wp-content/plugins/revslider/includes/framework/functions-wordpress.class.php:258) in /home/qualit96/public_html/wp-includes/rest-api/class-wp-rest-server.php on line 1372

Warning: Cannot modify header information - headers already sent by (output started at /home/qualit96/public_html/wp-content/plugins/revslider/includes/framework/functions-wordpress.class.php:258) in /home/qualit96/public_html/wp-includes/rest-api/class-wp-rest-server.php on line 1372

Warning: Cannot modify header information - headers already sent by (output started at /home/qualit96/public_html/wp-content/plugins/revslider/includes/framework/functions-wordpress.class.php:258) in /home/qualit96/public_html/wp-includes/rest-api/class-wp-rest-server.php on line 1372

Warning: Cannot modify header information - headers already sent by (output started at /home/qualit96/public_html/wp-content/plugins/revslider/includes/framework/functions-wordpress.class.php:258) in /home/qualit96/public_html/wp-includes/rest-api/class-wp-rest-server.php on line 1372

Warning: Cannot modify header information - headers already sent by (output started at /home/qualit96/public_html/wp-content/plugins/revslider/includes/framework/functions-wordpress.class.php:258) in /home/qualit96/public_html/wp-includes/rest-api/class-wp-rest-server.php on line 1372

Warning: Cannot modify header information - headers already sent by (output started at /home/qualit96/public_html/wp-content/plugins/revslider/includes/framework/functions-wordpress.class.php:258) in /home/qualit96/public_html/wp-includes/rest-api/class-wp-rest-server.php on line 1372

Warning: Cannot modify header information - headers already sent by (output started at /home/qualit96/public_html/wp-content/plugins/revslider/includes/framework/functions-wordpress.class.php:258) in /home/qualit96/public_html/wp-includes/rest-api/class-wp-rest-server.php on line 1372

Warning: Cannot modify header information - headers already sent by (output started at /home/qualit96/public_html/wp-content/plugins/revslider/includes/framework/functions-wordpress.class.php:258) in /home/qualit96/public_html/wp-includes/rest-api/class-wp-rest-server.php on line 1372
{"id":15114,"date":"2020-03-14T01:11:32","date_gmt":"2020-03-13T20:11:32","guid":{"rendered":"http:\/\/quality-spectrum.com\/?p=15114"},"modified":"2020-03-14T01:16:27","modified_gmt":"2020-03-13T20:16:27","slug":"onlinetestconf-2019-big-data-how-to-test-it","status":"publish","type":"post","link":"http:\/\/quality-spectrum.com\/onlinetestconf-2019-big-data-how-to-test-it\/","title":{"rendered":"OnlineTestConf 2019 \u2013 Big data & how to test it"},"content":{"rendered":"

The subject of Big data fascinates people and businesses. While this might be just a buzz word, big data is immensely helpful to unearth important information and in the 21st century, information is power. <\/p>\n

In this post, I\u2019ll summarize my talk at the OnlineTestConf2019 titled \u201cWhy we call it big data and how to test it\u201d. The talk was also recorded and can be watched on YouTube here<\/b><\/u><\/i><\/a>.<\/p>\n<\/p>\n

An intro to big data<\/h2>\n

I start with talking about little bit history of Big data and what factors fueled growth and innovation in this industry.
\nNext we put \u2018Big\u2019 into perspective to help in understand the sheer size of data and the challenge it poses to process it.<\/p>\n<\/div><\/span>

Defining Big data<\/h2>\n

When would a project classify as big data? Is it only the size of data? This slide explains the different ways we tried to classify it and the most common method used.<\/p>\n<\/div><\/span>

The Hadoop platform<\/h2>\n

Hadoop is the most widely used big data platform which is also open source. I talk about it\u2019s widely used MapReduce process and different products within like HDFS, HBase and HiveQL.<\/p>\n<\/div><\/span>

The Data Pipeline<\/h2>\n

All we are doing in a big data project is collect data from different sources, hash it up into meaningful big tables and generate insights from it. There are three main phases you might have in a big data project.<\/p>\n<\/div><\/span>

Testing stages<\/h2>\n

At the end we quickly skim through the different type of tests we perform across the pipeline. At each stage, depending on the type of activities being performed, the type of tests will be different.<\/p>\n<\/div><\/span>

<\/p>\n

Due to lack of time I couldn\u2019t go into details of the Quality Dimensions and sample tests of these dimensions across the pipeline. My talk at the AutomationGuild 2020<\/b><\/i><\/u><\/a> explains that in more detail.<\/p>\n<\/p>\n

Summary<\/h2>\n

Katjya did a very good sketch summarizing the talk she mentioned in her tweet.<\/p>\n<\/div><\/span>

<\/div><\/div><\/div><\/div><\/div>\n","protected":false},"excerpt":{"rendered":"","protected":false},"author":3,"featured_media":15122,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":[],"categories":[1],"tags":[],"yoast_head":"\nOnlineTestConf 2019 \u2013 Big data & how to test it - Quality Spectrum<\/title>\n<meta name=\"robots\" content=\"index, follow\" \/>\n<meta name=\"googlebot\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<meta name=\"bingbot\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"http:\/\/quality-spectrum.com\/onlinetestconf-2019-big-data-how-to-test-it\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"OnlineTestConf 2019 \u2013 Big data & how to test it - Quality Spectrum\" \/>\n<meta property=\"og:url\" content=\"http:\/\/quality-spectrum.com\/onlinetestconf-2019-big-data-how-to-test-it\/\" \/>\n<meta property=\"og:site_name\" content=\"Quality Spectrum\" \/>\n<meta property=\"article:published_time\" content=\"2020-03-13T20:11:32+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2020-03-13T20:16:27+00:00\" \/>\n<meta property=\"og:image\" content=\"http:\/\/quality-spectrum.com\/wp-content\/uploads\/2020\/03\/post-image.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"1200\" \/>\n\t<meta property=\"og:image:height\" content=\"400\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@aali_khalid\" \/>\n<meta name=\"twitter:site\" content=\"@aali_khalid\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Organization\",\"@id\":\"http:\/\/quality-spectrum.com\/#organization\",\"name\":\"Quality Spectrum\",\"url\":\"http:\/\/quality-spectrum.com\/\",\"sameAs\":[\"https:\/\/www.linkedin.com\/in\/alikhalid\/\",\"https:\/\/www.youtube.com\/c\/QualitySpectrum\",\"https:\/\/twitter.com\/aali_khalid\"],\"logo\":{\"@type\":\"ImageObject\",\"@id\":\"http:\/\/quality-spectrum.com\/#logo\",\"inLanguage\":\"en-US\",\"url\":\"http:\/\/quality-spectrum.com\/wp-content\/uploads\/2019\/11\/QS-logo-mobile-e1574510459832.png\",\"width\":40,\"height\":40,\"caption\":\"Quality Spectrum\"},\"image\":{\"@id\":\"http:\/\/quality-spectrum.com\/#logo\"}},{\"@type\":\"WebSite\",\"@id\":\"http:\/\/quality-spectrum.com\/#website\",\"url\":\"http:\/\/quality-spectrum.com\/\",\"name\":\"Quality Spectrum\",\"description\":\"Redefining software quality\",\"publisher\":{\"@id\":\"http:\/\/quality-spectrum.com\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":\"http:\/\/quality-spectrum.com\/?s={search_term_string}\",\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-US\"},{\"@type\":\"ImageObject\",\"@id\":\"http:\/\/quality-spectrum.com\/onlinetestconf-2019-big-data-how-to-test-it\/#primaryimage\",\"inLanguage\":\"en-US\",\"url\":\"http:\/\/quality-spectrum.com\/wp-content\/uploads\/2020\/03\/post-image.jpg\",\"width\":1200,\"height\":400},{\"@type\":\"WebPage\",\"@id\":\"http:\/\/quality-spectrum.com\/onlinetestconf-2019-big-data-how-to-test-it\/#webpage\",\"url\":\"http:\/\/quality-spectrum.com\/onlinetestconf-2019-big-data-how-to-test-it\/\",\"name\":\"OnlineTestConf 2019 \\u2013 Big data & how to test it - Quality Spectrum\",\"isPartOf\":{\"@id\":\"http:\/\/quality-spectrum.com\/#website\"},\"primaryImageOfPage\":{\"@id\":\"http:\/\/quality-spectrum.com\/onlinetestconf-2019-big-data-how-to-test-it\/#primaryimage\"},\"datePublished\":\"2020-03-13T20:11:32+00:00\",\"dateModified\":\"2020-03-13T20:16:27+00:00\",\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"http:\/\/quality-spectrum.com\/onlinetestconf-2019-big-data-how-to-test-it\/\"]}]},{\"@type\":\"Article\",\"@id\":\"http:\/\/quality-spectrum.com\/onlinetestconf-2019-big-data-how-to-test-it\/#article\",\"isPartOf\":{\"@id\":\"http:\/\/quality-spectrum.com\/onlinetestconf-2019-big-data-how-to-test-it\/#webpage\"},\"author\":{\"@id\":\"http:\/\/quality-spectrum.com\/#\/schema\/person\/4805a00d7139e111ea9430e17cc8f28c\"},\"headline\":\"OnlineTestConf 2019 \\u2013 Big data & how to test it\",\"datePublished\":\"2020-03-13T20:11:32+00:00\",\"dateModified\":\"2020-03-13T20:16:27+00:00\",\"mainEntityOfPage\":{\"@id\":\"http:\/\/quality-spectrum.com\/onlinetestconf-2019-big-data-how-to-test-it\/#webpage\"},\"commentCount\":0,\"publisher\":{\"@id\":\"http:\/\/quality-spectrum.com\/#organization\"},\"image\":{\"@id\":\"http:\/\/quality-spectrum.com\/onlinetestconf-2019-big-data-how-to-test-it\/#primaryimage\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"http:\/\/quality-spectrum.com\/onlinetestconf-2019-big-data-how-to-test-it\/#respond\"]}]},{\"@type\":[\"Person\"],\"@id\":\"http:\/\/quality-spectrum.com\/#\/schema\/person\/4805a00d7139e111ea9430e17cc8f28c\",\"name\":\"Ali Khalid\",\"image\":{\"@type\":\"ImageObject\",\"@id\":\"http:\/\/quality-spectrum.com\/#personlogo\",\"inLanguage\":\"en-US\",\"url\":\"http:\/\/1.gravatar.com\/avatar\/70cbf539f218f275a77959dd2e56bddb?s=96&d=mm&r=g\",\"caption\":\"Ali Khalid\"}}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","_links":{"self":[{"href":"http:\/\/quality-spectrum.com\/wp-json\/wp\/v2\/posts\/15114"}],"collection":[{"href":"http:\/\/quality-spectrum.com\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/quality-spectrum.com\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/quality-spectrum.com\/wp-json\/wp\/v2\/users\/3"}],"replies":[{"embeddable":true,"href":"http:\/\/quality-spectrum.com\/wp-json\/wp\/v2\/comments?post=15114"}],"version-history":[{"count":11,"href":"http:\/\/quality-spectrum.com\/wp-json\/wp\/v2\/posts\/15114\/revisions"}],"predecessor-version":[{"id":15129,"href":"http:\/\/quality-spectrum.com\/wp-json\/wp\/v2\/posts\/15114\/revisions\/15129"}],"wp:featuredmedia":[{"embeddable":true,"href":"http:\/\/quality-spectrum.com\/wp-json\/wp\/v2\/media\/15122"}],"wp:attachment":[{"href":"http:\/\/quality-spectrum.com\/wp-json\/wp\/v2\/media?parent=15114"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/quality-spectrum.com\/wp-json\/wp\/v2\/categories?post=15114"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/quality-spectrum.com\/wp-json\/wp\/v2\/tags?post=15114"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}