Category: QA

  • Near duplicate detection using Locality Sensitivity Hashing (LSH)

    Locality sensitive hashing (LSH) is a method for finding similar pairs in a large dataset. For a dataset of size N, the brute force method of comparing every possible pair would take N!/(2!(N-2)!) ~ N²/2 = O(N²) time. The LSH method aims to cut this down to O(N) time. In this article we will show…

  • Test Case

    A test case might be created either manually or automated to verify the functionality per the original acceptance criteria. High Level Steps Pre-Requisites Check connection details of source and target systems User access to various tools and access rights etc. Checks Check the ETL script name(s) are aligning with naming conventions Tables/Files are present in landing zone…

  • Test Plan

    An example of a less rigorous test plan. SCOPE The Scope of the Test Plan is to test the User Stories associated with each sprint The QAs will testing the user stories of one sprint behind The User Stories with complete Acceptance Criteria will be tested Developers should provide a complete walk-through of the User…

  • Test Strategy document

    A good test strategy document may contains the following. Table of Contents GLOSSARY OF TERMS PURPOSE GUIDING PRINCIPLES CONFORMANCE WITH ORGANISATIONAL TEST STRATEGY PROJECT BACKGROUND PROCESS OVERVIEW DIAGRAM PROJECT SCOPE IN SCOPE (HIGH. LEVEL) OUT OF SCOPE (HIGH LEVEL) TEST APPROACH TEST DELIVERABLES TEST RESOURCES/SUPPORT TEST PHASES AND ENVIRONMENTS TEST ENVIRONMENTS SYSTEM TEST SYSTEM INTEGRATION…

  • Unit Test

    Test description – check data changes captured correctly Test Data (SQL or PySpark code) – test_companieshouse_data.py Expected Result – capture the relevant changes in source data by comparing with the previous dataset Actual Result – verify the test result Test Result – output result after running test script – may be screen shot

  • Sprint Test Plan

    A sprint test plan may contains the following information: Title i.e. Extracting companies house data and loading them to a master table (MT load) Introduction – Sprint x will consist of backlog items that need to be added as well as any enhancements needed. We will begin to look at the requirements and start test…