About Us

BroadOakData is a data company. We want to work with you to get the best out of your data. We provide the following services:

  • Data Integration – distributed ETL, Spark and PySpark on different platforms
  • Data Migration – moving data to cloud (AWS, Google Cloud, Azure) or private cloud
  • Pentaho Data Integration / Apache Hop
  • WordPress
  • SugarCRM
  • BigQuery, Databricks, Google Colab and Data Studio

We are a group of Business Intelligence consultants who have worked across different market sectors. Our main focus is on ETL processes and providing data migration solutions. We have more than 30 years of experience architecting software solutions and managing systems in various technical leadership roles mainly in a scientific research establishment and tele communications.

We have years of experience in the following:

  • Apache Spark (including Spark SQL, SparkML) and Kafka
  • SQL,  Scala, Java and Python (PySpark)
  • ElasticSearch, MongoDB, SQL Server, Oracle,  MySQL and PostgreSQL
  • Hadoop – HDFS, Hive and Accumulo (with GeoMesa)
  • Pentaho Data Integration Tool  and Apache Hop
  • Linux – Ubuntu and  CentOS (Scientific Linux)
  • Google Cloud Platform, AWS and Azure
  • Data migrations and developing data lakes

Please contact us if you need any help with data migration work.