About Me

Selver Basic

Experienced software and data engineer with a strong background in consulting across multiple industries, primarily in Germany. My career began as a C++ and SQL developer, evolving through roles as a testing consultant and embedded systems developer. I spent three years at Audi before transitioning to cloud technologies in 2017, specializing in AWS and Azure platforms.

Fluent in German, English, and Bosnian/Croatian/Serbian, I am adept at using Emacs to maximize productivity. Since January 2020, I have been successfully freelancing, focusing on the Microsoft Azure stack for data engineering projects. Below are detailed descriptions of my project experiences.

Contact

  • selver at basix dot dev
  • selver dot basic at gmail dot com

Current Work

Working as a freelance data engineer mainly utilizing the MS Azure stack.

Projects

06/2023 - Present Data Onboarding on Azure using Databricks

Technology Stack

  • Azure: Databricks, Azure Synapse, Azure Log Analytics
  • Tools: GitHub, GitHub Actions

Languages and Frameworks

  • Python
  • Scripting: Bash
  • SparkSQL, PySpark

Description

In this project, I led the data ingestion initiative to pull data from multiple sources and integrate it into the Azure ecosystem using Databricks. I implemented a complex encryption mechanism using customer-provided keys to ensure the security and integrity of the data. This project improved data processing efficiency and adhered to stringent regulatory compliance requirements.

07/2021 - 06/2023 MS Purview Data Catalog

Technology Stack

  • Azure: Purview, Service Bus, Event Hubs, Storage Accounts
  • Tools: Apache Atlas

Languages and Frameworks

  • Python
  • Scripting: Bash
  • pyapacheatlas, azure-purview-*

Description

I collaborated with Microsoft to integrate multiple disparate data catalog solutions within an enterprise into a unified framework using MS Purview. The project involved designing and executing workflows to enhance data discoverability, governance, and regulatory compliance, significantly improving the organization’s data management capabilities.

12/2021 - 07/2022 ETL/ELT with Databricks

Technology Stack

  • Azure: Databricks, Data Lake, Data Factory, DevOps
  • Database: SQL Server

Languages and Frameworks

  • Python, PySpark
  • SQL: Spark SQL, MS SQLServer SQL
  • Scripting: Bash, PowerShell

Description

I led the creation and optimization of ETL/ELT pipelines using Databricks and SQL Server. I focused on enhancing data aggregation and transformation processes, ensuring high data quality and faster analytics. This project resulted in significant improvements in business intelligence and decision-making capabilities.

12/2021 - 03/2022 Data API Development

Technology Stack

  • Azure: API Management Service, Container Registry, App Services
  • Database: SQL Server

Languages and Frameworks

  • Python, FastAPI, MkDocs
  • Scripting: Bash

Description

I developed a blueprint for API development using FastAPI and Azure services. This blueprint, adopted by multiple teams within the organization, standardized API creation and deployment processes, enhancing development efficiency and API consistency across various projects.

03/2020 - 12/2021 Data Lake and Master Data Management

Technology Stack

  • Azure: Databricks, Data Lake, Data Factory, Logic Apps, DevOps
  • Database: SQL Server

Languages and Frameworks

  • Python, PySpark
  • SQL: Spark SQL, MS SQLServer SQL
  • Scripting: Bash, PowerShell

Description

I engineered data lakes and managed the migration of projects from MS Azure DL1 to DL2. My role involved both development and design work, ensuring efficient data storage and access. This project enhanced the organization’s capability to perform comprehensive data analytics and reporting.

10/2019 - 02/2020 Knowledge Management System

Technology Stack

  • Azure: Cognitive Search, WebApps, Batch Accounts, Container Registries
  • Tools: Docker, GitLab CI/CD

Languages and Frameworks

  • Python, Django
  • Scripting: Bash

Description

I designed and implemented an internal knowledge management system using Django, with search functionality powered by Azure Cognitive Search. The system significantly improved information retrieval and knowledge sharing within the organization.

04/2019 - 09/2019 Web Data Crawling for Sentiment Analysis

Technology Stack

  • Azure: Bing Search v7, WebApps, Data Factory, Batch Accounts, Container Registries
  • Tools: Docker, GitLab CI/CD

Languages and Frameworks

  • Python, Scrapy, Tornado, Click, Selenium
  • Scripting: Bash

Description

I automated the crawling of various web sources and stored the data (text, CSV, JSON) for sentiment analysis. Utilizing Azure services, the project streamlined the collection, storage, and processing of web data, enabling effective sentiment analysis.

10/2018 - 03/2019 Energy Usage Disaggregation

Technology Stack

  • Azure: Databricks, WebApps, Batch Accounts, Container Registries
  • Tools: Docker, GitLab CI/CD

Languages and Frameworks

  • Python, Tornado, Click
  • SQL

Description

I created endpoints allowing customers to query predictions of their electricity usage based on sampling statistics. The project delivered valuable insights into energy consumption, enhancing customer engagement through precise usage predictions.

04/2018 - 09/2018 Internal Weather Collector System

Technology Stack

  • Azure: Databricks, WebApps, API Management, Data Factory, Container Registries
  • Tools: Docker, Redis, GitLab CI/CD

Languages and Frameworks

  • Python, Tornado, Flask, Cookiecutter, Click, OpenAPI
  • SQL

Description

I developed a web-based internal weather application by scraping data from multiple sources (DWD and UK’s Met Office). The application provided real-time weather data, accessible through various endpoints, optimizing internal operations requiring weather information.

Work Experience as an Employee

05/2017 - 03/2019 Cognotekt

Position: Senior Back-End Developer

Description

As a senior back-end developer, I developed and maintained core applications for AI-based services in the insurance industry. My role involved writing new modules and integrating them into existing C++ applications. I also prototyped a recommender system in Python using Frequent Items Mining, contributing to improved customer recommendation strategies.

Technology Stack

  • AWS Elastic Beanstalk, AWS S3, Rancher, Grafana, Prometheus, Logz.io, GitLab CI/CD

Languages and Frameworks

  • C++, Python, PostgreSQL

03/2016 - 04/2017 Syniverse

Position: Senior Developer

Description

At Syniverse, I focused on enhancing legacy code for Fraud Detection and Revenue Assurance software. I led technical efforts on a project for Roshan Telecom, using C++ and Oracle SQL/PLSQL. This project earned industry awards for its successful implementation.

Technology Stack

  • On-premise Linux servers, Oracle Database Servers

Languages and Frameworks

  • C++, Oracle SQL, PL/SQL

03/2013 - 02/2016 e.Solutions

Position: Systems Applications Developer

Description

I developed system applications for Audi/Volkswagen’s MMI products, focusing on modules for device updates. Working in C++ on the QNX operating system, my contributions ensured the reliability and performance of software used in millions of VW group vehicles.

Technology Stack

  • Hardware MMI Units

Languages and Frameworks

  • C++, C, QNX, Perforce, CMake, bash/ksh, Python, QtCreator

10/2008 - 02/2013 ZIRA

Overview

At ZIRA, I worked on both rating and billing systems, developing core applications and modules. I significantly contributed to the fraud detection system, specifically the Profiling Engine, which was successfully deployed in major telecom companies.

Technology Stack

  • Oracle DB, On-premise Linux servers

Languages and Frameworks

  • C++, Oracle SQL, PL/SQL

Education

  • 05/2021: Databricks Data Engineering and Delta Optimization (6-day course)
  • 2003 - 2009: Dipl. Math-Inf., University of Sarajevo, Faculty of Sciences, Department of Mathematics
  • 1999 - 2003: Gymnasium Marko Marulic, Tomislavgrad