About Me

Name: Selver Basic

I am an experienced software and data engineer with a comprehensive background in consulting across various industries, predominantly in Germany. My professional journey began as a C++ and SQL developer, advancing through roles such as a testing consultant and embedded systems developer. After three years at Audi, I shifted my focus to cloud technologies in 2017, gaining expertise in AWS and Azure platforms.

Proficient in German, English, and Bosnian/Croatian/Serbian, I leverage tools like Emacs to enhance productivity. Since end of 2019, I have successfully operated as a freelancer, concentrating on the Microsoft Azure stack and Databricks for data engineering projects. Additionally, I possess substantial experience in DevOps, utilizing Azure DevOps and GitHub, with prior experience in GitLab. Below, you will find detailed descriptions of my project experiences.

Contact

  • selver at basix dot dev
  • selver dot basic at gmail dot com

Projects

06/2023 - now: Data Onboarding on Azure using Databricks

Client: RTL Deutschland and Bertelsmann

Implemented ELT pipelines for a large media client to streamline data ingestion from various sources (cloud storage, REST APIs, BigQuery etc,). Utilized Azure Synapse to orchestrate data flows, Databricks for data transformation and processing, and Azure SQL Server for data storage. Enhanced data reliability and performance through automated monitoring and alerting, ensuring timely insights for business intelligence reports. Using Databricks bundle deployments for configuring and deploying workflows.

Technology Stack

  • Databricks, Azure Synapse, Azure Log Analytics

Languages and Frameworks

  • Python
  • Bash
  • SparkSQL, PySpark
  • Github Actions

Description

In this project, I led the data ingestion initiative to pull data from multiple sources and integrate it into the Azure ecosystem using Databricks. I implemented a complex encryption mechanism using customer-provided keys to ensure the security and integrity of the data. This project improved data processing efficiency and adhered to stringent regulatory compliance requirements.

07/2021 - 06/2023 MS Purview Data Catalog (Bertelsmann)

Client: Bertelsmann

Technology Stack

  • Azure: Purview, Service Bus, Event Hubs, Storage Accounts
  • Tools: Apache Atlas

Languages and Frameworks

  • Python
  • Scripting: Bash
  • pyapacheatlas, azure-purview-*

Description

I collaborated with Microsoft to integrate multiple disparate data catalog solutions within an enterprise into a unified framework using MS Purview. The project involved designing and executing workflows to enhance data discoverability, governance, and regulatory compliance, significantly improving the organization’s data management capabilities.

12/2021 - 07/2022 ETL/ELT with Databricks

Client: BASF

Technology Stack

  • Azure: Databricks, Data Lake, Data Factory, DevOps
  • Database: SQL Server

Languages and Frameworks

  • Python, PySpark
  • SQL: Spark SQL, MS SQLServer SQL
  • Scripting: Bash, PowerShell

Description

I led the creation and optimization of ETL/ELT pipelines using Databricks and SQL Server. I focused on enhancing data aggregation and transformation processes, ensuring high data quality and faster analytics. This project resulted in significant improvements in business intelligence and decision-making capabilities.

12/2021 - 03/2022 Data API Development

Client: Vattenfall

Technology Stack

  • Azure: API Management Service, Container Registry, App Services
  • Database: SQL Server

Languages and Frameworks

  • Python, FastAPI, MkDocs
  • Scripting: Bash

Description

I developed a blueprint for API development using FastAPI and Azure services. This blueprint, adopted by multiple teams within the organization, standardized API creation and deployment processes, enhancing development efficiency and API consistency across various projects.

03/2020 - 12/2021 Data Lake and Master Data Management

Client: Essity

Technology Stack

  • Azure: Databricks, Data Lake, Data Factory, Logic Apps, DevOps
  • Database: SQL Server

Languages and Frameworks

  • Python, PySpark
  • SQL: Spark SQL, MS SQLServer SQL
  • Scripting: Bash, PowerShell

Description

I engineered data lakes and managed the migration of projects from MS Azure DL1 to DL2. My role involved both development and design work, ensuring efficient data storage and access. This project enhanced the organization’s capability to perform comprehensive data analytics and reporting.

10/2019 - 02/2020 Knowledge Management System

Client: E.On (onsite as freelancer)

Technology Stack

  • Azure: Cognitive Search, WebApps, Batch Accounts, Container Registries
  • Tools: Docker, GitLab CI/CD

Languages and Frameworks

  • Python, Django
  • Scripting: Bash

Description

I designed and implemented an internal knowledge management system using Django, with search functionality powered by Azure Cognitive Search. The system significantly improved information retrieval and knowledge sharing within the organization.

04/2019 - 09/2019 Web Data Crawling for Sentiment Analysis

Client: E.On (onsite as senior consultant)

Technology Stack

  • Azure: Bing Search v7, WebApps, Data Factory, Batch Accounts, Container Registries
  • Tools: Docker, GitLab CI/CD

Languages and Frameworks

  • Python, Scrapy, Tornado, Click, Selenium
  • Scripting: Bash

Description

I automated the crawling of various web sources and stored the data (text, CSV, JSON) for sentiment analysis. Utilizing Azure services, the project streamlined the collection, storage, and processing of web data, enabling effective sentiment analysis.

10/2018 - 03/2019 Energy Usage Disaggregation

Client: E.On (onsite as senior consultant)

Technology Stack

  • Azure: Databricks, WebApps, Batch Accounts, Container Registries
  • Tools: Docker, GitLab CI/CD

Languages and Frameworks

  • Python, Tornado, Click
  • SQL

Description

I created endpoints allowing customers to query predictions of their electricity usage based on sampling statistics. The project delivered valuable insights into energy consumption, enhancing customer engagement through precise usage predictions.

04/2018 - 09/2018 Internal Weather Collector System

Client: E.On (onsite as senior consultant)

Technology Stack

  • Azure: Databricks, WebApps, API Management, Data Factory, Container Registries
  • Tools: Docker, Redis, GitLab CI/CD

Languages and Frameworks

  • Python, Tornado, Flask, Cookiecutter, Click, OpenAPI
  • SQL

Description

I developed a web-based internal weather application by scraping data from multiple sources (DWD and UK’s Met Office). The application provided real-time weather data, accessible through various endpoints, optimizing internal operations requiring weather information.

Work Experience as an Employee

05/2017 - 03/2019 Cognotekt

Position: Senior Back-End Developer

Description

As a senior back-end developer, I developed and maintained core applications for AI-based services in the insurance industry. My role involved writing new modules and integrating them into existing C++ applications. I also prototyped a recommender system in Python using Frequent Items Mining, contributing to improved customer recommendation strategies.

Technology Stack

  • AWS Elastic Beanstalk, AWS S3, Rancher, Grafana, Prometheus, Logz.io, GitLab CI/CD

Languages and Frameworks

  • C++, Python, PostgreSQL

03/2016 - 04/2017 Syniverse

Position: Senior Developer

Description

At Syniverse, I focused on enhancing legacy code for Fraud Detection and Revenue Assurance software. I led technical efforts on a project for Roshan Telecom, using C++ and Oracle SQL/PLSQL. This project earned industry awards for its successful implementation.

Technology Stack

  • On-premise Linux servers, Oracle Database Servers

Languages and Frameworks

  • C++, Oracle SQL, PL/SQL

03/2013 - 02/2016 e.Solutions

Position: Systems Applications Developer

Description

I developed system applications for Audi/Volkswagen’s MMI products, focusing on modules for device updates. Working in C++ on the QNX operating system, my contributions ensured the reliability and performance of software used in millions of VW group vehicles.

Technology Stack

  • Hardware MMI Units

Languages and Frameworks

  • C++, C, QNX, Perforce, CMake, bash/ksh, Python, QtCreator

10/2008 - 02/2013 ZIRA

Overview

At ZIRA, I worked on both rating and billing systems, developing core applications and modules. I significantly contributed to the fraud detection system, specifically the Profiling Engine, which was successfully deployed in major telecom companies.

Technology Stack

  • Oracle DB, On-premise Linux servers

Languages and Frameworks

  • C++, Oracle SQL, PL/SQL

Education

  • 05/2021: Databricks Data Engineering and Delta Optimization (6-day course)
  • 2003 - 2009: Dipl. Math-Inf., University of Sarajevo, Faculty of Sciences, Department of Mathematics
  • 1999 - 2003: Gymnasium Marko Marulic, Tomislavgrad