Icon picture

SELVER BASIC

Data Architect and Engineer

Landsbergerstr. 107 - Munich, Germany selver.basic@gmail.com +49 1515 0284707

Profile

Experienced software engineer and data architect with a background in consulting across various industries in Germany. Heavy focus on Python.
Started as a C++ and SQL developer, moving through roles such as testing consultant and embedded systems developer. Shifted focus to cloud technologies in 2017, specializing in Azure.
Using Emacs.
Freelancing since late 2019, focusing on Microsoft Azure and Databricks for data engineering, with significant DevOps experience using Azure DevOps, GitHub, and GitLab.

Languages

  • German
  • English
  • Bosnian
  • Croatian
  • Serbian

Freelance Projects

Data architect

Designing a new data platform 04.2024 - present Client: RTL Deutschalnd & Bertelsmann

Architecting a new solution for data storage and processing using Azure SQL Server for storage and Databricks for processing and orchestration. Demonstrating the same solution using Delta Lake together with Unity Catalog enabled features like Delta Sharing, column masking, row-level security and access management (ACL).

Databricks Unity Catalog Azure SQL Server PostgreSQL Data Lake Delta Sharing Log Analytics Workspace Github Github Actions Alembic SQLAlchemy bash

Data engineer and lead developer

Designing and managing ELT pipelines with Azure Databricks 06.2023 - 04.2024 Client: RTL Deutschalnd & Bertelsmann

Implemented ELT pipelines for a large media client to streamline data ingestion from various sources (cloud storage, REST APIs, BigQuery etc). Utilized Azure Synapse to orchestrate data flows, Databricks for data transformation and processing, and Azure SQL Server for data storage. Enhanced data reliability and performance through automated monitoring and alerting, ensuring timely insights for business intelligence reports. Later on we changed the system and dropped Azure Synapse for Databricks Workflows which was a great move. Used Databricks bundle deployments for configuring and deploying workflows and configuring clusters.

python Databricks Databricks asset bundles Azure Synapse PostgreSQL Data Lake Log Analytics Workspace Github Github Actions Alembic pydantic bash

Data architect and engineer

Microsoft collaboration on designing a system for multiple Data Catalog integration 07.2022 - 06.2023 Client: Bertelsmann

This project involved the integration of various data catalogs, including Acryl (based on DataHub), Microsoft Purview, and custom catalogs sourced from databases, into a unified system — Microsoft Purview. The tasks included synchronizing entities, glossaries, and term hierarchies from external catalogs using the underlying Apache Atlas API. The solution was designed to be event-driven, leveraging Azure Event Hub and Azure Service Bus. The orchestration was implemented using Azure Durable Functions. Azure Table Storage was employed as the state store.

Microsoft Purview Azure Service Bus Azure Event Hub Azure Table Services Application Insights python Github Github Actions Azure Functions bash Apache Atlas

Data engineer

Improving a Databricks based solution 01.2022 - 07.2022 Client: BASF

Utilized the Enterprise Data Lake Platform to prepare data for the Data Science Platform. The primary data sources were SAP systems, which provided significant daily data volumes. Pipeline orchestration was managed through a custom solution implemented in SQL Server, responsible for state management of jobs, tasks and records.

Databricks Lakehouse Data Lake Data Factory Azure DevOps SQL Server PowerShell bash SAP B4

API developer

Blueprint for API Development: Standardizing Processes with FastAPI and Azure Services 11.2021 - 01.2022 Client: vattenfall

I designed a comprehensive blueprint for API development utilizing FastAPI and Azure services, which has been adopted by multiple teams within the organization. This blueprint standardizes the creation and deployment processes for APIs, thereby improving development efficiency and ensuring consistency across various projects. Tools used are Azure API Management, Azure App Services ans FastAPI.

Azure API Management Azure App Services FastAPI OpenAPI Container Registry

Data Engineer

Creating a Delta Lake and Masterdata management platform 02.2020 - 12.2021 Client: essity

The project involved the harmonization of data to establish master data, the creation of a data lake, and the development of data pipelines utilizing Data Factory. Data processing tasks are handled using Databricks, while Azure DevOps is employed for the deployment of both pipelines and databases. Additionally, SQL Server is utilized to enhance analytics performance, providing improved integration with PowerBI. Also, managed the migration of projects from MS Azure DL1 to DL2. My role involved both development and design work, ensuring efficient data storage and access.

Databricks Azure SQL Server Masterdata Data Factory Logic Apps Azure DevOps

Application Developer and Data engineer

Creating a knowledge management platform 10.2019 - 02.2020 Client: eon

Implemented a robust internal knowledge management system leveraging Django and Azure Cognitive Search, which facilitated efficient information retrieval and enhanced organizational knowledge sharing. Utilized Docker for containerization and GitLab CI/CD for streamlined deployment. Additionally, employed Bash scripting for automation tasks.

Azure Cognitive Search Azure App Services Azure Batch Accounts Container Registries Docker Gitlab CI/CD Django

Employment

Senior Consultant

Promantis GmbH 04.2018 - 12.2019

Was the Senior Consultant for Data questions at Promantis. Worked only for one client - e.On. My focus was on Data Engineering and application development. Once the data was in the Data Lake there was a need for different platforms to be created in order to make that data most accessible for internal users. Those users were Data Science teams as well as the management.

Bing Search v7 Azure App Services Azure Batch Accounts Container Registries Data Factory Docker Gitlab CI/CD Redis Tornado Flask OpenAPI Click

System developer and Architect

Cognotekt GmbH 05.2017 - 04.2018

As a senior back-end developer, I developed and maintained core applications for AI-based services in the insurance industry. My role involved writing new modules and integrating them into existing C++ applications. I also prototyped a recommender system in Python using Frequent Items Mining, contributing to improved customer recommendation strategies.

C++ PostgreSQL AWS Elastic Beanstalk AWS S3 Rencher Grafana Prometheus Logz.io Gitlab CI/CD python

Senior Developer

Syniverse 03.2016 - 05.2017

At Syniverse, I focused on enhancing legacy code for Fraud Detection and Revenue Assurance software. I led technical efforts on a project for Roshan Telecom, using C++ and Oracle SQL/PLSQL. This project earned industry awards for its successful implementation.

C++ Oracle PL/SQL Telecommunication Linux

Systems Applications Developer

E.Solutions GmbH 03.2013 - 03.2016

I developed system applications for Audi/Volkswagen's MMI products, focusing on modules for device updates. Working in C++ on the QNX operating system, my contributions ensured the reliability and performance of software used in millions of VW group vehicles.

C++ C QNX CMake bash

Developer and System Designer

ZIRA 10.2008 - 02.2013

At ZIRA, I worked on both rating and billing systems, developing core applications and modules. I significantly contributed to the fraud detection system, specifically the Profiling Engine, which was successfully deployed in major telecom companies.

C++ Oracle SQL PL/SQL bash Linux

Certificates

07.2024

Databricks Certified Data Engineer Professional RTL Deutschalnd

05.2024

Advanced Cryptocurrency Trading Course - With Strategies

05.2021

Data Engineering with Databricks

05.2021

Optimizing Apache Spark on Databricks

09.2014

R Programming

Formal Education

Diploma in Mathematics and Computer Science

University of Sarajevo, Faculty of Sciences, Department of Mathematics 2003 - 2009

High School Diploma

Gymnasium Marko Marulic, Tomislavgrad 1999 - 2003

Interests

Programming
Chess
Reading
Trading
Emacs