Data Scientist/Lead Data Engineer
- Developed and implemented statistical analysis, data mining, and other aspects of data analytics to collect, explore, and extract insights from structured and unstructured data using Spark and Python.Developed and implemented statistical analysis, data mining, and other aspects of data analytics to collect, explore, and extract insights from structured and unstructured data using SQL, Spark and Python.
- Automate manual reports using SQL, business intelligence tools, and Python, improving the completion time of reports.
- Built data warehousing solution with custom ETL scripts in Python to extract and house data from multiple sources.
- Developed and maintained tools and pipelines for automating big data processes using SQL.
- Constructed end-to-end custom pipelines using object-oriented programming in Python in order to inherit and implement Scikit-Learn API functionality.
© 2022 Masum Rumi, All rights reserved