Python Developer with expertise in data engineering and ETL processes. Strong background in Python development using pandas and NumPy for data transformation. Experienced in API development with FastAPI and database management with SQL/NoSQL. Proven track record of successfully migrating legacy systems to efficient Python-based implementations with improved performance. You can find my curriculum vitae here or download the PDF version here.
Developed a real-time customer service call management system using Python, FastAPI, and WebSockets, building end-to-end data processing pipelines for voice communication. Implemented a prototype application for voice memo transcription with structured data extraction and automated task management based on content severity. Optimized multi-class text classification through prompt engineering and k-fold validation, significantly improving model performance.
Successfully migrated a Unity-based Android application to WebGL, implementing asynchronous connection management and data handling, demonstrating strong system migration capabilities. Integrated Python-based analytics pipelines with cloud services, enhancing real-time data processing through optimized ETL workflows. Developed chatbot APIs across multiple platforms using Python and RESTful services, improving user engagement through data-driven insights.
Delivered comprehensive Python programming workshops focusing on data manipulation, pandas, and ETL concepts for real-world applications. Guided students in implementing ER diagrams and SQL for practical database solutions, emphasizing data integrity and efficient query optimization.
Developed a Python-based data generation and transformation pipeline, optimizing ETL processes for deep learning models in research applications. Designed custom data preprocessing workflows and performance metrics for efficient data analysis and model evaluation.
Developed a lightweight data processing pipeline for Retrieval-Augmented Generation (RAG) using Python, enhancing information retrieval from domain-specific datasets. Implemented efficient data extraction and transformation techniques to optimize contextual search, demonstrating expertise in ETL processes for NLP applications.
Built a search engine from scratch in Python, implementing data extraction, cleaning, and transformation pipelines for efficient text processing. Utilized pandas for data manipulation and designed an indexing system to improve search accuracy for large document collections.
Developed a Python-based ETL pipeline for processing textual data using pandas and NumPy. Created vectorized operations for efficient text feature extraction, achieving 93.61% accuracy in sentiment classification through optimized data transformation.