Core Skills
Machine Learning & Modeling: Python, machine learning algorithms, PyTorch, scikit-learn, LightGBM, feature engineering, gradient boosting, regression, classification, clustering, neural networks, Natural Language Processing (NLP), embeddings, fastText, sentence-transformers, metric learning, active learning, AutoML, Optuna, calibration, model evaluation
Production ML & MLOps: MLOps, model deployment, model monitoring, scheduled retraining, validation gates, fallback strategy, model versioning, data quality control, CI/CD, GitLab CI, automated testing, pytest, Docker, FastAPI, Google Cloud Platform (GCP), PostgreSQL, ClickHouse, SQL
Search, Matching & Ranking: product matching, duplicate detection, e-commerce search/catalog, information retrieval, semantic search, similarity search, vector search, candidate generation, approximate nearest neighbor search, reranking, pairwise features, fuzzy matching, FAISS, Elasticsearch, ranking metrics, precision-recall analysis
LLM / RAG & AI Data Workflows: Large Language Models (LLMs), Retrieval-Augmented Generation (RAG), prompt engineering, structured extraction, JSON schema, tool calling, embedding-based retrieval, automated data labeling, human-in-the-loop workflows, LLM evaluation, OpenAI API, Google Vertex AI
Anomaly Detection, Pricing & Forecasting: anomaly detection, price anomaly detection, anomaly scoring, price regression, uncertainty estimation, confidence scoring, calibration, quantile regression, forecasting, delivery time forecasting, time-aware validation, temporal features
Data Engineering & Analytics: Pandas, NumPy, Parquet, scheduled pipelines, Celery, batching, sharding, in-database computation, Apache Superset, Power BI, data quality analysis
Software Engineering & Observability: backend engineering, Python application architecture, computer science fundamentals, API design, REST APIs, system integration, Pydantic, type hints, mypy, automated testing, code review, Sentry, Grafana, C++, Ruby on Rails
Technical Ownership & Delivery: requirements clarification, metric definition, ML pipeline architecture, technical decision-making, scalable solution design, business trade-off analysis, stakeholder communication