Are you a skilled Data Scientist with a passion for applying machine learning (ML) techniques to solve real-world challenges? We are seeking an innovative and experienced individual to join our team and help build cutting-edge ML solutions. As a Data Scientist, you’ll be at the forefront of transforming data into actionable insights that drive business decisions and impact.
In this role, you will collaborate with cross-functional teams to design, implement, and optimize scalable data pipelines and machine learning models that tackle both generalized and specialized business use cases. You'll work with large, complex datasets and apply statistical analysis, feature engineering (FE), and model training to extract valuable insights and create efficient, production-ready solutions.
Key Responsibilities:
- Build Scalable Pipelines: Design and develop data collection, transformation, and integration pipelines that are efficient and reliable, ensuring the seamless flow of data for ML applications.
- Data Preprocessing & Feature Engineering: Collect, clean, preprocess, and conduct feature engineering (FE) to ensure high-quality input for machine learning models.
- Exploratory Data Analysis (EDA): Perform statistical analysis and EDA to uncover hidden patterns and validate findings in complex datasets, transforming data into actionable insights.
- Model Development & Deployment: Select, train, validate, and deploy machine learning models for both generalized and specialized business problems.
- Transfer Learning & Fine-Tuning: Implement transfer learning pipelines with pretrained transformers, optimizing models for specific use cases.
- End-to-End Pipeline Optimization: Develop, optimize, and maintain end-to-end ETL and ML pipelines for seamless deployment in production environments.
- Collaboration with Teams: Work closely with product, engineering, and business teams to implement data-driven solutions that address strategic challenges and create measurable impact.
Who You Are:
You are a proactive problem solver with a strong foundation in machine learning and data science. You have hands-on experience with Python and popular ML libraries, and you're comfortable working with large-scale datasets. You thrive in collaborative environments, translating complex, technical insights into clear and actionable recommendations for business stakeholders.
Requirements:
- Education: Master’s or Ph.D. in Computer Science, Statistics, Mathematics, or a related field.
- Technical Expertise: Proficiency in Python, with hands-on experience in libraries such as PySpark, Pandas, and PyTorch.
- Data Skills: Strong expertise in data preprocessing, transformation, and manipulation techniques to work with complex datasets.
- Modeling Skills: In-depth understanding of machine learning algorithms, particularly GBT, MLP/RNN/Transformer architectures.
- Model Interpretability: Experience implementing techniques to improve model interpretability and explainability.
- Big Data Knowledge: Familiarity with big data platforms and distributed computing to handle large-scale datasets.
- Communication Skills: Strong communication abilities, with a knack for presenting complex, technical insights to business stakeholders in a clear and actionable way.
Why Join Us?
- Innovative Work: Be part of an innovative team using the latest advancements in machine learning to solve real-world business challenges.
- Career Growth: You'll have the opportunity to continuously learn and develop your technical and business skills while making a measurable impact.
- Collaborative Culture: Work in a supportive and collaborative environment with cross-functional teams to solve complex problems.
- Competitive Compensation: Enjoy a competitive salary and a comprehensive benefits package designed to support your well-being and career development.