Talentium
Terbuka Kepada Seluruh Negara:Filipina
Keperluan Bahasa:Inggeris
Scope of Work/Responsibilities:
1. Data Pipeline Development:
· Design, implement, and optimize end-to-end data pipelines using Databricks and related technologies.
· Build workflows to handle large-scale data ingestion, transformation, and storage.
2. Data Preparation for LLMs:
· Preprocess, clean, and structure diverse datasets (text, structured, and unstructured) for LLM training and fine-tuning.
· Implement feature engineering, tokenization, and vectorization techniques to support NLP models.
3. Performance Optimization:
· Use Databricks features, including Delta Lake and MLflow, to streamline data workflows.
· Optimize data infrastructure for high availability, scalability, and cost-efficiency.
4. Collaboration with Teams:
· Work closely with data scientists, ML engineers, and other stakeholders to understand data requirements for LLM technology requirements.
· Ensure alignment between engineering pipelines and machine learning goals.
5. Data Quality & Governance:
· Implement processes to ensure data quality, consistency, and compliance with governance policies.
· Monitor and maintain data integrity throughout the pipeline lifecycle.
6. Emerging Technology Adoption:
· Stay updated on advancements in Databricks, generative AI, and LLM technologies.
· Contribute to the adoption of innovative tools and practices to improve workflows.
Requirement and Qualification (Education & Work Experience):
Experience:
· 7+ years of experience in data engineering roles, with at least 2 years in a leadership role and projects involving Databricks.
· Proven expertise in data pipelines, feature engineering, and dataset preparation for machine learning, specifically LLMs.
· Experience building enterprise-grade applications with GenAI or AI/ML integrations.
Technical Skills:
· Expertise in Databricks, Apache Spark, and Delta Lake.
· Strong programming skills in Python and SQL; knowledge of libraries like pandas, NumPy, or PyTorch is a plus
· Understanding of state management libraries like Redux, Recoil, or Zustand.Cypress), and version control (Git).
· Understanding of web security principles and compliance requirements for enterprise applications.
Soft Skills:
· Exceptional problem-solving and decision-making abilities.
· Excellent communication and leadership skills, with the ability to guide technical discussions and mentor team members.
· Strong focus on delivering quality
Please refer to job description.
Boss
HR ManagerTalentium
Disiarkan pada 24 April 2025
Jurutera Data (Analisis Data (Databricks, Apache Spark, dan Delta Lake, GenAI...
Datamatics
Boleh Dirunding
Jarak Jauh3 - 5 Tahun PengalamanDiplomaKontrak
BossHR Manager
Penulis Laporan Kanan (Bekerja Dari Rumah)
Shore360, Inc.
₱70-120K[Bulanan]
Jarak Jauh3 - 5 Tahun PengalamanDiplomaSepenuh-masa
Roma SilvestreHead of Talent Search and Recruitment
Pembangun Laporan
NCS Group
Boleh Dirunding
Jarak Jauh1 - 3 Tahun PengalamanDiplomaSepenuh-masa
BossHR Manager
Jurutera Data
Xpress Super App
Boleh Dirunding
Jarak Jauh1 - 3 Tahun PengalamanDiplomaSepenuh-masa
BossHR Manager
Jurutera Data
Reed Elsevier Philippines
Boleh Dirunding
Jarak Jauh1 - 3 Tahun PengalamanSarjana MudaSepenuh-masa
BossHR Manager
Jika jawatan memerlukan anda bekerja di luar negara, sila berhati-hati dan berhati-hati dengan penipuan.
Jika anda menemui majikan yang mempunyai tindakan berikut semasa pencarian kerja anda, sila laporkan segera