Senior Machine Learning Engineer (Spain)
Quick Summary
At RemoteStar, we're currently hiring for one of our client based in Spain. 9-month fixed-term contract | Hybrid (3 days/week onsite) | Location: Barcelona or Madrid About client : Well-funded and fast-growing deep-tech company founded in 2019.
Master’s, or Ph.D. in Computer Science, AI, Data Science, Physics, Math, or a related field. Or equivalent industry experience. 4+ years of experience in data science, machine learning, or related roles, with demonstrated experience with NLP or LLMs.
At RemoteStar, we're currently hiring for one of our client based in Spain.
Requirements
~1 min read- Master’s, or Ph.D. in Computer Science, AI, Data Science, Physics, Math, or a related field. Or equivalent industry experience.
- 4+ years of experience in data science, machine learning, or related roles, with demonstrated experience with NLP or LLMs.
- In-depth knowledge of large foundational model architectures (language and multimodal models) and their lifecycle: training, fine-tuning, alignment, and evaluation.
- Proficient in Python and data tooling ecosystems (Pandas, NumPy, Hugging Face Datasets & Transformers libraries).
- Hands-on experience with text data collection from diverse sources: web scraping, APIs, proprietary corpora, etc.
- Strong understanding of data quality metrics including bias detection, toxicity, and readability.
- Experience working in large shared distributed computing environments, familiarity with relevant tools for hardware optimization (vLLM, TensorRT, NeMo, etc.).
- Experience with version control (git), unit testing, and other fundamental aspects of software development.
- Effective communication and interpersonal abilities.
Requirements
~1 min read- Experience building or contributing to datasets used in LLM pretraining or supervised fine-tuning.
- Experience building foundational LLMs from the ground up
- Familiarity with alignment techniques (e.g., reinforcement learning, preference modeling, reward modeling).
- Exposure to multilingual and low-resource language datasets.
- Contributions to open-source datasets, tools, or publications in dataset-centric research.
- Knowledge of ethical AI, data governance, privacy laws (e.g., GDPR), and responsible data use.
- Familiarity with the software development lifecycle and agile methodologies
- Design and implement strategies for creating, sourcing, and augmenting datasets tailored for LLM training and fine-tuning.
- Develop scalable pipelines to collect, clean, filter, annotate, and validate large volumes of text data, ensuring quality, ethical compliance, etc.
- Collaborate with ML engineers, researchers, and software engineers to achieve ambitious goals in the preparation of LLMs and complementary work (preparing datasets, model evaluation, model serving, etc.).
- Develop and integrate new routines for modifying and enhancing LLMs, and extending their functionality.
- Make effective use of distributed compute resources and clusters (GPU’s), identify opportunities for further optimization.
- End-to-end preparation of compressed and specialized LLMs for use in production.
- Keep up to date with research trends in LLM foundation models, dataset curation, LLM pretraining data, and benchmarking.
- Contribute to building documentation, development standards, and a healthy shared code base.
- Mentor other engineers and provide knowledge sharing of cutting-edge techniques.
What We Offer
~1 min readLocation & Eligibility
Listing Details
- First seen
- May 6, 2026
- Last seen
- May 8, 2026
Posting Health
- Days active
- 0
- Repost count
- 0
- Trust Level
- 59%
- Scored at
- May 6, 2026
Signal breakdown
Please let remotestar-team know you found this job on Jobera.
4 other jobs at remotestar-team
View all →Explore open roles at remotestar-team.
Similar Machine Learning Engineer jobs
View all →Stay ahead of the market
Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.
No spam. Unsubscribe at any time.