featherlessai
New

Machine Learning Engineer — AI Architecture Research

(world)Remotefull-timemid
Machine Learning EngineerData
0 views0 saves0 applied

Quick Summary

Overview

About the Role We’re looking for a Machine Learning Engineer focused on AI architecture research to help design, prototype, and validate next-generation model architectures. You’ll work at the intersection of research and production — turning new ideas into scalable, real-world systems.

Requirements Summary

Experience with non-Transformer architectures (RNN variants, SSMs, long-context models) Background in research-driven startups or open-source ML projects Experience with large-scale training or custom training loops Publications, preprints, or…

Technical Tools
pytorchab-testingdeep-learningmachine-learning

About the Role

~1 min read

We’re looking for a Machine Learning Engineer focused on AI architecture research to help design, prototype, and validate next-generation model architectures. You’ll work at the intersection of research and production — turning new ideas into scalable, real-world systems.

This role is ideal for someone who enjoys questioning architectural assumptions, experimenting with novel model designs, and pushing beyond standard Transformer-style approaches.

  • Research and develop new neural network architectures (e.g. alternatives or extensions to Transformers, recurrent / hybrid models, long-context systems)

  • Design and run architecture-level experiments (scaling laws, memory mechanisms, compute trade-offs)

  • Prototype models end-to-end — from research code to training-ready implementations

  • Collaborate with inference and systems engineers to ensure architectures are deployable and efficient

  • Analyze model behavior, failure modes, and inductive biases

  • Read, reproduce, and extend cutting-edge research papers

  • Contribute to internal research notes, benchmarks, and open-source efforts (where applicable)

  • Strong background in machine learning fundamentals and deep learning

  • Hands-on experience implementing model architectures from scratch

  • Solid understanding of:

    • Attention mechanisms, RNNs, state-space models, or hybrid architectures

    • Training dynamics, scaling behavior, and optimization

    • Memory, latency, and compute constraints at the model level

  • Comfortable working in PyTorch or JAX

  • Ability to move fluidly between theory, experimentation, and engineering

  • Clear communicator who can explain architectural trade-offs

Nice to Have

~1 min read
  • Experience with non-Transformer architectures (RNN variants, SSMs, long-context models)

  • Background in research-driven startups or open-source ML projects

  • Experience with large-scale training or custom training loops

  • Publications, preprints, or notable research contributions

  • Familiarity with inference optimization and deployment constraints

What We Offer

~1 min read
Work on core model architecture, not just fine-tuning
Direct influence on the technical direction of a Series-A company
Small, high-caliber team with fast feedback loops
Opportunity to ship research into production
Competitive compensation + meaningful equity

Location & Eligibility

Where is the job
Worldwide
Fully remote, anywhere in the world
Who can apply
Same as job location

Listing Details

Posted
January 22, 2026
First seen
May 6, 2026
Last seen
May 8, 2026

Posting Health

Days active
0
Repost count
0
Trust Level
23%
Scored at
May 6, 2026

Signal breakdown

freshnesssource trustcontent trustemployer trust
Newsletter

Stay ahead of the market

Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.

A
B
C
D
Join 12,000+ marketers

No spam. Unsubscribe at any time.

featherlessaiMachine Learning Engineer — AI Architecture Research