
AI Senior Engineer - Vision
Quick Summary
Unlocking Visual Data: Building pipelines that can "read" complex documents, understanding layout, charts, and visual context using Vision-Language Models (GPT-4V, Claude 3.5) and Layout Analysis .
To work 40 hours per week, and be available during normal business hours as needed Payments made in USD 18 days of PTO per year, observance of local holidays,
Back in 2012, we were a group of engineers and designers who decided we wanted to build things—so we did. Able started as an engineering and product hub building for a portfolio of early-stage startups. We built many relationships while developing products that were thoughtful, effective, and genuinely useful. But, since then, we’ve grown… and so has our ambition.
Now, we’re entering our next chapter—defined by applied AI. AI is a powerful force in the end-to-end software development cycle, and we’re creating practices that allow us to deliver software fast and more effectively than traditional approaches, creating meaningful value for our partners. Today, our builder mindset is driving us to become an AI-native organization across every function. We’re still evolving, and that’s part of the opportunity. If you want to build, learn, and tackle challenges alongside an ambitious team, let’s build together.
Responsibilities
~1 min readWe are seeking someone who enjoys working at the cutting edge where Computer Vision meets Logic. You will be responsible for the "eyes" and the "brain" of our system—extracting complex data from visual documents and then orchestrating how that data is used by Large Language Models.
In short, someone who likes:
- →Unlocking Visual Data: Building pipelines that can "read" complex documents, understanding layout, charts, and visual context using Vision-Language Models (GPT-4V, Claude 3.5) and Layout Analysis.
- →Orchestrating Intelligence: Owning the application logic layer. You will use LangChain or LangGraph to build the agents and chains that query our data, reason about it, and generate responses.
- →Native PDF Handling: Handling the messy reality of PDF processing (PyMuPDF, layout parsing) to preserve structure before the AI even sees it.
- →Prompt Engineering & Logic: Crafting complex prompts and control flows to ensure models interpret financial charts and layouts accurately without hallucinating.
- →Cost & Scale: Applying a cost-optimization mindset (batch processing, model selection) to ensure our vision and orchestration layers are economically viable.
We want to work with people who have a passion for collaborating with their teams, building software while nurturing inclusive and respectful relationships with their coworkers. With the ones that are open about their shortcomings and what they do not know now, but remain eager to keep on growing and closing those gaps.
Ideally, they would also have:
- LLM Orchestration (Must Have): Deep experience with LangChain, LangGraph, or similar frameworks. You know how to manage context windows, tool calling, and agentic workflows.
- Multimodal AI Experience: Hands-on experience integrating state-of-the-art vision models (GPT-4V, Claude 3.5 Sonnet) and embedding models (CLIP).
- Document Intelligence Specialist: Familiarity with specialized models (e.g., Donut, Pix2Struct) and tools like Unstructured.io or Docling.
- PDF Processing Mastery: Mastery over tools like PyMuPDF or pdfplumber for native element extraction.
- Python ML Stack: Strong proficiency in PyTorch or TensorFlow.
- Fine-Tuning: Experience fine-tuning vision or language models, specifically to improve accuracy on domain-specific artifacts like financial charts or tables.
- Domain Knowledge: Prior experience handling documents in the Real Estate or Finance sectors.
Able is powered by curious, thoughtful people who care about what they build and how they build it. We’re actively investing in our team through AI training, knowledge-sharing, and hands-on experimentation to ensure everyone grows alongside the technology.
Requirements
~1 min readListing Details
- Posted
- April 8, 2026
- First seen
- March 25, 2026
- Last seen
- April 11, 2026
Posting Health
- Days active
- 16
- Repost count
- 0
- Trust Level
- 58%
- Scored at
- April 11, 2026
Signal breakdown

Able is a product acceleration studio that designs, develops, and deploys technology solutions for startups, established companies, and non-profits, with a remote-first team distributed across the Americas.
View company profilePlease let Able know you found this job on Jobera.
3 other jobs at Able
View all →Explore open roles at Able.
Similar AI Senior Engineer - Vision jobs
Stay ahead of the market
Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.
No spam. Unsubscribe at any time.