AI Senior Engineer - Vision
WFA Digital Insight
As demand for AI and machine learning specialists continues to grow, with a 25% increase in job postings in 2025, companies like Able are at the forefront of innovation. With the rise of remote work, professionals with expertise in Computer Vision and Logic are in high demand. Able's commitment to becoming an AI-native organization sets it apart, and this role offers a unique opportunity for engineers to work on cutting-edge projects. Before applying, candidates should be aware of the importance of collaboration, continuous learning, and a passion for building and delivering software effectively.
Job Description
About the Role
Able is seeking an AI Senior Engineer - Vision to join their team in a 100% remote position within Latin America. As a senior engineer, you will be responsible for developing and implementing Computer Vision solutions that extract complex data from visual documents. Your work will be crucial in driving the company's ambition to become an AI-native organization. You will be working closely with a team of engineers and designers who share a passion for building and delivering software that is thoughtful, effective, and genuinely useful.The role entails working at the cutting edge of Computer Vision and Logic, where you will be responsible for building pipelines that can 'read' complex documents, understanding layout, charts, and visual context using Vision-Language Models. Your expertise in orchestrating intelligence, native PDF handling, prompt engineering, and cost optimization will be essential in delivering high-quality solutions.
Able's team is driven by a builder mindset, and as such, you will be expected to collaborate with your teammates, share knowledge, and continuously learn and grow. The company is committed to investing in its team through AI training, knowledge-sharing, and hands-on experimentation.
What You Will Do
- Develop and implement Computer Vision solutions to extract complex data from visual documents
- Build pipelines that can 'read' complex documents, understanding layout, charts, and visual context using Vision-Language Models
- Own the application logic layer, using LangChain or LangGraph to build agents and chains that query data, reason about it, and generate responses
- Handle native PDF processing, preserving structure before the AI even sees it
- Craft complex prompts and control flows to ensure models interpret financial charts and layouts accurately without hallucinating
- Apply a cost-optimization mindset to ensure vision and orchestration layers are economically viable
- Collaborate with the team to deliver high-quality software solutions
- Continuously learn and grow, sharing knowledge and expertise with the team
What We Are Looking For
- Deep experience with LangChain, LangGraph, or similar frameworks
- Hands-on experience integrating state-of-the-art vision models and embedding models
- Familiarity with specialized models and tools like Unstructured.io or Docling
- Mastery over tools like PyMuPDF or pdfplumber for native element extraction
- Strong proficiency in PyTorch or TensorFlow
- Experience with multimodal AI and document intelligence
- Strong verbal and written communication skills in English
- Ability to work 40 hours per week and be available during normal business hours as needed
Nice to Have
- Experience fine-tuning vision or language models
- Prior experience handling documents in the Real Estate or Finance sectors
- Knowledge of other AI frameworks and tools
- Experience working in a remote team environment
Benefits and Perks
- 100% remote work within Latin America
- Payments made in USD
- 18 days of PTO per year, observance of local holidays, and an annual bonus
- Opportunity to work with a team of curious, thoughtful people who care about what they build and how they build it
- Access to AI training, knowledge-sharing, and hands-on experimentation to ensure continuous growth and development
- Strong emphasis on collaboration, inclusivity, and respect in the workplace
- Opportunity to work on cutting-edge projects and contribute to the company's ambition to become an AI-native organization
How to Stand Out
- Tip: Ensure you have a strong portfolio that showcases your experience with Computer Vision and Logic, including any relevant projects or certifications.
- Tip: Practice your communication skills, as strong verbal and written communication in English is a requirement for this role.
- Tip: Be prepared to discuss your experience with LangChain, LangGraph, or similar frameworks, and how you have applied them in previous roles.
- Tip: Highlight your ability to work independently and collaboratively in a remote team environment.
- Tip: Research the company culture and values, and be prepared to discuss how you align with them.
- Tip: Be prepared to discuss your experience with cost optimization and how you have applied it in previous roles.
This is a remote position listed on WFA Digital, the platform for professionals who work from anywhere. Browse more remote jobs across all categories.