C4ADS
C4ADS21d ago

Data Scientist

Washington Dcmid
Data ScienceData ScientistDataData & AI
0 views0 saves0 applied

Quick Summary

Overview

Data Scientist Position Summary The Data Scientist will research, design, and evaluate AI and machine learning solutions that advance C4ADS's investigative mission.

Technical Tools
Data ScienceData ScientistDataData & AI

The Data Scientist will research, design, and evaluate AI and machine learning solutions that advance C4ADS's investigative mission. This role is centered on applied research and experimentation, with a focus on designing and testing LLM-based workflows, building retrieval and agentic pipelines, and translating complex analytical needs into working tools that investigators and analysts can actually use. 

The ideal candidate combines strong technical depth in NLP and generative AI paired with genuine familiarity with global security, illicit networks, or open-source research. This person should be equally comfortable experimenting with emerging technologies, translating complex analytical needs into workable technical solutions, and communicating limitations and tradeoffs clearly to non-technical audiences.

Responsibilities

~1 min read
  • Research, prototype, and evaluate LLM-based approaches to investigative data challenges
  • Design and test RAG (retrieval-augmented generation) pipelines for document-heavy research workflows
  • Develop and maintain MCP-based agentic tooling that supports analyst workflows
  • Conduct prompt engineering and model evaluation to assess output quality, reliability, and limitations
  • Stay current with developments in generative AI, NLP, and vector-based retrieval and bring relevant advances into organizational practice
  • Apply advanced NLP techniques to multilingual, unstructured text data including entity extraction, classification, clustering, and semantic search
  • Work with vector embeddings and embedding models to support similarity search and knowledge retrieval
  • Evaluate and select appropriate models and frameworks for specific research tasks, balancing performance, cost, and interpretability
  • Maintain awareness of ML fundamentals and apply sound experimental design when testing approaches
  • Partner closely with research and program teams to understand investigative data needs
  • Translate complex analytical requirements into tractable ML or AI problem framings
  • Build and document tools and workflows that empower non-technical investigators to work with AI outputs effectively
  • Communicate model limitations, uncertainty, and appropriate use clearly to non-technical stakeholders
  • Develop clear visualizations and summaries that communicate analytical findings to internal and external audiences
  • Contribute to written outputs including technical documentation, research memos, and methodology notes
  • Present findings and methodologies to program teams and organizational leadership
  • Participate in technical demos and engagements with external partners and clients
  • Advise external stakeholders on AI-assisted research approaches and appropriate use of generative AI tools
  • Represent C4ADS's technical capabilities in partnership and grant contexts where relevant

Requirements

~1 min read
  • 2-4 years of experience in data science, applied ML, or NLP research (professional, academic, or a strong combination)
  • Strong understanding of NLP fundamentals and generative AI concepts including transformers, embeddings, RAG, and prompt engineering
  • Proficiency in Python for data processing, modeling, and pipeline development
  • Working knowledge of SQL and relational data
  • Hands-on experience with LLM application frameworks such as LangChain, LlamaIndex, or similar
  • Experience working with vector databases or embedding-based retrieval (Pinecone, Weaviate, pgvector, or similar)
  • Familiarity with AWS cloud services for storing and processing data
  • Experience working with government datasets, legal documents, or public records is a plus
  • Proficiency with Git and collaborative development practices
  • Background in international affairs, global security, conflict, sanctions, or illicit finance through coursework, research, or professional experience in the public sector, journalism, or other similar investigative and research fields.
  • Familiarity with OSINT methods or open-source research practices
  • Bachelor's or advanced degree in Data Science, Computer Science, Mathematics, Statistics, or a related technical field preferred, equivalent demonstrated experience considered
  • Ability to independently scope and execute research projects, including shaping deliverables in collaboration with non-technical analytical teams
  • Strong analytical rigor with comfort questioning model outputs, stress-testing assumptions, and designing evaluations that surface failure modes
  • Thoughtful approach to AI ethics and responsible use, including awareness of bias, hallucination risks, and the stakes of applying AI in sensitive investigative contexts
  • Strong written and verbal communication skills, including ability to explain technical concepts to non-technical audiences
  • Collaborative working style with comfort operating across research and technical teams
  • Intellectual curiosity and genuine interest in the organization's mission

Requirements

~1 min read
  • Familiarity with Palantir Foundry or similar enterprise data platforms
  • Experience with graph databases or network analysis (Neo4j or similar)
  • Background in a nonprofit, policy, journalism, or national security research context
  • Experience with multilingual NLP or working across non-English language datasets
  • Hybrid role based in Washington, DC, with a minimum of 2 in-office days per week
  • Standard business hours with occasional flexibility required based on project needs
  • Highly collaborative team environment with regular cross-functional engagement across research data, and operations teams

What We Offer

~1 min read
Salary: $95,000
Benefits: Medical, Dental, and Vision insurance
Retirement: Voluntary 401(k) with 4% employer match
Time Off: Unlimited vacation and flex time, emphasizing outcomes over hours
Learning: $4,500 annual education stipend to grow your skills and language capabilities
Development: Access to customizable professional development resources
Culture: A mission-driven team that values hard work, creativity, and a healthy, supportive workplace

Location & Eligibility

Where is the job
Washington Dc
On-site at the office
Who can apply
Same as job location
Listed under
Worldwide

Listing Details

Posted
April 13, 2026
First seen
April 19, 2026
Last seen
May 5, 2026

Posting Health

Days active
16
Repost count
0
Trust Level
23%
Scored at
May 5, 2026

Signal breakdown

freshnesssource trustcontent trustemployer trust
C4ADS
C4ADS
greenhouse
Employees
125
Founded
2002
Domain
c4ads.org
View company profile

1 other job at C4ADS

View all →

Explore open roles at C4ADS.

Newsletter

Stay ahead of the market

Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.

A
B
C
D
Join 12,000+ marketers

No spam. Unsubscribe at any time.

C4ADSData Scientist