Applied Methods
~The MetaSecurityTrust & Safety

Trust & Safety

Specialists in this role develop detection systems and enforcement strategies to identify and mitigate emerging abuse patterns across AI products, working at the intersection of data science, policy, and operations. They balance competing priorities—detecting sophisticated threat actors while maintaining platform usability—by building scalable detection pipelines, conducting rapid investigations, and collaborating with policy and engineering teams to implement mitigations. Unlike policy-focused roles, these positions emphasize technical implementation and quantitative analysis; unlike pure engineering roles, they require deep domain expertise in specific abuse vectors and threat actor behavior. These analysts typically sit within dedicated Trust & Safety or Safeguards teams that operate cross-functionally with research, product, and legal to stay ahead of evolving misuse techniques.

$ titles --canonical
Trust & Safety Operations AnalystContent Integrity AnalystAbuse InvestigatorAI Safety ManagerProduct Safety ManagerAI Safety & Responsibility ManagerResponsible AI Manager
Open Jobs39
Companies Hiring5
$02

Skills

What companies are looking for in this role.

$ skills --core

Identifying, investigating, and analyzing abuse patterns and harmful behaviors across digital platforms and ecosystems

95%

Designing and maintaining analytical tools, evaluation methodologies, and quality metrics for intelligence workflows

95%

Developing detection signals, tracking strategies, and classification systems to identify malicious activity and policy violations at scale

92%

Monitoring and analyzing emerging threats, trends, and attack vectors to inform proactive risk mitigation strategies

90%

Querying, organizing, and transforming complex datasets using structured data analysis techniques

90%

Collaborating with cross-functional teams including engineering, product, policy, and leadership to shape safety decisions

88%

Designing and executing capability evaluations and threat modeling to assess AI system vulnerabilities and misuse potential

88%

Creating comprehensive policy frameworks, taxonomies, and enforcement workflows for content moderation and safety

88%

Translating domain expertise into actionable safety requirements, guardrails, and technical safeguards for AI systems

85%

Building reusable playbooks, frameworks, templates, and lightweight tools to operationalize and scale recurring analyses

85%

Conducting rapid response investigations into escalations and high-impact safety incidents in ambiguous situations

82%
$ skills --emerging

Designing and refining safety systems to detect harmful AI model outputs and prevent misuse by sophisticated threat actors

78%

Analyzing behavioral and psychological signals to understand user vulnerability, risk patterns, and human-AI interaction risks

75%

Detecting and investigating autonomous, agentic, or self-improving behaviors in AI systems that may introduce safety risks

70%

Applying frontier risk scanning, horizon scanning, and strategic foresight methodologies to identify emerging threats

68%

Developing account-level signals, identity-linking strategies, and graph-based approaches to detect coordinated abuse

65%

Conducting open-source research, dark web monitoring, and cross-platform threat analysis to understand threat actor behavior

62%
$ skills --soft

Producing decision-ready intelligence briefs, risk assessments, and narratives for technical and non-technical stakeholders

90%

Communicating complex technical findings and domain expertise clearly to diverse teams and leadership

88%

Working cross-functionally and collaboratively with diverse technical and non-technical partners

85%

Synthesizing raw signals, multiple data sources, and qualitative information into structured, actionable insights

82%

Operating with strong judgment and resilience in high-pressure environments with sensitive or disturbing content

80%

Identifying gaps in existing safeguards, evaluations, and monitoring systems and proposing improvements

78%

Leading, mentoring, and managing teams focused on safety operations and content moderation

70%
$03

Technology

The tools and technologies that define this role.

$ tech --language
Pythonvery high
SQLvery high
$ tech --platform
Content Moderation Systemshigh
$ tech --tool
Dashboard Toolsmoderate
Graph Databasemoderate
Jupyter Notebooksmoderate
$ tech --concept
Anomaly Detectionhigh
Data Analysishigh
Evaluation Frameworkshigh
Large Language Modelshigh
Machine Learninghigh
Policy Enforcement Automationhigh
Statistical Analysishigh
Threat Modelinghigh
Open Source Intelligencemoderate
Dark Web Monitoringlow
$04

Open Jobs

39 open Trust & Safety jobs across 5 companies.

Replit21h
Staff Software Engineer, Trust & Safety
Foster City, CA·Security
Replit21h
Senior Software Engineer, Trust & Safety
Foster City, CA·Security
OpenAI1w
Data Scientist, Safety
London, UK·Security
OpenAI1w
Data Scientist, Safety
San Francisco·Security
Anthropic1w
Incident Response Manager, Enforcement
San Francisco, CA | New York City, NY | Washington, DC·Security
OpenAI3w
Protection Scientist Engineer, Integrity
San Francisco·Security
Anthropic3w
Content Moderation Specialist
San Francisco, CA | New York City, NY | Washington, DC·Security
OpenAI3w
Technical Intelligence Analyst
San Francisco·Security
OpenAI1mo
AI Emerging Risks Analyst
San Francisco·Security
OpenAI1mo
Abuse Investigator (AI Self-Improvement Risk)
San Francisco·Security
OpenAI1mo
Strategic Risk Analyst, Behavioral & Psychological Risk
San Francisco·Security
Replit1mo
Trust & Safety Specialist
Foster City, CA·Security
Anthropic1mo
Safeguards Policy Analyst, Fraud & Scams
Remote-Friendly (Travel-Required) | San Francisco, CA | New York City, NY·Security
xAI1mo
Senior Analyst - Safety Operations (Child Safety)
Palo Alto, CA·Security
xAI1mo
Senior Analyst - Safety Operations (Child Safety)
Bastrop, TX·Security
xAI1mo
Senior Analyst, Safety Operations
Bastrop, TX·Security
Nscale1mo
Staff Engineer, Customer Trust
AMER·Security
Anthropic2mo
Technical CBRN-E Threat Investigator
Remote-Friendly (Travel-Required) | San Francisco, CA | Washington, DC·Security
Anthropic2mo
Technical Cyber Threat Investigator
Remote-Friendly (Travel-Required) | San Francisco, CA | Washington, DC·Security
Anthropic2mo
Software Engineer, Safeguards Infrastructure
London, UK·Security