~/The Meta/Security/Trust & Safety_

Trust & Safety

Security

Specialists in this role develop detection systems and enforcement strategies to identify and mitigate emerging abuse patterns across AI products, working at the intersection of data science, policy, and operations. They balance competing priorities—detecting sophisticated threat actors while maintaining platform usability—by building scalable detection pipelines, conducting rapid investigations, and collaborating with policy and engineering teams to implement mitigations. Unlike policy-focused roles, these positions emphasize technical implementation and quantitative analysis; unlike pure engineering roles, they require deep domain expertise in specific abuse vectors and threat actor behavior. These analysts typically sit within dedicated Trust & Safety or Safeguards teams that operate cross-functionally with research, product, and legal to stay ahead of evolving misuse techniques.

$ titles --canonical

Trust & Safety Operations AnalystContent Integrity AnalystAbuse InvestigatorAI Safety ManagerProduct Safety ManagerAI Safety & Responsibility ManagerResponsible AI Manager

Open Jobs38

Companies Hiring9

$02_

Skills

What companies are looking for in this role.

$ skills --core

Designing and implementing machine learning-based detection systems for abuse, fraud, and policy violations

95%

Analyzing attack patterns and emerging abuse trends to identify novel threat vectors and behavioral anomalies

92%

Developing and maintaining enforceable policies, rules systems, and classification frameworks for platform safety

88%

Conducting investigations into complex misuse cases involving suspicious user behavior and coordinated harm

87%

Building automated response mechanisms and enforcement workflows that operate without manual intervention

85%

Querying, transforming, and analyzing large datasets using SQL and data manipulation techniques

84%

Writing and maintaining Python code for data analysis, system testing, and detection logic implementation

83%

Translating ambiguous safety risks and complex threat landscapes into measurable, evidence-based problems

81%

Building and operating systems to detect phishing, cryptomining, account takeovers, and financial fraud at scale

80%

Scoping and implementing abuse monitoring systems for new product launches and existing platforms

78%

Building dashboards, monitoring systems, and prevalence estimators for safety metrics and trends

76%

Designing experiments and conducting causal inference analyses to understand safety intervention impacts

75%

Building zero-to-one analytical systems and transforming prototypes into scalable, reusable tools

74%

Designing threat taxonomies and harm classification frameworks for emerging and frontier risks

70%

Performing high-volume content review and data labeling tasks with accuracy and attention to detail

65%

Integrating and tuning security scanning tools in continuous integration pipelines

58%

$ skills --emerging

Designing large-language-model guardrails to detect abuse scenarios in AI-generated content and interactions

82%

Detecting and analyzing agentic and autonomous behavior patterns in AI systems for safety risks

68%

Using large language models as defensive tools to identify malicious patterns and automate threat classification

65%

Conducting horizon scanning, competitive benchmarking, and external narrative analysis for risk sense-making

62%

Applying prompt injection attack detection and mitigation techniques at production scale

58%

Developing compliance programs aligned with global online safety and content moderation regulations

55%

Conducting regulatory risk assessments and translating legal obligations into enforceable safeguards

52%

Conducting behavioral and psychological analysis of user interactions with AI systems in high-risk contexts

48%

Developing domain-specific expertise in chemical, biological, radiological, nuclear, and explosives threat detection

42%

$ skills --soft

Communicating technical findings and risk assessments clearly to both technical and non-technical stakeholders

85%

Coordinating cross-functional teams across Policy, Legal, Engineering, and Communications during high-stakes situations

80%

Operating independently with high ownership in ambiguous, rapidly evolving problem domains

79%

Identifying gaps in existing safety systems and proposing improvements based on investigation findings

77%

Managing escalation procedures and on-call incident response operations for sensitive enforcement decisions

72%

$03_

Technology

The tools and technologies that define this role.

$ tech --language

Pythonvery high

SQLvery high

$ tech --platform

BigQueryhigh

Google Suitelow

$ tech --tool

Hexmoderate

SASTmoderate

SCAmoderate

Netwatchlow

Slurperlow

Zoomlow

$ tech --concept

LLMvery high

Machine Learningvery high

Anomaly Detectionhigh

Data Sciencehigh

Statistical Analysishigh

CI/CDmoderate

$04_

Open Jobs

38 open Trust & Safety jobs across 9 companies.

xAI

Senior Analyst, Safety Operations (Child Safety)

Security

Palo Alto, CA

xAI3d

Senior Analyst, Safety Operations (Child Safety)

Palo Alto, CA·Security

Anthropic

Threat Intel Manager, Model Exploitation & Fraud

Security

San Francisco, CA

Anthropic3d

Threat Intel Manager, Model Exploitation & Fraud

San Francisco, CA·Security

Anthropic

Threat Intel Manager, CBRN-E & Advanced Weapons

Security

San Francisco, CA | New York City, NY | Washington, DC

Anthropic3d

Threat Intel Manager, CBRN-E & Advanced Weapons

San Francisco, CA | New York City, NY | Washington, DC·Security

Anthropic

Threat Intel Manager, Influence Operations & Surveillance

Security

San Francisco, CA

Anthropic3d

Threat Intel Manager, Influence Operations & Surveillance

San Francisco, CA·Security

Isomorphic Labs

Senior Security Engineer (AI Safety), London, Lausanne

Security

Lausanne; London

Isomorphic Labs1w

Senior Security Engineer (AI Safety), London, Lausanne

Lausanne; London·Security

Palantir

Privacy & Civil Liberties Engineer - New Grad

Security

New York, NY

Palantir2w

Privacy & Civil Liberties Engineer - New Grad

New York, NY·Security

OpenAI

Trust & Safety Operations Analyst, Ads

Security

San Francisco

OpenAI2w

Trust & Safety Operations Analyst, Ads

San Francisco·Security

San Francisco·Security

Member of Technical Staff, Trust & Safety Engineer

Security

Remote

Runway1mo

Member of Technical Staff, Trust & Safety Engineer

Remote·Security

1mo

OpenAI

Abuse Investigator - Child Safety

Security

San Francisco

OpenAI1mo

Abuse Investigator - Child Safety

San Francisco·Security

1mo

xAI

Member of Technical Staff - Imagine Safety

Security

Palo Alto, CA

xAI1mo

Member of Technical Staff - Imagine Safety

Palo Alto, CA·Security

1mo

Anthropic

Data Scientist, Safeguards

Security

New York City, NY; San Francisco, CA; Seattle, WA

Anthropic1mo

Data Scientist, Safeguards

New York City, NY; San Francisco, CA; Seattle, WA·Security

1mo

OpenAI

Product Policy, Biosecurity Policy Manager

Security

San Francisco

OpenAI1mo

Product Policy, Biosecurity Policy Manager

San Francisco·Security

1mo

Lovable

Trust and Safety Support Specialist

Security

Stockholm

Lovable1mo

Trust and Safety Support Specialist

Stockholm·Security

1mo

Replit

Staff Software Engineer, Trust & Safety

Security

Foster City, CA

Replit1mo

Staff Software Engineer, Trust & Safety

Foster City, CA·Security

1mo

Replit

Senior Software Engineer, Trust & Safety

Security

Foster City, CA

Replit1mo

Senior Software Engineer, Trust & Safety

Foster City, CA·Security

1mo

OpenAI

Data Scientist, Safety

Security

London, UK

OpenAI1mo

Data Scientist, Safety

London, UK·Security

1mo

OpenAI

Data Scientist, Safety

Security

San Francisco

OpenAI1mo

Data Scientist, Safety

San Francisco·Security

1mo

Anthropic

Incident Response Manager, Enforcement

Security

San Francisco, CA | New York City, NY | Washington, DC

Anthropic1mo

Incident Response Manager, Enforcement

San Francisco, CA | New York City, NY | Washington, DC·Security

View all 38 jobs

$ roles --related --function=security

Other Security roles

Application Security Engineer

Identifies and mitigates security vulnerabilities in applications and products.

Infrastructure & Cloud Security Engineer

Secures cloud infrastructure, networks, and systems.

Security Engineer

Generalist security engineering role spanning multiple security domains. For security engineers who work across application, infrastructure, and cloud security without a single dominant specialization. The default home for "Security Engineer" titles when the function is clearly Security.

Detection & Incident Response

Builds detection systems, investigates security incidents, and leads incident response efforts.

Offensive Security & Red Team

Conducts offensive security assessments including red teaming, penetration testing, and adversarial simulation.