Applied Methods
~The MetaSecurityTrust & Safety

Trust & Safety

Specialists in this role develop detection systems and enforcement strategies to identify and mitigate emerging abuse patterns across AI products, working at the intersection of data science, policy, and operations. They balance competing priorities—detecting sophisticated threat actors while maintaining platform usability—by building scalable detection pipelines, conducting rapid investigations, and collaborating with policy and engineering teams to implement mitigations. Unlike policy-focused roles, these positions emphasize technical implementation and quantitative analysis; unlike pure engineering roles, they require deep domain expertise in specific abuse vectors and threat actor behavior. These analysts typically sit within dedicated Trust & Safety or Safeguards teams that operate cross-functionally with research, product, and legal to stay ahead of evolving misuse techniques.

$ titles --canonical
Trust & Safety Operations AnalystContent Integrity AnalystAbuse Investigator
37open jobs
6companies hiring
$02

Skills

What companies are looking for in this role.

$ skills --core

Monitoring and investigating content and behavior that violates terms of service

100%

Detecting, investigating, and disrupting malicious use of AI platforms

95%

Developing abuse signals and tracking strategies to proactively detect harmful activities

90%

Providing data labeling, annotations, and inputs for safety protocols

85%

Designing and implementing enforcement workflows and review processes

80%

Analyzing large datasets to identify patterns and coordinated networks

80%

Building and scaling detection systems for fraud and abuse

80%

Processing appeals and auditing automated systems

80%

Responding to urgent escalations and participating in on-call rotations

80%

Conducting threat intelligence analysis and threat actor investigations

75%

Conducting root cause analyses and deep-dive investigations

70%

Creating monitoring dashboards, alerts, and internal administrative interfaces

65%

Leading safety assessments and threat modeling for new products

60%

Managing vendor relationships and third-party content moderation services

60%
$ skills --emerging

Training and refining large language models for safety and policy enforcement

85%

Building multi-layered defenses and real-time safety mechanisms for AI systems

75%

Conducting safety evaluations and assessments of AI models

70%

Developing AI-specific detection capabilities and behavioral clustering techniques

70%

Creating automated enforcement systems that scale with AI platform growth

65%
$ skills --soft

Collaborating across cross-functional teams including engineering, policy, and legal

95%

Communicating complex technical concepts to non-technical stakeholders

85%

Leading and mentoring teams of safety operations analysts

50%
$03

Technology

The tools and technologies that define this role.

$ tech --language
Pythonhigh
SQLmoderate
$ tech --platform
Claudelow
GPTlow
Groklow
$ tech --tool
Dark web monitoringlow
$ tech --concept
LLMsvery high
Machine Learningmoderate
A/B testinglow
Embeddingslow
Fine-tuninglow
Graph-based data infrastructurelow
$04

Open Jobs

37 open Trust & Safety jobs across 6 companies.

Anthropic5d
Safeguards Policy Analyst, Fraud & Scams
Remote-Friendly (Travel-Required) | San Francisco, CA | New York City, NY·Security
xAI1w
Senior Analyst - Safety Operations (CSE)
Palo Alto, CA·Security
xAI1w
Senior Analyst - Safety Operations (CSE)
Bastrop, TX·Security
xAI1w
Senior Analyst, Safety Operations
Bastrop, TX·Security
Nscale1w
Staff Engineer, Customer Trust
AMER·Security
Anthropic2w
Technical Policy Manager, Cyber Harms
Remote-Friendly (Travel-Required) | San Francisco, CA | Washington, DC·Security
Anthropic2w
Technical Influence Operations Threat Investigator
Remote-Friendly, United States·Security
Anthropic2w
Technical Cyber Threat Investigator
Remote-Friendly (Travel-Required) | San Francisco, CA | Washington, DC·Security
Anthropic2w
Technical CBRN-E Threat Investigator
Remote-Friendly (Travel-Required) | San Francisco, CA | Washington, DC·Security
Anthropic2w
Software Engineer, Safeguards Infrastructure
London, UK·Security
Anthropic2w
Software Engineer, Account Abuse
San Francisco, CA | New York City, NY·Security
xAI2w
Manager, Safety Operations
Bastrop, TX·Security
Anthropic2w
Biological Safety Research Scientist
San Francisco, CA | New York City, NY·Security
Anthropic2w
Safeguards Analyst, Human Exploitation & Abuse
Remote-Friendly, United States·Security
Anthropic2w
Safeguards Enforcement Lead, Frontier Abuse Enforcement
San Francisco, CA | New York City, NY | Washington, DC·Security
Anthropic3w
Safeguards Analyst, Account Abuse
San Francisco, CA | New York City, NY·Security
Anthropic3w
Enforcement Operations Lead
San Francisco, CA | New York City, NY | Washington, DC·Security
Anthropic3w
Safeguards Enforcement Analyst, Safety Evaluations
Remote-Friendly (Travel-Required) | San Francisco, CA | Washington, DC; San Francisco, CA | New York City, NY·Security
Anthropic3w
Policy Manager, Chemical Weapons and High Yield Explosives
San Francisco, CA | New York City, NY·Security
OpenAI1mo
Data Scientist, Integrity
San Francisco·Security