Tag: chatgpt
-

Guardrails That Actually Help: Datadog’s Practical Playbook for Shipping Safer LLM Apps
The guidance Datadog published a practitioner-oriented guide to designing, implementing, and monitoring LLM guardrails in production systems. The piece addresses where guardrails live in typical LLM app architectures, what threats they mitigate, how to detect and neutralize injection attempts, how to enforce domain boundaries and least privilege for tools/agents, and how to evaluate and monitor…
-

Poisoning at Scale: New Research Shows ~250 Documents Can Corrupt LLMs—Regardless of Model Size
The finding A new study from Anthropic—conducted with the UK AI Security Institute and the Alan Turing Institute—demonstrates that a surprisingly small, nearly constant number of poisoned samples (on the order of 100–500, with ~250 as a representative threshold) can reliably corrupt models from hundreds of millions to 13 B parameters. In controlled training runs, the…
