HuggingFace Daily Papers ★ 25 1 min

SABER: Benchmarking Operational Safety of LLM Coding Agents in Stateful Project Workspaces

🔗 https://huggingface.co/papers/2606.01317

google/gemma-4-31b-it:free 自動生成