Publications
2025
Mura, R., Piras, G., Lukošiūtė, K., Pintor, M., Karbasi, A., & Biggio, B. (2025). LatentBreak: Jailbreaking Large Language Models through Latent Space Feedback. arXiv:2510.08604.
Piras, G., Mura, R., Brau, F., Oneto, L., Roli, F., & Biggio, B. (2025). SOM Directions are Better than One: Multi-Directional Refusal Suppression in Language Models. AAAI 2026.
Brau, F., Pintor, M., Cinà, A. E., Mura, R., Scionis, L., Oneto, L., et al. (2025). TransferBench: Benchmarking Ensemble-based Black-box Transfer Attacks. NeurIPS 2025 (Datasets and Benchmarks Track).
Lazzaro, D., Mura, R., Ciná, A. E., Laurita, G., Vercelli, G., Oneto, L., et al. (2025). Poison Once, Fool Many: Practical Poisoning Attacks against Text-to-Image Retrieval Systems. Knowledge-Based Systems.
Mura, R., Floris, G., Scionis, L., Piras, G., Pintor, M., Demontis, A., et al. (2025). HO-FMN: Hyperparameter Optimization for Fast Minimum-Norm Attacks. Neurocomputing, 616, 128918.
2024
Su, E., Vellore, A., Chang, A., Mura, R., Nelson, B., Kassianik, P., & Karbasi, A. (2024). Extracting Memorized Training Data via Decomposition. arXiv:2409.12367.
2023
Floris, G., Mura, R., Scionis, L., Piras, G., Pintor, M., Demontis, A., & Biggio, B. (2023). Improving Fast Minimum-Norm Attacks with Hyperparameter Optimization. arXiv:2310.08177.
