Publications

16 results for Ateret Anaby-Tavor

Towards Enforcing Company Policy Adherence in Agentic Workflows
- - Naama Zwerdling
  - David Boaz
  - et al.
- 2025
- EMNLP 2025
Effective Red-Teaming of Policy-Adherent Agents
- - Itay Nakash
  - George Kour
  - et al.
- 2025
- EMNLP 2025
Think Again! The Effect of Test-Time Compute on Preferences, Opinions, and Beliefs of Large Language Models
- - George Kour
  - Itay Nakash
  - et al.
- 2025
- ACL 2025
Breaking ReAct Agents: Foot-in-the-Door Attack Will Get You In
- - Itay Nakash
  - George Kour
  - et al.
- 2025
- NAACL 2025
Exploring Straightforward Methods for Automatic Conversational Red-Teaming
- - George Kour
  - Naama Zwerdling
  - et al.
- 2025
- NAACL 2025
A Novel Metric for Measuring the Robustness of Large Language Models in Non-adversarial Scenarios
- - Samuel Ackerman
  - Ella Rabinovich
  - et al.
- 2024
- EMNLP 2024
From Zero to Hero: Cold-Start Anomaly Detection
- - Tal Reiss
  - George Kour
  - et al.
- 2024
- ACL 2024
Predicting Question-Answering Performance of Large Language Models through Semantic Consistency
- - Ella Rabinovich
  - Samuel Ackerman
  - et al.
- 2023
- EMNLP 2023
Unveiling Safety Vulnerabilities of Large Language Models
- - George Kour
  - Marcel Zalmanovici
  - et al.
- 2023
- EMNLP 2023
Text Augmentation Using Dataset Reconstruction for Low-Resource Classification
- - Adir Rahamim
  - Guy Uziel
  - et al.
- 2023
- ACL 2023