Building and evaluating robust conversational interfaces using LLMs