The main goal of e-learning systems is just-in-time knowledge acquisition. Rule-based elearning systems, however, suffer from the mesa effect and the cold start problem, which both result in low user acceptance. E-learning systems suffer a further drawback in rendering the implementation of a natural interface in humanoids difficult. To address these concerns, even exceptional questions of the learner must be answerable. This paper aims to propose a method that can understand the learner’s verbal cues and then intelligently explore additional domains of knowledge based on crowd data sources such as Wikipedia and social media, ultimately allowing for better answers in real-time. A prototype system was implemented using the NAO platform.