Beyond Shallow Heuristics: Leveraging Human Intuition for Curriculum Learning

Despite its intuitive appeal, the effectiveness of data-level curriculum learning (CL) remains debated, mainly due to the absence of unambiguous notions of sample difficulty in real-world tasks. As a step towards a better understanding of the effective use of different curriculum strategies in natural language learning, we study CL in the context of regular languages, where both ground truth and sample difficulty can be precisely defined using deterministic finite automata. We consider two natural measures of difficulty: a data-driven metric based on input length and a task-specific metric derived from the automaton’s structure. Training RNNs and LSTMs across ten regular language classification tasks, we find that CL is not just beneficial but, in some cases, essential for generalisation. Surprisingly, straightforward data-driven curricula outperform more complex task-specific strategies, with the most successful approaches oversampling the shorter lengths early in training.

  • Published in:
    Proceedings of the 8th International Conference on Natural Language and Speech Processing ({ICNLSP}-2025)
  • Type:
    Inproceedings
  • Authors:
    Toborek, Vanessa; Müller, Sebastian; Selbach, Tim; Horváth, Tamás; Bauckhage, Christian
  • Year:
    2025
  • Source:
    https://aclanthology.org/2025.icnlsp-1.10/

Citation information

Toborek, Vanessa; Müller, Sebastian; Selbach, Tim; Horváth, Tamás; Bauckhage, Christian: Beyond Shallow Heuristics: Leveraging Human Intuition for Curriculum Learning, Proceedings of the 8th International Conference on Natural Language and Speech Processing ({ICNLSP}-2025), 2025, 87--92, August, Association for Computational Linguistics, https://aclanthology.org/2025.icnlsp-1.10/, Toborek.etal.2025b,

Associated Lamarr Researchers

- Lamarr Institute for Machine Learning (ML) and Artificial Intelligence (AI)

Vanessa Toborek

Author to the profile
lamarr institute person Mueller Sebastian e1663925309673 - Lamarr Institute for Machine Learning (ML) and Artificial Intelligence (AI)

Sebastian Müller

Scientist to the profile
Kopie von LAMARR Person 500x500 1 - Lamarr Institute for Machine Learning (ML) and Artificial Intelligence (AI)

Prof. Dr. Christian Bauckhage

Director to the profile