It is all in the [MASK]: Simple instruction-tuning enables BERT-like masked language models as generative classifiers

It is all in the [MASK]: Simple instruction-tuning enables BERT-like masked language models as generative classifiers

While encoder-only models such as BERT and ModernBERT are ubiquitous in real-world NLP applications, their conventional reliance on task-specific classification heads can limit their applicability compared to decoder-based large language models (LLMs). In this work, we introduce ModernBERT-Large-Ins...

Full description

Saved in:

Bibliographic Details
Main Authors:	Benjamin Clavié, Nathan Cooper, Benjamin Warner
Format:	Article
Language:	English
Published:	Elsevier 2025-06-01
Series:	Natural Language Processing Journal
Subjects:	Zero-shot classification Multiple-choice question answering Encoder models BERT ModernBERT Masked language modeling
Online Access:	http://www.sciencedirect.com/science/article/pii/S2949719125000263
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

MPTTF-BERT: Multi-Prefix-Tuning Template Fusion by BERT for Zero-shot English Text Relation Extraction Model
by: Fangqi Song
Published: (2025-05-01)

An approach to automatic answering for English reading comprehension tests
by: Phat Tien Bui, et al.
Published: (2024-09-01)

Positional embeddings and zero-shot learning using BERT for molecular-property prediction
by: Medard Edmund Mswahili, et al.
Published: (2025-02-01)

Enhancing QA System Evaluation: An In-Depth Analysis of Metrics and Model-Specific Behaviors
by: Heesop Kim, et al.
Published: (2025-03-01)

Japanese Short Answer Grading for Japanese Language Learners Using the Contextual Representation of BERT
by: Dyah Lalita Luhurkinanti, et al.
Published: (2025-01-01)

ZPVQA: Visual Question Answering of Images Based on Zero-Shot Prompt Learning
by: Naihao Hu, et al.
Published: (2025-01-01)

Assessing the performance of zero-shot visual question answering in multimodal large language models for 12-lead ECG image interpretation
by: Tomohisa Seki, et al.
Published: (2025-02-01)

Penerapan Sentence BERT Untuk Similaritas Kompetensi Pekerjaan dan Mata Kuliah
by: Kafka Febianto Agiharta, et al.
Published: (2024-12-01)

Comparative Analysis of BERT and GPT for Classifying Crisis News with Sudan Conflict as an Example
by: Yahya Masri, et al.
Published: (2025-07-01)

Autocorrelation Matrix Knowledge Distillation: A Task-Specific Distillation Method for BERT Models
by: Kai Zhang, et al.
Published: (2024-10-01)

Zero-Shot Automated Detection of Fake News: An Innovative Approach (ZS-FND)
by: Rania Baashirah
Published: (2024-01-01)

Revolutionizing Mental Health Sentiment Analysis With BERT-Fuse: A Hybrid Deep Learning Model
by: Md. Mithun Hossain, et al.
Published: (2025-01-01)

Semantic-BERT and semantic-FastText model for education question classification
by: Teotino Gomes Soares, et al.
Published: (2025-05-01)

Weapon equipment question answering system based on BERT and knowledge graph
by: WANG Bo, JIANG Xuping, HUANG Qihong
Published: (2025-06-01)

VitroBert: modeling DILI by pretraining BERT on in vitro data
by: Muhammad Arslan Masood, et al.
Published: (2025-08-01)

Leveraging BERT, DistilBERT, and TinyBERT for Rumor Detection
by: Aijazahamed Qazi, et al.
Published: (2025-01-01)

Innovative Sentiment Analysis and Prediction of Stock Price Using FinBERT, GPT-4 and Logistic Regression: A Data-Driven Approach
by: Olamilekan Shobayo, et al.
Published: (2024-10-01)

Fine-Tuning BiomedBERT with LoRA and Pseudo-Labeling for Accurate Drug–Drug Interactions Classification
by: Ioan-Flaviu Gheorghita, et al.
Published: (2025-08-01)

BERT-based network for intrusion detection system
by: Yong Yang, et al.
Published: (2025-03-01)

Perceived MOOC satisfaction: A review mining approach using machine learning and fine-tuned BERTs
by: Xieling Chen, et al.
Published: (2025-06-01)

Introducing MagBERT: A language model for magnesium textual data mining and analysis
by: Surjeet Kumar, et al.
Published: (2024-08-01)

BERTugues: A Novel BERT Transformer Model Pre-trained for Brazilian Portuguese
by: Ricardo Mazza Zago, et al.
Published: (2024-12-01)

A matching model for culture and tourism customer service questions based on domain dictionary fusion
by: ZHU Xinjuan, et al.
Published: (2024-06-01)

Psychological Stress Detection Using Transformer-Based Models
by: Derwin Suhartono, et al.
Published: (2024-06-01)

Zero-BertXGB: An Empirical Technique for Abstract Classification in Systematic Reviews
by: Mohammad Shariful Islam, et al.
Published: (2025-01-01)

Enhancing emerging technology discovery in nanomedicine by integrating innovative sentences using BERT and NLDA
by: Wang Yifan, et al.
Published: (2024-11-01)

Development of a Knowledge Base for Construction Risk Assessments Using BERT and Graph Models
by: Wonjong Lee, et al.
Published: (2024-10-01)

Internet bad information detection based on Bert model
by: Xin CAI
Published: (2020-11-01)

Source Code Error Understanding Using BERT for Multi-Label Classification
by: Md Faizul Ibne Amin, et al.
Published: (2025-01-01)

A BERT-Based Classification Model: The Case of Russian Fairy Tales
by: Валерий Дмитриевич Соловьев, et al.
Published: (2024-12-01)

Seaweed-Based Bioplastics: Data Mining Ingredient–Property Relations from the Scientific Literature
by: Fernanda Véliz, et al.
Published: (2025-02-01)

Sensitivity Analysis of a BERT-based scholarly recommendation system
by: Jie Zhu, et al.
Published: (2022-05-01)

Automatic Classification of Online Learner Reviews Via Fine-Tuned BERTs
by: Xieling Chen, et al.
Published: (2025-02-01)

ABERT: Adapting BERT model for efficient detection of human and AI-generated fake news
by: Jawaher Alghamdi, et al.
Published: (2025-12-01)

Multi task opinion enhanced hybrid BERT model for mental health analysis
by: Md. Mithun Hossain, et al.
Published: (2025-01-01)

A novel dual embedding few-shot learning approach for classifying bone loss using orthopantomogram radiographic notes
by: Pradeep Kumar Yadalam, et al.
Published: (2025-07-01)

Research on adverse event classification algorithm of da Vinci surgical robot based on Bert-BiLSTM model
by: Tianchun Li, et al.
Published: (2024-12-01)

TraitBertGCN: Personality Trait Prediction Using BertGCN with Data Fusion Technique
by: Muhammad Waqas, et al.
Published: (2025-03-01)

Evaluating sentiment analysis models: A comparative analysis of vaccination tweets during the COVID-19 phase leveraging DistilBERT for enhanced insights
by: Renuka Agrawal, et al.
Published: (2025-06-01)

Comparative Evaluation of IndoBERT, IndoBERTweet, and mBERT for Multilabel Student Feedback Classification
by: Fatma Indriani, et al.
Published: (2024-12-01)