RoBERTa achieved 93.02% accuracy on IMDb sentiment classification, highest among 7 models?

RoBERTa achieved 93.02% accuracy on IMDb sentiment classification, highest among 7 models.

Soft voting ensemble of all models (Naive Bayes to RoBERTa) further improved performance over any single model?

Soft voting ensemble of all models (Naive Bayes to RoBERTa) further improved performance over any single model.

Study compared legacy TF-IDF-based models with transformers, confirming RoBERTa's superiority for context understanding?

Study compared legacy TF-IDF-based models with transformers, confirming RoBERTa's superiority for context understanding.

Research & Papers

RoBERTa beats classic ML with 93% accuracy in sentiment analysis study

arXiv cs.IR May 22, 2026

⚡Transformers vs. TF-IDF: New benchmark shows RoBERTa dominating with ensemble boost.

Deep Dive

A new research paper, ‘From TF-IDF to Transformers: A Comparative and Ensemble Approach to Sentiment Classification’, systematically evaluates seven machine learning and deep learning models for binary sentiment analysis on the IMDb movie review dataset. The study, accepted at the International Conference on Intelligent Computing, Networks and Security (IC-ICNS 2026), covers traditional models like Naive Bayes, Logistic Regression, and SVM, alongside tree-based LightGBM, sequential LSTM, and transformer-based DistilBERT and RoBERTa. Using standard NLP preprocessing, the team measured accuracy, precision, recall, F1-score, and ROC-AUC to identify the best performer.

RoBERTa emerged as the top individual model with 93.02% accuracy, significantly outperforming classical approaches (SVM at ~89%) and lightweight transformers like DistilBERT (~91%). Notably, the authors implemented a soft voting ensemble that combined predictions from all models, which yielded even better classification metrics, demonstrating that ensemble strategies can squeeze additional performance from diverse architectures. The paper underscores that while transformer models excel at capturing contextual nuances, combining them with simpler models can create more robust systems for real-world opinion mining tasks. This work provides a practical benchmark for developers choosing between efficiency and accuracy in sentiment pipelines.

Key Points

RoBERTa achieved 93.02% accuracy on IMDb sentiment classification, highest among 7 models.
Soft voting ensemble of all models (Naive Bayes to RoBERTa) further improved performance over any single model.
Study compared legacy TF-IDF-based models with transformers, confirming RoBERTa's superiority for context understanding.

Why It Matters

Provides a clear benchmark for choosing sentiment models—shows ensembles beat individual transformers, guiding production NLP choices.

Read Original Article

RoBERTa beats classic ML with 93% accuracy in sentiment analysis study

Why It Matters

Related Articles

🚀 Stay Ahead in AI