TEJAS Journal of Technologies and Humanitarian Science

ISSN : 2583-5599

Open Access | Quarterly | Peer Reviewed Journal


Detection of Fake Job Descriptions Using NLP – An Intelligent NLP Framework for Identifying Fraudulent Recruitment Posts


Rahul Mishra

Department of Computer Science, Shri Ramswaroop Memorial University, Lucknow, India

Jasika Awasthi

Department of Computer Science, Shri Ramswaroop Memorial University, Lucknow, India

Mansi Yadav

Department of Computer Science, Shri Ramswaroop Memorial University, Lucknow, India

Homa Rizvi

Assistant Professor, Department of Computer Science, Shri Ramswaroop Memorial University, Lucknow, India


📌 DOI: https://doi.org/10.63920/tjths.52014

🔑 Keywords: Fake Job Detection, Natural Language Processing, Machine Learning, TF-IDF, Recruitment Fraud

📅 Publication Date: 11 April 2026

📜 License:

  • Share — Copy and Redistribute the material
  • Adapt — Remix, Transform, and build upon the material
  • The licensor cannot revoke these freedoms as long as you follow the license terms.

Abstract:

This paper proposes an intelligent Natural Language Processing (NLP) based framework for detecting fraudulent job descriptions posted on online recruitment platforms. The system applies preprocessing techniques including tokenization, stop word removal, stemming, and lemmatization. Feature extraction methods such as Bag of Words and TF-IDF are used to convert textual job descriptions into numerical form. Multiple machine learning models including Logistic Regression, Naïve Bayes, Random Forest, and Support Vector Machine are trained to classify job postings as genuine or fake.

Download Full PDF Paper


📖 How to Cite

Rahul M., Jasika A., Mansi Y., and Homa R. (2026). Detection of Fake Job Descriptions Using NLP – An Intelligent NLP Framework for Identifying Fraudulent Recruitment Posts. TEJAS J. Technol. Humanit. Sci.,, Vol. 05, Issue 02. https://doi.org/10.63920/tjths.52014

📊 Article Metrics

👁️ Views: 5
📥 Downloads: 8

References

[1] K. J. Lee and J. Kim, “Detecting fraudulent job postings using machine learning techniques,” IEEE Access, vol. 8, pp. 12345–12356, 2020.

[2] R. Feldman and J. Sanger, The Text Mining Handbook: Advanced Approaches in Analyzing Unstructured Data. Cambridge University Press, 2007.

[3] T. Mikolov, K. Chen, G. Corrado, and J. Dean, “Efficient estimation of word representations in vector space,” in Proc. Int. Conf. Learning Representations (ICLR), 2013.

[4] J. Ramos, “Using TF-IDF to determine word relevance in document queries,” in Proc. First Instructional Conf. Machine Learning, 2003.

[5] C. D. Manning, P. Raghavan, and H. Schütze, Introduction to Information Retrieval. Cambridge University Press, 2008.

[6] S. Bird, E. Klein, and E. Loper, Natural Language Processing with Python. O’Reilly Media, 2009.

[7] I. Goodfellow, Y. Bengio, and A. Courville, Deep Learning. MIT Press, 2016.

[8] L. Breiman, “Random forests,” Machine Learning, vol. 45, no. 1, pp. 5–32, 2001.

[9] K. P. Murphy, Machine Learning: A Probabilistic Perspective. MIT Press, 2012.

[10] Kaggle, “Fake Job Postings Dataset.” [Online]. Available: Kaggle Link