In this card, we provide a text preprocessing tool named "Pyplexity". This tool is available as a Python package through PyPi to preprocess text for different downstream tasks:
pip install pyplexity
You can also check the documentation here. We have also created a web demo.
Please cite our research:
Fernández-Pichel, M., Prada-Corral, M., Losada, D. E., Pichel, J. C., & Gamallo, P. (2023). An unsupervised perplexity-based method for boilerplate removal. Natural Language Engineering, 1-18.