Repository created for the purposes of the iConference 2023
Check out our Google Colab:
In this tutorial, we will be applying YAKE! keyword extraction Python package to extract relevant keyphrases from 16 political party programmes candidates to the Portuguese legislative elections held in January 30th, 2022. We refer to nine political parties with parliamentary representation in the last legislature (2019–2021):
- PS: Partido Socialista
- PSD: Partido Social Democrata
- BE: Bloco de Esquerda
- CDU: Coligação Democrática Unitária PCP-PEV
- CDS-PP: Partido do Centro Democrático e Social
- PAN: Partido Pessoas-Animais-Natureza
- Chega
- IL: Iniciativa Liberal
- Livre
and to seven political parties which did not have a representation in the parliament. We refer to:
- ADN: Alternativa Democrática Nacional
- Ergue-te
- MAS: Movimento Alternativo Socialista
- MPT: Partido da Terra
- Nós Cidadãos
- RIR: Reagir Incluir Reciclar
- Volt Portugal
Four other political parties (PTP; PCTP/MRPP; JPP; Aliança) are running in the elections but did no make their programme available online.
YAKE! is an unsupervised keyword extraction algorithm which rests on text statistical features to extract relevant keyphrases from single documents. Its plug-and-play nature and adaptability to different domains and languages, plus a good compromise between effectiveness and efficience, makes it a good solution for this use-case scenario.
- Campos, R., Jatowt, A. and Jorge, A. (2023). Keyword Extraction from Political Party Programmes - Portuguese Legislative Elections 2022, In Proceedings of the iConference 2023. Barcelona, Spain. March 29 - 27, 2023.
Please cite the following works when using YAKE:
In-depth journal paper at Information Sciences Journal
- Campos, R., Mangaravite, V., Pasquali, A., Jatowt, A., Jorge, A., Nunes, C. and Jatowt, A. (2020). YAKE! Keyword Extraction from Single Documents using Multiple Local Features. In Information Sciences Journal. Elsevier, Vol 509, pp 257-289. pdf
ECIR'18 Best Short Paper
-
Campos R., Mangaravite V., Pasquali A., Jorge A.M., Nunes C., and Jatowt A. (2018). A Text Feature Based Automatic Keyword Extraction Method for Single Documents. In: Pasi G., Piwowarski B., Azzopardi L., Hanbury A. (eds). Advances in Information Retrieval. ECIR 2018 (Grenoble, France. March 26 – 29). Lecture Notes in Computer Science, vol 10772, pp. 684 - 691. pdf
-
Campos R., Mangaravite V., Pasquali A., Jorge A.M., Nunes C., and Jatowt A. (2018). YAKE! Collection-independent Automatic Keyword Extractor. In: Pasi G., Piwowarski B., Azzopardi L., Hanbury A. (eds). Advances in Information Retrieval. ECIR 2018 (Grenoble, France. March 26 – 29). Lecture Notes in Computer Science, vol 10772, pp. 806 - 810. pdf