NOTE: This repository is unmaintained. Looking for a solution to retrieve information about Spanish companies via API while having enterprise support? Check out LibreBOR y LibreBOR API Documentation.
bormeparser is a Python library for parsing BORME files (Boletín Oficial del Registro Mercantil in Spain).
The Boletín Oficial del Registro Mercantil (Spanish for Official Mercantile Register Bulletin) is a document published daily by Registro Mercantil Central (RMC) in Spain which contains newly created societies, societies that have broken up, and some other data the companies must communicate.
This library takes advantage of the electronic format of BORMEs that are published since 2009 due to this Spanish law.
BORMEs are published at https://boe.es/diario_borme/.
Unfortunately due to some agreements with Mercantile Register they are not allowed to publish all data in some useful format like XML and the most interesting information is only available in PDF files.
You can read more about it on:
lxml package has some parts that need to be compiled and you need the following dependencies:
sudo apt-get install python3-dev libxslt1-dev
TBD...
You can install it by typing:
python setup.py install
or you can get it from PYPI by using pip:
pip install bormeparser
This package contains unittests. You can run them by typing:
python setup.py test
python -m unittest bormeparser.tests.test_bormeregex
The code license is GPLv3+