-
Notifications
You must be signed in to change notification settings - Fork 25
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Memory error #10
Comments
It seems there's a problem parsing the file 2007-February.txt from metrics-grimoire mailinglists. When mlstats parses it, loops forever. |
Dave Neaty report the same error almost two years ago... |
Maybe related to #1 mlstats does not handle correctly attachments, or for that matter it does not handle MIME objects. IIUC, some points of the a rant on mime parsers in http://jeffreystedfast.blogspot.ca/2013/09/time-for-rant-on-mime-parsers.html applies to mlstats (no, mlstats is not the target of the rant, but some things seems to apply the way mlstats parses mbox files). |
I have a branch where this problem is partially solved. The diff is here: and the branch is https://github.com/gpoo/MailingListStats/tree/strictmbox I said partially because in some messages my branch might not consider an extra (empty) line that is in the message. I have not looked in detail, and I wrote it a couple of months ago to remember :-) |
FWIW, in the source code of mailbox, with respect to the old classes, there is the following comment:
Even though the algorithm changed later, I don't think it changed in a way that solves this issue. Just to keep it in mind. |
$ python mlstats --db-user=root --db-password=root --db-name=mlstats_innodb --db-admin-user=root --db-admin-password=root https://lists.libresoft.es/pipermail/metrics-grimoire/ &> report.log
The text was updated successfully, but these errors were encountered: