Parler Parser

The parler parser is used to parse parler HTML posts and user profiles. Parler post dumps can be found from here.

Parsed Entities:

Refer to here

Example Post Parser:

import glob

from parler.parser.postParser import PostParser
from parler.dataType.post import Post

files = glob.glob('posts/*')

data = []
for file in files:
  post = PostParser(file).parse()
  if (post is not None):
    data.append(post.convert())

print(data)

Example Profile Page Parser:

from parler.parser.profilePageParser import ProfilePageParser

file = r".\profile\00KimPossible00\posts\index.html"
timestamp = 20201124075219

profilePage = ProfilePageParser(file, timestamp)

user, posts = profilePage.parse()

print(user.convert())
print()

for post in posts:
    print(post.convert())
    print()

Sample Output

You should get the same results as shown in sample_output.

Parsing Logic

Determine what type of post we are dealing with:
- New Post
- Echoed Post
- Echoed Post with Reply
- Echoed Post with Root Echo and No Reply
- Echoed Post with Root Echo and Reply
If new post, parse the only post as main post else parse the reply post as main post.
If not new post, parse the echoed post.
If echoed post or echoed post with root echo and no reply:
- Use the "Echoed by ... " line to fill out main post info with the user and created_at
- Grab username from the meta information stored in the header.
- No profile badge can be found in the post this way.
- The comment_count, echo_count, upvote_count belongs to the echoed post.
Else:
- The comment_count, echo_count, upvote_count belongs to the main post.
If Echoed Post with Root Echo and No Reply or Echoed Post with Root Echo and Reply:
- Parse the first post for the root echo.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Parler Parser

Parsed Entities:

Example Post Parser:

Example Profile Page Parser:

Sample Output

Parsing Logic

Files

README.md

Latest commit

History

README.md

File metadata and controls

Parler Parser

Parsed Entities:

Example Post Parser:

Example Profile Page Parser:

Sample Output

Parsing Logic