Skip to content

Latest commit

 

History

History
40 lines (29 loc) · 1.66 KB

README.md

File metadata and controls

40 lines (29 loc) · 1.66 KB

PowerSearch Assistant Quickstart


Python3 JS/HTML/CSS

bot react datastax astra - apache cassandra

Search single url

A search command for a single url takes the regex expression below:

!(https?:\/\/)?[a-z].*\.[a-z]+(\s+(scrape|entities|summarize|topics|raw))*

For an easier breakdown without regex, this may be more intuitive:

required: ![url]

optional:

scrape - webscrapes the url and captures only the text from important tags e.g. head, h1-h4, p -
entities - lists meaningful topic terms and links each associated Wikipedia article reference -
summarize - uses BERT summarizer to shorten scraped text to only the most important parts -
topics - displays a word cloud of most frequent and thus weighted terms -
raw - displays the output of preceding command as text directly in chat; default is save as a text file and upload

Example: !www.tesla.com summarize entities topics

Search multiple key words

Querying Google for keywords follows a slighltly different format:

required: !keyword1 keyword2 keyword3...

optional: same commands as above, except "raw" since the output is very large and sent as a single text file

Example: !tom jerry cat mouse entities