Bug: Redevelopment of the Quora scrapper #918

Saurabh254 · 2024-05-12T16:10:38Z

Describe the feature

As an GSSoC'24 contributer, I want to enhance my developing skills into this scrape-up.
also I'll be working in this issue,
point to be noted I'm the contributor of the python package pyquora (quora scrapper).

I'm working on this because pyquora lacks some features like fetch get Answers by search query.

also I would like to make the scrap-up quora scrapper better.

the part I'll be covering will be

fetch user Answers
Get answers by a search query
fetch UserProfile

Add ScreenShots

will cover every details from the bellow image

will also cover the top answers

Record

I agree to follow this project's Code of Conduct
I'm a GSSoC'24 contributor
I want to work on this issue

The text was updated successfully, but these errors were encountered:

viththagi · 2024-05-12T17:34:18Z

hi @Saurabh254 i would like to work on this issue my steps would be:

1.web scraping using libraries such as beautifulsoup,selenium
2.Understand the Website Structure to inspect the HTML of the comments section.
3.efficiency consideration:
Add delays between requests to avoid overloading the website's server.
Handle pagination
4.sentiment analysis: using libraries like TextBlob or NLTK.

Saurabh254 · 2024-05-12T17:49:11Z

@nikhil25803 you can assign me this task. :)

Saurabh254 · 2024-05-12T17:50:49Z

hi @Saurabh254 i would like to work on this issue my steps would be:

1.web scraping using libraries such as beautifulsoup,selenium 2.Understand the Website Structure to inspect the HTML of the comments section. 3.efficiency consideration: Add delays between requests to avoid overloading the website's server. Handle pagination 4.sentiment analysis: using libraries like TextBlob or NLTK.

we don't have to use selenium because not every system supports it.
I rather be using regex to scrap the json.

nikhil25803 · 2024-05-13T03:40:53Z

Go ahead @Saurabh254

Note

Please create a separate module for this, as in the folder and project structure (if it is already created, just add your features as functions in the same module).
Do not use the `selenium web driver as it is incompatible with all devices and cloud platforms.
Before making any changes, please check whether the module you want to add exists. If yes, then you can add your functionality as a method only make a separate module and class for it.

All the best 👨‍💻

nikhil25803 assigned Saurabh254 May 13, 2024

nikhil25803 added the gssoc GSSoC 2024 label May 13, 2024

nikhil25803 closed this as completed Aug 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug: Redevelopment of the Quora scrapper #918

Bug: Redevelopment of the Quora scrapper #918

Saurabh254 commented May 12, 2024 •

edited

Loading

viththagi commented May 12, 2024

Saurabh254 commented May 12, 2024

Saurabh254 commented May 12, 2024

nikhil25803 commented May 13, 2024

Bug: Redevelopment of the Quora scrapper #918

Bug: Redevelopment of the Quora scrapper #918

Comments

Saurabh254 commented May 12, 2024 • edited Loading

Describe the feature

Add ScreenShots

Record

viththagi commented May 12, 2024

Saurabh254 commented May 12, 2024

Saurabh254 commented May 12, 2024

nikhil25803 commented May 13, 2024

Saurabh254 commented May 12, 2024 •

edited

Loading