Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feat: Add Module for Myntra #1062

Closed
3 tasks done
aRUsh-codes opened this issue May 25, 2024 · 3 comments
Closed
3 tasks done

Feat: Add Module for Myntra #1062

aRUsh-codes opened this issue May 25, 2024 · 3 comments
Assignees
Labels
gssoc GSSoC 2024

Comments

@aRUsh-codes
Copy link
Contributor

Describe the feature

As a part of GSSOC 24, I'd like to add a Myntra module with the following functionalities:
https://www.myntra.com/

Add features like

  • product page
  • product details
  • Search for
    • categories
    • genders
    • brands

Add ScreenShots

Screenshot (130)

Record

  • I agree to follow this project's Code of Conduct
  • I'm a GSSoC'24 contributor
  • I want to work on this issue
@nikhil25803
Copy link
Member

Go ahead @aRUsh-codes

Note

  • Please create a separate module for this, as in the folder and project structure (if it is already created, just add your features as functions in the same module).
  • Do not use the `selenium web driver as it is incompatible with all devices and cloud platforms.
  • Before making any changes, please check whether the module you want to add exists. If yes, then you can add your functionality as a method only make a separate module and class for it.

All the best 👨‍💻

@aRUsh-codes
Copy link
Contributor Author

@nikhil25803 Thank you for assigning me. Can you also add the apt. level label to this.

@aRUsh-codes
Copy link
Contributor Author

Hi @nikhil25803
After careful consideration and trying out different ways I don't think Myntra's data can be scraped and I suggest you to close this issue.
Here are my findings:-

  • Myntra uploads its data dynamically into the webpage using javascript which makes it hard to catch using only BeautifulSoup and requests method (A solution to this problem is to use Selenium but we can't do that either)
  • I tried taking advantage of the fact that Myntra stores its data in json format on the webiste, which is a seperate gateway url( eg- https://www.myntra.com/gateway/v2/search/men-sweaters?rows=50&o=49&plaEnabled=false&xdEnabled=false&pincode=110001) but even that didn't work because Myntra is strict with authentication(which means a request would need a solid authentication like product_id,name, description in case of products which won't be feasible to be provided by users)

Screenshot (135)

Image Source


The official documentation and the robotx.txt suggest the same strictness.

If anyone else finds(or knows) a way I would be happy to learn more.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
gssoc GSSoC 2024
Projects
None yet
Development

No branches or pull requests

2 participants