Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

NSFW Detection #74

Open
harshita-srivastava-yral opened this issue Oct 9, 2024 · 23 comments
Open

NSFW Detection #74

harshita-srivastava-yral opened this issue Oct 9, 2024 · 23 comments
Assignees
Labels
enhancement New feature or request

Comments

@harshita-srivastava-yral
Copy link

harshita-srivastava-yral commented Oct 9, 2024

  • Implement it in the Token creation image (Get the image icon there)
  • Non NSFW and more engaging with the videos we don't show them the NSFW
@harshita-srivastava-yral
Copy link
Author

  • Fly token is required from Saikat to deploy it on production

@harshita-srivastava-yral harshita-srivastava-yral added the enhancement New feature or request label Oct 9, 2024
@harshita-srivastava-yral
Copy link
Author

@harshita-srivastava-yral
Copy link
Author

  • Connected with @komal-sai-yral
  • Need to qrite few queries but that has dependency on Komal's end for a task to be done

@harshita-srivastava-yral
Copy link
Author

  • Integration task to be done with @komal-sai-yral today
  • Post that ETA to be shared on NSFW task

@harshita-srivastava-yral
Copy link
Author

  • Waiting for Komal to get back with the pipeline

@harshita-srivastava-yral
Copy link
Author

  • Made the changes for NSFW and working on local
  • Prod key wasn't working as FLY token was failing

@harshita-srivastava-yral
Copy link
Author

  • Pushed things to main fly account.

@harshita-srivastava-yral
Copy link
Author

  • We would want to have cleaner feed on Yral domain
  • We would want to work on NSFW first on Yral before the comments section

@harshita-srivastava-yral
Copy link
Author

  • Completed with NSFW implementation on the pumpAI
  • Need a fly token for the production deployment
  • Post this, connect with @komal-sai-yral
  • Pick up Yral feed as next project

@harshita-srivastava-yral harshita-srivastava-yral removed their assignment Nov 18, 2024
@harshita-srivastava-yral
Copy link
Author

harshita-srivastava-yral commented Nov 19, 2024

  • @komal-sai-yral is done with the pipeline
  • @jay-dhanwant-yral to pick it up and integrate it with ML feed
  • Post this expectation is we will not be able to see NSFW content post this
  • Toggle off means we will inject the very moderate amount of it
  • Negative signal loop to be closed but we have clarity on the positive loop
  • Its expected to take around 3 days to close the implementation from Jay's end

@harshita-srivastava-yral
Copy link
Author

  • We will have NSFW feed only in hotornot.wtf domain
  • We need a clean version of the app where only cleaner videos are shown and doesn't have any toggle
  • This app will be used for app store submission and interaction with financial institution

@harshita-srivastava-yral
Copy link
Author

  • Further enhancement of meta data is being done on the current feed
  • Provocative tagged and popularity as metrics were considered
  • Selectively give more importance to certain dimensions

@harshita-srivastava-yral
Copy link
Author

harshita-srivastava-yral commented Dec 4, 2024

  • 2 Exercises done - Softmax function used to test the video embedding gave 2-3% higher lift. Post training the model we were able to get 95% accuracy. Recommendation is to go ahead with this
  • 70% accuracy for the grey area provocative NSFW videos
  • We should be 90% accuracy overall
  • Disclaimer - Training data is less currently. We don't have rich annotation
  • We might have to train again at a large data.
  • We should do retraining on entire data.

@harshita-srivastava-yral
Copy link
Author

  • We are half way there

@harshita-srivastava-yral
Copy link
Author

  • We have the annotations done
  • We have testing done on new model and its trained
  • We got some validation and we have report to share

@harshita-srivastava-yral
Copy link
Author

  • Prod readiness of NSFW model
  • Once Komal is back, we will backfill all the data

@harshita-srivastava-yral
Copy link
Author

@jay-dhanwant-yral
Copy link

  • I checked the nsfw videos that were popping up in the platform. They were false negatives from the previous vision based models.
  • I ran the same videos with the embedding based model which is going to be live soon, and I didn't get any false negatives.

@jay-dhanwant-yral
Copy link

jay-dhanwant-yral commented Jan 6, 2025

TODOs for embedding deployment:

  • Make the data pipeline live through dataproc
  • Alerts for model degradation
  • Take the model live if the benchmarking_df has a good enough accuracy

@harshita-srivastava-yral
Copy link
Author

  • Embedding based model we are on track

@harshita-srivastava-yral
Copy link
Author

  • Deployment to be done today

@harshita-srivastava-yral
Copy link
Author

  • DS side we are done we need to connect with @komal-sai-yral on how to integrate that into system

@siyara-m-yral
Copy link

  • Video embedding should be done via offchain to avoid expensive Joins. @komal-sai-yral to pick related task

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants