Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

docs: Better Facial Recognition Clusters #14911

Open
wants to merge 4 commits into
base: main
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 2 additions & 0 deletions docs/docs/features/facial-recognition.md
Original file line number Diff line number Diff line change
Expand Up @@ -69,6 +69,8 @@ Navigating to Administration > Settings > Machine Learning Settings > Facial Rec

:::tip
It's better to only tweak the parameters here than to set them to something very different unless you're ready to test a variety of options. If you do need to set a parameter to a strict setting, relaxing other settings can be a good option to compensate, and vice versa.

You can learn how the tune the result in this [Guide](/docs/guides/better-facial-clusters)
:::

### Facial recognition model
Expand Down
76 changes: 76 additions & 0 deletions docs/docs/guides/better-facial-clusters.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,76 @@
# Better Facial Recognition Clusters

## Purpose

This guide explains how to optimize facial recognition in systems with large image libraries. By following these steps, you'll achieve better clustering of faces, reducing the need for manual merging.

---

## Important Notes

- **Best Suited For:** Large image libraries after importing a significant number of images.
- **Warning:** This method deletes all previously assigned names.
- **Tip:** **Always take a [backup](/docs/administration/backup-and-restore#database) before proceeding!**

---

## Step-by-Step Instructions

### Objective

To enhance face clustering and ensure the model effectively identifies faces using qualitative initial data.

---

### Steps

#### 1. Adjust Machine Learning Settings

Navigate to:
**Admin → Administration → Settings → Machine Learning Settings**

Make the following changes:

- **Minimum detection score:**
Set this to **0.5**.
> This helps recognize more faces while avoiding false positives (non-faces).
Comment on lines +35 to +36
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Recommending a different detection score is out of scope for this guide. A lower score risks false positives that are difficult to deal with once they exist.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I can understand the reason you say this, I decided to point it out because in the test I did that once I obtained high-quality clusters even faces that are a little covered (and when the value is 0.6 they are not even recognized at all) manage to enter the high-quality cluster that is created and then there is no risk in faces that are not assigned to anyone.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It might not have caused an issue for your library, but I've seen it cause non-faces to come up for others and things get very messy. A guide that shows how to get higher quality results shouldn't suggest things that can often cause lower quality results.

- **Maximum recognition distance (Optional):**
Lower this value, e.g., to **0.4**, if the library contains people with similar facial features.
- **Minimum recognized faces:**
Set this to a **high value** (e.g., 10).
> A high value ensures clusters only include faces that appear at least 10 times in the library, improving the initial clustering process.
---

#### 2. Run Reset Jobs

Go to:
**Admin → Administration → Settings → Jobs**

Perform the following:

1. **FACE DETECTION → Reset**
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Just resetting facial recognition is enough without the detection score change. If you do change the detection score, then you should only queue face detection and let it queue faces for recognition automatically.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, in fact it should run the job automatically as you mentioned, it's a bit old guide I wrote, and if I'm not mistaken at the time there was a "bug" that didn't run this job automatically. (If this is no longer the case then I will remove this part)

2. **FACIAL RECOGNITION → Reset**

> These reset jobs rebuild the recognition model based on the new settings.
---

#### 3. Refine Recognition with Lower Thresholds

Once the reset jobs are complete, refine the recognition as follows:

- **Step 1:**
Return to **Minimum recognized faces** in Machine Learning Settings and lower the value to **8**.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

10 -> 8 might get the right result for your library, but I think it's too subtle to work as a general recommendation. Maybe 20 -> 10 -> 3?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

From a test I did, setting the value too high to start with causes many faces to not be recognized (because 20 faces per person is too many), which causes people who look similar to other person to merge (false positives).

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

But this is going to vary by library, right? Some people will have large libraries where 10 vs 8 doesn't matter, some people will have burst photos, etc. You can adjust the text to recommend different numbers based on their library.


> Run the job: **FACIAL RECOGNITION → MISSING Mode**
- **Step 2:**
Lower the value again to **3**.
> Run the job: **FACIAL RECOGNITION → MISSING Mode**
---

## Outcome

This method creates high-quality face clusters for core faces and ensures the model can reliably add new faces as they appear in your library.
Comment on lines +74 to +76
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This seems unnecessary.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

👍I will remove this part soon.

Loading