Feedback #12

Jin-SukKim · 2024-09-30T02:25:41Z

No description provided.

Merge pull request #1 from boostcampaitech7/main

organize validation fucntion

…ir in inference.py

…geclassification-cv-06 into main

- add csv_path - change result_path

…geclassification-cv-06 into main

Train optim

final Merge

Test_feature_streamlit

aandyjeon

comments

aandyjeon · 2024-10-09T14:01:27Z

scripts/moveFiles.py

+        os.makedirs(dest_path)
+
+    # 파일 이동
+    for file in files:


여기에 try, except로 이동이 안 되었을 경우에는 어떤 오류때문인지 어떤 파일이 문제인지 출력해주면 좋을 거 같습니다. logging package를 활용하여 남겨주시면 좋습니다 (logging.info, logging.error).

aandyjeon · 2024-10-09T14:10:25Z

scripts/diffusionImage.py

+new_train_info = train_info.copy()
+new_train_info['converted_image_path'] = ""
+
+pin = 1


이 부분이 조금 직관적이지는 않습니다. 왜 "n01774384/sketch_10.JPEG"에서 멈추려고 하지? 라는 느낌이 들어 조금 더 general하게 코드를 만들 수도 있지 않을까란 생각이 들었습니다.

aandyjeon · 2024-10-09T14:16:27Z

scripts/diffusionImage.py

@@ -0,0 +1,152 @@
+import os


전반적으로 조금 산만하다는 생각이 들어서 모듈화가 잘 되었다고 가정할 때의 main함수만 구성을 해봤습니다. 먼저, 지금 짜신 거처럼 코드를 짜고 동작이 되면 한 번 이런식으로 정리하는 습관이 좋은 거 같습니다.

def main(config):
set_seed(42)
device = "cuda" if torch.cuda.is_available() else "cpu"
pipe = initialize_pipeline(device)
preprocess = get_preprocessing_pipeline()
mapping = load_mapping(config['mapping_file'])
train_info = pd.read_csv(config['train_csv']).sort_values(by='image_path').reset_index(drop=True)
train_info['converted_image_path'] = ""
process_sketch_images(train_info, mapping, config, pipe, preprocess)
train_info.to_csv(config['new_train_csv'], index=False)
print(f"csv file saved")

config = {
"train_csv": "./data/train.csv",
"converted_data_dir": "./data/converted_images",
"sketch_data_dir": "./data/train",
"new_train_csv": "./data/new_train.csv",
"mapping_file": "./data/imagenet_synset_to_definition.txt"
}

if name == "main":
main(config)

aandyjeon · 2024-10-09T14:22:55Z

src/sketch_transforms.py

@@ -0,0 +1,105 @@
+import torch


transformer.py의 class를 상속받아 코드를 짜면 중복을 줄일 수 있을 거 같습니다. 저는 많은 파일을 만드는 것을 좋아하지 않아, 보통 transformer.py 아래에 하나의 클래스를 더 만들어 상속형식으로 정의를 합니다.

aandyjeon · 2024-10-09T14:25:38Z

README.md

+  - `train.py`: Script to train the model
+
+
+## Usage


이 부분 좋기는 한데 저는 보통 잠을 자는 시간에 많은 코드를 돌려놓고 아침에 확인하는 편이라서, train.sh을 만들고 이 안에서 다양한 python train.py -parameter = argument의 형식으로 여러 parameter의 조합을 돌리고 ablation study를 합니다. config.json파일을 직접 고쳐서 수정하는 것은 여러 실험을 할 때 적합하지 않은 거 같습니다.

aandyjeon · 2024-10-09T14:30:48Z

eda/augmentation_viewer.py

+    cam = np.mean(cam, axis=0)  # 여러 채널이 있는 경우 평균
+
+    # CAM 크기를 입력 이미지 크기에 맞게 조정
+    cam = cv2.resize(cam, (image.shape[3], image.shape[2]))


image는 (batch, channel, height, width) 형식이므로 image.shape[2] (height)과 image.shape[3] (width)이 되어야 하지 않나요?

aandyjeon · 2024-10-09T14:34:47Z

eda/augmentation_viewer.py

+            status_text.text(f"Processing {image_type} image {i+1}/{total_images}")
+
+            # Streamlit이 업데이트를 표시할 시간을 주기 위해 잠시 대기
+            time.sleep(0.01)


보통은 st.progress()와 st.text()를 사용하여 업데이트합니다. time.sleep이 실행속도를 늦출 수도 있습니다.

aandyjeon · 2024-10-09T14:37:01Z

eda/augmentation_viewer.py

@@ -0,0 +1,412 @@
+import streamlit as st


전반적으로 st.session_state에 저장되는 코드가 중복되고 있는 거 같은데, 아래와 같이 모듈화하시면 어떨까 싶습니다.

def initialize_session_state():
session_keys = ['original_dataset', 'sketch_dataset', 'info_df', 'model_loaded',
'model_load_success', 'misclassified', 'misclassified_dataset',
'misclassified_original_images', 'current_misclassified_index']
for key in session_keys:
if key not in st.session_state:
st.session_state[key] = None if 'index' not in key else 0

aandyjeon · 2024-10-09T14:39:14Z

README.md

+
+1. Prepare your data in the `data/` directory.
+2. Adjust the configuration in `configs/config.json` if needed.
+3. Run training:


diffusion model로 만든 이미지를 저장하고, 학습에 활용하시는 거 같은데 그 과정에 대해 실행하는 측면에서 알려주는 부분이 있으면 좋을 거 같습니다.

aandyjeon · 2024-10-09T14:40:37Z

scripts/moveFiles.py

@@ -0,0 +1,26 @@
+import os


매우 큰 데이터셋의 경우, 파일을 하나씩 처리하는 것은 시간이 오래 걸릴 수 있습니다. 다중 스레드나 프로세스를 사용하여 성능을 개선할 수 있습니다.

…ᅵ포트(06조).pdf

jhuni17 and others added 30 commits September 13, 2024 00:10

file_refactoring

2bba532

Merge pull request #2 from boostcampaitech7/feedback

2ac3712

Merge pull request #1 from boostcampaitech7/main

Merge branch 'master' into main

d1548dd

add project_root in inference.py

c2b4ce6

fix readme.md

d2851ac

fix .gitignore

f8e48cc

show Validation Acc

7b508b4

organize validation fucntion

f8828be

Merge pull request #4 from boostcampaitech7/feature

bcb19a8

organize validation fucntion

fix directory in config.json

2cc496b

fix dataset root_dir in train.py, inference.py / fix transform_type d…

4c22518

…ir in inference.py

Merge branch 'main' of https://github.com/boostcampaitech7/level1-ima…

f83903d

…geclassification-cv-06 into main

fix epochs in config.json

1a89799

revise cofing

da05636

- add csv_path - change result_path

Merge branch 'main' of https://github.com/boostcampaitech7/level1-ima…

ef2fb3b

…geclassification-cv-06 into main

fix save_model func in trainer.py

338e477

fix save_model func in trainer.py

e3678f7

fix config.json

68b79c4

Contrastive Loss Test

632eebd

trainer optim

30aa499

add set_seed func in train.py

e2b04aa

Diffusion Base Code

dd15a30

add num_workers param

db89155

add FocalLoss, LabelSmoothingLoss, WeightedCrossEntropyLoss

c1e80a8

remove contrastive loss, dataset

1d4a0b3

Merge pull request #5 from boostcampaitech7/train_optim

2e1cd7c

Train optim

make augmentation_viewer with streamlit

a8e5c7b

make sketch_transfroms (0-1 scaling instead normalized)

99d3496

add get_original_image func for augmentation_viewer.py

eb4d801

test train convnextv2_large with sketch_transform

85bd59b

jung0228 and others added 24 commits September 25, 2024 19:12

Diffusion Aug code

4c45dcd

add AugMix

1d72cd1

Update README.md

afedaec

control diffusion aug by cofing

a5ef91c

Update README.md

99f0f23

idx2class txt file

4db33e4

Merge branch 'main' into train_optim

6db24cb

Merge pull request #9 from boostcampaitech7/train_optim

3e8b141

final Merge

add train script file with diffusion data

b5ef677

add converted train csv dir

2892e9a

minor fix in augmentation viewer for simplifying

77c27d6

Merge remote-tracking branch 'origin/main' into test_feature_streamlit

f38c421

restore base code augmentation

e6723df

fix gui text and add divider

3db379a

minor fix in Grad-CAM

8d6b1f5

Merge pull request #11 from boostcampaitech7/test_feature_streamlit

5fc58f4

Test_feature_streamlit

delete train_with_diffuisiondata.py

7a8365f

Clean Up Code & Add Comments

cfc808a

Clean Up code & Add Commnets

f4ef35d

Add comments

6b07223

delete misclassified_analysis.py

3631086

Update README.md

0a2e62c

Update README.md

1bc581e

Update README.md

bd0fdf6

aandyjeon reviewed Oct 9, 2024

View reviewed changes

jung0228 added 5 commits November 8, 2024 16:14

Add files via upload

32bea4a

Rename CV기초대회_CV_팀 리포트(06조).pdf to docs/CV기초대회_CV_팀 ᄅ…

59e78e6

…ᅵ포트(06조).pdf

Add files via upload

0ddbfb3

Rename CV기초대회_CV_팀 리포트(06조).pdf to project_report.pdf

00e6956

Add files via upload

5f07233

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feedback #12

Feedback #12

Jin-SukKim commented Sep 30, 2024

aandyjeon left a comment

aandyjeon Oct 9, 2024

aandyjeon Oct 9, 2024

aandyjeon Oct 9, 2024

aandyjeon Oct 9, 2024

aandyjeon Oct 9, 2024

aandyjeon Oct 9, 2024

aandyjeon Oct 9, 2024

aandyjeon Oct 9, 2024

aandyjeon Oct 9, 2024

aandyjeon Oct 9, 2024

Feedback #12

Are you sure you want to change the base?

Feedback #12

Conversation

Jin-SukKim commented Sep 30, 2024

aandyjeon left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment