OutOfMemoryError with PrototypicalCalibrationBlock #73

gladdduck · 2024-04-10T13:29:09Z

Hello, when I train my dataset using DeFRCN, I encountered an issue. The base training process goes smoothly, but when I attempt K-shot finetuning, I keep getting an OutOfMemoryError.

I tried to solve it and found that when setting PCB_ENABLE to False, this issue doesn't occur.

However, when PCB_ENABLE is set to True, even if I adjust IMS_PER_BATCH to 1 on A100-40G, I still encounter the OutOfMemoryError.

Has anyone else experienced a similar issue? How was it resolved?

cnjhh · 2024-04-11T08:04:57Z

Solution is to locate the PCB module :/path/defrcn/defrcn/evaluation/calibration_layer py build_prototypes function in the code: 'All_feature.append (feature.cpu ().data)' adds the following code: features =None.

gladdduck · 2024-04-12T05:09:31Z

Solution is to locate the PCB module :/path/defrcn/defrcn/evaluation/calibration_layer py build_prototypes function in the code: 'All_feature.append (feature.cpu ().data)' adds the following code: features =None.

thanks for your reply! this works!

gladdduck · 2024-04-19T05:57:15Z

Solution is to locate the PCB module :/path/defrcn/defrcn/evaluation/calibration_layer py build_prototypes function in the code: 'All_feature.append (feature.cpu ().data)' adds the following code: features =None.

However, this error still occurs from time to time.
the code locate in calibration_layer py build_prototypes function
features = self.extract_roi_features(img, boxes)
extract_roi_features function
conv_feature = self.imagenet_model(images.tensor[:, [2, 1, 0]])[
I'm very confused about this, even though I used gc.collect() and torch.cuda.empty_cache()

cnjhh · 2024-04-19T08:14:08Z

features = self.extract_roi_features(img, boxes) boxes = None img = None all_features.append(features.cpu().data) features = None

features create by you customs datasets for novel classes ,you can solve this by reducing the number of novel classes, or generate features offline, instead of loading the novel datas to train it when the model validated, save it through the pickle module, and then modify the code to load the offline trained one directly during validation

cnjhh · 2024-04-19T08:18:41Z

The device I use is A800 80G, and the novel data I set is 10 shot 13class, and when the model is loaded with the pcb module, the video memory occupies 53G, and 80G is not enough before the modification

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OutOfMemoryError with PrototypicalCalibrationBlock #73

OutOfMemoryError with PrototypicalCalibrationBlock #73

gladdduck commented Apr 10, 2024

cnjhh commented Apr 11, 2024

gladdduck commented Apr 12, 2024

gladdduck commented Apr 19, 2024

cnjhh commented Apr 19, 2024

cnjhh commented Apr 19, 2024

OutOfMemoryError with PrototypicalCalibrationBlock #73

OutOfMemoryError with PrototypicalCalibrationBlock #73

Comments

gladdduck commented Apr 10, 2024

cnjhh commented Apr 11, 2024

gladdduck commented Apr 12, 2024

gladdduck commented Apr 19, 2024

cnjhh commented Apr 19, 2024

cnjhh commented Apr 19, 2024