Compression of deep learning models through global weight pruning using ADMM

Sunghun Hwangbo; Dongwook Yang; Geonseok Lee; Kichun Lee

논문 상세보기

Compression of deep learning models through global weight pruning using ADMM

ADMM 기반 전역 가중치 제거를 통한 딥러닝 모델의 압축

Sunghun Hwangbo, Dongwook Yang, Geonseok Lee, Kichun Lee

언어KOR
URLhttps://db.koreascholar.com/Article/Detail/407092

구독 기관 인증 시 무료 이용이 가능합니다. 4,000원

한국산업경영시스템학회 학술대회

2021년 한국산업경영시스템학회 춘계학술대회 (2021.05)
pp.313-319

한국산업경영시스템학회 (Society of Korea Industrial and Systems Engineering)

초록

Deep learning, which has recently shown excellent performance, has a problem that the amount of computation and required memory are large. Model compression is very useful because it saves memory and reduces storage size while maintaining model performance. Model compression methods reduce the number of edges by pruning weights that are deemed unnecessary in the calculation. Existing weight pruning methods using ADMM construct an optimization problem by a layer-by-layer addition of pre-defined removal-ratio constraints. Decomposing into two subproblems through the ADMM process, one can solve them through gradient descent and projection. However, the layer-by-layer removal ratios must be structurally specified, causing a sharp increase in training time due to a large number of parameters, and hardly feasible to use for large models that actually require weight pruning. Our proposed method performs weight pruning, producing similar performance, by setting a global removal ratio for the entire model without prior knowledge of structural characteristics in order to solve the shortcomings of the existing ADMM weight-pruning methods. To effectively avoid performance degradation, the method removes a relatively small number of previous layers in charge of feature extraction. Experiments show high-quality performance, not necessarily setting layer-by-layer removal ratios. Additionally, experiments increasing layers yield an insight for feature extraction in pruned layers. The experiment of the proposed method to the LeNet-5 model using MNIST data results in a higher compression ratio of 99.3% outperforming those of other existing algorithms. We also demonstrate the effectiveness of the proposed method in YOLOv4, an object detection model requiring substantial computation.

키워드

Weight Pruning ADMM Removal ratio

1. 개요
2. 배경
    2.1 관련 연구
    2.2 Alternating Direction Method of Multipliers(ADMM)
3. 모델
    3.1. 모델 구조
    3.2 제안 알고리즘
4. 실험
    4.1 점진적으로 레이어를 추가한 모델
    4.2 LeNet
    4.3 활용
5. 결론
References

저자

Sunghun Hwangbo(한양대학교 산업공학과) | 황보성훈
Dongwook Yang(한양대학교 산업공학과) | 양동욱
Geonseok Lee(한양대학교 산업공학과) | 이건석
Kichun Lee(한양대학교 산업공학과) | 이기천 Corresponding Author

같은 권호 다른 논문