Skip to content

Latest commit

ย 

History

History
21 lines (14 loc) ยท 888 Bytes

README.md

File metadata and controls

21 lines (14 loc) ยท 888 Bytes

LLM

LLM ํ•™์Šต์‹œํ‚ค๋Š” ์ฝ”๋“œ ๋ชจ์™€๋‘” repo

packing

packing attention_mask
๋Œ€๋žต packingํ•ด์„œ ๋“ค์–ด๊ฐ€๋ฉด attention_mask๊ฐ€ ์ด๋Ÿฐ์‹์œผ๋กœ ๋“ค์–ด๊ฐ€๊ฒŒ ๋จ.
๊ทผ๋ฐ flash_attention์€ attention_mask ๋”ฐ๋กœ ์•ˆ์ฃผ๊ณ , position_ids๋กœ ๋ถ„๊ฐ„ํ•จ.

LogicKor

์•„์ง ์ œ์ž‘ ์ค‘ ๋Œ€์ถฉ ํ•™์Šต ๋๋‚œ ๋’ค LogicKor ๋Œ๋ฆฌ๊ธฐ ๊ท€์ฐฎ์•„์„œ ๋งŒ๋“  ์ฝ”๋“œ
ํ•™์Šต ์ค‘ chekcpoint saveํ•˜๊ณ  ๋‚œ ๋’ค ์ˆ˜ํ–‰ํ•จ.
๊ทผ๋ฐ zero-3์—์„  config ์„ค์ •์— ๋”ฐ๋ผ eval ํ•˜๋Š”๋ฐ 4์‹œ๊ฐ„ ๊ฑธ๋ฆฌ๋”๋ผ

refer