RoPE and Length Scaling
Introduce some basic concepts of Position Encoding, RoPE and length extrapolation related it.
Introduce some basic concepts of Position Encoding, RoPE and length extrapolation related it.
一个偏综述的文章,总结 codeLLM 相关 paper 从 data collection 到 training 中间的一些细节
介绍一下 continued pre-train