├── GRPO.py ├── GRPO_From_Scratch_Multi_GPU_DataParallel_Qwen_2_5_1_5B_Instruct.ipynb ├── GRPO_Qwen_0_5_Instruct.ipynb ├── README.md ├── byte_pair_encoding.ipynb ├── count_language_model.ipynb ├── document_classifier_with_LLMs_as_labelers.ipynb ├── embedding_vs_linear.py ├── emotion_GPT2_as_classifier.ipynb ├── emotion_GPT2_as_text_generator.ipynb ├── emotion_GPT2_as_text_generator_LoRA.ipynb ├── emotion_classifier_CNN.ipynb ├── emotion_classifier_LR.ipynb ├── instruct_GPT2.ipynb ├── news_RNN_language_model.ipynb ├── news_decoder_language_model.ipynb ├── quadratic_loss.py ├── sampling_method.ipynb ├── spotify_gemini_playlist.py └── wiki ├── GPU-rental.md ├── MoE.md ├── PyTorch.md ├── VLM.md ├── alignment.md ├── colabs.md ├── compression.md ├── corrections.md ├── deployment.md ├── distributed.md ├── embeddings.md ├── encoder-decoder.md ├── encoder.md ├── evaluation.md ├── function-calling.md ├── index.md ├── inference.md ├── math.md ├── merging.md ├── non-transformer.md ├── notebook-services.md ├── online-finetuning.md ├── overfitting.md ├── prompting.md ├── scaling.md ├── scripts.md ├── security.md ├── test.md └── tokenization.md /GRPO.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/aburkov/theLMbook/HEAD/GRPO.py -------------------------------------------------------------------------------- /GRPO_From_Scratch_Multi_GPU_DataParallel_Qwen_2_5_1_5B_Instruct.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/aburkov/theLMbook/HEAD/GRPO_From_Scratch_Multi_GPU_DataParallel_Qwen_2_5_1_5B_Instruct.ipynb -------------------------------------------------------------------------------- /GRPO_Qwen_0_5_Instruct.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/aburkov/theLMbook/HEAD/GRPO_Qwen_0_5_Instruct.ipynb -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/aburkov/theLMbook/HEAD/README.md -------------------------------------------------------------------------------- /byte_pair_encoding.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/aburkov/theLMbook/HEAD/byte_pair_encoding.ipynb -------------------------------------------------------------------------------- /count_language_model.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/aburkov/theLMbook/HEAD/count_language_model.ipynb -------------------------------------------------------------------------------- /document_classifier_with_LLMs_as_labelers.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/aburkov/theLMbook/HEAD/document_classifier_with_LLMs_as_labelers.ipynb -------------------------------------------------------------------------------- /embedding_vs_linear.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/aburkov/theLMbook/HEAD/embedding_vs_linear.py -------------------------------------------------------------------------------- /emotion_GPT2_as_classifier.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/aburkov/theLMbook/HEAD/emotion_GPT2_as_classifier.ipynb -------------------------------------------------------------------------------- /emotion_GPT2_as_text_generator.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/aburkov/theLMbook/HEAD/emotion_GPT2_as_text_generator.ipynb -------------------------------------------------------------------------------- /emotion_GPT2_as_text_generator_LoRA.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/aburkov/theLMbook/HEAD/emotion_GPT2_as_text_generator_LoRA.ipynb -------------------------------------------------------------------------------- /emotion_classifier_CNN.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/aburkov/theLMbook/HEAD/emotion_classifier_CNN.ipynb -------------------------------------------------------------------------------- /emotion_classifier_LR.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/aburkov/theLMbook/HEAD/emotion_classifier_LR.ipynb -------------------------------------------------------------------------------- /instruct_GPT2.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/aburkov/theLMbook/HEAD/instruct_GPT2.ipynb -------------------------------------------------------------------------------- /news_RNN_language_model.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/aburkov/theLMbook/HEAD/news_RNN_language_model.ipynb -------------------------------------------------------------------------------- /news_decoder_language_model.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/aburkov/theLMbook/HEAD/news_decoder_language_model.ipynb -------------------------------------------------------------------------------- /quadratic_loss.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/aburkov/theLMbook/HEAD/quadratic_loss.py -------------------------------------------------------------------------------- /sampling_method.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/aburkov/theLMbook/HEAD/sampling_method.ipynb -------------------------------------------------------------------------------- /spotify_gemini_playlist.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/aburkov/theLMbook/HEAD/spotify_gemini_playlist.py -------------------------------------------------------------------------------- /wiki/GPU-rental.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/aburkov/theLMbook/HEAD/wiki/GPU-rental.md -------------------------------------------------------------------------------- /wiki/MoE.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/aburkov/theLMbook/HEAD/wiki/MoE.md -------------------------------------------------------------------------------- /wiki/PyTorch.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/aburkov/theLMbook/HEAD/wiki/PyTorch.md -------------------------------------------------------------------------------- /wiki/VLM.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/aburkov/theLMbook/HEAD/wiki/VLM.md -------------------------------------------------------------------------------- /wiki/alignment.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/aburkov/theLMbook/HEAD/wiki/alignment.md -------------------------------------------------------------------------------- /wiki/colabs.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/aburkov/theLMbook/HEAD/wiki/colabs.md -------------------------------------------------------------------------------- /wiki/compression.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/aburkov/theLMbook/HEAD/wiki/compression.md -------------------------------------------------------------------------------- /wiki/corrections.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/aburkov/theLMbook/HEAD/wiki/corrections.md -------------------------------------------------------------------------------- /wiki/deployment.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/aburkov/theLMbook/HEAD/wiki/deployment.md -------------------------------------------------------------------------------- /wiki/distributed.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/aburkov/theLMbook/HEAD/wiki/distributed.md -------------------------------------------------------------------------------- /wiki/embeddings.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/aburkov/theLMbook/HEAD/wiki/embeddings.md -------------------------------------------------------------------------------- /wiki/encoder-decoder.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/aburkov/theLMbook/HEAD/wiki/encoder-decoder.md -------------------------------------------------------------------------------- /wiki/encoder.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/aburkov/theLMbook/HEAD/wiki/encoder.md -------------------------------------------------------------------------------- /wiki/evaluation.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/aburkov/theLMbook/HEAD/wiki/evaluation.md -------------------------------------------------------------------------------- /wiki/function-calling.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/aburkov/theLMbook/HEAD/wiki/function-calling.md -------------------------------------------------------------------------------- /wiki/index.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/aburkov/theLMbook/HEAD/wiki/index.md -------------------------------------------------------------------------------- /wiki/inference.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/aburkov/theLMbook/HEAD/wiki/inference.md -------------------------------------------------------------------------------- /wiki/math.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/aburkov/theLMbook/HEAD/wiki/math.md -------------------------------------------------------------------------------- /wiki/merging.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/aburkov/theLMbook/HEAD/wiki/merging.md -------------------------------------------------------------------------------- /wiki/non-transformer.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/aburkov/theLMbook/HEAD/wiki/non-transformer.md -------------------------------------------------------------------------------- /wiki/notebook-services.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/aburkov/theLMbook/HEAD/wiki/notebook-services.md -------------------------------------------------------------------------------- /wiki/online-finetuning.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/aburkov/theLMbook/HEAD/wiki/online-finetuning.md -------------------------------------------------------------------------------- /wiki/overfitting.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/aburkov/theLMbook/HEAD/wiki/overfitting.md -------------------------------------------------------------------------------- /wiki/prompting.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/aburkov/theLMbook/HEAD/wiki/prompting.md -------------------------------------------------------------------------------- /wiki/scaling.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/aburkov/theLMbook/HEAD/wiki/scaling.md -------------------------------------------------------------------------------- /wiki/scripts.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/aburkov/theLMbook/HEAD/wiki/scripts.md -------------------------------------------------------------------------------- /wiki/security.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/aburkov/theLMbook/HEAD/wiki/security.md -------------------------------------------------------------------------------- /wiki/test.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/aburkov/theLMbook/HEAD/wiki/test.md -------------------------------------------------------------------------------- /wiki/tokenization.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/aburkov/theLMbook/HEAD/wiki/tokenization.md --------------------------------------------------------------------------------