├── .github └── workflows │ └── ExportPluto.yml ├── .gitignore ├── Byte_Pair_Encoding_tokenization.ipynb ├── ChemicalData.ipynb ├── GenSLM_Downstream.ipynb ├── LICENSE ├── README.md ├── SciFMRAGTutorial.ipynb ├── Training_Tokenizers.ipynb ├── ViT_SciFM.ipynb ├── data ├── .gitignore ├── example.fasta ├── realspace_tiny.zip └── scaling_law.csv ├── img ├── central_dogma.jpg ├── chinchilla_model.png ├── compute_optimal_scaling.png ├── dna.jpg ├── dugong-image.jpg ├── ena_embl_growth.png ├── genetic-code.jpg ├── gpt4_scaling.png ├── kaplan_scaling.png ├── model_shape.png ├── msa.gif ├── omics-bpe.png ├── omics-overview.png ├── protein.jpg ├── repo_qr.svg ├── ribosome.jpg ├── rna.jpg ├── tree-of-life.png └── woese-paper.png ├── neural_scaling_laws.jl ├── omics.ipynb └── protein_gene_dpo.ipynb /.github/workflows/ExportPluto.yml: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/scifm/summer-school-2024/HEAD/.github/workflows/ExportPluto.yml -------------------------------------------------------------------------------- /.gitignore: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/scifm/summer-school-2024/HEAD/.gitignore -------------------------------------------------------------------------------- /Byte_Pair_Encoding_tokenization.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/scifm/summer-school-2024/HEAD/Byte_Pair_Encoding_tokenization.ipynb -------------------------------------------------------------------------------- /ChemicalData.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/scifm/summer-school-2024/HEAD/ChemicalData.ipynb -------------------------------------------------------------------------------- /GenSLM_Downstream.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/scifm/summer-school-2024/HEAD/GenSLM_Downstream.ipynb -------------------------------------------------------------------------------- /LICENSE: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/scifm/summer-school-2024/HEAD/LICENSE -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/scifm/summer-school-2024/HEAD/README.md -------------------------------------------------------------------------------- /SciFMRAGTutorial.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/scifm/summer-school-2024/HEAD/SciFMRAGTutorial.ipynb -------------------------------------------------------------------------------- /Training_Tokenizers.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/scifm/summer-school-2024/HEAD/Training_Tokenizers.ipynb -------------------------------------------------------------------------------- /ViT_SciFM.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/scifm/summer-school-2024/HEAD/ViT_SciFM.ipynb -------------------------------------------------------------------------------- /data/.gitignore: -------------------------------------------------------------------------------- 1 | realspace_tiny/ 2 | -------------------------------------------------------------------------------- /data/example.fasta: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/scifm/summer-school-2024/HEAD/data/example.fasta -------------------------------------------------------------------------------- /data/realspace_tiny.zip: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/scifm/summer-school-2024/HEAD/data/realspace_tiny.zip -------------------------------------------------------------------------------- /data/scaling_law.csv: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/scifm/summer-school-2024/HEAD/data/scaling_law.csv -------------------------------------------------------------------------------- /img/central_dogma.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/scifm/summer-school-2024/HEAD/img/central_dogma.jpg -------------------------------------------------------------------------------- /img/chinchilla_model.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/scifm/summer-school-2024/HEAD/img/chinchilla_model.png -------------------------------------------------------------------------------- /img/compute_optimal_scaling.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/scifm/summer-school-2024/HEAD/img/compute_optimal_scaling.png -------------------------------------------------------------------------------- /img/dna.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/scifm/summer-school-2024/HEAD/img/dna.jpg -------------------------------------------------------------------------------- /img/dugong-image.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/scifm/summer-school-2024/HEAD/img/dugong-image.jpg -------------------------------------------------------------------------------- /img/ena_embl_growth.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/scifm/summer-school-2024/HEAD/img/ena_embl_growth.png -------------------------------------------------------------------------------- /img/genetic-code.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/scifm/summer-school-2024/HEAD/img/genetic-code.jpg -------------------------------------------------------------------------------- /img/gpt4_scaling.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/scifm/summer-school-2024/HEAD/img/gpt4_scaling.png -------------------------------------------------------------------------------- /img/kaplan_scaling.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/scifm/summer-school-2024/HEAD/img/kaplan_scaling.png -------------------------------------------------------------------------------- /img/model_shape.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/scifm/summer-school-2024/HEAD/img/model_shape.png -------------------------------------------------------------------------------- /img/msa.gif: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/scifm/summer-school-2024/HEAD/img/msa.gif -------------------------------------------------------------------------------- /img/omics-bpe.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/scifm/summer-school-2024/HEAD/img/omics-bpe.png -------------------------------------------------------------------------------- /img/omics-overview.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/scifm/summer-school-2024/HEAD/img/omics-overview.png -------------------------------------------------------------------------------- /img/protein.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/scifm/summer-school-2024/HEAD/img/protein.jpg -------------------------------------------------------------------------------- /img/repo_qr.svg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/scifm/summer-school-2024/HEAD/img/repo_qr.svg -------------------------------------------------------------------------------- /img/ribosome.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/scifm/summer-school-2024/HEAD/img/ribosome.jpg -------------------------------------------------------------------------------- /img/rna.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/scifm/summer-school-2024/HEAD/img/rna.jpg -------------------------------------------------------------------------------- /img/tree-of-life.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/scifm/summer-school-2024/HEAD/img/tree-of-life.png -------------------------------------------------------------------------------- /img/woese-paper.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/scifm/summer-school-2024/HEAD/img/woese-paper.png -------------------------------------------------------------------------------- /neural_scaling_laws.jl: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/scifm/summer-school-2024/HEAD/neural_scaling_laws.jl -------------------------------------------------------------------------------- /omics.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/scifm/summer-school-2024/HEAD/omics.ipynb -------------------------------------------------------------------------------- /protein_gene_dpo.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/scifm/summer-school-2024/HEAD/protein_gene_dpo.ipynb --------------------------------------------------------------------------------