├── 2021HDSI-SD-MLMeetupArunKumar.pdf
├── Bernard Lee Presentation for SDML (Distribution Copy).pdf
├── Explaining_by_Removing.pdf
├── Hack COVID slides.pdf
├── README.md
├── SDML-RapidMiner_Overview.pdf
├── SDML_genomics_FNL.pdf
├── Voytek-SD-meetup.pdf
├── cd_reasoning_presentation_1016.pdf
└── papers
├── AM-RADIO_Agglomerative Vision Foundation Model Reduce All Domains Into One.pdf
├── AURORA_a_foundation_model_of_the_atmosphere.pdf
├── Applications_of_ML_in_Astronomy.pdf
├── DeepSeek-R1_Incentivizing_Reasoning_Capability_in_LLMs_via_Reinforcement_Learning.pdf
├── Foundation_Models_for_Generalist_Geospatial_Artificial_Intelligence.pdf
├── GAIA.pdf
├── Interesting Papers 202311.pdf
├── MAGVIT_Masked_Generative_Video_Transformer.pdf
├── ML_Paper_Review_202311.pdf
├── ML_Paper_Review_202312.pdf
├── ML_Paper_Review_202401_Jailbroken.pdf
├── ML_Paper_Review_202403_Generalization.pdf
├── ML_Paper_Review_202404_DoRA.pdf
├── ML_Paper_Review_202405_long_context.pdf
├── ML_Paper_Review_202406_multi_token_pred.pdf
├── ML_Paper_Review_202408_stealing_part_of_llm.pdf
├── ML_Paper_Review_202410_writing_in_the_margins.pdf
├── ML_Paper_Review_202411_multi-head_latent_attn.pdf
├── ML_Paper_Review_202412_segment_anything.pdf
├── ML_Paper_Review_202502_visual_autoregressive.pdf
├── ML_Paper_Review_202504_biology_of_a_LLM.pdf
├── ML_Paper_Review_202505_describe_anything.pdf
├── Machine_Learning_Key_Concepts.pdf
├── Mixture of Depths.pdf
├── Should We Fine-Tune or RAG.pdf
├── SimMIM.pdf
├── Sonali Tamhankar DSS SF.pdf
└── TextGrad_Automatic_Differentiation_via_Text.pdf
/2021HDSI-SD-MLMeetupArunKumar.pdf:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/SanDiegoMachineLearning/talks/31edc1eb06ed2646ca6bcdb0605dae2c7998eda2/2021HDSI-SD-MLMeetupArunKumar.pdf
--------------------------------------------------------------------------------
/Bernard Lee Presentation for SDML (Distribution Copy).pdf:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/SanDiegoMachineLearning/talks/31edc1eb06ed2646ca6bcdb0605dae2c7998eda2/Bernard Lee Presentation for SDML (Distribution Copy).pdf
--------------------------------------------------------------------------------
/Explaining_by_Removing.pdf:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/SanDiegoMachineLearning/talks/31edc1eb06ed2646ca6bcdb0605dae2c7998eda2/Explaining_by_Removing.pdf
--------------------------------------------------------------------------------
/Hack COVID slides.pdf:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/SanDiegoMachineLearning/talks/31edc1eb06ed2646ca6bcdb0605dae2c7998eda2/Hack COVID slides.pdf
--------------------------------------------------------------------------------
/README.md:
--------------------------------------------------------------------------------
1 | Below are the links for SDML presentations and videos.
2 | For SDML book club notes and videos, please see the **[SDML book club repo](https://github.com/SanDiegoMachineLearning/bookclub)**.
3 |
4 |
5 | ### 2025
6 |
7 | **ML Papers and Lightning Talks** \
8 | Monthly discussions of interesting ML papers and other ML lightning talks
9 | * **February**: \
10 | Ted discussed *Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction* (https://arxiv.org/abs/2404.02905). \
11 | Ryan discussed *DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning* (https://arxiv.org/abs/2501.12948). \
12 | [Video](https://youtu.be/j4O3g3DxvKE) and link to [Ted's slides](./papers/ML_Paper_Review_202502_visual_autoregressive.pdf)
13 | and [Ryan's slides](./papers/DeepSeek-R1_Incentivizing_Reasoning_Capability_in_LLMs_via_Reinforcement_Learning.pdf).
14 |
15 | * **March**: \
16 | Ryan discussed *GAIA: a benchmark for General AI Assistants* (https://arxiv.org/abs/2311.12983). \
17 | Sophin Liu shared a fun probability problem for the game Rock, paper, scissors, minus one. \
18 | [Video](https://youtu.be/cb-5al2__wA) and [Ryan's slides](./papers/GAIA.pdf).
19 |
20 | * **April**: \
21 | Alex Chaushev discussed An Application of ML in Astronomy: Exoplanet Identification. \
22 | Ted discussed *On the Biology of a Large Language Model* (https://transformer-circuits.pub/2025/attribution-graphs/biology.html). \
23 | [Video](https://youtu.be/fE84sZnrLJA) and link to [Alex's slides](./papers/Applications_of_ML_in_Astronomy.pdf)
24 | and [Ted's slides](./papers/ML_Paper_Review_202504_biology_of_a_LLM.pdf)
25 |
26 | Please let us know if you have a topic you would like to share about in either a long (\~30 minute) or short (\~5-10 minutes) talk.
27 |
28 | ### 2024
29 |
30 | **ML Papers and Lightning Talks** \
31 | Monthly discussions of interesting ML papers and other ML lightning talks
32 | * **January**: \
33 | Ryan discussed *PowerInfer: Fast Large Language Model Serving with a Consumer-grade GPU* (https://arxiv.org/abs/2312.12456). \
34 | Ted discussed *Jailbroken: How Does LLM Safety Training Fail?* (https://arxiv.org/abs/2307.02483). \
35 | [Video](https://youtu.be/YZWV8s4K4zw) and link to [Ted's slides](./papers/ML_Paper_Review_202401_Jailbroken.pdf).
36 | * **February**: \
37 | Shibo Hao discussed his paper *ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings* (https://arxiv.org/abs/2305.11554). \
38 | [Video recording of the session](https://youtu.be/_THNEMzG-wM)
39 | * **March**: \
40 | Ryan discussed *Kosmos-2.5: A Multimodal Literate Model* (https://arxiv.org/abs/2309.11419). \
41 | Ted discussed *Understanding Generalization through Visualizations* (https://arxiv.org/abs/1906.03291). \
42 | [Video](https://youtu.be/zlfpfmel6O8) and link to [Ted's slides](./papers/ML_Paper_Review_202403_Generalization.pdf).
43 | * **April**: \
44 | Ted discussed parameter efficient fine-tuning (PEFT) and *DoRA: Weight-Decomposed Low-Rank Adaptation* (https://arxiv.org/abs/2402.09353). \
45 | Ryan discussed *Mixture-of-Depths: Dynamically allocating compute in transformer-based language models* (https://arxiv.org/abs/2404.02258). \
46 | [Video](https://youtu.be/DnlvLbDPR6c) and links to [Ted's slides](./papers/ML_Paper_Review_202404_DoRA.pdf) and [Ryan's slides](./papers/Mixture%20of%20Depths.pdf).
47 | * **May**: \
48 | Ted discussed *Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention* (https://arxiv.org/abs/2404.07143). \
49 | Ryan discussed *AM-RADIO: Agglomerative Vision Foundation Model -- Reduce All Domains Into One* (https://arxiv.org/abs/2312.06709). \
50 | [Video](https://youtu.be/6xXchgULI0c) and link to [Ted's slides](./papers/ML_Paper_Review_202405_long_context.pdf)
51 | * **June**: \
52 | Ted discussed *Better & Faster Large Language Models via Multi-token Prediction* (https://arxiv.org/abs/2404.19737). \
53 | Ryan discussed *RemoteCLIP: A Vision Language Foundation Model for Remote Sensing* (https://arxiv.org/abs/2306.11029). \
54 | [Video](https://youtu.be/MFOb9sh9geY) and links to [Ted's slides](./papers/ML_Paper_Review_202406_multi_token_pred.pdf).
55 | * **July**: \
56 | Sonali Tamhankar discussed Machine learning in healthcare. \
57 | Ryan discussed *Should We Fine-Tune or RAG? Evaluating Different Techniques to Adapt LLMs for Dialogue* (https://arxiv.org/abs/2406.06399). \
58 | [Video](https://youtu.be/_V67S3Jzwvg) and links to [Sonali's slides](./papers/Sonali%20Tamhankar%20DSS%20SF.pdf) and [Ryan's slides](./papers/Should%20We%20Fine-Tune%20or%20RAG.pdf).
59 | * **August**: \
60 | Ted discussed *Stealing Part of a Production Language Model* (https://arxiv.org/abs/2403.06634). \
61 | Liam Barstad discussed Key Machine Learning Concepts. \
62 | [Video](https://youtu.be/nM-aCn52fQE) and links to [Ted's slides](./papers/ML_Paper_Review_202408_stealing_part_of_llm.pdf) and [Liam's slides](./papers/Machine_Learning_Key_Concepts.pdf).
63 | * **September**: \
64 | Anna Malyarenko and Fae Gaze discussed the SDML-KIP work they did on the recent LMSYS competition on Kaggle. \
65 | Ryan discussed *TextGrad: Automatic "Differentiation" via Text* (https://arxiv.org/abs/2406.07496). \
66 | [Video](https://youtu.be/JFYHEihKCJo) and links to [Ryan's slides](./papers/TextGrad_Automatic_Differentiation_via_Text.pdf).
67 | * **October**: \
68 | Ted discussed *Writing in the Margins: Better Inference Pattern for Long Context Retrieval* (https://www.arxiv.org/abs/2408.14906). \
69 | Ryan discussed *Aurora: A Foundation Model of the Atmosphere* (https://arxiv.org/abs/2405.13063). \
70 | [Video](https://youtu.be/s-mEjnRE6JY) and links to [Ted's slides](./papers/ML_Paper_Review_202410_writing_in_the_margins.pdf) and [Ryan's slides](./papers/AURORA_a_foundation_model_of_the_atmosphere.pdf).
71 | * **November**: \
72 | Ryan discussed *Foundation Models for Generalist Geospatial Artificial Intelligence* (https://arxiv.org/abs/2310.18660). \
73 | Ted discussed Multi-head Latent Attention from *DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model* (https://arxiv.org/abs/2405.04434). \
74 | [Video](https://youtu.be/xPGvxE_hGd4) and links to [Ryan's slides](./papers/Foundation_Models_for_Generalist_Geospatial_Artificial_Intelligence.pdf) and [Ted's slides](./papers/ML_Paper_Review_202411_multi-head_latent_attn.pdf).
75 | * **December**: \
76 | Ryan discussed *MAGVIT: Masked Generative Video Transformer* (https://arxiv.org/abs/2212.05199). \
77 | Ted discussed *SAM 2: Segment Anything in Images and Videos* (https://arxiv.org/abs/2408.00714). \
78 | [Video](https://youtu.be/Ni62_K1t9HY) and links to [Ryan's slides](./papers/MAGVIT_Masked_Generative_Video_Transformer.pdf) and [Ted's slides](./papers/ML_Paper_Review_202412_segment_anything.pdf).
79 |
80 | ### 2023
81 |
82 | **ML Papers and Lightning Talks** \
83 | Monthly discussions of interesting ML papers and other ML lightning talks
84 | * **November**: \
85 | Ryan discussed *PDFTriage* (https://arxiv.org/abs/2309.08872) and *PseudoSeg* (https://arxiv.org/abs/2010.09713). \
86 | Ted discussed *Phi-1.5* (https://arxiv.org/abs/2309.05463) and *3D Gaussian Splatting* (https://arxiv.org/abs/2308.04079). \
87 | [Video](https://youtu.be/pi_yaKUlhSU) and links to [Ryan's slides](./papers/Interesting%20Papers%20202311.pdf) and [Ted's slides](./papers/ML_Paper_Review_202311.pdf).
88 | * **December**: \
89 | Ted discussed *Efficient Streaming Language Models with Attention Sinks* (https://arxiv.org/abs/2309.17453). \
90 | Ryan discussed *SimMIM: A Simple Framework for Masked Image Modeling* (https://arxiv.org/abs/2111.09886). \
91 | [Video](https://youtu.be/jHy-cv5C0MI) and links to [Ted's slides](./papers/ML_Paper_Review_202312.pdf) and [Ryan's slides](./papers/SimMIM.pdf).
92 |
93 | **State of the Art in Knowledge Editing** by Alex Loftus \
94 | *October 21, 2023*
95 | * Work across many subfields of machine learning has become increasingly reliant on billion parameter-scale models.
96 | If we can find methods to update these models without spending the time and computation necessary to run full fine-tuning sessions,
97 | we unlock customization previously only possible with industry-scale compute power.
98 | A parallel and related question is in interpretability:
99 | Where does knowledge live in these giant models? How is it stored, and can we find interpretable directions in their weight-space?
100 | * Join us for an exploration of the research surrounding these questions.
101 | We'll explore work coming out of Jacob Andreas' lab at MIT, Christopher Manning at Stanford, Jacob Steinhardt's lab at Berkeley,
102 | and Ludwig Schmidt's group at the University of Washington.
103 | We'll also look at some open-source work being done by EleutherAI as well as
104 | knowledge-editing work in natural language processing done by the Allen Institute for AI's Israeli team.
105 | * Alex is a data scientist working at a company using machine learning to speed up the drug discovery process. He holds an undergraduate degree in neuroscience with minors in chemistry and philosophy at Western Washington University, and a master's in biomedical engineering with a data science concentration from Johns Hopkins University.
106 | * [Video](https://youtu.be/q9mC3T2aBL8)
107 |
108 | **Contrastive Inference Methods** by Sean O'Brien \
109 | *October 16, 2023*
110 | * The rapidly increasing scale of large language models (LLMs) and their training corpora has led to remarkable improvements in fluency, reasoning, and information recall. Still, these models are prone to hallucination and fundamental reasoning errors, and reliably eliciting desired behaviors from them can be challenging. The development of strategies like chain-of-thought and self-consistency have since demonstrated that training-free techniques can extract better behavior from existing models, launching a wave of research into prompting techniques. In this talk, we will examine a separate, emerging class of training-free techniques also intended to better control LLM behavior, which can be called contrastive inference methods. These techniques achieve improvements across a number of tasks by exploiting the behavioral differences between two different inference processes; in the case of contrastive decoding, between a large “expert” model and a smaller “amateur” model. While this talk will mostly focus on contrastive decoding, it will also introduce similar methods and discuss applications beyond the domain of text generation.
111 | * Sean O’Brien is a first-year Ph.D. student at UC San Diego, advised by Julian McAuley. In the past, he has researched dark matter search methods, chess strategy, and language model reasoning at UC Berkeley and Meta AI. Now he is interested in developing more reliable decoding methods for large language models, as well as exploring the application of these techniques to goal-oriented dialogue systems.
112 | * [Slides](./cd_reasoning_presentation_1016.pdf) and [video](https://youtu.be/nsDPF48eSmE)
113 |
114 | **ICML Conference Highlights** by Alex Loftus \
115 | *September 2, 2023*
116 | * Alex Loftus shares from his experience at this year's [ICML Conference](https://icml.cc/).
117 | This talk will be about new machine learning techniques in drug discovery and medicine presented at ICML. The general themes were molecule generation with diffusion; retrosynthesis, e.g., chemical synthesis planning; molecule representation learning; molecular property prediction; and LLMs for sequence infilling.
118 | * Alex is a data scientist working at a company using machine learning to speed up the drug discovery process. He holds an undergraduate degree in neuroscience with minors in chemistry and philosophy at Western Washington University, and a master's in biomedical engineering with a data science concentration from Johns Hopkins University.
119 | * [Slides](https://docs.google.com/presentation/d/1t7qfAU5TXDpYLqNx3tRbk2khwNygyGt9CTkHqY6W0Fw/edit?usp=sharing)
120 |
121 | **Kaggle Competition Recap -- Identify Contrails to Reduce Global Warming** by Ryan Chesler \
122 | *August 19, 2023*
123 | * Come hear about the recently completed [Google Research - Identify Contrails to Reduce Global Warming](https://www.kaggle.com/competitions/google-research-identify-contrails-reduce-global-warming) competition on Kaggle.
124 | This free, virtual meetup will discuss the successful approaches used in this image segmentation competition.
125 | * From Kaggle:
126 | Contrails are clouds of ice crystals that form in aircraft engine exhaust.
127 | They can contribute to global warming by trapping heat in the atmosphere.
128 | Researchers have developed models to predict when contrails will form and how much warming they will cause.
129 | However, they need to validate these models with satellite imagery.
130 | Your work will help researchers improve the accuracy of their contrail models.
131 | This will help airlines avoid creating contrails and reduce their impact on climate change.
132 | * Ryan Chesler is a 2x Kaggle Grandmaster. He is an organizer of the San Diego Machine Learning meetup and a Data Scientist working for H2O.ai.
133 |
134 | **Kaggle Vesuvius Ink Detection -- Winning Solutions** \
135 | *June 17, 2023*
136 | * Come hear from the winning teams from the recently completed [Vesuvius Ink Detection competition on Kaggle](https://www.kaggle.com/competitions/vesuvius-challenge-ink-detection). The first place team will share about their solution and discuss a meta analysis of what other teams tried and seemed to work. Additional teams will be added to present as their schedules permit.
137 | * From Kaggle:
138 | Join the $1,000,000+ [Vesuvius Challenge](https://scrollprize.org/) to resurrect an ancient library from the ashes of a volcano. In this competition you are tasked with detecting ink from 3D X-ray scans and reading the contents. Thousands of scrolls were part of a library located in a Roman villa in Herculaneum, a town next to Pompeii. This villa was buried by the Vesuvius eruption nearly 2000 years ago. Due to the heat of the volcano, the scrolls were carbonized, and are now impossible to open without breaking them. There is a $700,000 grand prize available to the first team that can read these scrolls from a 3D X-ray scan.
139 | * The members of the winning team, Ryan Chesler, Aina Tersol, Alex Loftus, and Ted Kyi, are all San Diego Machine Learning regulars.
140 | * [Video recording](https://youtu.be/IWySc8s00P0) of the session
141 | * [Intro slides](https://github.com/tedkyi/talks/blob/master/Vesuvius%20Intro.pdf)
142 | * [Technical slides](https://docs.google.com/presentation/d/1YZ_KfjSm9tbzXisOwVBalJ-WVlvOF65iugZa73I8MuI/edit?usp=drivesdk)
143 |
144 | **Language Model Foundations** by Ted Kyi \
145 | *April 1, 2023*
146 | * With all the current hype surrounding large language models, we are going to take a step back in time and look at the GPT-3 paper Language Models are Few-Shot Learners (https://arxiv.org/abs/2005.14165). We will utilize video paper reviews, blogs, and architectural diagrams to provide a clear understanding of how GPT-style language models work. Come for a solid foundation how decoder-only transformers are used in language models, and to see how current discussions about LLMs echo the same debates that arose when GPT-3 came out.
147 | * [Notes doc](https://docs.google.com/document/d/1EiiljsglzVRb0RfMlHI7Mm1DdTnNw4uKomN45LkkRrc/edit?usp=sharing)
148 |
149 | **Introduction to Transformers** by Ted Kyi \
150 | *February 18, 2023*
151 | * After learning the architecture of the transformer, you know the parts and how they work. Through a coding example, pattern matching with a 1-D CNN will be compared to pattern matching using attention, with the goal of providing intuition as to why the self-attention mechanism makes neural networks so powerful.
152 | * [Video](https://youtu.be/mluRoOf-OTk) and [Jupyter notebook](https://colab.research.google.com/drive/1tQrG6IFw8B3xMv9lCZOFS23OTt2YuTXV?usp=sharing)
153 |
154 | ### 2022
155 |
156 | **Hands-On Workshop on Training and Using Transformers** by Ryan Chesler \
157 | *Starting December 3, 2022 and running about 5 Saturdays (skipping the holidays)*
158 | * The transformer architecture that is taking over all of machine learning was initially used for NLP. This series will introduce the transformer and use Jupyter notebooks to walk through gathering data, loading models from HuggingFace, training, and fine-tuning transformer models for language tasks.
159 | * The first session reviewed the roadmap for the series, discussed why transformers are important, and went through a notebook showing the high level process of loading and training a transformer model.
160 | * [Video for session 1](https://youtu.be/QxNHoVqPly8) and [Week 1 Colab notebook](https://colab.research.google.com/drive/1bSQwJ8HiyJh_v_TnBCJl_FoI7WWhzqI4?usp=sharing)
161 | * The second session discussed how text data gets used and walked through loading datasets in a pretraining notebook.
162 | * [Video for session 2](https://youtu.be/oXtq_mLFmeU) and [Week 2 Colab notebook](https://colab.research.google.com/drive/1HCwrZ93CeKJtAE4JfqA_IHLkVSJ8J8Nj?usp=sharing)
163 | * Model pretraining [Video for session 3](https://youtu.be/UhvRu6GthAs)
164 | * Model fine-tuning [Video for session 4](https://youtu.be/k69527MtjV4)
165 | and [Week 4 Colab notebook](https://colab.research.google.com/drive/1HCwrZ93CeKJtAE4JfqA_IHLkVSJ8J8Nj?usp=sharing)
166 | * Inference and deployment [Video for session 5](https://youtu.be/zhW995EJ--0)
167 | and [Week 5 Colab notebook](https://colab.research.google.com/drive/1jXbapHzKQJWaWcXCnRNsN7xlXtYVUbtT?usp=sharing)
168 |
169 | **Rigorous Statistics for Academics and Practitioners** by Michal Fabinger \
170 | Michal is continuing Thursday evening lectures on statistics
171 | * Fill out this form to receive the notes and quizes by email (you only need to do this once,
172 | but it is different from the initial 4-part series): https://form.typeform.com/to/rep1RuEc
173 | * The first series was **Shapes and Moments of Probability Distributions**, on March 31 and April 7, 2022.
174 | The MLT recordings of this series are [Part 1](https://youtu.be/aR-kh5O5fOg) and [Part 2](https://youtu.be/BWn8tGO-noI).
175 | * The second series is **Dependence of random variables and conditional distributions**, three parts starting April 14, 2022.
176 | Recordings from the Tokyo sessions are [Part 1](https://youtu.be/-nTYm21I_gM), [Part 2](https://youtu.be/14h5uSvoOv4), and [Part 3](https://youtu.be/OLxrtGk4tMg).
177 | * The third series will be **Estimators, asymptotic theory, and types of convergence of random variables**, in two parts starting May 5, 2022.
178 | The session recordings from MLT are now available: [Part 1](https://youtu.be/hWGzEnYaPf8) and [Part 2](https://youtu.be/mT3WGyDDzhI).
179 | * The next series will be on **Hypothesis testing**, in two parts starting May 19, 2022.
180 | * The fifth series covers **Linear regression models**, in four parts starting June 2, 2022.
181 |
182 |
183 | **Rigorous Probability and Statistics** by Michal Fabinger \
184 | *February 8, February 15, February 22, and March 1, 2022*
185 | * A 4-part series of lectures on Probability and Statistics starting from the beginning and proceeding in an intuitive, but mathematically rigorous way. (Similar Machine Learning lectures could also be scheduled.) The lectures should help Machine Learning practitioners and researchers to understand academic papers and to implement their methods. They should also help people pursuing academic paths in various scientific disciplines.
186 | * These lectures a being jointly hosted by Silicon Valley Hands On Programming Events (https://www.meetup.com/HandsOnProgrammingEvents/).
187 | They were also delivered to Machine Learning Tokyo.
188 | * Fill out this form to receive the notes and quizes by email: https://form.typeform.com/to/tYqIEqGN
189 | * The topics are:
190 | 1. *Feb. 8 [meetup](https://www.meetup.com/San-Diego-Machine-Learning/events/283748116/)* - Types of probability distributions and the need for a rigorous mathematical framework. Probability spaces, sample spaces, event spaces, and probability measures. Examples of probability spaces.
191 | Here is the [video recording of the MLT lecture](https://youtu.be/MxZN5F8iYLM).
192 | 1. *Feb. 15 [meetup](https://www.meetup.com/San-Diego-Machine-Learning/events/283844144/)* - Sigma-algebras for events. Borel sigma-algebras for events corresponding to continuous sample spaces. Random variables. Examples of random variables.
193 | Here is the [video of the MLT session](https://youtu.be/G9Tmp_82esA).
194 | 1. *Feb. 22* - Distributions of random variables. Cumulative distribution functions, probability mass functions, and probability density functions. Examples of distributions.
195 | The [video recording of the MLT lecture](https://youtu.be/Y6dPDJvcZrg).
196 | 1. *Mar. 1* - Transformations of random variables. Transformations of cumulative distribution functions, probability mass functions, and probability density functions. Examples of usage of transformed random variables.
197 | Here's the [video of the MLT meetup](https://youtu.be/kdqYQM--rKM).
198 |
199 |
200 | ### 2021
201 | **State of the Art Machine Learning Algorithms for Tabular Data** by Ryan Chesler \
202 | *November 20, 2021*
203 | * In this talk, Kaggle Grandmaster Ryan Chesler will discuss state of the art models used for tabular machine learning. He will explain the data preprocessing steps as well as the algorithms and how the two interact. He will discuss the nuts and bolts of the algorithms as well as some benchmarks showing the performance of the various different methods.
204 | * [Video](https://youtu.be/8Tag8uEH_ds)
205 |
206 | **Evaluating Robustness of Neural Networks** by Lily Weng \
207 | *October 28, 2021*
208 | * The robustness of neural networks to adversarial examples has received great attention due to security implications. Despite various attack approaches to crafting visually imperceptible adversarial examples, little has been developed towards a comprehensive measure of robustness. In this talk, Weng will introduce several robustness quantification frameworks for deep neural networks against both adversarial and non-adversarial input perturbations, including the first robustness score CLEVER, efficient certification algorithms Fast-Lin, CROWN, CNN-Cert, and probabilistic robustness verification algorithm PROVEN. The proposed approaches are computationally efficient and provide a good quality of robustness estimate/certificate as demonstrated by extensive experiments on MNIST, CIFAR and ImageNet.
209 | * Lily Weng is an Assistant Professor in the Halicioglu Data Science Institute at UC San Diego with an affiliation to the CSE department. She has broad research interest in the intersection of machine learning, optimization and reinforcement learning, with applications in cybersecurity and healthcare.
210 | * [Video](https://youtu.be/kB0r5KZdvx8)
211 |
212 | **The New DBfication of ML/AI** by Arun Kumar \
213 | *September 22, 2021*
214 | * The recent boom in ML/AI applications has brought into sharp focus the pressing need for tackling the concerns of scalability, usability, and manageability across the entire lifecycle of ML/AI applications. The ML/AI world has long studied the concerns of accuracy, automation, etc. from theoretical and algorithmic vantage points. But to truly democratize ML/AI, the vantage point of building and deploying practical systems is equally critical. \
215 | In this talk, Professor Kumar will make the case that it is high time to bridge the gap between the ML/AI world and a world that exemplifies successful democratization of data technology: databases. He will show how new bridges rooted in the principles, techniques, and tools of the database world are helping tackle the above pressing concerns and in turn, posing new research questions to the world of ML/AI. As case studies of such bridges, he will describe two lines of work from his group: query optimization for ML systems and benchmarking data preparation in AutoML platforms. He will conclude with his thoughts on community mechanisms to foster more such bridges between research worlds and between research and practice.
216 | * Arun Kumar is an Assistant Professor in the Department of Computer Science and Engineering and the Halicioglu Data Science Institute and an HDSI Faculty Fellow at the University of California, San Diego. He is a member of the Database Lab and Center for Networked Systems and an affiliate member of the AI Group.
217 | * [Slides](./2021HDSI-SD-MLMeetupArunKumar.pdf) and [video](https://youtu.be/z4VpmRv3Ylo)
218 |
219 | **Learning with Distributed Data** by Arya Mazumdar \
220 | *August 25, 2021*
221 | * In recent years, large-scale training for machine learning models mandatorily takes place in a massively large distributed system composed of individual computational nodes (e.g., ranges from GPUs to low-end commodity hardware). Such distributed systems are inherently constrained by communication. For example, a computational node 1) may not be able to communicate local information due to limited bandwidth 2) may not want to share information to maintain privacy 2) may intentionally corrupt information as an adversarial attack. What is the compromise in the convergence rate of an optimization algorithm due to these communication constraints? We explore these trade-offs and show that first and second order methods of optimization still work under all such heavy information constraints.
222 | * Arya Mazumdar is an associate professor in the Halicioglu Data Science Institute of University of California San Diego, with additional affiliation to the Departments of Computer Science and Electrical Engineering.
223 | * [Video](https://youtu.be/90AueFwajAs)
224 |
225 | **Discrete Morse-based Graph Skeletonization and Data Analysis** by Yusu Wang \
226 | *July 28, 2021*
227 | * In recent years, topological and geometric data analysis (TGDA) has emerged as a new and promising field for processing, analyzing and understanding complex data. Indeed, geometry and topology form natural platforms for data analysis, with geometry describing the ”shape” and ”structure” behind data; and topology characterizing / summarizing both the domain where data are sampled from, as well as functions and maps associated to them. In this talk, Yusu will show how the topological objects and ideas can be combined with algorithmic developments to lead to new approaches for inferring hidden graph skeleton structure behind (low and high dimensional) data; as well as how they can be combined with machine learning pipelines for further data analysis tasks (e.g., to neuroscience and to material science). This talk is based on multiple projects with multiple collaborators and references will be given during the talk.
228 | * Yusu Wang is currently a Professor in the Halicioglu Data Science Institute at University of California, San Diego. Prior to joining UCSD, she was a Professor in the Computer Science and Engineering Department at the Ohio State University, where she also co-directed the Foundation of Data Science Research CoP at Translational Data Analytics Institute (TDAI)@OSU from 2018-2020.
229 | * [Video](https://youtu.be/h2b6CnCK-x8)
230 |
231 | **A friendly introduction to PySpark MLlib (and a taste of MLFlow)** by Michelle Hoogenhout \
232 | *July 17, 2021*
233 | * Doing data science at scale? PySpark and MLlib bring the power of Spark's distributed processing to python users so that you can train machine learning models on massive datasets. MLlib provides tools for data extraction, transformation and loading, common ML algorithms, and model evaluation. And with the addition of MLFlow, it's easier than ever to log, reproduce and deploy your ML models. This walkthrough is aimed at those new to MLflow, and will take you through the ML lifecycle with PySpark's ML toolset.
234 | * Michelle Hoogenhout is a data scientist with a background in cognitive neuroscience and experimental design. She is a senior data science and analytics instructor at Galvanize and co-founder of Ingane Health, a data science consulting firm. Michelle holds a PhD (Psychology) from the University of Cape Town, South Africa and has published on topics such as statistics and data management, data science training methods, ethics, and cognitive and physiological assessment.
235 | * The recording is available at this [video link](https://youtu.be/spDmZAwBLAo)
236 |
237 | **Causal Algorithmic Fairness and Transparency** by Babak Salimi \
238 | *May 26, 2021*
239 | * Scaling and democratizing access to big data promises to provide meaningful and actionable information that supports decision-making. Today, data-driven decisions profoundly affect the course of our lives, such as whether to admit applicants to a particular school, offer them a job, or grant them a mortgage. Unfair, inconsistent, or faulty decision-making raises serious concerns about ethics and responsibility. For example, we may know that our training data is biased, but how do we avoid propagating discrimination when we use this data? How do we avoid incorrect, spurious and non-reproducible findings? How can we curate and expose existing data to make it "safe" for informed decision-making? \
240 | In this talk, Babak will describe how we can combine techniques from causal inference and data management to develop systems and algorithms that help answer questions about fairness and transparency of algorithmic systems. First, he will present a new notion of fairness that subsumes and improves upon previous definitions and correctly distinguishes between fairness violations and non-violations. Further, he will discuss how we can leverage techniques from data management to remove historical discrimination from data. Second he will present a novel declarative framework that enables reasoning about fairness and discrimination from complex relational data. Finally, he will present his most recent work that exploits counterfactual reasoning for explaining black-box decision-making algorithms.
241 | * Babak Salimi is an assistant professor in HDSI at UC San Diego. Before joining UC San Diego, he was a postdoctoral research associate in the Department of Computer Science and Engineering, University of Washington where he worked with Prof. Dan Suciu and the database group.
242 | * [Video](https://youtu.be/DJBAzvZDEgs)
243 |
244 | **Take a Hack at COVID!** by Benjamin Smarr \
245 | *April 21, 2021*
246 | * Professor Smarr is the technical lead on TemPredict, an international research effort aimed at building deployable algorithms for COVID detection and health monitoring. TemPredict gathered wearable and survey data from ~65,000 global participants. Professor Smarr will share some early insights, and highlight opportunities for interested hackers to get involved in future analyses.
247 | * Benjamin Smarr is an assistant professor at the Halicioğlu Data Science Institute and the Department of Bioengineering at the University of California, San Diego. As an NIH fellow at UC Berkeley he developed techniques for extracting health and performance predictors from repeated, longitudinal physiological measurements. Historically his work has focused on neuroendocrine control and women’s health, including demonstrations of pregnancy detection and outcome prediction, neural control of ovulation, and the importance of circadian rhythms in healthy in utero development.
248 | * [Slides](Hack%20COVID%20slides.pdf) and [video](https://vimeo.com/569978555)
249 |
250 | **Explaining by Removing: A Unified Framework for Model Explanation** by Ian Covert \
251 | *April 13, 2021*
252 | * Researchers have proposed a wide variety of model explanation approaches, but it remains unclear how most methods are related or when one method is preferable to another. In this talk we'll discuss a new class of methods, removal-based explanations, that are based on the principle of simulating feature removal to quantify each feature’s influence. These methods vary in several respects, so we develop a framework that characterizes each method along three dimensions: 1) how the method removes features, 2) what model behavior the method explains, and 3) how the method summarizes each feature’s influence. Our framework unifies 25 existing methods, including several of the most widely used approaches (SHAP, LIME, Meaningful Perturbations, permutation tests), and it helps us reveal underlying connections with fields such as cognitive psychology, game theory and information theory.
253 | * Ian Covert is a PhD student at the University of Washington, where he is advised by Su-In Lee and collaborates with Scott Lundberg. His research focuses on explainable machine learning and the applications of these tools to problems in biology and medicine. Previously, he was a student researcher at Google Brain and completed his bachelors degree at Columbia University.
254 | * [Slides](./Explaining_by_Removing.pdf) and [video](https://youtu.be/ZbOSjimCZm0)
255 |
256 | **AutoNet: Automated Network Construction from Massive Text Corpora** by Jingbo Shang
257 |
258 | *March 24, 2021*
259 | * Mining structured knowledge from massive unstructured text data is a key challenge in data science. In this talk, I will discuss my proposed framework, AutoNet, that transforms unstructured text data into structured heterogeneous information networks, on which actionable knowledge can be further uncovered flexibly and effectively. AutoNet is a data-driven approach using distant supervision instead of human curation and labeling. It consists of four essential steps: (1) quality phrase mining; (2) entity recognition and typing; (3) relation extraction; and (4) taxonomy construction.
260 | * Jingbo Shang is an Assistant Professor in Computer Science Engineering and Halıcıoğlu Data Science Institute at UC San Diego. His research has been recognized by many prestigious awards, including the Grand Prize of Yelp Dataset Challenge in 2015, Google Ph.D. Fellowship in Structured Data and Database Management in 2017, SIGKDD Dissertation Award Runner-up in 2020.
261 | * [Video](https://youtu.be/E69GTjnXjf0) has been uploaded to YouTube
262 |
263 | **Rainforest Connection Species Audio Detection - A Kaggle Competition Recap** by Ryan Chesler
264 |
265 | *March 13, 2021*
266 | * Come hear about the Rainforest Connection Species Audio Detection competition on Kaggle (https://www.kaggle.com/c/rfcx-species-audio-detection). This free, virtual meetup will discuss the recently completed competition.
267 | * From Kaggle:
268 | In this competition, you’ll automate the detection of bird and frog species in tropical soundscape recordings. You'll create your models with limited, acoustically complex training data. Rich in more than bird and frog noises, expect to hear an insect or two, which your model will need to filter out.
269 | * Ryan is an organizer of the San Diego Machine Learning meetup and a Data Scientist working for H2O.ai developing automated machine learning systems. He is a Kaggle grand master and self-driving car enthusiast.
270 | * [Slides](https://docs.google.com/presentation/d/1Pi36eq4NnFeQ19gTNYkyVEXMBs0PyW8LtM1RtuKenKQ/edit#slide=id.p) and [video](https://youtu.be/Vf_FIMDVs00)
271 |
272 | ### 2020
273 |
274 | **Investment Analytics in the Dawn of Artificial Intelligence** by Bernard Lee \
275 | *December 12, 2020*
276 | * This talk attempts to answer a few questions:
277 | 1) Why artificial intelligence/machine learning (AI/ML) is helpful to certain fintech apps?
278 | 2) When AI/ML works well in fintech? When it may not work so well?
279 | 3) Apart from interesting theories, what are the practical prerequisites for AI/ML applications to perform well in real-life finance, to the point of replacing jobs?
280 | 4) Where society may end up if trusted AI/ML applications in fintech become widely available?
281 | 5) What may be the potential implications for me, my financial future, and my career?
282 | * Bernard Lee is the founder/CEO of HedgeSPA, whose mission is to revolutionize the landscape of professional investment analytics by democratizing access to the most sophisticated B2B investment analytics tool. HedgeSPA's core investment platform is unique in its utilization of Artificial Intelligence, Big Data, and High-Performance/Quantum Computing.
283 | * [Slides](./Bernard%20Lee%20Presentation%20for%20SDML%20(Distribution%20Copy).pdf) can be downloaded, however please contact Ted, the meetup organizer, for the password.
284 |
285 | **Neuroscience in the data science age** by Bradley Voytek \
286 | *December 8, 2020*
287 | * The brain is often likened to a symphony, where 86 billion neurons are coordinating in an unfathomably complex electrochemical orchestra. However, our brains are more like a symphony without a conductor: there is no leader orchestrating those 86 billion neurons! Despite this apparent chaos, our brains usually just work (if we're lucky!). My research lab leverages a data science approach to neuroscience in order to understand how these 86 billion neurons communicate with one another, and to figure out when, why, and how that process breaks down.
288 | * Bradley Voytek is an Associate Professor in the Department of Cognitive Science, the Halıcıoğlu Data Science Institute, and the Neurosciences Graduate Program at UC San Diego. He is both an Alfred P. Sloan Neuroscience Research Fellow and a Kavli Fellow of the National Academies of Sciences, as well as a founding faculty member of the UC San Diego Halıcıoğlu Data Science Institute and the Undergraduate Data Science program, where he serves as Vice-Chair.
289 | * [Slides](./Voytek-SD-meetup.pdf) and [video](https://youtu.be/vCxHGuQb5b0)
290 |
291 | **Lyft Motion Prediction for Autonomous Vehicles - A Kaggle Competition Recap** by Ryan Chesler
292 |
293 | *November 28, 2020*
294 | * Come hear about the Lyft Motion Prediction for Autonomous Vehicles competition on Kaggle (https://www.kaggle.com/c/lyft-motion-prediction-autonomous-vehicles). This virtual meetup is scheduled just days after the competition closes.
295 | * From Kaggle:
296 | In this competition, you’ll apply your data science skills to build motion prediction models for self-driving vehicles. You'll have access to the largest Prediction Dataset ever released to train and test your models. Your knowledge of machine learning will then be required to predict how cars, cyclists, and pedestrians move in the AV's environment.
297 | * Ryan is an organizer of the San Diego Machine Learning meetup and a Data Scientist working for H2O.ai developing automated machine learning systems. He is a Kaggle triple master and self-driving car enthusiast.
298 | * [Slides](https://docs.google.com/presentation/d/1U3smjkPcGiit6EzqfyYnSxGdoCxj31EGdKyfDA2a0RI/edit#slide=id.p) and [video](https://youtu.be/ScjnDwghHGM)
299 |
300 | **A RapidMiner Overview** by Steven Fouskarinis
301 |
302 | *November 14, 2020*
303 | * RapidMiner is a platform that is used by over 4,000 universities to teach machine learning concepts to students while not presuming those students can program. However, the platform is also used by over 40,000 organizations worldwide for commercial applications. Come and take a brief but informative guided tour of the RapidMiner coal mines from a user not a salesperson.
304 | * Steven Fouskarinis is an electrical engineer and computer scientist by training who works on scaling execution at a company level. Past projects include using NLP to infer ICDs & CPTs from clinical notes, automating medical bill creation and submission, predicting infusion pump misuse, and using a chatbot as a non-visual interface for research paper discovery.
305 | * [Slides](./SDML-RapidMiner_Overview.pdf) and [video](https://youtu.be/xJqkhWZ-e3A)
306 |
307 | **Genetics & Genomics** by Chris Keown
308 |
309 | *June 20, 2020*
310 | * Most people have a superficial understanding of genetics, as it plays a causal role in who we are. In the first half of this talk, I will explain what the field of genetics is, how we do experiments and analyses, and the recent learnings and challenges in the field. In the second half of my talk, I will focus on genomics and epigenomics. These are two very hot areas in research, but are lesser known to non-experts. Again, I will explain the field, how we do experiments and analyses, and how machine learning has been and can be applied in this domain.
311 | * [Slides](./SDML_genomics_FNL.pdf) and [video](https://www.youtube.com/watch?v=NT9hBuBAexg)
312 |
313 | **Productizing ML** by Ryan Chesler
314 |
315 | *June 13, 2020*
316 | * A discussion how to make a product out of a machine learning idea.
317 | Walking briefly through all of the things that need to be considered while building a machine learning project, beyond just building models.
318 | * Ryan is an organizer of the San Diego Machine Learning meetup and a Data Scientist working for H2O.ai developing automated machine learning systems. He is a Kaggle triple master and has taken several systems to production all the way from idea to deployment.
319 | * [Video](https://www.youtube.com/watch?v=hAtiPsShYMk)
320 |
321 | **Intro to Text Summarization and Topic Segmentation** by Vibhu Sapra
322 |
323 | *June 13, 2020*
324 | * An introduction to text summarization and to different approaches towards topic segmentation.
325 | The talk will cover the basics of how machine summarization works, applications of text summarization, code examples, and explore current state of the art models.
326 | * Vibhu is the founder of Byrd.ai and has been developing various NLP models for text summarization in a production setting.
327 | * [Video](https://www.youtube.com/watch?v=hAtiPsShYMk)
328 |
329 | **Self-Driving Cars** by Ryan Chesler
330 |
331 | *June 6, 2020*
332 | * A discussion of the mechanisms and challenges involved in self-driving cars and what part machine learning plays in it.
333 | * Ryan is an organizer of the San Diego Machine Learning meetup and a Data Scientist working for H2O.ai developing automated machine learning systems. He is a Kaggle triple master and self-driving car enthusiast.
334 | * [Video](https://www.youtube.com/watch?v=yAMygd-HZTU)
335 |
336 | **A Beginner's Overview of Spark and PySpark** by Ted Kyi
337 |
338 | *May 30, 2020*
339 | * A discussion of what Spark is, why you would use it, as well as details, key features, code examples and how to learn more about it.
340 | * Ted is vice president of analytics for Matrix Medical Network. https://www.linkedin.com/in/tedkyi/
341 | * The slides and PySpark notebook are available in Ted's GitHub repo https://github.com/tedkyi/spark_talk, unfortunately video not avaialable
342 |
343 | ### 2019 and earlier
344 |
345 | Prior to the global pandemic, we met in person and regrettably did not record any of our presentations.
346 |
347 | ### Additional information
348 |
349 | For SDML book club notes and videos, please see the [book club repo](https://github.com/SanDiegoMachineLearning/bookclub).
350 |
351 | To stay in touch with San Diego Machine Learning and receive announcements of all of our events, join our Meetup group https://www.meetup.com/San-Diego-Machine-Learning.
352 |
353 | For more events, job postings, and discussion of machine learning, join our slack channel
354 | https://join.slack.com/t/sdmachinelearning/shared_invite/zt-2b2207qhg-Iyys1g0Ot6iErTYMioV9Mg
355 |
356 |
357 |
358 |
--------------------------------------------------------------------------------
/SDML-RapidMiner_Overview.pdf:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/SanDiegoMachineLearning/talks/31edc1eb06ed2646ca6bcdb0605dae2c7998eda2/SDML-RapidMiner_Overview.pdf
--------------------------------------------------------------------------------
/SDML_genomics_FNL.pdf:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/SanDiegoMachineLearning/talks/31edc1eb06ed2646ca6bcdb0605dae2c7998eda2/SDML_genomics_FNL.pdf
--------------------------------------------------------------------------------
/Voytek-SD-meetup.pdf:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/SanDiegoMachineLearning/talks/31edc1eb06ed2646ca6bcdb0605dae2c7998eda2/Voytek-SD-meetup.pdf
--------------------------------------------------------------------------------
/cd_reasoning_presentation_1016.pdf:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/SanDiegoMachineLearning/talks/31edc1eb06ed2646ca6bcdb0605dae2c7998eda2/cd_reasoning_presentation_1016.pdf
--------------------------------------------------------------------------------
/papers/AM-RADIO_Agglomerative Vision Foundation Model Reduce All Domains Into One.pdf:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/SanDiegoMachineLearning/talks/31edc1eb06ed2646ca6bcdb0605dae2c7998eda2/papers/AM-RADIO_Agglomerative Vision Foundation Model Reduce All Domains Into One.pdf
--------------------------------------------------------------------------------
/papers/AURORA_a_foundation_model_of_the_atmosphere.pdf:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/SanDiegoMachineLearning/talks/31edc1eb06ed2646ca6bcdb0605dae2c7998eda2/papers/AURORA_a_foundation_model_of_the_atmosphere.pdf
--------------------------------------------------------------------------------
/papers/Applications_of_ML_in_Astronomy.pdf:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/SanDiegoMachineLearning/talks/31edc1eb06ed2646ca6bcdb0605dae2c7998eda2/papers/Applications_of_ML_in_Astronomy.pdf
--------------------------------------------------------------------------------
/papers/DeepSeek-R1_Incentivizing_Reasoning_Capability_in_LLMs_via_Reinforcement_Learning.pdf:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/SanDiegoMachineLearning/talks/31edc1eb06ed2646ca6bcdb0605dae2c7998eda2/papers/DeepSeek-R1_Incentivizing_Reasoning_Capability_in_LLMs_via_Reinforcement_Learning.pdf
--------------------------------------------------------------------------------
/papers/Foundation_Models_for_Generalist_Geospatial_Artificial_Intelligence.pdf:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/SanDiegoMachineLearning/talks/31edc1eb06ed2646ca6bcdb0605dae2c7998eda2/papers/Foundation_Models_for_Generalist_Geospatial_Artificial_Intelligence.pdf
--------------------------------------------------------------------------------
/papers/GAIA.pdf:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/SanDiegoMachineLearning/talks/31edc1eb06ed2646ca6bcdb0605dae2c7998eda2/papers/GAIA.pdf
--------------------------------------------------------------------------------
/papers/Interesting Papers 202311.pdf:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/SanDiegoMachineLearning/talks/31edc1eb06ed2646ca6bcdb0605dae2c7998eda2/papers/Interesting Papers 202311.pdf
--------------------------------------------------------------------------------
/papers/MAGVIT_Masked_Generative_Video_Transformer.pdf:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/SanDiegoMachineLearning/talks/31edc1eb06ed2646ca6bcdb0605dae2c7998eda2/papers/MAGVIT_Masked_Generative_Video_Transformer.pdf
--------------------------------------------------------------------------------
/papers/ML_Paper_Review_202311.pdf:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/SanDiegoMachineLearning/talks/31edc1eb06ed2646ca6bcdb0605dae2c7998eda2/papers/ML_Paper_Review_202311.pdf
--------------------------------------------------------------------------------
/papers/ML_Paper_Review_202312.pdf:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/SanDiegoMachineLearning/talks/31edc1eb06ed2646ca6bcdb0605dae2c7998eda2/papers/ML_Paper_Review_202312.pdf
--------------------------------------------------------------------------------
/papers/ML_Paper_Review_202401_Jailbroken.pdf:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/SanDiegoMachineLearning/talks/31edc1eb06ed2646ca6bcdb0605dae2c7998eda2/papers/ML_Paper_Review_202401_Jailbroken.pdf
--------------------------------------------------------------------------------
/papers/ML_Paper_Review_202403_Generalization.pdf:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/SanDiegoMachineLearning/talks/31edc1eb06ed2646ca6bcdb0605dae2c7998eda2/papers/ML_Paper_Review_202403_Generalization.pdf
--------------------------------------------------------------------------------
/papers/ML_Paper_Review_202404_DoRA.pdf:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/SanDiegoMachineLearning/talks/31edc1eb06ed2646ca6bcdb0605dae2c7998eda2/papers/ML_Paper_Review_202404_DoRA.pdf
--------------------------------------------------------------------------------
/papers/ML_Paper_Review_202405_long_context.pdf:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/SanDiegoMachineLearning/talks/31edc1eb06ed2646ca6bcdb0605dae2c7998eda2/papers/ML_Paper_Review_202405_long_context.pdf
--------------------------------------------------------------------------------
/papers/ML_Paper_Review_202406_multi_token_pred.pdf:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/SanDiegoMachineLearning/talks/31edc1eb06ed2646ca6bcdb0605dae2c7998eda2/papers/ML_Paper_Review_202406_multi_token_pred.pdf
--------------------------------------------------------------------------------
/papers/ML_Paper_Review_202408_stealing_part_of_llm.pdf:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/SanDiegoMachineLearning/talks/31edc1eb06ed2646ca6bcdb0605dae2c7998eda2/papers/ML_Paper_Review_202408_stealing_part_of_llm.pdf
--------------------------------------------------------------------------------
/papers/ML_Paper_Review_202410_writing_in_the_margins.pdf:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/SanDiegoMachineLearning/talks/31edc1eb06ed2646ca6bcdb0605dae2c7998eda2/papers/ML_Paper_Review_202410_writing_in_the_margins.pdf
--------------------------------------------------------------------------------
/papers/ML_Paper_Review_202411_multi-head_latent_attn.pdf:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/SanDiegoMachineLearning/talks/31edc1eb06ed2646ca6bcdb0605dae2c7998eda2/papers/ML_Paper_Review_202411_multi-head_latent_attn.pdf
--------------------------------------------------------------------------------
/papers/ML_Paper_Review_202412_segment_anything.pdf:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/SanDiegoMachineLearning/talks/31edc1eb06ed2646ca6bcdb0605dae2c7998eda2/papers/ML_Paper_Review_202412_segment_anything.pdf
--------------------------------------------------------------------------------
/papers/ML_Paper_Review_202502_visual_autoregressive.pdf:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/SanDiegoMachineLearning/talks/31edc1eb06ed2646ca6bcdb0605dae2c7998eda2/papers/ML_Paper_Review_202502_visual_autoregressive.pdf
--------------------------------------------------------------------------------
/papers/ML_Paper_Review_202504_biology_of_a_LLM.pdf:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/SanDiegoMachineLearning/talks/31edc1eb06ed2646ca6bcdb0605dae2c7998eda2/papers/ML_Paper_Review_202504_biology_of_a_LLM.pdf
--------------------------------------------------------------------------------
/papers/ML_Paper_Review_202505_describe_anything.pdf:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/SanDiegoMachineLearning/talks/31edc1eb06ed2646ca6bcdb0605dae2c7998eda2/papers/ML_Paper_Review_202505_describe_anything.pdf
--------------------------------------------------------------------------------
/papers/Machine_Learning_Key_Concepts.pdf:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/SanDiegoMachineLearning/talks/31edc1eb06ed2646ca6bcdb0605dae2c7998eda2/papers/Machine_Learning_Key_Concepts.pdf
--------------------------------------------------------------------------------
/papers/Mixture of Depths.pdf:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/SanDiegoMachineLearning/talks/31edc1eb06ed2646ca6bcdb0605dae2c7998eda2/papers/Mixture of Depths.pdf
--------------------------------------------------------------------------------
/papers/Should We Fine-Tune or RAG.pdf:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/SanDiegoMachineLearning/talks/31edc1eb06ed2646ca6bcdb0605dae2c7998eda2/papers/Should We Fine-Tune or RAG.pdf
--------------------------------------------------------------------------------
/papers/SimMIM.pdf:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/SanDiegoMachineLearning/talks/31edc1eb06ed2646ca6bcdb0605dae2c7998eda2/papers/SimMIM.pdf
--------------------------------------------------------------------------------
/papers/Sonali Tamhankar DSS SF.pdf:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/SanDiegoMachineLearning/talks/31edc1eb06ed2646ca6bcdb0605dae2c7998eda2/papers/Sonali Tamhankar DSS SF.pdf
--------------------------------------------------------------------------------
/papers/TextGrad_Automatic_Differentiation_via_Text.pdf:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/SanDiegoMachineLearning/talks/31edc1eb06ed2646ca6bcdb0605dae2c7998eda2/papers/TextGrad_Automatic_Differentiation_via_Text.pdf
--------------------------------------------------------------------------------