├── requirements.txt
├── README.md
├── LICENSE
├── .devcontainer
    ├── Dockerfile
    └── devcontainer.json
├── examples
    ├── mlops-wikipedia.txt
    ├── download.ipynb
    └── mlflow.txt
├── .gitignore
└── notebooks
    ├── try-datasets.ipynb
    └── try-transformers.ipynb


/requirements.txt:
--------------------------------------------------------------------------------
1 | ipywidgets==8.0.1
2 | ipykernel==6.15.1
3 | transformers==4.21.1
4 | datasets==2.4.0
5 | ipykernel==6.15.1
6 | tensorflow==2.10
7 | 


--------------------------------------------------------------------------------
/README.md:
--------------------------------------------------------------------------------
 1 | # Try 🤗 HuggingFace!
 2 | 
 3 | Examples to try HuggingFace datasets and transformers
 4 | 
 5 | * Search the [model hub](https://huggingface.co/models) for existing models
 6 | * Install the [Datasets](https://github.com/huggingface/datasets/) Python package from the [requirements.txt](./requirements.txt) file
 7 | * Install the [Transformers](https://github.com/huggingface/transformers) Python package from the [requirements.txt](./requirements.txt) file
 8 | 
 9 | Browse through the [Jupyter Notebook examples](./notebooks).
10 | 


--------------------------------------------------------------------------------
/LICENSE:
--------------------------------------------------------------------------------
 1 | MIT License
 2 | 
 3 | Copyright (c) 2022 Alfredo Deza
 4 | 
 5 | Permission is hereby granted, free of charge, to any person obtaining a copy
 6 | of this software and associated documentation files (the "Software"), to deal
 7 | in the Software without restriction, including without limitation the rights
 8 | to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
 9 | copies of the Software, and to permit persons to whom the Software is
10 | furnished to do so, subject to the following conditions:
11 | 
12 | The above copyright notice and this permission notice shall be included in all
13 | copies or substantial portions of the Software.
14 | 
15 | THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
16 | IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
17 | FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
18 | AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
19 | LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
20 | OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
21 | SOFTWARE.
22 | 


--------------------------------------------------------------------------------
/.devcontainer/Dockerfile:
--------------------------------------------------------------------------------
 1 | # See here for image contents: https://github.com/microsoft/vscode-dev-containers/tree/v0.245.2/containers/python-3/.devcontainer/base.Dockerfile
 2 | 
 3 | # [Choice] Python version (use -bullseye variants on local arm64/Apple Silicon): 3, 3.10, 3.9, 3.8, 3.7, 3.6, 3-bullseye, 3.10-bullseye, 3.9-bullseye, 3.8-bullseye, 3.7-bullseye, 3.6-bullseye, 3-buster, 3.10-buster, 3.9-buster, 3.8-buster, 3.7-buster, 3.6-buster
 4 | ARG VARIANT="3.10-bullseye"
 5 | FROM mcr.microsoft.com/vscode/devcontainers/python:0-${VARIANT}
 6 | 
 7 | 
 8 | # [Optional] If your pip requirements rarely change, uncomment this section to add them to the image.
 9 | COPY requirements.txt /tmp/pip-tmp/
10 | RUN pip3 --disable-pip-version-check --no-cache-dir install -r /tmp/pip-tmp/requirements.txt \
11 |    && rm -rf /tmp/pip-tmp
12 | 
13 | # [Optional] Uncomment this section to install additional OS packages.
14 | # RUN apt-get update && export DEBIAN_FRONTEND=noninteractive \
15 | #     && apt-get -y install --no-install-recommends <your-package-list-here>
16 | 
17 | # [Optional] Uncomment this line to install global node packages.
18 | # RUN su vscode -c "source /usr/local/share/nvm/nvm.sh && npm install -g <your-package-here>" 2>&1


--------------------------------------------------------------------------------
/examples/mlops-wikipedia.txt:
--------------------------------------------------------------------------------
1 | MLOps or ML Ops is a set of practices that aims to deploy and maintain machine learning models in production reliably and efficiently.[1] The word is a compound of "machine learning" and the continuous development practice of DevOps in the software field. Machine learning models are tested and developed in isolated experimental systems. When an algorithm is ready to be launched, MLOps is practiced between Data Scientists, DevOps, and Machine Learning engineers to transition the algorithm to production systems.[2] Similar to DevOps or DataOps approaches, MLOps seeks to increase automation and improve the quality of production models, while also focusing on business and regulatory requirements. While MLOps started as a set of best practices, it is slowly evolving into an independent approach to ML lifecycle management. MLOps applies to the entire lifecycle - from integrating with model generation (software development lifecycle, continuous integration/continuous delivery), orchestration, and deployment, to health, diagnostics, governance, and business metrics. According to Gartner, MLOps is a subset of ModelOps. MLOps is focused on the operationalization of ML models, while ModelOps covers the operationalization of all types of AI models.[3]
2 | 


--------------------------------------------------------------------------------
/.devcontainer/devcontainer.json:
--------------------------------------------------------------------------------
 1 | // For format details, see https://aka.ms/devcontainer.json. For config options, see the README at:
 2 | // https://github.com/microsoft/vscode-dev-containers/tree/v0.245.2/containers/python-3
 3 | {
 4 | 	"name": "Python 3",
 5 | 	"build": {
 6 | 		"dockerfile": "Dockerfile",
 7 | 		"context": "..",
 8 | 		"args": { 
 9 | 			// Update 'VARIANT' to pick a Python version: 3, 3.10, 3.9, 3.8, 3.7, 3.6
10 | 			// Append -bullseye or -buster to pin to an OS version.
11 | 			// Use -bullseye variants on local on arm64/Apple Silicon.
12 | 			"VARIANT": "3.8",
13 | 			// Options
14 | 			"NODE_VERSION": "none"
15 | 		}
16 | 	},
17 | 
18 | 	// Configure tool-specific properties.
19 | 	"customizations": {
20 | 		// Configure properties specific to VS Code.
21 | 		"vscode": {
22 | 			// Set *default* container specific settings.json values on container create.
23 | 			"settings": { 
24 | 				"python.defaultInterpreterPath": "/usr/local/bin/python",
25 | 				"python.linting.enabled": true,
26 | 				"python.linting.pylintEnabled": true,
27 | 				"python.formatting.autopep8Path": "/usr/local/py-utils/bin/autopep8",
28 | 				"python.formatting.blackPath": "/usr/local/py-utils/bin/black",
29 | 				"python.formatting.yapfPath": "/usr/local/py-utils/bin/yapf",
30 | 				"python.linting.banditPath": "/usr/local/py-utils/bin/bandit",
31 | 				"python.linting.flake8Path": "/usr/local/py-utils/bin/flake8",
32 | 				"python.linting.mypyPath": "/usr/local/py-utils/bin/mypy",
33 | 				"python.linting.pycodestylePath": "/usr/local/py-utils/bin/pycodestyle",
34 | 				"python.linting.pydocstylePath": "/usr/local/py-utils/bin/pydocstyle",
35 | 				"python.linting.pylintPath": "/usr/local/py-utils/bin/pylint"
36 | 			},
37 | 			
38 | 			// Add the IDs of extensions you want installed when the container is created.
39 | 			"extensions": [
40 | 				"ms-python.python",
41 | 				"ms-python.vscode-pylance"
42 | 			]
43 | 		}
44 | 	},
45 | 
46 | 	// Use 'forwardPorts' to make a list of ports inside the container available locally.
47 | 	// "forwardPorts": [],
48 | 
49 | 	// Use 'postCreateCommand' to run commands after the container is created.
50 | 	// "postCreateCommand": "pip3 install --user -r requirements.txt",
51 | 
52 | 	// Comment out to connect as root instead. More info: https://aka.ms/vscode-remote/containers/non-root.
53 | 	"remoteUser": "vscode"
54 | }
55 | 


--------------------------------------------------------------------------------
/.gitignore:
--------------------------------------------------------------------------------
  1 | # Byte-compiled / optimized / DLL files
  2 | __pycache__/
  3 | *.py[cod]
  4 | *$py.class
  5 | 
  6 | # C extensions
  7 | *.so
  8 | 
  9 | # Distribution / packaging
 10 | .Python
 11 | build/
 12 | develop-eggs/
 13 | dist/
 14 | downloads/
 15 | eggs/
 16 | .eggs/
 17 | lib/
 18 | lib64/
 19 | parts/
 20 | sdist/
 21 | var/
 22 | wheels/
 23 | pip-wheel-metadata/
 24 | share/python-wheels/
 25 | *.egg-info/
 26 | .installed.cfg
 27 | *.egg
 28 | MANIFEST
 29 | 
 30 | # PyInstaller
 31 | #  Usually these files are written by a python script from a template
 32 | #  before PyInstaller builds the exe, so as to inject date/other infos into it.
 33 | *.manifest
 34 | *.spec
 35 | 
 36 | # Installer logs
 37 | pip-log.txt
 38 | pip-delete-this-directory.txt
 39 | 
 40 | # Unit test / coverage reports
 41 | htmlcov/
 42 | .tox/
 43 | .nox/
 44 | .coverage
 45 | .coverage.*
 46 | .cache
 47 | nosetests.xml
 48 | coverage.xml
 49 | *.cover
 50 | *.py,cover
 51 | .hypothesis/
 52 | .pytest_cache/
 53 | 
 54 | # Translations
 55 | *.mo
 56 | *.pot
 57 | 
 58 | # Django stuff:
 59 | *.log
 60 | local_settings.py
 61 | db.sqlite3
 62 | db.sqlite3-journal
 63 | 
 64 | # Flask stuff:
 65 | instance/
 66 | .webassets-cache
 67 | 
 68 | # Scrapy stuff:
 69 | .scrapy
 70 | 
 71 | # Sphinx documentation
 72 | docs/_build/
 73 | 
 74 | # PyBuilder
 75 | target/
 76 | 
 77 | # Jupyter Notebook
 78 | .ipynb_checkpoints
 79 | 
 80 | # IPython
 81 | profile_default/
 82 | ipython_config.py
 83 | 
 84 | # pyenv
 85 | .python-version
 86 | 
 87 | # pipenv
 88 | #   According to pypa/pipenv#598, it is recommended to include Pipfile.lock in version control.
 89 | #   However, in case of collaboration, if having platform-specific dependencies or dependencies
 90 | #   having no cross-platform support, pipenv may install dependencies that don't work, or not
 91 | #   install all needed dependencies.
 92 | #Pipfile.lock
 93 | 
 94 | # PEP 582; used by e.g. github.com/David-OConnor/pyflow
 95 | __pypackages__/
 96 | 
 97 | # Celery stuff
 98 | celerybeat-schedule
 99 | celerybeat.pid
100 | 
101 | # SageMath parsed files
102 | *.sage.py
103 | 
104 | # Environments
105 | .env
106 | .venv
107 | env/
108 | venv/
109 | ENV/
110 | env.bak/
111 | venv.bak/
112 | 
113 | # Spyder project settings
114 | .spyderproject
115 | .spyproject
116 | 
117 | # Rope project settings
118 | .ropeproject
119 | 
120 | # mkdocs documentation
121 | /site
122 | 
123 | # mypy
124 | .mypy_cache/
125 | .dmypy.json
126 | dmypy.json
127 | 
128 | # Pyre type checker
129 | .pyre/
130 | myenv/
131 | 


--------------------------------------------------------------------------------
/examples/download.ipynb:
--------------------------------------------------------------------------------
  1 | {
  2 |  "cells": [
  3 |   {
  4 |    "cell_type": "code",
  5 |    "execution_count": 12,
  6 |    "metadata": {},
  7 |    "outputs": [],
  8 |    "source": [
  9 |     "from transformers import pipeline"
 10 |    ]
 11 |   },
 12 |   {
 13 |    "cell_type": "code",
 14 |    "execution_count": 14,
 15 |    "metadata": {},
 16 |    "outputs": [
 17 |     {
 18 |      "name": "stdout",
 19 |      "output_type": "stream",
 20 |      "text": [
 21 |       "huggingface/tokenizers: The current process just got forked, after parallelism has already been used. Disabling parallelism to avoid deadlocks...\n",
 22 |       "To disable this warning, you can either:\n",
 23 |       "\t- Avoid using `tokenizers` before the fork if possible\n"
 24 |      ]
 25 |     },
 26 |     {
 27 |      "name": "stderr",
 28 |      "output_type": "stream",
 29 |      "text": [
 30 |       "Downloading config.json: 100%|██████████| 570/570 [00:00<00:00, 172kB/s]"
 31 |      ]
 32 |     },
 33 |     {
 34 |      "name": "stdout",
 35 |      "output_type": "stream",
 36 |      "text": [
 37 |       "\t- Explicitly set the environment variable TOKENIZERS_PARALLELISM=(true | false)\n"
 38 |      ]
 39 |     },
 40 |     {
 41 |      "name": "stderr",
 42 |      "output_type": "stream",
 43 |      "text": [
 44 |       "\n",
 45 |       "Downloading tf_model.h5: 100%|██████████| 511M/511M [00:05<00:00, 98.2MB/s] \n",
 46 |       "All model checkpoint layers were used when initializing TFBertForMaskedLM.\n",
 47 |       "\n",
 48 |       "All the layers of TFBertForMaskedLM were initialized from the model checkpoint at bert-base-uncased.\n",
 49 |       "If your task is similar to the task the model of the checkpoint was trained on, you can already use TFBertForMaskedLM for predictions without further training.\n",
 50 |       "Downloading tokenizer_config.json: 100%|██████████| 28.0/28.0 [00:00<00:00, 19.8kB/s]\n",
 51 |       "Downloading vocab.txt: 100%|██████████| 226k/226k [00:00<00:00, 4.62MB/s]\n",
 52 |       "Downloading tokenizer.json: 100%|██████████| 455k/455k [00:00<00:00, 6.54MB/s]\n",
 53 |       "The model 'TFBertForMaskedLM' is not supported for summarization. Supported models are ['TFBartForConditionalGeneration', 'TFBlenderbotForConditionalGeneration', 'TFBlenderbotSmallForConditionalGeneration', 'TFEncoderDecoderModel', 'TFLEDForConditionalGeneration', 'TFMarianMTModel', 'TFMBartForConditionalGeneration', 'TFMT5ForConditionalGeneration', 'TFPegasusForConditionalGeneration', 'TFT5ForConditionalGeneration'].\n"
 54 |      ]
 55 |     }
 56 |    ],
 57 |    "source": [
 58 |     "summarizer = pipeline(\"summarization\", model=\"t5-small\", tokenizer=\"t5-small\", truncation=True, framework=\"tf\")"
 59 |    ]
 60 |   },
 61 |   {
 62 |    "cell_type": "code",
 63 |    "execution_count": null,
 64 |    "metadata": {},
 65 |    "outputs": [],
 66 |    "source": [
 67 |     "with open(\"mlflow.txt\", \"r\") as _f:\n",
 68 |     "    print(summarizer(_f.read()))"
 69 |    ]
 70 |   },
 71 |   {
 72 |    "cell_type": "code",
 73 |    "execution_count": null,
 74 |    "metadata": {},
 75 |    "outputs": [],
 76 |    "source": [
 77 |     "from huggingface_hub import hf_hub_download\n",
 78 |     "hf_hub_download(repo_id=\"t5-small\", filename=\"pytorch_model.bin\")"
 79 |    ]
 80 |   }
 81 |  ],
 82 |  "metadata": {
 83 |   "kernelspec": {
 84 |    "display_name": "Python 3.8.13 ('summarize')",
 85 |    "language": "python",
 86 |    "name": "python3"
 87 |   },
 88 |   "language_info": {
 89 |    "codemirror_mode": {
 90 |     "name": "ipython",
 91 |     "version": 3
 92 |    },
 93 |    "file_extension": ".py",
 94 |    "mimetype": "text/x-python",
 95 |    "name": "python",
 96 |    "nbconvert_exporter": "python",
 97 |    "pygments_lexer": "ipython3",
 98 |    "version": "3.8.13"
 99 |   },
100 |   "orig_nbformat": 4,
101 |   "vscode": {
102 |    "interpreter": {
103 |     "hash": "66521298b426c9301623669054a50dd0a367106800a6395f809220cc391585ba"
104 |    }
105 |   }
106 |  },
107 |  "nbformat": 4,
108 |  "nbformat_minor": 2
109 | }
110 | 


--------------------------------------------------------------------------------
/examples/mlflow.txt:
--------------------------------------------------------------------------------
 1 | How we build MLflow projects and rapidly iterate over them
 2 | 
 3 | Introduction
 4 | Starting a new machine learning (ML) project is always cumbersome, especially when collaborating with many people: different standards need to be followed, some files need to exist by default, artifacts need to be stored at a certain location, and so on.
 5 | 
 6 | As the number of projects keeps increasing, they end up looking very similar in structure and standards (writing tests, maintaining documentation, etc.), and while these projects are similar, sometimes they do not always work as intended.
 7 | 
 8 | As mentioned in a previous article, we use Databricks as our data platform and MLflow at the core of all our projects that involve machine learning. For those of you who are new to MLflow, it is basically a tool/platform that manages the ML life-cycle — creating experiments, registering and deploying models. While the MLflow CLI is very handy, creating new projects that follow the structure of an MLflow project needs to be done manually.
 9 | 
10 | To tackle the issues mentioned above, we used cookiecutter to create a template for MLflow projects. By doing this, the data team can focus less on structure and configuration of the project and more on the implementation of the model.
11 | 
12 | The following article shows how we designed our cookiecutter template and how we use it to run our projects on Databricks.
13 | 
14 | Project templating using cookiecutter
15 | The cookiecutter template helps streamline creating, testing, running and deploying MLflow projects. We designed the template in such a way that:
16 | 
17 | the structure of our project with that defined by MLflow is consistent
18 | an internal standard can be enforced within the team
19 | a default set of requirements for local setup and development is maintained
20 | it facilitates continuous integration/continuous delivery (CI/CD)
21 | When the cookiecutter template is used, required information such as the specifics of the project along with the location of where it will be stored (the artifact path) and the MLflow experiment name are configured. Once this information is recorded, the cookiecutter creates a new git-initialized folder with the structure defined in the template and the details entered earlier. Once the steps in the cookiecutter template have been completed, all we need to do is to add the GitHub remote and we are good to go !
22 | 
23 | Since the cookiecutter structures the project and its files, the next and exciting step is to start building the model by adding logic to it. Once the model is ready, we make sure that the tests do not fail, and that the MLflow project runs locally without any errors. As an added step, the project can be deployed and served locally to test if responses are received when requests are sent to it.
24 | 
25 | Apart from the local development and testing, workflows that assist continuous integration have been set up using GitHub Actions. The cookiecutter template includes two workflows:
26 | 
27 | the push workflow that performs tests and runs the project
28 | the release workflow that runs the project on a cluster in Databricks once all the tests have passed
29 | MLflow on Databricks
30 | With every new release, a new run of the project is logged on Databricks. Every logged run contains information such as the run name, the time it was started, who started it, the GitHub commit hash and a description if given. An example of how runs are logged is shown in Fig 1:
31 | 
32 | 
33 | Figure 1: Logging of MLflow runs on Databricks (Sensitive information hidden)
34 | From the list of all runs, we can then choose the latest successful run and register the model associated with this run, if the output of the run is what was expected.
35 | 
36 | 
37 | Figure 2: Model registration (Sensitive information hidden)
38 | MLflow facilitates model registration with the help of a button and also automatically versions every model that is registered.
39 | 
40 | 
41 | Figure 3: Model versioning (Sensitive information hidden)
42 | Registered models can subsequently be assigned a stage. A stage can either be Production, Staging, Archived or None. By default, when a model is served, it does not have a stage. We always serve newly registered models to Staging so that teams such as the Frontend/Backend can communicate with it and integrate the new logic from the MLflow model into their respective staging environments.
43 | 
44 | 
45 | Figure 4: Model stage transitioning
46 | Once a model is registered, we can serve the model to a REST API endpoint. Managing models just by using MLflow is pretty easy, but managing MLflow models on Databricks is a lot easier! For example, model serving on Databricks is so easy that there is no need to configure the computing resources where the model will be served on, or the URL of the endpoint. Databricks does all of this automatically when the model serving option is enabled! If the model lives in either the Production or Staging stage, the invocation URL contains /Production or /Staging depending on the stage respectively.
47 | 
48 | 
49 | Figure 5: Model serving (Sensitive information hidden)
50 | POST requests can now be sent to the served model using the “Model URL”. Databricks provides a friendly UI to test the served model on their platform (as shown in Fig 5).
51 | 
52 | Once all systems are go, with a click of a button, we can easily transition the stage to Production and watch our newly created model work its magic!
53 | 
54 | Entire process in a nutshell:
55 | While the process of initializing an MLflow project and putting it to production does require some manual effort (to ensure that the right models are registered, and served), all other checks and balances are taken care of by the CI/CD workflows that are part of our Cookiecutter template by default. This makes it extremely easy to continuously add new logic into our models (or create new ones) and put these changes to production.
56 | 
57 | The entire process from initialization to putting the model to production is summarized in Fig 6.
58 | 
59 | 
60 | Figure 6: Summary of process
61 | Final thoughts:
62 | Templating using cookiecutter has helped the team focus less on the structure of the project and more on the core implementation of the project. Adding to this, by using MLflow, managing and tracking the models that we build has never been easier. And finally, the seamless integration of MLflow into the Databricks platform has made the setup a lot simpler, and deployment a whole lot faster !
63 | 
64 | In short, the process we follow allows us to focus solely on adding and improving the implementation/logic of the model, which can then be integrated into our core product, the app.
65 | 
66 | 
67 | 


--------------------------------------------------------------------------------
/notebooks/try-datasets.ipynb:
--------------------------------------------------------------------------------
  1 | {
  2 |  "cells": [
  3 |   {
  4 |    "cell_type": "markdown",
  5 |    "metadata": {},
  6 |    "source": [
  7 |     "# 🤗 HuggingFace Datasets\n",
  8 |     "\n",
  9 |     "HuggingFace provides the ability to interact with datasets dynamically. There is no need to pre-download large models and include them in a repository like this one. "
 10 |    ]
 11 |   },
 12 |   {
 13 |    "cell_type": "code",
 14 |    "execution_count": 1,
 15 |    "metadata": {},
 16 |    "outputs": [],
 17 |    "source": [
 18 |     "from datasets import load_dataset, list_datasets"
 19 |    ]
 20 |   },
 21 |   {
 22 |    "cell_type": "code",
 23 |    "execution_count": 2,
 24 |    "metadata": {},
 25 |    "outputs": [
 26 |     {
 27 |      "name": "stdout",
 28 |      "output_type": "stream",
 29 |      "text": [
 30 |       "175825\n",
 31 |       "[]\n"
 32 |      ]
 33 |     }
 34 |    ],
 35 |    "source": [
 36 |     "# Explore available datasets\n",
 37 |     "available = list_datasets()\n",
 38 |     "print(len(available))\n",
 39 |     "print([i for i in available if '/' not in i])"
 40 |    ]
 41 |   },
 42 |   {
 43 |    "cell_type": "code",
 44 |    "execution_count": 3,
 45 |    "metadata": {},
 46 |    "outputs": [
 47 |     {
 48 |      "name": "stderr",
 49 |      "output_type": "stream",
 50 |      "text": [
 51 |       "Using custom data configuration default\n",
 52 |       "Reusing dataset movie_rationales (/home/vscode/.cache/huggingface/datasets/movie_rationales/default/0.1.0/70ed6b72496c90835e8ee73ebf8d0e49f5ad3aa93f302c8a4b6c886143cfb779)\n"
 53 |      ]
 54 |     },
 55 |     {
 56 |      "data": {
 57 |       "application/vnd.jupyter.widget-view+json": {
 58 |        "model_id": "9a1ad48369fb4d42b0188e632be57928",
 59 |        "version_major": 2,
 60 |        "version_minor": 0
 61 |       },
 62 |       "text/plain": [
 63 |        "  0%|          | 0/3 [00:00<?, ?it/s]"
 64 |       ]
 65 |      },
 66 |      "metadata": {},
 67 |      "output_type": "display_data"
 68 |     }
 69 |    ],
 70 |    "source": [
 71 |     "# load the dataset dynamically by passing the name. \n",
 72 |     "movie_rationales = load_dataset(\"movie_rationales\")"
 73 |    ]
 74 |   },
 75 |   {
 76 |    "cell_type": "code",
 77 |    "execution_count": 4,
 78 |    "metadata": {},
 79 |    "outputs": [
 80 |     {
 81 |      "data": {
 82 |       "text/plain": [
 83 |        "datasets.dataset_dict.DatasetDict"
 84 |       ]
 85 |      },
 86 |      "execution_count": 4,
 87 |      "metadata": {},
 88 |      "output_type": "execute_result"
 89 |     }
 90 |    ],
 91 |    "source": [
 92 |     "type(movie_rationales)"
 93 |    ]
 94 |   },
 95 |   {
 96 |    "cell_type": "code",
 97 |    "execution_count": 5,
 98 |    "metadata": {},
 99 |    "outputs": [
100 |     {
101 |      "data": {
102 |       "text/plain": [
103 |        "DatasetDict({\n",
104 |        "    train: Dataset({\n",
105 |        "        features: ['review', 'label', 'evidences'],\n",
106 |        "        num_rows: 1600\n",
107 |        "    })\n",
108 |        "    validation: Dataset({\n",
109 |        "        features: ['review', 'label', 'evidences'],\n",
110 |        "        num_rows: 200\n",
111 |        "    })\n",
112 |        "    test: Dataset({\n",
113 |        "        features: ['review', 'label', 'evidences'],\n",
114 |        "        num_rows: 199\n",
115 |        "    })\n",
116 |        "})"
117 |       ]
118 |      },
119 |      "execution_count": 5,
120 |      "metadata": {},
121 |      "output_type": "execute_result"
122 |     }
123 |    ],
124 |    "source": [
125 |     "# The object is a dict-like mapping of actual datasets\n",
126 |     "movie_rationales"
127 |    ]
128 |   },
129 |   {
130 |    "cell_type": "code",
131 |    "execution_count": 6,
132 |    "metadata": {},
133 |    "outputs": [],
134 |    "source": [
135 |     "# Select the \"train\" dataset and then port it to pandas\n",
136 |     "train = movie_rationales[\"train\"]\n",
137 |     "df = train.to_pandas()"
138 |    ]
139 |   },
140 |   {
141 |    "cell_type": "code",
142 |    "execution_count": 7,
143 |    "metadata": {},
144 |    "outputs": [
145 |     {
146 |      "data": {
147 |       "text/html": [
148 |        "<div>\n",
149 |        "<style scoped>\n",
150 |        "    .dataframe tbody tr th:only-of-type {\n",
151 |        "        vertical-align: middle;\n",
152 |        "    }\n",
153 |        "\n",
154 |        "    .dataframe tbody tr th {\n",
155 |        "        vertical-align: top;\n",
156 |        "    }\n",
157 |        "\n",
158 |        "    .dataframe thead th {\n",
159 |        "        text-align: right;\n",
160 |        "    }\n",
161 |        "</style>\n",
162 |        "<table border=\"1\" class=\"dataframe\">\n",
163 |        "  <thead>\n",
164 |        "    <tr style=\"text-align: right;\">\n",
165 |        "      <th></th>\n",
166 |        "      <th>review</th>\n",
167 |        "      <th>label</th>\n",
168 |        "      <th>evidences</th>\n",
169 |        "    </tr>\n",
170 |        "  </thead>\n",
171 |        "  <tbody>\n",
172 |        "    <tr>\n",
173 |        "      <th>0</th>\n",
174 |        "      <td>plot : two teen couples go to a church party ,...</td>\n",
175 |        "      <td>0</td>\n",
176 |        "      <td>[mind - fuck movie, the sad part is, downshift...</td>\n",
177 |        "    </tr>\n",
178 |        "    <tr>\n",
179 |        "      <th>1</th>\n",
180 |        "      <td>the happy bastard 's quick movie review damn\\n...</td>\n",
181 |        "      <td>0</td>\n",
182 |        "      <td>[it 's pretty much a sunken ship, sutherland i...</td>\n",
183 |        "    </tr>\n",
184 |        "    <tr>\n",
185 |        "      <th>2</th>\n",
186 |        "      <td>it is movies like these that make a jaded movi...</td>\n",
187 |        "      <td>0</td>\n",
188 |        "      <td>[the characters and acting is nothing spectacu...</td>\n",
189 |        "    </tr>\n",
190 |        "    <tr>\n",
191 |        "      <th>3</th>\n",
192 |        "      <td>\" quest for camelot \" is warner bros . '\\nfirs...</td>\n",
193 |        "      <td>0</td>\n",
194 |        "      <td>[dead on arrival, the characters stink, subpar...</td>\n",
195 |        "    </tr>\n",
196 |        "    <tr>\n",
197 |        "      <th>4</th>\n",
198 |        "      <td>synopsis : a mentally unstable man undergoing ...</td>\n",
199 |        "      <td>0</td>\n",
200 |        "      <td>[it is highly derivative and somewhat boring, ...</td>\n",
201 |        "    </tr>\n",
202 |        "  </tbody>\n",
203 |        "</table>\n",
204 |        "</div>"
205 |       ],
206 |       "text/plain": [
207 |        "                                              review  label  \\\n",
208 |        "0  plot : two teen couples go to a church party ,...      0   \n",
209 |        "1  the happy bastard 's quick movie review damn\\n...      0   \n",
210 |        "2  it is movies like these that make a jaded movi...      0   \n",
211 |        "3  \" quest for camelot \" is warner bros . '\\nfirs...      0   \n",
212 |        "4  synopsis : a mentally unstable man undergoing ...      0   \n",
213 |        "\n",
214 |        "                                           evidences  \n",
215 |        "0  [mind - fuck movie, the sad part is, downshift...  \n",
216 |        "1  [it 's pretty much a sunken ship, sutherland i...  \n",
217 |        "2  [the characters and acting is nothing spectacu...  \n",
218 |        "3  [dead on arrival, the characters stink, subpar...  \n",
219 |        "4  [it is highly derivative and somewhat boring, ...  "
220 |       ]
221 |      },
222 |      "execution_count": 7,
223 |      "metadata": {},
224 |      "output_type": "execute_result"
225 |     }
226 |    ],
227 |    "source": [
228 |     "df.head()"
229 |    ]
230 |   },
231 |   {
232 |    "cell_type": "code",
233 |    "execution_count": 8,
234 |    "metadata": {},
235 |    "outputs": [
236 |     {
237 |      "data": {
238 |       "text/html": [
239 |        "<div>\n",
240 |        "<style scoped>\n",
241 |        "    .dataframe tbody tr th:only-of-type {\n",
242 |        "        vertical-align: middle;\n",
243 |        "    }\n",
244 |        "\n",
245 |        "    .dataframe tbody tr th {\n",
246 |        "        vertical-align: top;\n",
247 |        "    }\n",
248 |        "\n",
249 |        "    .dataframe thead th {\n",
250 |        "        text-align: right;\n",
251 |        "    }\n",
252 |        "</style>\n",
253 |        "<table border=\"1\" class=\"dataframe\">\n",
254 |        "  <thead>\n",
255 |        "    <tr style=\"text-align: right;\">\n",
256 |        "      <th></th>\n",
257 |        "      <th>label</th>\n",
258 |        "    </tr>\n",
259 |        "  </thead>\n",
260 |        "  <tbody>\n",
261 |        "    <tr>\n",
262 |        "      <th>count</th>\n",
263 |        "      <td>1600.000000</td>\n",
264 |        "    </tr>\n",
265 |        "    <tr>\n",
266 |        "      <th>mean</th>\n",
267 |        "      <td>0.500000</td>\n",
268 |        "    </tr>\n",
269 |        "    <tr>\n",
270 |        "      <th>std</th>\n",
271 |        "      <td>0.500156</td>\n",
272 |        "    </tr>\n",
273 |        "    <tr>\n",
274 |        "      <th>min</th>\n",
275 |        "      <td>0.000000</td>\n",
276 |        "    </tr>\n",
277 |        "    <tr>\n",
278 |        "      <th>25%</th>\n",
279 |        "      <td>0.000000</td>\n",
280 |        "    </tr>\n",
281 |        "    <tr>\n",
282 |        "      <th>50%</th>\n",
283 |        "      <td>0.500000</td>\n",
284 |        "    </tr>\n",
285 |        "    <tr>\n",
286 |        "      <th>75%</th>\n",
287 |        "      <td>1.000000</td>\n",
288 |        "    </tr>\n",
289 |        "    <tr>\n",
290 |        "      <th>max</th>\n",
291 |        "      <td>1.000000</td>\n",
292 |        "    </tr>\n",
293 |        "  </tbody>\n",
294 |        "</table>\n",
295 |        "</div>"
296 |       ],
297 |       "text/plain": [
298 |        "             label\n",
299 |        "count  1600.000000\n",
300 |        "mean      0.500000\n",
301 |        "std       0.500156\n",
302 |        "min       0.000000\n",
303 |        "25%       0.000000\n",
304 |        "50%       0.500000\n",
305 |        "75%       1.000000\n",
306 |        "max       1.000000"
307 |       ]
308 |      },
309 |      "execution_count": 8,
310 |      "metadata": {},
311 |      "output_type": "execute_result"
312 |     }
313 |    ],
314 |    "source": [
315 |     "df.describe()"
316 |    ]
317 |   },
318 |   {
319 |    "cell_type": "code",
320 |    "execution_count": 9,
321 |    "metadata": {},
322 |    "outputs": [
323 |     {
324 |      "data": {
325 |       "text/plain": [
326 |        "label\n",
327 |        "0    800\n",
328 |        "1    800\n",
329 |        "Name: count, dtype: int64"
330 |       ]
331 |      },
332 |      "execution_count": 9,
333 |      "metadata": {},
334 |      "output_type": "execute_result"
335 |     }
336 |    ],
337 |    "source": [
338 |     "df['label'].value_counts()"
339 |    ]
340 |   }
341 |  ],
342 |  "metadata": {
343 |   "kernelspec": {
344 |    "display_name": "Python 3.9.13 ('huggingface')",
345 |    "language": "python",
346 |    "name": "python3"
347 |   },
348 |   "language_info": {
349 |    "codemirror_mode": {
350 |     "name": "ipython",
351 |     "version": 3
352 |    },
353 |    "file_extension": ".py",
354 |    "mimetype": "text/x-python",
355 |    "name": "python",
356 |    "nbconvert_exporter": "python",
357 |    "pygments_lexer": "ipython3",
358 |    "version": "3.8.17"
359 |   },
360 |   "orig_nbformat": 4,
361 |   "vscode": {
362 |    "interpreter": {
363 |     "hash": "920d5173f2c6743f2c8a5baff36bfaa747ac4cb3d34512c636ba17b43fdf31dc"
364 |    }
365 |   }
366 |  },
367 |  "nbformat": 4,
368 |  "nbformat_minor": 2
369 | }
370 | 


--------------------------------------------------------------------------------
/notebooks/try-transformers.ipynb:
--------------------------------------------------------------------------------
  1 | {
  2 |  "cells": [
  3 |   {
  4 |    "cell_type": "markdown",
  5 |    "metadata": {},
  6 |    "source": [
  7 |     "## Trying 🤗 HuggingFace Transformers\n",
  8 |     "\n",
  9 |     "Make sure you install the dependencies from `requirements.txt` before executing cells in this notebook."
 10 |    ]
 11 |   },
 12 |   {
 13 |    "cell_type": "code",
 14 |    "execution_count": 1,
 15 |    "metadata": {},
 16 |    "outputs": [
 17 |     {
 18 |      "name": "stderr",
 19 |      "output_type": "stream",
 20 |      "text": [
 21 |       "2024-07-10 14:48:23.676226: I tensorflow/core/platform/cpu_feature_guard.cc:193] This TensorFlow binary is optimized with oneAPI Deep Neural Network Library (oneDNN) to use the following CPU instructions in performance-critical operations:  AVX2 FMA\n",
 22 |       "To enable them in other operations, rebuild TensorFlow with the appropriate compiler flags.\n",
 23 |       "2024-07-10 14:48:25.555554: W tensorflow/stream_executor/platform/default/dso_loader.cc:64] Could not load dynamic library 'libcudart.so.11.0'; dlerror: libcudart.so.11.0: cannot open shared object file: No such file or directory\n",
 24 |       "2024-07-10 14:48:25.555602: I tensorflow/stream_executor/cuda/cudart_stub.cc:29] Ignore above cudart dlerror if you do not have a GPU set up on your machine.\n",
 25 |       "2024-07-10 14:48:25.705622: E tensorflow/stream_executor/cuda/cuda_blas.cc:2981] Unable to register cuBLAS factory: Attempting to register factory for plugin cuBLAS when one has already been registered\n",
 26 |       "2024-07-10 14:48:26.813722: W tensorflow/stream_executor/platform/default/dso_loader.cc:64] Could not load dynamic library 'libnvinfer.so.7'; dlerror: libnvinfer.so.7: cannot open shared object file: No such file or directory\n",
 27 |       "2024-07-10 14:48:26.813905: W tensorflow/stream_executor/platform/default/dso_loader.cc:64] Could not load dynamic library 'libnvinfer_plugin.so.7'; dlerror: libnvinfer_plugin.so.7: cannot open shared object file: No such file or directory\n",
 28 |       "2024-07-10 14:48:26.813920: W tensorflow/compiler/tf2tensorrt/utils/py_utils.cc:38] TF-TRT Warning: Cannot dlopen some TensorRT libraries. If you would like to use Nvidia GPU with TensorRT, please make sure the missing libraries mentioned above are installed properly.\n"
 29 |      ]
 30 |     }
 31 |    ],
 32 |    "source": [
 33 |     "from transformers import pipeline"
 34 |    ]
 35 |   },
 36 |   {
 37 |    "cell_type": "markdown",
 38 |    "metadata": {},
 39 |    "source": [
 40 |     "Define the generator pipeline. In this case, use the `text2text` for NLP processing"
 41 |    ]
 42 |   },
 43 |   {
 44 |    "cell_type": "code",
 45 |    "execution_count": 2,
 46 |    "metadata": {},
 47 |    "outputs": [],
 48 |    "source": [
 49 |     "import tensorflow as tf\n",
 50 |     "import keras\n",
 51 |     "from transformers import TFAutoModel"
 52 |    ]
 53 |   },
 54 |   {
 55 |    "cell_type": "code",
 56 |    "execution_count": 10,
 57 |    "metadata": {},
 58 |    "outputs": [
 59 |     {
 60 |      "data": {
 61 |       "application/vnd.jupyter.widget-view+json": {
 62 |        "model_id": "c52e68ef515a4fe5ae5a3f64a4e72be8",
 63 |        "version_major": 2,
 64 |        "version_minor": 0
 65 |       },
 66 |       "text/plain": [
 67 |        "Downloading tf_model.h5:   0%|          | 0.00/851M [00:00<?, ?B/s]"
 68 |       ]
 69 |      },
 70 |      "metadata": {},
 71 |      "output_type": "display_data"
 72 |     },
 73 |     {
 74 |      "name": "stderr",
 75 |      "output_type": "stream",
 76 |      "text": [
 77 |       "All model checkpoint layers were used when initializing TFT5ForConditionalGeneration.\n",
 78 |       "\n",
 79 |       "All the layers of TFT5ForConditionalGeneration were initialized from the model checkpoint at t5-base.\n",
 80 |       "If your task is similar to the task the model of the checkpoint was trained on, you can already use TFT5ForConditionalGeneration for predictions without further training.\n"
 81 |      ]
 82 |     },
 83 |     {
 84 |      "data": {
 85 |       "application/vnd.jupyter.widget-view+json": {
 86 |        "model_id": "9d945ba9699b42fe98b93b8a092059c8",
 87 |        "version_major": 2,
 88 |        "version_minor": 0
 89 |       },
 90 |       "text/plain": [
 91 |        "Downloading spiece.model:   0%|          | 0.00/773k [00:00<?, ?B/s]"
 92 |       ]
 93 |      },
 94 |      "metadata": {},
 95 |      "output_type": "display_data"
 96 |     },
 97 |     {
 98 |      "data": {
 99 |       "application/vnd.jupyter.widget-view+json": {
100 |        "model_id": "2a930a9c6a5148aeb2409447021d25c1",
101 |        "version_major": 2,
102 |        "version_minor": 0
103 |       },
104 |       "text/plain": [
105 |        "Downloading tokenizer.json:   0%|          | 0.00/1.32M [00:00<?, ?B/s]"
106 |       ]
107 |      },
108 |      "metadata": {},
109 |      "output_type": "display_data"
110 |     },
111 |     {
112 |      "name": "stderr",
113 |      "output_type": "stream",
114 |      "text": [
115 |       "/workspaces/try-huggingface/myenv/lib/python3.8/site-packages/transformers/models/t5/tokenization_t5_fast.py:156: FutureWarning: This tokenizer was incorrectly instantiated with a model max length of 512 which will be corrected in Transformers v5.\n",
116 |       "For now, this behavior is kept to avoid breaking backwards compatibility when padding/encoding with `truncation is True`.\n",
117 |       "- Be aware that you SHOULD NOT rely on t5-base automatically truncating your input to 512 when padding/encoding.\n",
118 |       "- If you want to encode/pad to sequences longer than 512 you can either instantiate this tokenizer with `model_max_length` or pass `max_length` when encoding/padding.\n",
119 |       "- To avoid this warning, please instantiate this tokenizer with `model_max_length` set to your preferred value.\n",
120 |       "  warnings.warn(\n"
121 |      ]
122 |     }
123 |    ],
124 |    "source": [
125 |     "generator = pipeline(\"text2text-generation\", model=\"t5-base\")"
126 |    ]
127 |   },
128 |   {
129 |    "cell_type": "code",
130 |    "execution_count": 11,
131 |    "metadata": {},
132 |    "outputs": [
133 |     {
134 |      "data": {
135 |       "text/plain": [
136 |        "[{'generated_text': 'machine learning is a key to a successful production environment . a foundational process'}]"
137 |       ]
138 |      },
139 |      "execution_count": 11,
140 |      "metadata": {},
141 |      "output_type": "execute_result"
142 |     }
143 |    ],
144 |    "source": [
145 |     "# Summarize\n",
146 |     "generator(\"summarize: Machine Learning in production environments is largely seen as the ultimate goal. Sometimes, deploying models can be difficult when automation is not part of the workflow. Creating a foundational process that is reliable and automated is complex and requires commitment from the team and the organization as a whole\")"
147 |    ]
148 |   },
149 |   {
150 |    "cell_type": "code",
151 |    "execution_count": 12,
152 |    "metadata": {},
153 |    "outputs": [
154 |     {
155 |      "data": {
156 |       "text/plain": [
157 |        "[{'generated_text': 'positive'}]"
158 |       ]
159 |      },
160 |      "execution_count": 12,
161 |      "metadata": {},
162 |      "output_type": "execute_result"
163 |     }
164 |    ],
165 |    "source": [
166 |     "# Sentiment\n",
167 |     "generator(\"sst2 sentence: Automation takes hard work but allows you to have a solid deployment\")"
168 |    ]
169 |   },
170 |   {
171 |    "cell_type": "code",
172 |    "execution_count": 13,
173 |    "metadata": {},
174 |    "outputs": [
175 |     {
176 |      "data": {
177 |       "text/plain": [
178 |        "[{'generated_text': 'negative'}]"
179 |       ]
180 |      },
181 |      "execution_count": 13,
182 |      "metadata": {},
183 |      "output_type": "execute_result"
184 |     }
185 |    ],
186 |    "source": [
187 |     "# Sentiment\n",
188 |     "generator(\"sst2 sentence: This course is not very well documented...\")"
189 |    ]
190 |   },
191 |   {
192 |    "cell_type": "code",
193 |    "execution_count": 14,
194 |    "metadata": {},
195 |    "outputs": [
196 |     {
197 |      "data": {
198 |       "text/plain": [
199 |        "[{'generated_text': 'not_entailment'}]"
200 |       ]
201 |      },
202 |      "execution_count": 14,
203 |      "metadata": {},
204 |      "output_type": "execute_result"
205 |     }
206 |    ],
207 |    "source": [
208 |     "# Questions\n",
209 |     "generator(\"question: Is deploying models into production hard?\")"
210 |    ]
211 |   },
212 |   {
213 |    "cell_type": "code",
214 |    "execution_count": 15,
215 |    "metadata": {},
216 |    "outputs": [
217 |     {
218 |      "data": {
219 |       "text/plain": [
220 |        "[{'generated_text': '2006'}]"
221 |       ]
222 |      },
223 |      "execution_count": 15,
224 |      "metadata": {},
225 |      "output_type": "execute_result"
226 |     }
227 |    ],
228 |    "source": [
229 |     "# Questions\n",
230 |     "generator(\"question: When was president Biden elected as US President?\")"
231 |    ]
232 |   },
233 |   {
234 |    "cell_type": "code",
235 |    "execution_count": 19,
236 |    "metadata": {},
237 |    "outputs": [
238 |     {
239 |      "data": {
240 |       "text/plain": [
241 |        "[{'generated_text': 'where is Rome?'}]"
242 |       ]
243 |      },
244 |      "execution_count": 19,
245 |      "metadata": {},
246 |      "output_type": "execute_result"
247 |     }
248 |    ],
249 |    "source": [
250 |     "# Questions\n",
251 |     "generator(\"question: Where is Rome?\")"
252 |    ]
253 |   },
254 |   {
255 |    "cell_type": "code",
256 |    "execution_count": 21,
257 |    "metadata": {},
258 |    "outputs": [
259 |     {
260 |      "data": {
261 |       "text/plain": [
262 |        "[{'generated_text': \"L'automatisation exige beaucoup de travail, mais vous permet d'avoir un dé\"}]"
263 |       ]
264 |      },
265 |      "execution_count": 21,
266 |      "metadata": {},
267 |      "output_type": "execute_result"
268 |     }
269 |    ],
270 |    "source": [
271 |     "# Translation\n",
272 |     "generator(\"translate English to French: Automation takes hard work but allows you to have a solid deployment\")"
273 |    ]
274 |   },
275 |   {
276 |    "cell_type": "markdown",
277 |    "metadata": {},
278 |    "source": [
279 |     "You can create other generation objects by calling in other models as well"
280 |    ]
281 |   },
282 |   {
283 |    "cell_type": "code",
284 |    "execution_count": 25,
285 |    "metadata": {},
286 |    "outputs": [
287 |     {
288 |      "name": "stderr",
289 |      "output_type": "stream",
290 |      "text": [
291 |       "All model checkpoint layers were used when initializing TFGPT2LMHeadModel.\n",
292 |       "\n",
293 |       "All the layers of TFGPT2LMHeadModel were initialized from the model checkpoint at gpt2.\n",
294 |       "If your task is similar to the task the model of the checkpoint was trained on, you can already use TFGPT2LMHeadModel for predictions without further training.\n"
295 |      ]
296 |     }
297 |    ],
298 |    "source": [
299 |     "gpt2_generator = pipeline(\"text-generation\", model=\"gpt2\")"
300 |    ]
301 |   },
302 |   {
303 |    "cell_type": "code",
304 |    "execution_count": 26,
305 |    "metadata": {},
306 |    "outputs": [
307 |     {
308 |      "name": "stderr",
309 |      "output_type": "stream",
310 |      "text": [
311 |       "Setting `pad_token_id` to 50256 (first `eos_token_id`) to generate sequence\n"
312 |      ]
313 |     },
314 |     {
315 |      "data": {
316 |       "text/plain": [
317 |        "[{'generated_text': \"When was president Biden elected as US President?\\n\\nGILLIAN LOUIS D'SZIELLARD: Well, before then, I think that there was a great opportunity here, and he was elected, as a first-term, to make a very difficult decision where he doesn't want the United States to be.\\n\\nI think he was going to seek to win over the country that this was a great moment, not just to him right now but to the wider world around him. That was his goal. And it was a good moment because it reaffirmed the role this country has played for the last 20, 30 years, I think. So, it wasn't just a question of getting the president to do it — the President was able to put the country together that way.\\n\\nI will be very careful of what I'll say a long time from now, and I hope, just before we enter with our next President, he will talk to everybody in terms of how he feels about the world and what's right for him, what's wrong for him or what's wrong for his country and what's going on in the world. But I certainly have not said the president should go in the direction of doing what's best for us.\\n\\nAnd I do think it's important to understand the other, more important part of this. I can't say anything negative about it because the president spoke at length about that.\\n\\nBut that said: I think this situation is a real issue, and what I am trying to say is that there aren't issues that have the level of public support that we're getting and this has the potential to have a profound effect on other places around the world. But it's time to start the discussion about our leadership.\\n\\nAnd then the president was elected. He's very well placed to act, if President Obama and Congress don't work out what that says, as is the case for the first seven years of Mr Trump. So we can begin to get clear on what he means.\\n\\nGILLIAN LOUIS D'SZIELLARD: But I mean, look, there is always a moment when this president says he's going to step out of the Republican party and leave the Democratic party and walk away from the Republican party.\\n\\nI think it's important that he does the right thing for the United States. It is not going to happen, but it has to happen as it goes along. And that doesn't diminish the fact that, as the former White House chief\"}]"
318 |       ]
319 |      },
320 |      "execution_count": 26,
321 |      "metadata": {},
322 |      "output_type": "execute_result"
323 |     }
324 |    ],
325 |    "source": [
326 |     "gpt2_generator(\"When was president Biden elected as US President?\", max_new_tokens=512)"
327 |    ]
328 |   }
329 |  ],
330 |  "metadata": {
331 |   "kernelspec": {
332 |    "display_name": "Python 3.9.13 ('huggingface')",
333 |    "language": "python",
334 |    "name": "python3"
335 |   },
336 |   "language_info": {
337 |    "codemirror_mode": {
338 |     "name": "ipython",
339 |     "version": 3
340 |    },
341 |    "file_extension": ".py",
342 |    "mimetype": "text/x-python",
343 |    "name": "python",
344 |    "nbconvert_exporter": "python",
345 |    "pygments_lexer": "ipython3",
346 |    "version": "3.8.17"
347 |   },
348 |   "orig_nbformat": 4,
349 |   "vscode": {
350 |    "interpreter": {
351 |     "hash": "920d5173f2c6743f2c8a5baff36bfaa747ac4cb3d34512c636ba17b43fdf31dc"
352 |    }
353 |   }
354 |  },
355 |  "nbformat": 4,
356 |  "nbformat_minor": 2
357 | }
358 | 


--------------------------------------------------------------------------------