├── .gitignore
├── .pr-preview.json
├── .prettierrc.json
├── CODE_OF_CONDUCT.md
├── CONTRIBUTING.md
├── LICENSE.md
├── README.md
├── captured-surface-control.js
├── images
    └── explainer
    │   ├── onboarding_mock.png
    │   ├── onboarding_mock_full_context.png
    │   └── zoom_controls_mock.png
├── index.html
├── questionnaire.md
├── style.css
└── w3c.json


/.gitignore:
--------------------------------------------------------------------------------
1 | .DS_Store
2 | 


--------------------------------------------------------------------------------
/.pr-preview.json:
--------------------------------------------------------------------------------
1 | {
2 |   "src_file": "index.html",
3 |   "type": "respec"
4 | }
5 | 


--------------------------------------------------------------------------------
/.prettierrc.json:
--------------------------------------------------------------------------------
1 | {
2 |   "printWidth": 100
3 | }
4 | 


--------------------------------------------------------------------------------
/CODE_OF_CONDUCT.md:
--------------------------------------------------------------------------------
1 | # Code of Conduct
2 | 
3 | All documentation, code and communication under this repository are covered by the [W3C Code of Conduct](https://www.w3.org/policies/code-of-conduct/).
4 | 


--------------------------------------------------------------------------------
/CONTRIBUTING.md:
--------------------------------------------------------------------------------
 1 | # Web Real-Time Communications Working Group
 2 | 
 3 | Contributions to this repository are intended to become part of Recommendation-track documents governed by the
 4 | [W3C Patent Policy](https://www.w3.org/Consortium/Patent-Policy/) and
 5 | [Software and Document License](https://www.w3.org/copyright/software-license/). To make substantive contributions to specifications, you must either participate
 6 | in the relevant W3C Working Group or make a non-member patent licensing commitment.
 7 | 
 8 | If you are not the sole contributor to a contribution (pull request), please identify all 
 9 | contributors in the pull request comment.
10 | 
11 | To add a contributor (other than yourself, that's automatic), mark them one per line as follows:
12 | 
13 | ```
14 | +@github_username
15 | ```
16 | 
17 | If you added a contributor by mistake, you can remove them in a comment with:
18 | 
19 | ```
20 | -@github_username
21 | ```
22 | 
23 | If you are making a pull request on behalf of someone else but you had no part in designing the 
24 | feature, you can remove yourself with the above syntax.
25 | 


--------------------------------------------------------------------------------
/LICENSE.md:
--------------------------------------------------------------------------------
1 | All documents in this Repository are licensed by contributors
2 | under the 
3 | [W3C Software and Document License](https://www.w3.org/copyright/software-license/).
4 | 
5 | 


--------------------------------------------------------------------------------
/README.md:
--------------------------------------------------------------------------------
  1 | # Captured Surface Control
  2 | 
  3 | ## In a nutshell
  4 | 
  5 | We introduce a new Web API that allows Web applications to:
  6 | 
  7 | 1. Forward wheel events to a captured tab.
  8 | 2. Read/write the zoom level of a captured tab.
  9 | 
 10 | ## Motivation
 11 | 
 12 | Nearly all video-conferencing Web applications offer their users the ability to share a browser tab, a native window, or screen. Many of these applications also show the local user a "preview tile" with a video of the captured [display surface](https://www.w3.org/TR/screen-capture/#dfn-display-surface).
 13 | 
 14 | All these applications suffer from the same drawback - if the user wishes to interact with a captured tab or window, the user must first switch to that surface, taking them away from the video-conferencing application. This presents some challenges:
 15 | 
 16 | - The user can't see the captured app and the videos of remote users at the same time unless they use [Picture-in-Picture](https://wicg.github.io/document-picture-in-picture/) or separate side-by-side windows for the video conference tab and the shared tab. On a smaller screen, this could be difficult.
 17 | - The user is burdened by the need to jump between the video conferencing app and the captured surface.
 18 | - The user loses access to the controls exposed by the video conferencing app while they are away from it; for example, an embedded chat app, emoji reactions, notifications about users asking to join the call, multimedia and layout controls, and other useful video conferencing features.
 19 | 
 20 | The Captured Surface Control APIs address these problems.
 21 | 
 22 | ## Terminology
 23 | 
 24 | A **"capturing application"** is an application which has called [getDisplayMedia()](https://www.w3.org/TR/screen-capture/#dom-mediadevices-getdisplaymedia), leading the browser to present a "media-picker" dialog to the user, from which the user will have chosen to capture a tab, a window or a screen. The surface selected by the user is the **"captured surface"**. If that surface was a tab, we call the Web application currently loaded in that tab the **"captured application"**. If the user's chosen surface is a window, we designate the associated native application as the **"captured application"**.
 25 | 
 26 | ## General API shape
 27 | 
 28 | We extend [CaptureController](https://www.w3.org/TR/screen-capture/#capturecontroller) to support a limited set of low-risk actions.
 29 | 
 30 | In the code samples provided in this document, assume the following preliminaries:
 31 | 
 32 | ```js
 33 | const controller = new CaptureController();
 34 | const stream = await navigator.mediaDevices.getDisplayMedia({ controller });
 35 | const previewTile = document.querySelector("video");
 36 | previewTile.srcObject = stream;
 37 | ```
 38 | 
 39 | ### Prelude: Permission Prompt
 40 | 
 41 | We distinguish between write-access and read-access APIs.
 42 | 
 43 | - The write-access APIs introduced here, `forwardWheel()`, `increaseZoomLevel()`, `decreaseZoomLevel()` and `resetZoomLevel()`, are gated by a new [Permissions Policy](https://www.w3.org/TR/permissions-policy-1/#permissionspolicy) called `"captured-surface-control"`.
 44 | - Our read-access APIs are innocuous and are threfore left ungated.
 45 | 
 46 | With most browsers' interpretation of [Permissions Policy](https://www.w3.org/TR/permissions-policy-1/#permissionspolicy), the first time an origin invokes either `forwardWheel()`, `increaseZoomLevel()`, `decreaseZoomLevel()` or `resetZoomLevel()`, the browser shows a [permission prompt](https://w3c.github.io/permissions/#prompt-the-user-to-choose). How long this permission is persisted is up to the browser, with typical durations being "forever" or "for the current browsing session".
 47 | 
 48 | Before displaying a permission prompt to the user, the app must solicit a user gesture. If the app wants to show zoom-in/out buttons ahead of time, then the user gesture is a given. But if the app wants to first inform the user about these new features, and provide clearer context about the ensuing permission prompt, then the app could include an onboarding experience that features a "start" button of some sort, after which it will invoke a write-access API in a way that will produce the prompt but will not cause change of state, as perceived by the user. An example is:
 49 | 
 50 | <p align="center">
 51 |   <img src="images/explainer/onboarding_mock_full_context.png">
 52 | </p>
 53 | 
 54 | Code to support this could look as follows:
 55 | 
 56 | ```js
 57 | document.getElementById("startButton").onclick = async () => {
 58 |   try {
 59 |     const hasPermission = await navigator.permissions.query({
 60 |       name: "captured-surface-control",
 61 |     });
 62 |     if (hasPermission.state !== "granted") {
 63 |       await controller.forwardWheel(previewTile);
 64 |     }
 65 |   } catch (e) {
 66 |     console.log(`Error: ${e}`);
 67 |   }
 68 | };
 69 | ```
 70 | 
 71 | ### Scroll forwarding
 72 | 
 73 | #### forwardWheel()
 74 | 
 75 | To faciliate scrolling of captured surfaces, we extend `CaptureController` as follows:
 76 | 
 77 | ```webidl
 78 |   partial interface CaptureController {
 79 |     Promise<undefined> forwardWheel(HTMLElement? element);
 80 |   };
 81 | ```
 82 | 
 83 | Using `forwardWheel()`, a capturing application can forward subsequent [wheel events](https://developer.mozilla.org/en-US/docs/Web/API/Element/wheel_event) from a local element, such as the preview tile, to the captured surface's viewport. The browser determines the coordinates of the event relative to the origin of the target element, then produces a corresponding event on the captured surface at corresponding coordinates, after scaling. This forwarded event is indistinguishable to the captured application from direct user interaction.
 84 | 
 85 | `forwardWheel()` is subject to a permissions policy, which might involve a permission prompt. The method returns a `Promise` that is resolved if the connection is successfully made, and rejected otherwise (for example, if the user rejects the permission prompt).
 86 | 
 87 | Sample usage:
 88 | 
 89 | ```js
 90 | try {
 91 |   await controller.forwardWheel(previewTile);
 92 | } catch (e) {
 93 |   console.log(`Error: ${e}`);
 94 | }
 95 | ```
 96 | 
 97 | It is possible to use `forwardWheel()` with any type of element. This allows applications to forward gestures from elements other than the `HTMLVideoElement` itself. Thanks to this useful property of the API, applicationss can draw text, annotations and emoji-reactions over the video preview tile, and the experience will still work as the user expects.
 98 | 
 99 | To stop the forwarding of wheel events, applications can invoke `forwardWheel(null)`.
100 | 
101 | Forwarding will also stop if the capture-session ends for whatever reason.
102 | 
103 | ## Zoom controls
104 | 
105 | To faciliate read-access and write-access to a captured surface's zoom, we extend `CaptureController` as follows:
106 | 
107 | ```webidl
108 | partial interface CaptureController {
109 |   sequence<long> getSupportedZoomLevels();
110 |   readonly attribute long? zoomLevel;
111 |   Promise<undefined> increaseZoomLevel();
112 |   Promise<undefined> decreaseZoomLevel();
113 |   Promise<undefined> resetZoomLevel();
114 |   attribute EventHandler onzoomlevelchange;
115 | };
116 | ```
117 | 
118 | #### getSupportedZoomLevels()
119 | 
120 | Returns a list of valid zoom-levels for the captured display surface. This zoom level is represented as a percentage of the "default zoom-level", which is defined as 100%. The list is guaranteed to be monotonically increasing, and is guaranteed to contain the value `100`. It is also guaranteed to contain the minimum and maximum values.
121 | 
122 | Note that:
123 | - The list `[100]` is technically valid, as the minimum/maximum values are not required to be distinct from the default value or from each other.
124 | - The user agents may trim the list to a reasonable length. If the need arises, this function may in the future be extended to receive an argument with the maximum number of entries the application is interested in receiving.
125 | - `getSupportedZoomLevels()` may only be called if `zoomLevel` is non-null; otherwise, the user agent does not have a concept of zoom level for that type of display surface, and if `getSupportedZoomLevels()` were called, an exception would be raised.
126 | - `getSupportedZoomLevels()` may only be called while `controller` is associated with an active capture; otherwise, the method raises an exception.
127 | 
128 | #### zoomLevel
129 | 
130 | This read-only attribute contains the current zoom-level of the captured surface. (Or `null` if a zoom level is not defined to the display surface, which is currently the case for windows and screens.)
131 | 
132 | Sample usage:
133 | 
134 | ```js
135 | currentZoomLabel.textContent = `${controller.zoomLevel}%`;
136 | ```
137 | 
138 | This method is not gated by a permission policy.
139 | 
140 | After capture stops, reading `zoomLevel` would yield the last value it held while the capture was active. Notably, `zoomLevel` will NOT be updated after the capture session ends even if the captured surface's zoom level changes again.
141 | 
142 | #### increaseZoomLevel(), decreaseZoomLevel() and resetZoomLevel()
143 | 
144 | These methods are used to increase, decrease or reset the zoom level of the captured surface. (Resetting sets the value to the default - 100.)
145 | 
146 | These methods are subject to a permissions policy, which might involve a permission prompt. These methods return a promise. If the permission policy is in the `'granted'` state, or if it is in the `'prompt'` state and the user does grant it once prompted, the promise is resolved; otherwise, it is rejected.
147 | 
148 | One way to use these methods is to present UX elements to the user:
149 | 
150 | <p align="center">
151 |   <img src="images/explainer/zoom_controls_mock.png">
152 | </p>
153 | 
154 | Code backing up these controls could look like:
155 | 
156 | ```js
157 | zoomIncreaseButton.addEventListener("click", async (event) => {
158 |   try {
159 |     await controller.increaseZoomLevel();
160 |   } catch (e) {
161 |     console.log(`Error: ${e}`);
162 |   }
163 | });
164 | ```
165 | 
166 | #### onzoomlevelchange
167 | 
168 | Users can change the captured application's zoom-level by interacting with the user agent, the captured application, or possibly by additional means. If the capturing application is displaying any user-facing controls and UX element, such as an indicator of the current zoom-level, or buttons to increase/decrease zoom, then the capturing application will want to listen to such externally-triggered zoom-changes, and reflect them in the capturing application's own UX. The `onzoomlevelchange` event handler helps with that.
169 | 
170 | Sample usage:
171 | 
172 | ```js
173 | controller.addEventListener("zoomlevelchange", (event) => {
174 |   const zoomLevel = controller.zoomLevel;
175 | 
176 |   // Update label.
177 |   zoomLevelLabel.textContent = `${zoomLevel}%`;
178 | 
179 |   // Update controls.
180 |   const supportedZoomLevels = controller.getSupportedZoomLevels();
181 |   const currentZoomLevelIndex = supportedZoomLevels.indexOf(zoomLevel);
182 |   zoomIncreaseButton.disabled = currentZoomLevelIndex >= supportedZoomLevels.length - 1;
183 |   zoomDecreaseButton.disabled = currentZoomLevelIndex <= 0;
184 | });
185 | ```
186 | 
187 | ## Security and Privacy Considerations
188 | 
189 | ### Permission prompts
190 | 
191 | Permission prompts are currently used as mitigations for Web Platform capabilities which are arguably even riskier than those presented in this document - clipboard access, geolocation, mic- and camera-access, and most notably, screen-capture itself. It follows that, if the prompt can be clear enough for the user, it should be a sufficient mitigation for the risks associated with the API surfaces we introduce.
192 | 
193 | ### Risks and mitigations
194 | 
195 | #### User confusion
196 | 
197 | To obtain initial permission to use the API, and to keep on using it, an application does not need to show the user a video representation of the surface under control via a `<video>` element. Even if it does show such a `<video>` element, it is not guaranteed that the `<video>` element is connected to a [MediaStreamTrack](https://developer.mozilla.org/en-US/docs/Web/API/MediaStreamTrack) derived from the surface being captured, or that the MediaStreamTrack being used is not manipulated in some ways, e.g. using Breakout Box.
198 | 
199 | A browser-mediated connection between the captured surface and the permission prompt is infeasible. Instead, we rely on careful phrasing of the permission prompt as the mitigation for these risks.
200 | 
201 | #### Access to pixels beyond those originally intended by the user
202 | 
203 | When a user chooses to share a tab or a window, it might be that they only intended to share with the app the content immediately visible. The APIs we introduce will allow the capturing application to gain access to yet more content. This is both the core risk of the API as well as its intended use.
204 | 
205 | We mitigate using a permission prompt. The downsides are as usual:
206 | 
207 | - Dialog fatigue and the risk of the user just approving so as to be done with it.
208 | - Impaired usability of Web apps that now require additional toil of their users.
209 | - Difficulty in communicating to the user precisely what permission they are asked to grant.
210 | - Difficulty in communicating to the user why such a permission could be risky.
211 | 
212 | The downsides are all known, and should be considered against those associated with alternative approaches (e.g. [Video Portal](#rejected-alternative-video-portal)).
213 | 
214 | #### Access at a time not controlled by the user
215 | 
216 | All write-access introduced by this spec, are limited to the time when the user agent is actively dispatching the event caused by the user's interaction.
217 | 
218 | #### Side effects
219 | 
220 | Gestures like wheel and pinch might have effects other than scrolling the page. If we consider some modern dating applications, we note that "swiping right" could have far-reaching consequences, and might even culminate in matrimony. However, the author of this document argues that the permission prompt is sufficient here, as it was for other Web Platform capabilities.
221 | 
222 | ## Potential future extensions
223 | 
224 | ### Extension to additional gestures
225 | 
226 | At the moment, the API allows forwarding of wheel events. In the future, other gestures might be considered, such as pinch.
227 | 
228 | Note that forwarding of such events as `"click"` is NOT foreseen.
229 | 
230 | ## Alternatives considered
231 | 
232 | ### Rejected alternative: Communication with the captured application
233 | 
234 | We have considered the alternative of providing a mechanism for the capturing application to communicate with the captured application, **asking** it to scroll or change its zoom level. This alternative was deemed wholly insufficient - solutions that require opt-in by the captured application, would fail to work for the majority of capturer/capturee combinations due to absent opt-in, thereby failing to solve the problem.
235 | 
236 | Note that there are other good reasons to support communication between the capturing and captured application, and that both **structured communication** (e.g. "next slide" and "previous slide") as well as **unstructured communication** (e.g. `postMessage(anything)`) has its merits. However, that is a different solution, useful for a different set of problems.
237 | 
238 | ### Rejected alternative: Zoom-control through browser-level UX
239 | 
240 | We have considered leaving it up to the user agent to present zoom-controls controls to the user. However, this alternative approach would not solve the problem sufficiently.
241 | 
242 | - Capturing applications need zoom-controls to be discoverable, which often means placing them inside of the viewport, in a position that is congruent with the rest of the application-level controls. For some capturing applications, this means overlaying zoom-controls over the video preview tile; for other capturing applications, this means placing them alongside pre-existing app-level controls.
243 | - Applications need to customize the look and feel of controls to fit together with the rest of the application.
244 | 
245 | It bears mentioning that, while we provide an API for app-level control of zoom, this does _not_ stop user agents from providing such controls as well.
246 | 
247 | ### Rejected alternative: Forwarding of gestures based on browser heuristics
248 | 
249 | Heuristics are imperfect - they often fail to trigger when desired, or trigger when not desired. Browser vendors are free to try their hand at developing such heuristics, but we believe it is essential that the Web platform include a mechanism for Web applications to explicitly trigger the functionality introduces by Captured Surface Control. (Note that this is not mutually-exclusive with heuristics.)
250 | 
251 | ### Rejected alternative: Video Portal
252 | 
253 | An alternative approach, currently labeled "Video Portal", has been discussed, for example in the [following slides deck](https://docs.google.com/presentation/d/1RIRPAg-M3pQYTFqL0rDGBIl8bQvLAzq122lWUF5JIy8/edit#slide=id.g1df86d70a44_0_25). This approach strikes yet _another_ balance, giving control over the captured surface to the local user rather than to the application.
254 | 
255 | On the one hand, this means that almost any action could be allowed; on the other, it means that all the power rests in the local user's hands, and cannot be delegated from the user to the application.
256 | 
257 | We see this approach as a possible _complementary_ approach that may be pursued separately from Captured Surface Control.
258 | 
259 | ## Security and Privacy Considerations
260 | 
261 | Please see the specification's own section [here](https://screen-share.github.io/captured-surface-control/#privacy-and-security-considerations).
262 | 
263 | ## Common questions
264 | 
265 | ### What about Picutre-in-Picture?
266 | 
267 | [Document Picture-in-Picture](https://wicg.github.io/document-picture-in-picture/) presents an alternative partial solution to some of the problems addressed by Captured Surface Control. Given the different trade-offs chosen, some applications/users might prefer PiP, while other would prefer Captured Surface Control.
268 | 
269 | Benefits of PiP:
270 | 
271 | - No additional new APIs required.
272 | - Captured surface fully interactive (as opposed to the limited set of actions afforded by Captured Surface Control).
273 | - Captured surface available in its original resolution.
274 | 
275 | Benefits of Captured Surface Control:
276 | 
277 | - All elements of the capturing application presented in their original size and are fully interactive. (Consider the challenges of fitting a chat box into the PiP.)
278 | - Application has control of layout of remote participants videos and the captured-surface's preview tile.
279 | - Local user can delegate some actions to remote users (through the mediation of the application).
280 | 
281 | It is expected that users intending to engage in significant interaction with the presented content would prefer PiP or other alternatives, while users who only need brief interactions to adjust scrolling and zooming would prefer Captured Surface Control.
282 | 
283 | ## Uncommon questions
284 | 
285 | ### What about blur?
286 | 
287 | Should a captured tab be unblurred before receiving `wheel` events? That's an open question for now.
288 | 
289 | ### How does the zoom-level correspond to CSS zoom?
290 | 
291 | It is not currently clear that it does, but we're investigating the issue in light of [recent activity](https://github.com/w3c/csswg-drafts/issues/5623#issuecomment-1646125737) on that front.
292 | 


--------------------------------------------------------------------------------
/captured-surface-control.js:
--------------------------------------------------------------------------------
 1 | var respecConfig = {
 2 |   group: "wg/webrtc",
 3 |   specStatus: "ED",
 4 |   github: {
 5 |     repoURL: "https://github.com/w3c/mediacapture-surface-control",
 6 |     branch: "main",
 7 |   },
 8 |   editors: [
 9 |     {
10 |       name: "Elad Alon",
11 |       email: "eladalon@google.com",
12 |       company: "Google",
13 |       w3cid: 118124
14 |     },
15 |   ],
16 |   xref: [
17 |     "html",
18 |     "infra",
19 |     "permissions",
20 |     "permissions-policy",
21 |     "dom",
22 |     "mediacapture-streams",
23 |     "webidl",
24 |     "screen-capture",
25 |     "uievents",
26 |     "cssom-view",
27 |     "geometry-1",
28 |   ],
29 |   latestVersion: null,
30 |   shortName: "mediacapture-surface-control",
31 |   subjectPrefix: "[mediacapture-surface-control]",
32 | };
33 | 


--------------------------------------------------------------------------------
/images/explainer/onboarding_mock.png:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/w3c/mediacapture-surface-control/c266db01ccd3491753cee86d772db43c1febeaf2/images/explainer/onboarding_mock.png


--------------------------------------------------------------------------------
/images/explainer/onboarding_mock_full_context.png:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/w3c/mediacapture-surface-control/c266db01ccd3491753cee86d772db43c1febeaf2/images/explainer/onboarding_mock_full_context.png


--------------------------------------------------------------------------------
/images/explainer/zoom_controls_mock.png:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/w3c/mediacapture-surface-control/c266db01ccd3491753cee86d772db43c1febeaf2/images/explainer/zoom_controls_mock.png


--------------------------------------------------------------------------------
/index.html:
--------------------------------------------------------------------------------
  1 | <!DOCTYPE html>
  2 | <html lang="en-us">
  3 |   <head>
  4 |     <meta content="text/html; charset=utf-8" http-equiv="Content-Type" />
  5 |     <title>Captured Surface Control</title>
  6 |     <script class="remove" src="captured-surface-control.js" type="text/javascript"></script>
  7 |     <script src="https://www.w3.org/Tools/respec/respec-w3c" class="remove"></script>
  8 |     <link href="style.css" rel="stylesheet" type="text/css" />
  9 |   </head>
 10 |   <body>
 11 |     <section id="abstract">
 12 |       <p>
 13 |         Consider a Web application |capturer| which has used {{MediaDevices/getDisplayMedia()}} to
 14 |         start capturing another [=display surface=], |capturee|. This specification introduces a set
 15 |         of APIs that allow |capturer| the following new capabilities:
 16 |       </p>
 17 |       <ul>
 18 |         <li>Read and write |capturee|'s [=zoom level=].</li>
 19 |         <li>
 20 |           Deliver a wheel event over |capturee|'s viewport at coordinates of |capturer|'s choosing.
 21 |         </li>
 22 |       </ul>
 23 |     </section>
 24 | 
 25 |     <section id="sotd"></section>
 26 | 
 27 |     <section id="background">
 28 |       <h1>Background</h1>
 29 |       <p>
 30 |         Nearly all video-conferencing Web applications offer their users the ability to share
 31 |         [=display surfaces=] - typically a browser tab ([=display surface/browser=]), a native app's
 32 |         window ([=display surface/window=]), or an entire screen ([=display surface/monitor=]).
 33 |       </p>
 34 |       <p>
 35 |         Many of these applications also show the local user a "preview tile" with a video of the
 36 |         captured [=display surface=].
 37 |       </p>
 38 |       <p>
 39 |         All these applications suffer from one key drawback - if the user wishes to interact with a
 40 |         captured [=display surface=], the user must first switch to that surface, taking them away
 41 |         from the video-conferencing application. This presents a few issues:
 42 |       </p>
 43 |       <ol>
 44 |         <li>
 45 |           Users can't simultaneously interact with the captured application and see the videos of
 46 |           remote users.
 47 |         </li>
 48 |         <li>
 49 |           Users are burdened by the need to repeatedly switch between the video-conferencing
 50 |           application and the captured surface.
 51 |         </li>
 52 |         <li>
 53 |           Users are limited in their ability to see and interact with controls exposed by the
 54 |           video-conferencing application while they are interacting with the captured surface. A
 55 |           non-comprehensive list of examples of such controls includes - embedded chat applications,
 56 |           emoji reactions, "knock-ins" by users asking to join the call, and multimedia controls.
 57 |         </li>
 58 |       </ol>
 59 |       <p>
 60 |         It bears mentioning that
 61 |         <a href="https://wicg.github.io/document-picture-in-picture/"
 62 |           >Document Picture-in-Picture</a
 63 |         >
 64 |         goes a long way towards addressing some of these issues. However, it not always a suitable
 65 |         solution, as not all use cases are adequately addressed by a floating window which will
 66 |         often be small, which obscures arbitrary other content on the screen, and whose size and
 67 |         positioning must be manually controlled by the user.
 68 |       </p>
 69 |     </section>
 70 | 
 71 |     <section id="feature-policy-integration">
 72 |       <h1>Permissions Policy Integration</h1>
 73 |       <p>
 74 |         This specification defines a [=policy-controlled feature=] identified by the string
 75 |         <dfn class="permission export">`"captured-surface-control"`</dfn>. Its [=policy-controlled
 76 |         feature/default allowlist=] is `"self"`.
 77 |       </p>
 78 |       <div class="note">
 79 |         <p>
 80 |           The API surfaces introduced by this specification can be categorized as either read-access
 81 |           or write-access. Note that only the write-access APIs ({{CaptureController/forwardWheel}},
 82 |           {{CaptureController/increaseZoomLevel}}, {{CaptureController/decreaseZoomLevel}} and
 83 |           {{CaptureController/resetZoomLevel}}) are gated by the <a>"captured-surface-control"</a>
 84 |           permissions policy.
 85 |         </p>
 86 |       </div>
 87 |     </section>
 88 | 
 89 |     <section id="zoom">
 90 |       <h1>Zoom</h1>
 91 |       <section id="zoom-definition">
 92 |         <h2>Definition of Zoom</h2>
 93 |         <p>
 94 |           We define a concept of an integer "<dfn>zoom level</dfn>" that can be applied to [=display
 95 |           surfaces=] of any type, and which is independent of the user agent and the platform. It is
 96 |           expected that in the case of [=display surface/browser=] [=display surfaces=], this
 97 |           concept will match the concept of zoom level that user agents typically exposed to the
 98 |           user.
 99 |         </p>
100 |         <ul>
101 |           <li>
102 |             The <dfn>default zoom level</dfn> of any [=display surface=] is defined to be 100. All
103 |             implementations must support this value for all [=display surface=] of any type.
104 |           </li>
105 |           <li>
106 |             Decreasing [=zoom level=] values represent "zooming out". The minimum theoretical value
107 |             is 1; however, user agents may cap their support for "zooming out" at a larger values,
108 |             with 100 being the largest permissible minimum value, representing lack of support for
109 |             "zooming out".
110 |           </li>
111 |           <li>
112 |             Increasing values represent "zooming in". This specification does not mandate a
113 |             theoretical maximum. The smallest possible maximum is 100, which represents lack of
114 |             support for "zooming in".
115 |           </li>
116 |         </ul>
117 |         <p>
118 |           For a given [=display surface=] of type |surfaceType|, we define the user agent's set of
119 |           <dfn>supported zoom levels</dfn> for |surfaceType| as a non-empty set of integers
120 |           including at least the [=default zoom level=] (100), and not including any integers lesser
121 |           than 1.
122 |         </p>
123 |       </section>
124 |       <section>
125 |         <h2>Permitted Event Types for zoom-setting</h2>
126 |         <p>
127 |           We define the <dfn>permitted event types for zoom-setting</dfn> as a set composed of the
128 |           following <a data-cite="DOM#dom-event-type">event types</a>:
129 |         </p>
130 |         <ul>
131 |           <li><a data-cite="uievents#event-type-click">"click"</a></li>
132 |           <li><a data-cite="uievents#event-type-input">"input"</a></li>
133 |         </ul>
134 |       </section>
135 |       <section id="zoom-control-apis">
136 |         <h2>Zoom-control APIs</h2>
137 |         <pre class="idl">
138 |           partial interface CaptureController {
139 |             sequence&lt;long&gt; getSupportedZoomLevels();
140 |             readonly attribute long? zoomLevel;
141 |             Promise&lt;undefined&gt; increaseZoomLevel();
142 |             Promise&lt;undefined&gt; decreaseZoomLevel();
143 |             Promise&lt;undefined&gt; resetZoomLevel();
144 |             attribute EventHandler onzoomlevelchange;
145 |           };
146 |         </pre>
147 |         <dl data-link-for="CaptureController" data-dfn-for="CaptureController" class="methods">
148 |           <dt><dfn>getSupportedZoomLevels()</dfn></dt>
149 |           <dd>
150 |             <p>
151 |               This method allows applications to discover the set of [=zoom levels=] supported by
152 |               the user agent.
153 |             </p>
154 |             <p>When invoked, the user agent MUST run the following steps:</p>
155 |             <ol>
156 |               <li>
157 |                 If [=this=] is not [=actively capturing=], [=exception/throw=] an
158 |                 "{{InvalidStateError}}" {{DOMException}}.
159 |               </li>
160 |               <li>Let |surfaceType| be [=this=].{{CaptureController/[[DisplaySurfaceType]]}}.</li>
161 |               <li>
162 |                 If |surfaceType| is not a [=supported display surface type=], [=exception/throw=] a
163 |                 "{{NotSupportedError}}" {{DOMException}}.
164 |               </li>
165 |               <li>
166 |                 Return a monotonically increasing sequence containing all of the values in the
167 |                 [=supported zoom levels=] for |surfaceType|.
168 |               </li>
169 |             </ol>
170 |           </dd>
171 |           <dt><dfn>zoomLevel</dfn></dt>
172 |           <dd>
173 |             <p>
174 |               This attribute allows applications to discover the captured [=display surface=]'s
175 |               [=zoom level=].
176 |             </p>
177 |             <p>
178 |               On getting, the user agent MUST return [=this=].{{CaptureController/[[ZoomLevel]]}}.
179 |             </p>
180 |           </dd>
181 |           <dt><dfn>increaseZoomLevel()</dfn></dt>
182 |           <dd>
183 |             <p>
184 |               This method allows applications to set the captured [=display surface=]'s [=zoom
185 |               level=] one step higher than its current value.
186 |             </p>
187 |             <p>
188 |               When this method is invoked, the user agent MUST run the [=set zoom level algorithm=]
189 |               with [=this=] as the |controller| and `"increase"` as the |zoomAction|.
190 |             </p>
191 |           </dd>
192 |           <dt><dfn>decreaseZoomLevel()</dfn></dt>
193 |           <dd>
194 |             <p>
195 |               This method allows applications to set the captured [=display surface=]'s [=zoom
196 |               level=] one step lower than its current value.
197 |             </p>
198 |             <p>
199 |               When this method is invoked, the user agent MUST run the [=set zoom level algorithm=]
200 |               with [=this=] as the |controller| and `"decrease"` as the |zoomAction|.
201 |             </p>
202 |           </dd>
203 |           <dt><dfn>resetZoomLevel()</dfn></dt>
204 |           <dd>
205 |             <p>
206 |               This method allows applications to set the captured [=display surface=]'s [=zoom
207 |               level=] to 100.
208 |             </p>
209 |             <p>
210 |               When this method is invoked, the user agent MUST run the [=set zoom level algorithm=]
211 |               with [=this=] as the |controller| and `"reset"` as the |zoomAction|.
212 |             </p>
213 |           </dd>
214 |           <dt><dfn>onzoomlevelchange</dfn></dt>
215 |           <dd>
216 |             <p>
217 |               An [=event handler IDL attribute=] whose [=event handler event type=] is
218 |               `zoomlevelchange`.
219 |             </p>
220 |             <p>
221 |               Whenever [=this=].<a data-cite="!screen-capture/#dfn-source">[[\Source]]</a>'s [=zoom
222 |               level=] changes to |newZoomLevel|, the user agent MUST [=queue a global task=] on the
223 |               [=user interaction task source=] given the current realm's global object, which will
224 |               run the following stpes:
225 |             </p>
226 |             <ol>
227 |               <li>If [=this=] is not [=actively capturing=], abort these steps.</li>
228 |               <li>Set [=this=].{{CaptureController/[[ZoomLevel]]}} to |newZoomLevel|.</li>
229 |               <li>[=Fire an event=] named `zoomlevelchange` at [=this=].</li>
230 |             </ol>
231 |             <div class="note">
232 |               <p>Examples of causes include:</p>
233 |               <ul>
234 |                 <li>
235 |                   The user interacted with the user agent to change the zoom level of a captured
236 |                   tab.
237 |                 </li>
238 |                 <li>The capturing application called {{CaptureController/increaseZoomLevel()}}.</li>
239 |                 <li>
240 |                   The user changed the shared [=display surface=], choosing one which has a
241 |                   different [=zoom level=].
242 |                 </li>
243 |               </ul>
244 |             </div>
245 |           </dd>
246 |         </dl>
247 |       </section>
248 |     </section>
249 | 
250 |     <section id="scrolling">
251 |       <h1>Scroll</h1>
252 |       <section>
253 |         <h2>Scrolling APIs</h2>
254 |         <pre class="idl">
255 |           partial interface CaptureController {
256 |             constructor();
257 |             Promise&lt;undefined&gt; forwardWheel(HTMLElement? element);
258 |           };
259 |         </pre>
260 |         <dl data-link-for="CaptureController" data-dfn-for="CaptureController" class="methods">
261 |           <dt><dfn>constructor</dfn></dt>
262 |           <dd>
263 |             <p>
264 |               {{CaptureController}}'s
265 |               <a data-cite="!screen-capture/#dom-capturecontroller-constructor">constructor</a> is
266 |               extended to also define and initialize the following internal slots:
267 |             </p>
268 |             <table class="simple">
269 |               <thead>
270 |                 <tr>
271 |                   <th>Internal Slot</th>
272 |                   <th>Initial value</th>
273 |                 </tr>
274 |               </thead>
275 |               <tbody>
276 |                 <tr>
277 |                   <td><dfn>[[\ZoomLevel]]</dfn></td>
278 |                   <td>`null`</td>
279 |                 </tr>
280 |                 <tr>
281 |                   <td><dfn>[[\ForwardWheelElement]]</dfn></td>
282 |                   <td>`null`</td>
283 |                 </tr>
284 |                 <tr>
285 |                   <td><dfn>[[\ForwardWheelEventListener]]</dfn></td>
286 |                   <td>`null`</td>
287 |                 </tr>
288 |               </tbody>
289 |             </table>
290 |           </dd>
291 |           <dt><dfn>forwardWheel()</dfn></dt>
292 |           <dd>
293 |             <p>
294 |               This method allows applications to automatically forward
295 |               <a data-cite="uievents/#idl-wheelevent">wheel events</a>
296 |               from an {{HTMLElement}} to the viewport of a captured [=display surface=].
297 |             </p>
298 |             <p>When invoked, the user agent MUST run the following steps:</p>
299 |             <ol>
300 |               <li>
301 |                 If [=this=] is not [=actively capturing=], return a promise [=reject|rejected=] with
302 |                 a {{DOMException}} object whose {{DOMException/name}} attribute has the value
303 |                 {{InvalidStateError}}.
304 |               </li>
305 |               <li>
306 |                 If [=this=] [=is self-capturing=], return a promise [=reject|rejected=] with a
307 |                 {{DOMException}} object whose {{DOMException/name}} attribute has the value
308 |                 {{InvalidStateError}}.
309 |               </li>
310 |               <li>Let |surfaceType| be [=this=].{{CaptureController/[[DisplaySurfaceType]]}}.</li>
311 |               <li>
312 |                 If |surfaceType| is not a [=supported display surface type=], return a promise
313 |                 [=reject|rejected=] with a {{DOMException}} object whose {{DOMException/name}}
314 |                 attribute has the value {{NotSupportedError}}.
315 |               </li>
316 |               <li>Let |element| be the method's first argument.</li>
317 |               <li>Let |P| be a new {{Promise}}.</li>
318 |               <li>
319 |                 Run the following steps [=in parallel=]:
320 |                 <ol>
321 |                   <li>
322 |                     [=Get the current permission state=] of <a>"captured-surface-control"</a>. If
323 |                     the result is NOT {{PermissionState/"granted"}}, and the [=relevant global
324 |                     object=] does NOT have [=transient activation=], then:
325 |                     <ol>
326 |                       <li>
327 |                         [=Queue a global task=] on the [=user interaction task source=] given the
328 |                         current realm's [=global object=] as |global| to [=reject=] |P| with a
329 |                         {{DOMException}} object whose {{DOMException/name}} attribute has the value
330 |                         {{InvalidStateError}}.
331 |                       </li>
332 |                       <li>Abort these steps.</li>
333 |                     </ol>
334 |                     <div class="note">
335 |                       <p>
336 |                         This step ensures that on the one hand, permission prompts are not be shown
337 |                         without [=transient activation=], while on the one hand, if the permission
338 |                         is already {{PermissionState/"granted"}},
339 |                         {{CaptureController/forwardWheel()}} may be called immediately after
340 |                         {{MediaDevices/getDisplayMedia()}} resolves, even if the [=transient
341 |                         activation=] that permitted the call to {{CaptureController/forwardWheel()}}
342 |                         has since expired.
343 |                       </p>
344 |                     </div>
345 |                   </li>
346 |                   <li>
347 |                     [=Request permission to use=] a {{PermissionDescriptor}} with its
348 |                     {{PermissionDescriptor/name}} member set to
349 |                     <a>"captured-surface-control"</a>. If the result of the request is
350 |                     {{PermissionState/"denied"}}, then:
351 |                     <ol>
352 |                       <li>
353 |                         [=Queue a global task=] on the [=user interaction task source=] given the
354 |                         current realm's [=global object=] as |global| to [=reject=] |P| with a new
355 |                         {{DOMException}} object whose {{DOMException/name}} is {{NotAllowedError}}.
356 |                       </li>
357 |                       <li>Abort these steps.</li>
358 |                     </ol>
359 |                   </li>
360 |                   <li>
361 |                     If [=this=].{{CaptureController/[[ForwardWheelElement]]}} is not `null`,
362 |                     [=remove an event listener=] with
363 |                     [=this=].{{CaptureController/[[ForwardWheelElement]]}} as |eventTarget| and
364 |                     [=this=].{{CaptureController/[[ForwardWheelEventListener]]}} as |listener|.
365 |                   </li>
366 |                   <li>
367 |                     Set [=this=].{{CaptureController/[[ForwardWheelEventListener]]}} to `null`.
368 |                   </li>
369 |                   <li>Set [=this=].{{CaptureController/[[ForwardWheelElement]]}} to |element|.</li>
370 |                   <li>
371 |                     If [=this=].{{CaptureController/[[ForwardWheelElement]]}} is not `null`:
372 |                     <ol>
373 |                       <li>
374 |                         Set [=this=].{{CaptureController/[[ForwardWheelEventListener]]}} to an
375 |                         [=event listener=] defined as follows:
376 |                         <dl>
377 |                           <dt>type</dt>
378 |                           <dd>`wheel`</dd>
379 |                           <dt>[=event listener/callback=]</dt>
380 |                           <dd>
381 |                             The result of creating a new Web IDL {{EventListener}} instance
382 |                             representing a reference to a function of one argument of type {{Event}}
383 |                             |event|. This function executes the [=forward wheel event algorithm=]
384 |                             given [=this=] and |event|.
385 |                           </dd>
386 |                         </dl>
387 |                       </li>
388 |                       <li>
389 |                         [=Add an event listener=] with
390 |                         [=this=].{{CaptureController/[[ForwardWheelElement]]}} as |eventTarget| and
391 |                         [=this=].{{CaptureController/[[ForwardWheelEventListener]]}} as |listener|.
392 |                       </li>
393 |                     </ol>
394 |                   </li>
395 |                   <li>
396 |                     [=Queue a global task=] on the [=user interaction task source=] given the
397 |                     current realm's [=global object=] as |global| to [=resolve=] |P|.
398 |                   </li>
399 |                 </ol>
400 |               </li>
401 |               <li>Return |P|.</li>
402 |             </ol>
403 |           </dd>
404 |         </dl>
405 |       </section>
406 |       <section>
407 |         <h2>Extensions to the getDisplayMedia algorithm</h2>
408 |         <p>
409 |           Extend the
410 |           <a data-cite="screen-capture#dom-mediadevices-getdisplaymedia"
411 |             >getDisplayMedia algorithm</a
412 |           >
413 |           as follows:
414 |         </p>
415 |         <p>
416 |           Recall that |p| is the promise which the algorithm returns. Immediately before the step
417 |           which resolves it, add the following steps:
418 |         </p>
419 |         <ol>
420 |           <li>
421 |             If |controller| is not `null` and |controller|.<a
422 |               data-cite="screen-capture/#dfn-displaysurfacetype"
423 |               >[[\DisplaySurfaceType]]</a
424 |             >
425 |             is a [=supported display surface type=], then set
426 |             |controller|.{{CaptureController/[[ZoomLevel]]}} to |controller|.<a
427 |               data-cite="!screen-capture/#dfn-source"
428 |               >[[\Source]]</a
429 |             >'s [=zoom level=].
430 |           </li>
431 |         </ol>
432 |       </section>
433 |     </section>
434 | 
435 |     <section>
436 |       <h1>Subroutines</h1>
437 |       <section>
438 |         <h2>Subroutine: Actively capturing</h2>
439 |         <p>
440 |           To determine if a {{CaptureController}} |controller| is
441 |           <dfn>actively capturing</dfn>, run the following steps:
442 |         </p>
443 |         <ol>
444 |           <li>Let |source| be |controller|.{{CaptureController/[[Source]]}}.</li>
445 |           <li>If |source| is `null`, return `false`.</li>
446 |           <li>
447 |             If |source| has been <a data-cite="GETUSERMEDIA#source-stopped">stopped</a>, return
448 |             `false`.
449 |           </li>
450 |           <li>Return `true`.</li>
451 |         </ol>
452 |       </section>
453 |       <section>
454 |         <h2>Subroutine: Is self-capturing</h2>
455 |         <p>
456 |           To determine if a {{CaptureController}} |controller| is
457 |           <dfn>is self-capturing</dfn>, run the following steps:
458 |         </p>
459 |         <ol>
460 |           <li>If |controller| is not [=actively capturing=], return `false`.</li>
461 |           <li>
462 |             If |controller|.{{CaptureController/[[Source]]}} is a [=display surface=] of type
463 |             [=display surface/browser=], and represents the [=relevant global object=]'s
464 |             [=associated `Document`=], return `true`.
465 |           </li>
466 |           <li>Return `false`.</li>
467 |         </ol>
468 |       </section>
469 |       <section>
470 |         <h2>Subroutine: Supported display surface type</h2>
471 |         <p>
472 |           To determine if a [=display surface=] |surfaceType| is
473 |           <dfn>supported display surface type</dfn>, run the following steps:
474 |         </p>
475 |         <ol>
476 |           <li>If |surfaceType| is [=display surface/browser=], return `true`.</li>
477 |           <li>Return `false`.</li>
478 |         </ol>
479 |         <div class="note">
480 |           <p>Whether [=display surface/window=] should be supported is under discussion.</p>
481 |         </div>
482 |       </section>
483 |       <section>
484 |         <h2>Subroutine: Setting the zoom level</h2>
485 |         <p>
486 |           The <dfn>set zoom level algorithm</dfn>, given a |controller:CaptureController| of type
487 |           {{CaptureController}} and a |zoomAction:DOMString| of type {{DOMString}} as arguments,
488 |           consists of running the following steps:
489 |         </p>
490 |         <ol>
491 |           <li>
492 |             If |controller| is not [=actively capturing=], return a promise [=reject|rejected=] with
493 |             a {{DOMException}} object whose {{DOMException/name}} attribute has the value
494 |             {{InvalidStateError}}.
495 |           </li>
496 |           <li>
497 |             If |controller| [=is self-capturing=], return a promise [=reject|rejected=] with a
498 |             {{DOMException}} object whose {{DOMException/name}} attribute has the value
499 |             {{InvalidStateError}}.
500 |           </li>
501 |           <li>Let |surfaceType| be |controller|.{{CaptureController/[[DisplaySurfaceType]]}}.</li>
502 |           <li>
503 |             If |surfaceType| is not a [=supported display surface type=], return a promise
504 |             [=reject|rejected=] with a {{DOMException}} object whose {{DOMException/name}} attribute
505 |             has the value {{NotSupportedError}}.
506 |           </li>
507 |           <li>
508 |             <p>
509 |               Ensure that the code is running from within the context of an event handler which was
510 |               triggered by the browser agent firing a trusted event, triggered by the user
511 |               interacting with the user agent. To do so, run the following steps:
512 |             </p>
513 |             <ol>
514 |               <li>Let |currentEvent:Event| be {{Window}}.{{Window/event}}.</li>
515 |               <li>
516 |                 If |currentEvent| is {{undefined}}, return a promise [=reject|rejected=] with a
517 |                 {{DOMException}} object whose {{DOMException/name}} attribute has the value
518 |                 {{InvalidStateError}}.
519 |               </li>
520 |               <li>
521 |                 If |currentEvent|.{{Event/isTrusted}} is `false`, return a promise
522 |                 [=reject|rejected=] with a {{DOMException}} object whose {{DOMException/name}}
523 |                 attribute has the value {{InvalidStateError}}.
524 |               </li>
525 |               <li>
526 |                 If |currentEvent|.{{Event/type}} is not in [=permitted event types for
527 |                 zoom-setting=], return a promise [=reject|rejected=] with a {{DOMException}} object
528 |                 whose {{DOMException/name}} attribute has the value {{InvalidStateError}}.
529 |                 <div class="note">
530 |                   <p>
531 |                     It follows from these steps that {{CaptureController/increaseZoomLevel()}},
532 |                     {{CaptureController/decreaseZoomLevel()}} and
533 |                     {{CaptureController/resetZoomLevel()}} are only callable with [=transient
534 |                     activation=], because [=permitted event types for zoom-setting=] only contains
535 |                     <a data-cite="DOM#dom-event-type">event types</a> that confer this activation.
536 |                   </p>
537 |                   <p>
538 |                     In fact, our API shape implies a stronger guarantee - whereas [=transient
539 |                     activation=] persists for several seconds after the user action, the API shape
540 |                     here limits zoom-setting to immediately after the user's action.
541 |                   </p>
542 |                 </div>
543 |               </li>
544 |             </ol>
545 |           </li>
546 |           <li>
547 |             Let |currentZoomLevel| be |controller|.{{CaptureController/[[Source]]}}'s [=zoom level=]
548 |           </li>
549 |           <li>Let |targetZoomLevel| be a {{long}}. Set its value as follows:</li>
550 |           <ol>
551 |             <li>
552 |               <p>If |zoomAction| is `"decrease"` then:</p>
553 |               <ol>
554 |                 <li>
555 |                   If |currentZoomLevel| is the minimum value in [=supported zoom levels=], return a
556 |                   promise [=reject|rejected=] with a {{DOMException}} object whose
557 |                   {{DOMException/name}} attribute has the value {{InvalidStateError}}.
558 |                 </li>
559 |                 <li>
560 |                   Otherwise, set |targetZoomLevel| to the value in [=supported zoom levels=] that
561 |                   appears immediately after |currentZoomLevel|.
562 |                 </li>
563 |               </ol>
564 |             </li>
565 |             <li>
566 |               <p>Else, if |zoomAction| is `"increase"` then:</p>
567 |               <ol>
568 |                 <li>
569 |                   If |currentZoomLevel| is the maximum value in [=supported zoom levels=], return a
570 |                   promise [=reject|rejected=] with a {{DOMException}} object whose
571 |                   {{DOMException/name}} attribute has the value {{InvalidStateError}}.
572 |                 </li>
573 |                 <li>
574 |                   Otherwise, set |targetZoomLevel| to the value in [=supported zoom levels=] that
575 |                   appears immediately after |currentZoomLevel|.
576 |                 </li>
577 |               </ol>
578 |             </li>
579 |             <li>
580 |               <p>Else:</p>
581 |               <ol>
582 |                 <li>Assert that |zoomAction| is `"reset"`.</li>
583 |                 <li>Set |targetZoomLevel| to `100`.</li>
584 |               </ol>
585 |             </li>
586 |           </ol>
587 |           <li>Let |P| be a new {{Promise}}.</li>
588 |           <li>
589 |             <p>Run the following steps [=in parallel=]:</p>
590 |             <ol>
591 |               <li>
592 |                 [=Request permission to use=] a {{PermissionDescriptor}} with its
593 |                 {{PermissionDescriptor/name}} member set to
594 |                 <a>"captured-surface-control"</a>. If the result of the request is
595 |                 {{PermissionState/"denied"}}, then:
596 |                 <ol>
597 |                   <li>
598 |                     [=Queue a global task=] on the [=user interaction task source=] given the
599 |                     current realm's [=global object=] as |global| to [=reject=] |P| with a new
600 |                     {{DOMException}} object whose {{DOMException/name}} is {{NotAllowedError}}.
601 |                   </li>
602 |                   <li>Abort these steps.</li>
603 |                 </ol>
604 |               </li>
605 |               <li>
606 |                 Set [=this=].{{CaptureController/[[Source]]}}'s [=zoom level=] to |targetZoomLevel|.
607 |               </li>
608 |               <li>
609 |                 [=Queue a global task=] on the [=user interaction task source=] given the current
610 |                 realm's [=global object=] as |global| to [=resolve=] |P|.
611 |               </li>
612 |             </ol>
613 |           </li>
614 |           <li>Return |P|.</li>
615 |         </ol>
616 |       </section>
617 |       <section>
618 |         <h2>Subroutine: Forward wheel event</h2>
619 |         <p>
620 |           The <dfn>forward wheel event algorithm</dfn> takes a {{CaptureController}} |controller|
621 |           and a {{WheelEvent}} |event|, and runs the following steps:
622 |         </p>
623 |         <ol>
624 |           <li>If |controller| is not [=actively capturing=], abort these steps.</li>
625 |           <li>If [=this=] [=is self-capturing=], abort these steps.</li>
626 |           <li>Let |surfaceType| be |controller|.{{CaptureController/[[DisplaySurfaceType]]}}.</li>
627 |           <li>If |surfaceType| is not a [=supported display surface type=], abort these steps.</li>
628 |           <li>
629 |             Run the following steps [=in parallel=]:
630 |             <ol>
631 |               <li>
632 |                 [=Get the current permission state=] of <a>"captured-surface-control"</a>. If the
633 |                 result is NOT {{PermissionState/"granted"}}, abort these steps.
634 |               </li>
635 |               <li>If |event|.{{Event/isTrusted}} is `false`, abort these steps.</li>
636 |               <li>
637 |                 Let [|scaledX|, |scaledY|] be the result of the [=scale element coordinates
638 |                 algorithm=] on [|event|.{{MouseEvent/offsetX}}, |event|.{{MouseEvent/offsetY}}] and
639 |                 [=this=].{{CaptureController/[[ForwardWheelElement]]}}.
640 |               </li>
641 |               <li>
642 |                 [=Queue a global task=] on the [=user interaction task source=] of
643 |                 |controller|.[[\Source]]'s current realm, given that
644 |                 <a data-cite="HTML#concept-realm-global">realm's global object</a>, to [=fire an
645 |                 event=] named `"wheel"` using {{WheelEvent}} with the {{MouseEvent//x}} attribute
646 |                 initialized to |scaledX|, the {{MouseEvent//y}} attribute initialized to |scaledY|,
647 |                 the {{WheelEvent/deltaX}} attribute initialized to |event|.|deltaX| and the
648 |                 {{WheelEvent/deltaY}} attribute initialized to |event|.|deltaY|, at the
649 |                 <a data-cite="uievents#topmost-event-target">topmost event target</a>.
650 |               </li>
651 |             </ol>
652 |           </li>
653 |         </ol>
654 |       </section>
655 |       <section>
656 |         <h2>Subroutine: Scale element coordinates</h2>
657 |         <p>
658 |           The <dfn>scale element coordinates algorithm</dfn> takes {{double}} coordinates [|x|, |y|]
659 |           and a {{CaptureController}} |controller|, and run the following steps:
660 |         </p>
661 |         <ol>
662 |           <li>
663 |             Let |scaleFactorX| be
664 |             <code
665 |               >(|x| /
666 |               |controller|.{{CaptureController/[[ForwardWheelElement]]}}.{{Element/getBoundingClientRect()}}.{{DOMRect/width}})</code
667 |             >.
668 |           </li>
669 |           <li>
670 |             Let |scaleFactorX| be
671 |             <code
672 |               >(|x| /
673 |               |controller|.{{CaptureController/[[ForwardWheelElement]]}}.{{Element/getBoundingClientRect()}}.{{DOMRect/height}})</code
674 |             >.
675 |           </li>
676 |           <li>
677 |             Let |surfaceWidth| be |controller|.{{CaptureController/[[Source]]}}'s viewport's width.
678 |           </li>
679 |           <li>
680 |             Let |surfaceHeight| be |controller|.{{CaptureController/[[Source]]}}'s viewport's
681 |             height.
682 |           </li>
683 |           <li>Let |scaledX| be `(|scaleFactorX| * |surfaceWidth|)`.</li>
684 |           <li>Let |scaledY| be `(|scaleFactorY| * |surfaceHeight|)`.</li>
685 |           <li>Return [|scaledX|, |scaledY|].</li>
686 |         </ol>
687 |         <div class="note">
688 |           <p>This subroutine assumes that |controller| is [=actively capturing=].</p>
689 |         </div>
690 |       </section>
691 |     </section>
692 |     <section>
693 |       <h2>Privacy and Security Considerations</h2>
694 |       <p>
695 |         The API surfaces introduced in this specification allow a capturing application limited
696 |         control over a captured application. These APIs allow the capturing application to gain
697 |         access to additional pixels in the captured application. This specification employs multiple
698 |         means to ensure that new capabilities are used in accordance with the user's intentions.
699 |         Among these means:
700 |       </p>
701 |       <ul>
702 |         <li>
703 |           All new capabilities introduced here are implicitly gated by the prior mitigations which
704 |           were employed to render screen-sharing safe.
705 |         </li>
706 |         <li>A new {{PermissionsPolicy}} called <a>"captured-surface-control"</a> is used.</li>
707 |         <li>
708 |           {{CaptureController/forwardWheel()}} is designed such that only the user's scrolling over
709 |           an {{Element}} can trigger scrolling in the captured application. This API shape ensures
710 |           that the capturing application can only [=forward wheel event algorithm|forward wheel
711 |           events=] to the captured application at the time when the user agent dispatches the
712 |           trusted wheel event on the capturing application itself.
713 |         </li>
714 |         <li>
715 |           Setting the zoom level is gated by a requirement that is even more stringent than
716 |           [=transient activation=]. Whereas [=transient activation=] could be used several seconds
717 |           after the interaction, this specification limits zoom-setting to the time when the user
718 |           agent is dispatching the event associated with that interaction.
719 |         </li>
720 |       </ul>
721 |       <section>
722 |         <h3>Zoom-setting: Limitation to specific interactions</h3>
723 |         <p>
724 |           {{CaptureController/increaseZoomLevel()}}, {{CaptureController/decreaseZoomLevel()}} and
725 |           {{CaptureController/resetZoomLevel()}} are only callable from event handlers of specific
726 |           <a data-cite="DOM#dom-event-type">event types</a> - the [=permitted event types for
727 |           zoom-setting=]. These are events dispatched directly by the user agent, triggered by user
728 |           interaction. This specification intentionally excludes from this set such events as
729 |           <a data-cite="uievents#mousemove">"mousemove"</a>, which users are liable to trigger
730 |           inadvertently.
731 |         </p>
732 |       </section>
733 |       <section>
734 |         <h3>Scrolling: Limitation to specific interactions</h3>
735 |         <p>
736 |           The shape of {{CaptureController/forwardWheel()}} is intentionally chosen to limit the
737 |           capturing application's control. The application designates a specific element which, when
738 |           the user scrolls over it, the corresponding wheel events are forwarded to the captured
739 |           application.
740 |         </p>
741 |       </section>
742 |       <section>
743 |         <h3>Limiting element types</h3>
744 |         <p>
745 |           This specification does not limit the type of {{Element}} for which either
746 |           {{CaptureController/increaseZoomLevel()}}, {{CaptureController/decreaseZoomLevel()}},
747 |           {{CaptureController/resetZoomLevel()}} or {{CaptureController/forwardWheel()}} work. Such
748 |           a limitation would accomplish nothing, because malicious applications could always overlay
749 |           transparent permitted {{Element}} types on top of visible non-permitted {{Element}}s,
750 |           thereby bypassing this restriction.
751 |         </p>
752 |         <p>
753 |           The limitation of interaction types is sufficient. This is accomplished by
754 |           {{CaptureController/forwardWheel()}} through its shape, and by
755 |           {{CaptureController/increaseZoomLevel()}}, {{CaptureController/decreaseZoomLevel()}} and
756 |           {{CaptureController/resetZoomLevel()}} through their gating on
757 |           <a data-cite="DOM#dom-event-type">event types</a>.
758 |         </p>
759 |       </section>
760 |     </section>
761 |     <section id="conformance"></section>
762 |   </body>
763 | </html>
764 | 


--------------------------------------------------------------------------------
/questionnaire.md:
--------------------------------------------------------------------------------
 1 | # Security and Privacy Questionnaire for Captured Surface Control
 2 | 
 3 | ## Questions to Consider
 4 | 
 5 | ### 2.1. What information might this feature expose to Web sites or other parties, and for what purposes is that exposure necessary?
 6 | 
 7 | Given a Web application `WA` that has used pre-existing means (`getDisplayMedia()`) to capture a tab `T`, and that therefore has access to all of `T`'s pixels, this feature exposes to `WA`:
 8 | 1. `T`'s zoom-level. (This is exposed without another permission policy or prompt).
 9 | 2. Potentially, additional pixels in `T`, if the user grants permission to forward gestures from `WA` to `T`, and/or allow `WA` to change `T`'s zoom-level.
10 | 
11 | ### 2.2. Do features in your specification expose the minimum amount of information necessary to enable their intended uses?
12 | 
13 | Yes.
14 | 
15 | ### 2.3. How do the features in your specification deal with personal information, personally-identifiable information (PII), or information derived from them?
16 | 
17 | Not applicable.
18 | 
19 | ### 2.4. How do the features in your specification deal with sensitive information?
20 | 
21 | Not applicable.
22 | 
23 | ### 2.5. Do the features in your specification introduce new state for an origin that persists across browsing sessions?
24 | 
25 | Yes - the state of the permission policy introduced by this spec, `"captured-surface-control"`.  
26 | Otherwise - no.
27 | 
28 | ### 2.6. Do the features in your specification expose information about the underlying platform to origins?
29 | 
30 | The answer is probably "no", but it's worth mentioning that `getSupportedZoomLevels()` exposes the supported zoom-levels on the platform. User agents are encouraged to ensure the result is not dependent on the OS or the user's configuration of the user agent; that is, for a given browser version, the result should be the same for all users on any machine.
31 | 
32 | ### 2.7. Does this specification allow an origin to send data to the underlying platform?
33 | 
34 | The answer is "no", but it's worth mentioning that `increaseZoomLevel()`, `decreaseZoomLevel()` and `resetZoomLevel()` allow applications to request that the user agent change zoom levels on specific tabs.
35 | 
36 | ### 2.8. Do features in this specification enable access to device sensors?
37 | 
38 | No.
39 | 
40 | ### 2.9. Do features in this specification enable new script execution/loading mechanisms?
41 | 
42 | No.
43 | 
44 | ### 2.10. Do features in this specification allow an origin to access other devices?
45 | 
46 | No.
47 | 
48 | ### 2.11. Do features in this specification allow an origin some measure of control over a user agent’s native UI?
49 | 
50 | Yes, in a limited way - the zoom-level of captured tabs can be changed. Naturally, this is reflected in the user agent's native UX that informs the user of a tab's zoom-level. This is limited so it only happens in response to direct interaction by the user with the capturing Web application.
51 | 
52 | ### 2.12 What temporary identifiers do the features in this specification create or expose to the web?
53 | 
54 | None.
55 | 
56 | ### 2.13. How does this specification distinguish between behavior in first-party and third-party contexts?
57 | 
58 | This feature does not distinguish first-party and third-party contexts.
59 | 
60 | ### 2.14. How do the features in this specification work in the context of a browser’s Private Browsing or Incognito mode?
61 | 
62 | Not applicable.
63 | 
64 | 
65 | ### 2.15. Does this specification have both "Security Considerations" and "Privacy Considerations" sections?
66 | 
67 | Yes.
68 | 
69 | ### 2.16. Do features in your specification enable origins to downgrade default security protections?
70 | 
71 | No.
72 | 
73 | ### 2.17. How does your feature handle non-"fully active" documents?
74 | 
75 | This feature only works for documents which use pre-existing mechanisms to initiate tab-capture. A non-"fully active" document will have this capture-session interrupted, thereby also terminating the use of this feature.
76 | 
77 | ### 2.18. What should this questionnaire have asked?
78 | 
79 | N/A
80 | 


--------------------------------------------------------------------------------
/style.css:
--------------------------------------------------------------------------------
1 | table { border-collapse: collapse; border-style: hidden hidden none hidden; }
2 | table thead, table tbody { border-bottom: solid; }
3 | table tbody th { text-align: left; }
4 | table tbody th:first-child { border-left: solid; }
5 | table td, table th { border-left: solid; border-right: solid; border-bottom: solid thin; vertical-align: top; padding: 0.2em; }
6 | 


--------------------------------------------------------------------------------
/w3c.json:
--------------------------------------------------------------------------------
1 | {
2 |   "group":     "wg/webrtc",
3 |   "contacts":  ["dontcallmedom","caribouW3"],
4 |   "repo-type": "rec-track"
5 | }
6 | 


--------------------------------------------------------------------------------