Hephaestus — DPIA Pre-Screen (Art. 35 GDPR)

Last updated: 2026-05-07.

Records whether a full Data Protection Impact Assessment is required for the TUM-operated Hephaestus deployment. Two gates apply to TUM as a Bavarian public body: Art. 35(3) GDPR and the Bavarian Blacklist published by the BayLfD under Art. 35(4) GDPR. The DSK list (which addresses the non-public sector) is referenced only as a cross-check.

Trigger	Present?	Reasoning
Systematic and extensive evaluation, including profiling, that forms the basis for decisions producing legal effects or similarly significant effects (Art. 35(3)(a))	No	Hephaestus produces advisory findings and guidance delivered to the contributor and visible to other workspace members on dashboards and per-artefact views. They are not consumed by any automated grading, assessment, HR, or access-control pipeline operated by Hephaestus. The processing is workspace-scoped and practice-scoped (defined per workspace by the workspace administrator), not exhaustively profiling individuals across a large population.
Large-scale processing of Art. 9(1) (special categories) or Art. 10 (criminal convictions) data	No	No special-category data is processed. Contributors are warned in the privacy statement not to enter third-party personal data into commits, reviews, or practice-review diffs.
Systematic monitoring of a publicly accessible area on a large scale	No	Hephaestus does not monitor publicly accessible areas.

None of the three Art. 35(3) triggers is present.

2. Bavarian Blacklist (BayLfD)

The BayLfD's Bavarian Blacklist (published 7 March 2019 under Art. 35(4) GDPR) enumerates concrete public-sector processing constellations that require a DPIA. None of the listed constellations describes a university teaching-support feedback platform. The risk-criteria assessment below tracks the WP29 Guidelines on DPIA (WP248rev.01) criteria the BayLfD applies in screening:

Criterion	Present?	Reasoning
Vulnerable data subjects on a large scale	No	Users sign in via federated identity providers (GitHub, LRZ-GitLab); typical use is TUM courses and AET research-project repositories. No vulnerable group is processed systematically.
Employee performance / behaviour monitoring	No	Art. 75a BayPVG's Eignungsrechtsprechung is the capability test for monitoring employee behaviour or performance. Hephaestus is suitable for displaying contributor activity but is not deployed as a personnel-evaluation, performance-management, or HR-consuming instrument; staff appear as contributors on the same footing as students. The platform exposes no Dienststelle-segmented dashboard, no HR export, and no manager-facing roll-up.
Innovative technology with unclear DP impact	Partly (AI-assisted features)	AI-assisted guidance and automated practice review call an external LLM provider per workspace. This is a commonly understood class of processing under enterprise no-training terms with DPF / SCC safeguards, but falls on the elevated-risk side of the innovative-technology criterion and warrants the documented mitigations in §5 below.
Dataset-matching from different sources	No	Hephaestus does not cross-match contributor data with external profiles beyond the federated identity attributes the IdP itself discloses.
Third-country transfer outside adequacy decision	No	U.S. recipients on the active EU-US Data Privacy Framework list rely on the DPF adequacy decision; otherwise SCCs Module 2 (Commission Implementing Decision (EU) 2021/914) apply as the contractual safeguard. DPF status is verified per recipient before engagement.
Scoring / profiling affecting service access	No	Findings are advisory and do not gate access, features, or grades within Hephaestus.
Biometric / genetic data	No
Surveillance of publicly accessible areas	No

3. DSK list — cross-check (not directly applicable)

The DSK list of processing operations requiring a DPIA under Art. 35(4) GDPR addresses the non-public sector and is not directly applicable to TUM/AET as a Bavarian public body — the controlling instrument for TUM is the Bavarian Blacklist in §2 above. Cross-checking the DSK list as a sanity test, none of its entries forces a DPIA on the Hephaestus processing profile. The platform-side mitigations relied on for the residual AI-feature risk (enterprise no-training terms, joint-controller arrangement documented, Art. 21 objection in the privacy statement) are listed in §5 and reflected in privacy §10, §3, and §7.

4. Residual risk analysis

Risk	Likelihood	Impact	Mitigation
Free-text artefact content (PR descriptions, commit messages, review comments) contains personal data of identifiable third parties and is transmitted to the LLM provider	Low-medium	Low-medium	Privacy statement warns contributors not to enter third-party personal data; practice-review sandbox runs on a per-job `--internal` Docker network with no general egress except a per-job, token-authenticated LLM proxy; only the diff and surrounding discussion is forwarded; contributors have an Art. 21 objection toggle ("AI review comments").
LLM provider retains the prompt beyond the enterprise default retention window	Low	Low-medium	Enterprise no-training terms; abuse-monitoring retention per the provider's published terms (Microsoft Azure OpenAI: enterprise abuse-monitoring window per the published data-privacy documentation; eligible customers can apply for Microsoft's modified abuse monitoring / Limited Access program); Zero Data Retention can be negotiated where the provider supports it; DPF / SCCs Module 2 in place.
Server access logs retain IP addresses	Low	Low	Tomcat AccessLogValve deletes entries after 14 days; pattern minimised to timestamp, IP, method, path, protocol, status, response size, processing time.
gitlab.lrz.de content leaks via the LRZ integration	Low	Low	LRZ is a separate controller; inter-public-body transmission under Art. 5(1) Nr. 1 BayDSG; LRZ applies its own TOMs on its own infrastructure.
Workspace administrator enables Slack without informing contributors about monitored channels	Low-medium	Low	Joint-controller / shared-responsibility model documented in privacy §10; monitored channels are forward-only, require explicit activation, post a visible channel announcement, and provide App Home/settings opt-out plus erasure.
Compromise of the application DB exposes federated identity links + cookie-session revocation list	Low	Medium	Self-hosted on AET infrastructure; upstream tokens encrypted at rest; short-lived ES256 cookie-JWTs with server-side revocation; TLS-only ingress; incident response under TUM DPO oversight.

5. DPIA-light mitigations that must remain in place

No-training enterprise API terms for every LLM provider configured by the AET-pool. Regressing to a consumer tier is a material change.
Per-job LLM proxy enforced by the practice-review sandbox. DNS disabled inside the sandbox; outbound traffic limited to a per-job, token-authenticated proxy. Any widening of this network posture is a material change.
Art. 21 objection switch in contributor profile settings ("AI review comments"). Disabling or hiding it is a material change.
Workspace-administrator joint-controller notice in the privacy statement (§3.2). Structural changes to the shared-responsibility split require an amended record.
Bounded server-log retention via Tomcat's native 14-day retention with the minimised pattern. Extending the retention window or widening the logged fields is a material change.
Error telemetry and product analytics remain disabled. The webapp ships a Sentry integration and a PostHog integration, both disabled in the current production deployment. Activating either is a material change.

6. Conclusion

No full DPIA required at the current scope. None of the three Art. 35(3) triggers is present, the Bavarian Blacklist enumerates no constellation that fits Hephaestus, and the DSK cross-check is consistent with this conclusion. The documented mitigations in §5 remain in place for the elevated-risk AI surface.

A full DPIA must be opened before any of the following takes effect:

LLM provider is added, changed to a consumer tier, or loses its no-training commitment.
Practice-review sandbox gains outbound connectivity beyond the per-job LLM proxy.
Findings begin to drive any automated decision within Hephaestus (grading, recognition caps, feature access).
The bundled Sentry integration is activated against a SaaS tenant, or the bundled PostHog integration is activated.
The processing population starts to include data subjects in a category covered by the BayLfD vulnerable-data-subjects criterion.

1. Threshold check against Art. 35(3) GDPR​

2. Bavarian Blacklist (BayLfD)​

3. DSK list — cross-check (not directly applicable)​

4. Residual risk analysis​

5. DPIA-light mitigations that must remain in place​

6. Conclusion​

1. Threshold check against Art. 35(3) GDPR

2. Bavarian Blacklist (BayLfD)

3. DSK list — cross-check (not directly applicable)

4. Residual risk analysis

5. DPIA-light mitigations that must remain in place

6. Conclusion