Skip to content
View UMBRA-Artifacts's full-sized avatar

Block or report UMBRA-Artifacts

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
umbra-artifacts/README.md

UMBRA Artifact Repository

Source Code and Supplementary Materials

1. Ground Truth Dataset/

This directory contains manually annotated labels used to validate and evaluate the accuracy of UMBRA’s detection modules.

The annotations serve as reference ground truth for performance analysis.


2. Large-Scale-Measurement Dataset/

This directory contains datasets collected from large-scale web crawls. EU/ → European websites (2000) USA/ → United States websites(2000) Tranco-10K/ → Tranco top-ranked websites (10000)

3. Lexicons

This directory provides keyword lists and linguistic indicators used to detect multiple categories of dark patterns, including:

  • Cookie information disclosure

  • Purpose transparency

  • Opt-out pricing

  • Legal ambiguity

  • These lexicons support the automated text-based detection modules.

  • 4. Results_Cookies/Cookie_json/ (Limited)

This directory stores collected cookie records in JSON format.

Each file includes information such as:

  • Cookie names and domains
  • Expiration times
  • Security attributes
  • Interaction-dependent changes

These records enable security and privacy risk analysis.

5. Screenshots (Limited)/

This directory contains a limited subset of banner screenshots.

These images are provided for qualitative validation and illustrative purposes. These final images were used by both researchers to annotate the ground-truth dataset.

Popular repositories Loading

  1. umbra-artifacts umbra-artifacts Public

    This repository provides an implementation of a system for automatically detecting evolved dark patterns in cookie consent interfaces.

    Python 1

  2. UMBRA-ARTIFACTS1 UMBRA-ARTIFACTS1 Public

    This repository contains the ground truth dataset, source code, lexisons, few results, and large scale measurement dataset

    Python