Which term describes a data repository that can aggregate structured and unstructured data while preserving access controls from the source?

Study for the AAISM Domain 1: AI Governance Program Management Test. Utilize flashcards and multiple-choice questions. Each question includes hints and explanations to prepare you for success!

Multiple Choice

Which term describes a data repository that can aggregate structured and unstructured data while preserving access controls from the source?

Explanation:
A data lake is the repository that can hold both structured and unstructured data at scale while preserving governance and access controls from the source. It stores data in its native format, ranging from tables and CSVs to PDFs, images, and logs, enabling flexible schema-on-read analytics and data science workflows. Because data lakes are designed to integrate with security and governance mechanisms—like identity and access management, encryption, and policy-based controls—they can maintain or enforce access rules as data is ingested, stored, and consumed, helping preserve the origin’s security posture. The other options don’t fit as well. A data exploration and training platform focuses on tools and interfaces for analyzing data and building models rather than serving as a central, diverse data repository with integrated access controls. A vector database specializes in storing high-dimensional vectors for similarity search and is not typically used to aggregate all data types with preserved source-level access controls. An AI system production refers to deploying models and serving AI capabilities, not to storing and governing large, heterogeneous datasets.

A data lake is the repository that can hold both structured and unstructured data at scale while preserving governance and access controls from the source. It stores data in its native format, ranging from tables and CSVs to PDFs, images, and logs, enabling flexible schema-on-read analytics and data science workflows. Because data lakes are designed to integrate with security and governance mechanisms—like identity and access management, encryption, and policy-based controls—they can maintain or enforce access rules as data is ingested, stored, and consumed, helping preserve the origin’s security posture.

The other options don’t fit as well. A data exploration and training platform focuses on tools and interfaces for analyzing data and building models rather than serving as a central, diverse data repository with integrated access controls. A vector database specializes in storing high-dimensional vectors for similarity search and is not typically used to aggregate all data types with preserved source-level access controls. An AI system production refers to deploying models and serving AI capabilities, not to storing and governing large, heterogeneous datasets.

Subscribe

Get the latest from Passetra

You can unsubscribe at any time. Read our privacy policy