Palisade Readers

A Tool for Complex and Scalable Data Access Policy Enforcement

Palisade Readers

Status

Palisade is no longer under active development.

Windows is not an explicitly supported environment, although where possible Palisade has been made compatible.
For Windows developer environments, we recommend setting up WSL.

For an overview of Palisade, start with the Palisade introduction and the accompanying guides: QuickStart Guide; and Developer Guide which are found in the Palisade README.

Overview of the Readers

The Palisade-readers repository enables functionality for providing the implementations needed for Palisade to integrate with existing products and technologies.

A good starting point to understanding these modules is with the Data Service. For a single request to the Data Service, the request might look like GET /read/chunked resourceId=hdfs:/some/protected/employee_file0.avro token=some-uuid-token, which can be broken down into a number of capabilities that are required:

Reading data from an HDFS cluster
Deserialising an Avro data-stream
Understanding what an Employee datatype looks like and how the rules on the /protected directory will apply to the fields
How to return data for the /read/chunked API endpoint

The Palisade-readers repository therefore implements many of these functions, abstracted away from the inner workings of the core Palisade-services:

In this case, the Data Service's default API is for the /read/chunked endpoint, and is therefore already implemented in the ReadChunkedDataService, but we could imagine other protocols.
To read from an HDFS filesystem, we need the Resource Service to discover the available resources (like doing an ls on a directory), as well as needing the Data Service to read the raw bytes of that resource. We implement the Hadoop Resource Service and Hadoop Data Reader to enable this functionality.
To work with the raw bytes returned from the Data Reader, we need to deserialise into Java objects. We implement the Avro Serialiser that, given a domain class, will serialise and deserialise between Java objects of this class and plain bytes.
The domain class for the aforementioned serialiser in this case is Employee, which is implemented elsewhere and equivalent to a schema definition and is generally a property of the specific dataset, not the Palisade deployment in general. All that is important is that this POJO exists somewhere on the classpath.

The decoupling of these technology-specific implementations allows Palisade to be flexible enough to be trivially implemented into existing tech stacks and datasets. The above deployment could as easily have been using the S3 Resource Service and S3 Data Reader to serve a request for GET /read/chunked resourceId=s3:/some/protected/employee_file0.avro token=some-uuid-token.

For information on the different implementations, see the following modules:

Apache Avro Format
- Avro Serialiser
Apache Hadoop Distributed File System
- Hadoop Resource Service
- Hadoop Data Reader
Amazon S3 Object Storage
- S3 Resource Service
- S3 Data Reader

Name		Name	Last commit message	Last commit date
Latest commit History 57 Commits
.github/ISSUE_TEMPLATE		.github/ISSUE_TEMPLATE
avro-serialiser		avro-serialiser
code-style		code-style
hadoop-data-reader		hadoop-data-reader
hadoop-resource-service		hadoop-resource-service
licenses		licenses
logos		logos
s3-data-reader		s3-data-reader
s3-resource-service		s3-resource-service
.gitignore		.gitignore
.helmignore		.helmignore
Jenkinsfile		Jenkinsfile
LICENSE		LICENSE
NOTICES.md		NOTICES.md
README.md		README.md
mvn_dependency_tree.txt		mvn_dependency_tree.txt
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A Tool for Complex and Scalable Data Access Policy Enforcement

Palisade Readers

Status

Overview of the Readers

About

Releases 3

Packages

Contributors 10

Languages

License

gchq/Palisade-readers

Folders and files

Latest commit

History

Repository files navigation

A Tool for Complex and Scalable Data Access Policy Enforcement

Palisade Readers

Status

Overview of the Readers

About

Resources

License

Stars

Watchers

Forks

Releases 3

Packages 0

Contributors 10

Languages

Packages