Data Privacy in AI Copilot
How SGS Digicomply utilizes Generative AI while prioritizing your security.
At SGS Digicomply, we are committed to transparently explaining how we utilize Generative Artificial Intelligence (AI) in our AI Copilot to enhance your research and insights while prioritizing your data privacy and security.
This document outlines our current and potential future AI applications and the measures we take to protect your information, including our use of a private, in-house Large Language Model (LLM) for private data and third-party providers for public data.
Our Commitment to You
Regardless of how AI technology evolves, we are rigid in our commitment to:
-
Always prioritize the privacy and security of your data.
-
Utilize our private, in-house LLM for all private customer data, ensuring it remains within the SGS Digicomply cloud.
-
Utilize third-party providers exclusively for public data, solely for the purpose of fulfilling your requests related to that data.
-
Never transmit private customer data to third-party providers, nor in a manner that could identify our clients.
-
Never train our models, nor permit third-party providers to train their models, on customer data.
-
Inform you of changes in our use of third-party providers, if applicable.
-
Maintain the option for you to opt out of these features.
How We Use Customer Data
-
Private Data:
-
Definition: All data generated directly by the user, including every query, comment, and document uploaded to the system.
-
Handling: We utilize our own instance of an open-source LLM hosted securely within the SGS Digicomply cloud. This ensures your private data never leaves our controlled environment. Our private LLM processes your inputs to provide accurate responses while maintaining complete data privacy.
-
-
Public Data:
-
Definition: Publicly available information or data explicitly designated as public.
-
Handling: We may utilize trusted third-party LLM providers. This allows us to leverage external capabilities for broader data processing when privacy concerns are mitigated by the public nature of the data.
-
Which Third-Party Providers Do We Use?
For public data, we utilize the following providers to improve our services:
-
Google Cloud Generative AI
-
LLM models: Used in various parts of our applications (e.g., to generate summaries and answers).
-
Custom fine-tuned Gemini models: Employed for large-scale extraction tasks. These are trained using proprietary datasets created by SGS experts. We never use client data to train these models.
-
-
OpenAI
-
LLM models: Used in various parts of our applications to generate summaries and answers.
-
Embeddings: Numerical representations of text that facilitate features based on similarity and relevancy.
-
-
Anthropic (upcoming integration)
- LLM models: Used in various parts of our applications, e.g., to generate summaries and answers.
- Agent SDK: used to build production-ready AI agents, allowing us to create complex, autonomous workflows.