Skip to main content

What It Actually Costs When Your Team Uploads Client Documents to ChatGPT

Neetusha
Neetusha · Founder & CEO of RedactifyAI ·

An HR manager at a mid-sized healthcare company pastes an employee performance review into ChatGPT to help draft talking points for a difficult conversation. The document includes the employee's name, their manager's name, details about a medical accommodation the employee requested under the ADA, and disciplinary notes spanning 18 months.

The employee later files an ADA complaint. During litigation, the company's AI tool usage logs are subpoenaed. The document, with its medical accommodation details, was processed by OpenAI's servers under consumer terms that permit data retention.

This is not a hypothetical. The scenario combines documented patterns from actual breach investigations and employment litigation since 2023. The underlying facts, that AI tool inputs are processed and potentially retained on external servers under consumer-grade terms, are not disputed by any major AI vendor.

Quick answer: What must be redacted under HIPAA? The 18 PHI identifiers and what the Safe Harbor standard requires, condensed to ~400 words.


Why this is happening at scale

A 2024 Fishbowl survey of professionals found that 43 percent reported using AI tools at work without their employer's knowledge. A separate 2024 Cyberhaven analysis covering over 1.6 million workers found that employees had pasted sensitive corporate data into generative AI tools at a rate that tripled between March 2023 and March 2024.

Most employees do not think of pasting text into ChatGPT as uploading data to an external server. They think of it as a faster way to draft a document. The legal and compliance consequences of what happens to that input after it leaves the browser are not visible in the tool's interface.

The tool looks like a word processor. It is not. It is a cloud service with its own data retention terms, and those terms vary significantly depending on which tier your employee happens to be using.


What ChatGPT actually does with the documents you upload

OpenAI's data practices differ by account type. Most employees are not using enterprise accounts.

Free and Plus accounts. Under OpenAI's privacy policy, conversations from free and Plus accounts may be used to improve AI models by default. Users can opt out in their account settings, but few do. The default means that a document uploaded to a free or Plus account can become part of OpenAI's training pipeline.

ChatGPT Team. Team accounts are not used for training by default. Conversation data is retained for 30 days.

ChatGPT Enterprise. Enterprise accounts come with a zero-data-retention option and a data processing addendum suitable for business use. OpenAI offers a HIPAA Business Associate Agreement for healthcare organizations on the Enterprise tier. Without a signed BAA, using ChatGPT with protected health information is a HIPAA violation regardless of which tier your team is on.

The gap that matters: most employees using ChatGPT at work are on free or Plus accounts, not Enterprise. The data protections that exist at the Enterprise tier simply do not apply to the account your HR coordinator, clinic admin, or financial analyst opened with their personal email address.


The compliance exposure by regulation

HIPAA (healthcare and health data)

HIPAA prohibits covered entities and their business associates from disclosing protected health information (PHI) without authorization. Uploading a patient record to ChatGPT without a signed BAA is a disclosure to a third party without authorization, regardless of whether the employee intended any harm.

The penalty tiers for HIPAA violations set by the Department of Health and Human Services:

HIPAA civil penalty tiers (per violation category)

TierCausePer-violation rangeAnnual maximum
1Unknowing violation$100 to $50,000$25,000
2Reasonable cause$1,000 to $50,000$100,000
3Willful neglect, corrected$10,000 to $50,000$250,000
4Willful neglect, not corrected$50,000$1.9 million

Tier 4 applies when an organization knew the risk existed and did not correct it. A healthcare system that has an IT policy against unauthorized AI tool use, but whose staff still uses ChatGPT without the organization acting on it, is in Tier 3 or Tier 4 territory. The average total cost of a healthcare data breach in the United States was $9.77 million in 2024, according to IBM's annual Cost of a Data Breach report.

GDPR (EU personal data)

GDPR applies to any organization that processes personal data of EU residents, regardless of where the organization is based. The maximum fine is 4 percent of annual global revenue or 20 million euros, whichever is higher.

Under Article 28, any third party that processes personal data on your behalf must be a data processor with a signed data processing agreement. OpenAI's consumer products do not satisfy the requirements for a GDPR-compliant DPA. Uploading an EU resident's personal data to a consumer ChatGPT account is a transfer of personal data to an unauthorized sub-processor.

Italy's data protection authority fined OpenAI 15 million euros in December 2024 for processing personal information without an adequate legal basis. The Dutch DPA fined Clearview AI 30.5 million euros that same year for scraping facial images without consent. European regulators are actively pursuing AI companies and the organizations that use them improperly.

CCPA (California data)

The California Consumer Privacy Act gives California residents specific rights over their personal information and imposes obligations on businesses that collect it. CCPA penalties are $2,500 per violation for unintentional violations and $7,500 per violation for intentional violations. Consumers can also bring class actions for certain breaches, with statutory damages of $100 to $750 per consumer per incident.

The word "intentional" matters here. If an employee knowingly pastes a California customer's personal information into ChatGPT and the company had no policy prohibiting it, CCPA treats the company's inaction as part of the intentional violation. CCPA does not require that the company intended to harm anyone. It requires that the act was deliberate.

Attorney-client privilege

For law firms and legal departments, the analysis runs through privilege doctrine rather than regulatory fines. United States v. Heppner (S.D.N.Y., February 2026) held that documents created using Anthropic's Claude on a consumer account were not protected by attorney-client privilege, because the platform was not an attorney, its terms eliminated any expectation of confidentiality, and the documents were not created at the direction of counsel. The full analysis is in our post on the Heppner ruling and what it means for law firms using AI tools.


The costs beyond regulatory fines

Fines are the visible number. The actual cost of a data breach from an AI tool is substantially higher.

Breach notification. HIPAA requires notification to affected individuals within 60 days of discovering a breach. Breaches affecting more than 500 people in a single state also require media notification. The notification process itself, mailing, call centers, and individual outreach, ranges from $50 to $300 per person for complex breaches.

Legal defense. A single regulatory investigation typically costs between $100,000 and $500,000 in legal fees before any fine is issued. Class action defense costs more.

Credit monitoring and identity protection services. Organizations providing credit monitoring to affected individuals typically pay $10 to $30 per person per year. For 10,000 affected individuals, that is $100,000 to $300,000 annually.

Corrective action and audit. After a HIPAA breach, covered entities are subject to mandatory corrective action plans, which include external audits, staff retraining, policy overhauls, and ongoing monitoring. These run $200,000 to $2 million depending on the organization's size.

Client and customer attrition. IBM's 2024 breach cost report found the average breach leads to a 6 to 7 percent loss of existing customers. For a healthcare system or financial services firm with high revenue per client, this is often the largest cost line.


Which document types create the highest exposure

Document types, applicable regulation, and maximum exposure

Document typeWho handles itPrimary regulationMaximum exposure
Patient records, medical notes, lab resultsClinicians, admin staffHIPAA$1.9M/year per violation category
Employee HR files, PIPs, accommodation requestsHR teams, managersADA, CCPA, GDPR$7,500/violation (CCPA); 4% global revenue (GDPR)
Client contracts, NDA agreements, deal termsLegal, deal teamsTrade secret law, privilege doctrinePrivilege waiver, injunction, contract damages
Customer financial records, bank statementsFinance, accountingGLBA, CCPA, GDPR$100,000/day (GLBA); $7,500/violation (CCPA)
Deposition transcripts, case strategy documentsAttorneys, paralegalsAttorney-client privilege, ABA Rule 1.6Privilege waiver, bar discipline, malpractice claim

Metadata travels with every document you upload. A file's metadata carries author names, firm names, revision histories, and creation software even when the visible text looks clean. When you upload that document to any AI tool, the metadata goes with it. For more on what PDF metadata contains and why stripping it matters, see our post on PDF metadata privacy risks.


What to do before using AI on any sensitive document

The answer is not to stop using AI tools. It is to control what the AI actually receives.

Redacting identifying information before uploading is the only point in the workflow where you have full control. Once a document leaves your environment, you are relying on the vendor's data handling practices. Those practices differ by tier, can change with terms updates, and do not retroactively apply to data already processed.

For a deeper explanation of why post-processing options (asking the vendor to delete data, switching to enterprise tiers after a consumer-tier exposure) do not solve the memorization problem, see our post on why you must redact documents before feeding them to AI.

The workflow that works:

  1. Identify what makes the document sensitive: names, identifiers, accommodation details, case numbers, financial figures, account numbers
  2. Use a purpose-built redaction tool to permanently remove those elements from the file structure, not just cover them with a black box on screen
  3. Upload the sanitized document to the AI tool
  4. Keep the original in your secure environment, unmodified
  5. Log the redaction step: what was removed, who did it, and when

Step 5 is not optional for regulated organizations. If your process is ever audited, a per-document log is what demonstrates due diligence. Visual redactions (black boxes drawn in Preview or Word) do not satisfy step 2 and will not satisfy an auditor. For a full breakdown of what permanent redaction means versus visual masking, see how to redact documents safely.


A practical policy checklist for teams

Before publishing any guidance to staff on AI tool use, work through these:

  1. Audit what your team actually uses. Many employees have personal accounts for consumer AI products that IT has not approved. The gap between what is authorized and what is in active use is typically wider than legal or compliance teams expect.
  2. Classify documents by sensitivity. PHI, PII, trade secrets, attorney-client communications, and HR records each carry different regulatory exposure. One policy does not fit all of them.
  3. Specify the tier requirement. If your organization permits AI tool use with sensitive categories, only enterprise accounts with signed DPAs and BAAs (where applicable) satisfy the legal baseline. Consumer and Team tiers do not.
  4. Make the redaction step mandatory and explicit. Implied is not enough. Employees need to know that redaction before upload is a required step, not a suggestion.
  5. Train each specific team. HR coordinators, clinic admins, and financial analysts are not typically trained on data privacy law the way legal teams are. A general policy memo will not reach them. Role-specific guidance will.
  6. Get client consent where required. ABA Formal Opinion 512 found that using AI tools with client data without informed consent may violate professional responsibility rules. Boilerplate engagement letters do not satisfy this requirement.

If you use Clio, RedactifyAI integrates directly with it. Pull a document from a matter, redact the identifying content with AI detection across 40+ entity types, and upload a clean version to whatever AI tool you need. The original stays in Clio, unmodified. Full details are in our guide on how to redact documents in Clio without overwriting originals.


Frequently asked questions

Does using a paid ChatGPT Plus subscription protect documents from being used for AI training?

No. ChatGPT Plus is a consumer subscription with additional features, not enterprise data protections. Under OpenAI's current terms, Plus conversations are subject to the same default data handling as free accounts unless you actively opt out in your account settings. An opt-out toggle in a consumer account is not the same as a contractual data processing agreement. If your use case involves HIPAA-regulated data or GDPR-covered personal information, only the Enterprise tier with a signed DPA and, for healthcare, a signed BAA satisfies the legal minimum.

If an employee uploads a client document to ChatGPT and we discover it, what are the steps?

First, identify which regulation governs the data in the document: HIPAA for PHI, GDPR for EU personal data, CCPA for California residents. Second, assess whether the disclosure triggers mandatory breach notification under the applicable law. HIPAA requires notification to affected individuals within 60 days of discovering a breach, even unintentional ones. Third, contact legal counsel before notifying anyone. The notification process itself is regulated and getting it wrong adds liability. Fourth, use the incident to update your internal AI use policy with specific, explicit document handling requirements.

Is the risk different if the document was uploaded to Claude, Gemini, or Microsoft Copilot instead of ChatGPT?

The platform matters less than the account tier and the terms governing it. Every major AI vendor offers consumer, professional, and enterprise tiers with different data handling terms. The pattern is consistent: consumer accounts may retain and process inputs by default; enterprise accounts with explicit contracts typically do not. The ABA's analysis in Formal Opinion 512 applies to all external AI tools, not just OpenAI products. The Heppner ruling addressed Anthropic's Claude specifically but used reasoning that applies to any platform with similar consumer terms. Check the specific ToS for whatever tool your team uses.

How does redacting a document before uploading it protect the organization?

Permanent redaction removes identifying information from the file structure, not just from what appears on screen. A document where names, case numbers, account numbers, and other identifiers have been permanently removed cannot expose that information even if the AI tool retains the uploaded file. The AI processes a sanitized version. The original, with all its identifying content, stays in your environment under your control. If the identifying information was never in the document that was uploaded, it cannot be memorized, retained, or reproduced by the AI system. For what permanent redaction means technically, see what is document redaction.

Does GDPR apply to US-based companies?

Yes, if you process personal data of EU residents. GDPR's territorial scope under Article 3 applies to any organization offering goods or services to EU residents or monitoring their behavior, regardless of where the organization is headquartered. A US law firm with EU clients, a US healthcare company with EU employee data, or a US financial services firm with EU customer accounts all fall within GDPR's scope for that data. The regulation does not distinguish between US-headquartered and EU-headquartered organizations when it comes to the personal data of EU residents.


Try RedactifyAI free on your first document at redactifyai.com/tools/redact-pdf-free. No account needed. For teams with ongoing volume, Clio integration, or multiple file formats, sign up free and run your first document through the full AI detection pipeline.

Stop redacting documents manually

RedactifyAI detects PII automatically and redacts it permanently. Not just a black box overlay. Try it free, no credit card required.

Learn more about AI redaction software and how it compares to manual redaction tools.