It might be time for IT to consider AI models that don’t steal – Computerworld

With enterprises pouring billions of dollars into generative AI (genAI) initiatives, doubts about future legal exposures are typically ignored.

The risks are practically endless. Although enterprises usually do extensive data fine-tuning before deploying large language models (LLMs), the massive underlying database is unknown. The major model makers — including OpenAI, Google, AWS, Anthropic, Meta, and Microsoft — provide no visibility into their training data. That includes how old or out-of-date it is, how reliable it is, source languages, and, critically, whether the data violates privacy rules, copyright restrictions, trademarks, patents, or regulatory sensitive data (healthcare data, financial data, PII, payment card details, security credentials, etc.).

Even when vendors provide source lists for the data used to train their models, those lists may not include meaningful data. For example, a source might be “Visa transaction information.” How old? Is it verified? Has it been sufficiently sanitized for compliance?

Read the full article here

Share This Article

It might be time for IT to consider AI models that don’t steal – Computerworld

Leave a Reply Cancel reply

Trending Stories

Today’s Wordle clues, hints and answer for September 9 (#1543)

Tee Lopes has turned Silksong’s Sherma and Shakra into vocalists for unbearably cute club bangers

The death and resurrection of Battlefield: Bad Company 2 is exactly why real server browsers are important

Where to find Silver Bells in Hollow Knight: Silksong

I spent all weekend playing Hollow Knight Silksong and I’m totally enthralled, but nothing could completely live up to the hype after so many years

Microsoft inks AI infrastructure deal worth up to $19.4B with data center company Nebius

Follow US on Social Media

Quick Links

You Might Also Like

Leave a Reply Cancel reply

Trending Stories

Always Stay Up to Date

Follow US on Social Media

Quick Links