LLMs And Chatbots: The Need For Proprietary Data

▶️ Watch on 3Speak


We know that LLMs require a ton of data. Entities such as OpenAI scrubbed the Internet grabbing whatever they could. This has led to a lot of data being fed into these NNs.

In this video I discuss how this is one piece of the equation. Since all the data is similar, what is going to distringuish one from the other? Here we get the paradox that proprietary data is going to be needed also.


▶️ 3Speak



0
0
0.000
1 comments
avatar

I agree. A lot of the information is available to all. That is why some are starting to protect and monetize their data. Reddit already did it, and some companies are suing LLMs for the data. The exclusivity of information is going to be a big deal going forward, although hiding it will be difficult. If it is added to the AI, then users will be able to know about it eventually.

0
0
0.000