• Large data sources with bad data. Smaller data sources with good data. Which is better?

  • Jul 26 2023
  • Length: 6 mins
  • Podcast
Large data sources with bad data. Smaller data sources with good data. Which is better?  By  cover art

Large data sources with bad data. Smaller data sources with good data. Which is better?

  • Summary

  • Large bad data. Obviously not good and some have recognized this as “model collapse” – the bad data causes more bad data to be generated. Small bad data – nothing needs to be said here. Small (vetted, provenance-known) data – perhaps within your enterprise walls – perhaps the way to go, until large, good data that is used for training appears. When will this happen? I do not think this is on the horizon.

    Show more Show less

What listeners say about Large data sources with bad data. Smaller data sources with good data. Which is better?

Average customer ratings

Reviews - Please select the tabs below to change the source of reviews.