
OpenAI Collaborates with Organizations to Enhance AI Training Data
OpenAI has announced its Data Partnerships program, aiming to collaborate with third-party organizations to develop new data sets for training AI models. The initiative seeks to address the flaws and biases present in existing data sets, which can lead to harmful amplification by AI models. OpenAI plans to collect large-scale data sets that reflect human society and encompass various modalities, including images, audio, and video. The company is particularly interested in data that expresses human intention across different languages, topics, and formats. OpenAI will work with organizations to digitize training data and create both open source and private data sets. While the program aims to improve AI model understanding, concerns have been raised about potential bias and compensation for data owners.