Introducing the CANDOR corpus from @BetterUp: A 1TB, 850hr audio-video dataset of 1,656 unscripted conversations in America in 2020. CANDOR = Conversation: A Naturalistic Dataset of Online Recordings arxiv.org/abs/2203.00674
CANDOR corpus is now available! It took years of hard work, but we hope it will be useful for researchers from many fields interested in conversation and social interaction. Dataset available for download (link in manuscript). Please share it widely.
science.org/doi/10.1126/scia…
I asked 5 hard questions to two modern language AIs - Meta's new BlenderBot (blenderbot.ai) and OpenAI's GPT-3. Warning: Gets a little computer nerdy, but there are riddles and metaphysics too!
Answers and commentary in :thread:
Introducing the CANDOR corpus from @BetterUp: A 1TB, 850hr audio-video dataset of 1,656 unscripted conversations in America in 2020. CANDOR = Conversation: A Naturalistic Dataset of Online Recordings arxiv.org/abs/2203.00674