Show HN: Claity AI – An AI Aggregator with Smart Prompt Routing (Join Waitlist) (claity.netlify.app)

>Recently, we also trained an experimental social model which was well received by our community. However, this was not trained on Discord API data. In fact, we would have no use of such data. Shapes have exchanged millions of messages through our website, through X, through email and other experimental integrations. We used a small anonymised dataset of prompts between users and shapes on these non-Discord platforms to train this model. We have always used Discord's API data as directed by users, to enable interactions with their Shapes.

not sure what's meant here? so you meant you used data collected from other platforms to train model but not discord one? if so why is that even valid coz you confirm that most of users traffic from discord. it would be more better off if you've released detailed research paper to cover the details of how you anonymised the data no matter what platform you took the data. what if someother integration platforms you mentioned above wanted to sue shapes again?

matthewsh · 6h ago

Valid point tbh, if they said they were training off of data and never explicitly stated what data sources they were using then Discord should be concerned of that violation. Would also love to see that announcement from them about it. If the announcement was made in Discord then that only solidifies the reason for Discord to be concerned.

matthewsh · 8h ago

Correct me if I'm wrong but didn't they say they were basically gonna train their LLMs of message data?

I know that there was a post here: https://medium.com/lightspeed-venture-partners/circle-labs-t...

Which states: "What’s more, community members have already interacted with Shapes enough to trigger millions of messages over the short, several month duration that the product has been in beta. We believe this head start in an emergent market will further enrich the conversation datasets which power Circle’s NPCs, and serve as a competitive moat over time."

This is an old post though, so it's philosophy could've changed, but even back then stating something like that is concerning. I do feel like it's worth calling out that the Discord developer policy did not explicitly state this until the 2024 policy, but it's been in effect since July 8th, 2024...so they had plenty of time to stop training their "shapes" on the user data before this happened and it seems they've been in contact with them before too so they could've just gotten clarification on it or just asked for permission.

Complete side-note it bothers me how they're using all these examples of people who use these "shapes" as emotional support and basically therapists as a way to "strengthen" their argument when IMO it weakens it. If so many people are reliant on robots and code for emotional support then they need to seek help or seek real, human connection. It's not healthy to talk to these "shapes" all day. What's even more concerning is that trauma you're dumping is then being used to "enrich the conversation datasets."

ETA: I also think Discord is probably taking action now so they can release their own version later on without any competition, but this still could've been mitigated. Even if they got them on the tokens aspect they could've had a really strong argument considering that's what they were advised to do.

Liftyee · 9h ago

Anecdotally, Discord seems to have a history of questionable inconsistent behaviour like this. Watch them roll out an equivalent/competing feature next week along with making the UI worse as usual.

Show HN: Real-time AI Voice Chat at ~500ms Latency (github.com)

Show HN: VectorVFS, your filesystem as a vector database (vectorvfs.readthedocs.io)

Show HN: ProcASM – A general purpose, visual programming lanugage (procasm.temware.site)

Show HN: TextQuery – Query CSV, JSON, XLSX Files with SQL (textquery.app)

Show HN: Tkintergalactic - Declarative Tcl/Tk UI Library for Python (github.com)

Show HN: I built a 7-day calendar app – no months or years, just the next 7 days (weeklong.life)

Show HN: Klavis AI – Open-source MCP integration for AI applications (github.com)

Show HN: Bracket – selfhosted tournament system (github.com)

Show HN: My AI Native Resume (ai.jakegaylor.com)

Show HN: Journelly for iOS: like tweeting but for your eyes only (in plain text) (xenodium.com)

Show HN: API Testing and Security with AI (qodex.ai)

Show HN: CodeCafé – A real-time collaborative code editor in the browser (github.com)

Show HN: Driverless print server for legacy printers, profit goes to open-source (printserver.ink)

Show HN: Pinggy – A free RSS reader for the web (pinggy.com)

Show HN: Tired of bloated time trackers? Here's a dead-simple, free one I built (apps.apple.com)

Show HN: Claity AI – An AI Aggregator with Smart Prompt Routing (Join Waitlist) (claity.netlify.app)

Show HN: Free, in-browser PDF editor (breezepdf.com)

Show HN: OpenRouter Model Price Comparison (compare-openrouter-models.pages.dev)

Show HN: McPoogle: Search Engine for MCP Servers (mcpoogle.com)

Show HN: I taught AI to commentate Pong in real time (github.com)

Show HN: Low Fidelity Wireframe Powered by AI for Dashboard and UI Mockups (wireframes.org)

Show HN: Automate your workflows with screen recordings and AI agents (nutix.ai)

Show HN: X-Terminate, a Chrome extension to remove politics from your X feed

Show HN: Use Third Party LLM API in JetBrains AI Assistant (github.com)

Show HN: Reverse Pac-Man (reverse-pacman.staticrun.app)

Show HN: I built a synthesizer based on 3D physics (anukari.com)

Show HN: I built a mini macOS app to reveal my yearly subscription spending (appps.od.ua)

Show HN: MP3 File Editor for Bulk Processing (cjmapp.net)

Show HN: DistilKitPlus, a distillation framework between any LLMs (github.com)

Show HN: OSle – A 510 bytes OS in x86 assembly (github.com)

Show HN: VoltAgent – Open-Source Observability-First TS AI Agent Framework (github.com)

Show HN: Pipask – safer pip without compromising convenience (github.com)

Show HN: Oci2git – Convert OCI container images into Git repositories (github.com)

Show HN: Open-source AI web parser lib & TUI (github.com)

Show HN: Ridvay Code – An AI Coding Assistant for VS Code (ridvay.com)

Show HN: GPT-2 implemented using graphics shaders (github.com)

Show HN: Kubetail – Real-time log search for Kubernetes (github.com)

Show HN: Roons – Mechanical Computer Kit (whomtech.com)

Show HN: An open-source low-code platform (flowcentralplatform.com)

Show HN: Create your own finetuned AI model using Google Sheets (promptrepo.com)

Show HN: Reno, React and Vite and Hono Starter with Auth and E2E Type Safety (github.com)

Show HN: Hyperparam: OSS tools for exploring datasets locally in the browser (hyperparam.app)

Show HN: I built a hardware processor that runs Python (runpyxl.com)

Show HN: ART – a new open-source RL framework for training agents (github.com)

Show HN: I built a painless local dev env for macOS (servbay.com)

Show HN: EZ-TRAK Satellite Hand Tracking Suite (github.com)

Show HN: Sim Studio – Open-Source Agent Workflow GUI (github.com)

Show HN: A social media network where users share prompts instead of posts (2fjxieoiipm32.mocha.app)

Show HN: Ductape – Build back end integrations once, reuse them anywhere (ductape.app)

Show HN: I kept forgetting names and contacts so I built Cardio (mycardio.co)

Next Chapter of Shapes

Comments (4)