Show HN: Sumble – knowledge graph for GTM data – query tech stack, key projects (sumble.com)

I’m Anthony, co-founder/CEO of Sumble. I was previously co-founder/CEO of Kaggle. Sumble is my newco with Ben Hamner (former co-founder and CTO of Kaggle).

### What we built

Sumble is a knowledge graph for go-to-market teams. We allow you to run very rich queries to identify prospects at a granular level and be able to do very targeted outreach.

Sumble allows you to find:

- tech stacks (in larger companies, down to the team or buying group level) - key projects those teams are working on (cloud migrations, GenAI initiatives, etc.) - people involved in those key projects

For example, here's a list of GenAI projects at Capital One that involve RAG/Vector databases: https://sumble.com/l/6sDqKmhyAH

And this view includes a list of people who we think are involved in a particular project being undertaken by the AI Foundation Team at Capital One: https://sumble.com/l/j8mbRrDsly

These views allow you to reach out to that team with a granular understanding of what they are working on.

### Inspiration

Sumble was very much inspired by our experience at Kaggle:

1. Kaggle’s public-data platform showed us how hungry people are for high-quality data (the metrics on that product were really strong)

2. At Google we saw knowledge graphs unlock powerful and composable queries

### Trying it out

- The app is live today; you’ll need to log in (Google OAuth or magic links)

- Most functionality and data are free; we only charge individual users for bulk exports

### How it works (briefly)

- Sources: job posts, resume data, company websites (more to come!)

- Extraction & linking: We use LLM (mostly fine-tuned models) to extract entities out of text from sources (company → team → people on a team → projects the team is undertaking → technology the team uses)

### What’s next

- Adding more sources so you can run even more composable queries

- Opening an API so devs can hit the graph directly

- Much later: expand to use cases beyond GTM

### Feedback

- Is the web app intuitive?

- What queries do you want us to prioritize supporting in an API?

- What additional external data sources would you like us to prioritize? - What workflow improvements/integrations would you find most helpful?

Comments (14)

csomar · 3m ago

This is incredibly useful and I can see myself using it and paying a subscription. That being said:

1. I couldn't find some key persons that I know works in an organization. How accurate is the data?

2. I don't know if this is happening because you are getting lots of traffic now, but each query takes 20-30 seconds which is unusable.

> - Is the web app intuitive?

Yes

> - What queries do you want us to prioritize supporting in an API?

Maybe specific but I want to filter by head count in job function (ie: find organizations that have 50-200 software engineers regardless of their total head count).

> - What additional external data sources would you like us to prioritize? - What workflow improvements/integrations would you find most helpful?

I don't really care as long as the data is as accurate as possible. The process of lead generation/research is a slow one that I don't think workflows matter.

johnsillings · 17m ago

Sumble is one of my go-to data tools for GTM – great data quality and lots of interesting data points that are kind of a pain to find elsewhere.

I do find myself wanting to transform the data (especially the stuff in job descriptions) using an LLM, e.g. for scoring companies/contacts or looking for more subtle signals. Sometimes I do this manually but exporting a bunch of JDs from Sumble isn't possible AFAIK. Or doing it in Sumble would be great, too.

Awesome to see it on HN. Congrats on the launch!

catpower · 2m ago

How far off is an API? Looks slick but I’d want to be able to query programmatically

jeffchuber · 11m ago

There is so much signal in job posts - excited to see this launch.

richardmeng · 11m ago

Sumble has been my critical tool to research the organization structure and responsibility in a large company, technology adoption like which organization has the LLM adoption.

Congrats on the launch!

ryanrasti · 10m ago

Wow -- tried it out and looks quite impressive. The granularity of data for these companies is amazing!

My last startup was selling to SMBs. It looks like Sumble is most likely targeted at mid-market and enterprise companies. Any plans to expand coverage into the long tail of smaller companies?

benhamner · 4m ago

Thanks! Our current coverage is focused on companies with a significant online presence (e.g. they've made job posts, people say they at the company, and/or they have a functional website).

Our goal is to have complete coverage for active companies and organizations in the world, and an understanding for companies that previously existed but are no longer active as well (these appear extensively in CRM's and add noise).

We prioritize expanding data coverage in areas that we hear are most useful from our current users and customers.

ryanrasti · 32s ago

Awesome, go crush it!

pbmango · 10m ago

As the founder of another product in this space - this is super impressive and well built. Great demo video and congrats on top of HN! Getting this smooth UX and data behind the scenes is not easy.

Nivge · 25m ago

Congratulations! Looks awesome. 1. I found it very intuitive. 2. If I could have smart filtering using llm classification, that would be very powerful. Any plans on doing that?

antgoldbloom · 23m ago

As in a search box where you can ask free form queries rather than applying filters? We haven't heard much demand for that yet, so haven't prioritized it. We will if it's a common request.

esafak · 9m ago

Nicely done. Do you have a roadmap, public ticketing system or communication channel?

benhamner · 7m ago

Thanks! Haven't prioritized something public facing on this front yet - what would you find most helpful?

esafak · 5m ago

I'd set up a ticketing system so you can receive bug reports and help set the roadmap. It's more structured than chat rooms, which are information black holes.