Added Token and LLM Cost Estimation to Microsoft's GraphRAG Indexing Pipeline

Comments (1)

KhaledAlam · 6h ago

Microsoft’s open-source GraphRAG project lacked a way to estimate token usage and LLM cost prior to running the indexing pipeline. I recently contributed a feature that adds a CLI flag (--estimate-cost) which previews token counts and cost estimates for both embedding and summarization steps.

It simulates chunking using the same logic as GraphRAG’s actual pipeline, pulls live model pricing from a hosted JSON, and includes output token projections. The user is also prompted whether to proceed with full indexing after seeing the estimates.

This is particularly useful when working with large corpora or limited OpenAI quotas.

Blog post (with technical deep dive and lessons learned): https://blog.khaledalam.net/how-i-added-token-llm-cost-estim...

GitHub PR: https://github.com/microsoft/graphrag/pull/1917

Study suggests we don't just hear music, but 'become it' (sciencedaily.com)

Valve Proton 10.0-1d (beta) (github.com)

How climate change is raising your electricity bill (theclimatebrink.com)

What Can a 500MB LLM Do? You'll Be Surprised [video] (youtube.com)

I made 4000 agent calls in Cursor last month. Each model has a personality

Motiff is Figma with AI [video] (youtube.com)

Hugo Administrators Resign in Wake of ChatGPT Controversy (gizmodo.com)

Pegasus spyware creator ordered to pay WhatsApp $168M for 2019 hack (ft.com)

How to Ask Questions the Smart Way (catb.org)

"Vibe Coding" by Emergent Garden [video] (youtube.com)

Tulsi Gabbard Reused the Same Weak Password on Multiple Accounts for Years (wired.com)

X402 - HTTP based payments from Coinbase (github.com)

Usenix Announces the Discontinuation of ATC Conference (usenix.org)

Show HN: Tired of checking 10 sites for AI info? I built a one-stop feed (infobuzz.ai)

A tiny super fast RDBMS supports replication (github.com)

Corpspeak: Infinite Corporate BS Generator (lurkertech.com)

Reddit will tighten verification to keep out human-like AI bots (techcrunch.com)

GPT-2 attention weights, visualized (amanvir.com)

What's New in Grafana v12.0 (grafana.com)

Rahul Goel (NordSpace) – Building Canada's Sovereign Space Launch Capability [video] (youtube.com)

News Literacy Project (newslit.org)

$360k, ultraluxury EV Cadillac Celestiq (theverge.com)

We created another Kafka client for Node.js (blog.platformatic.dev)

Don't Guess (jakeworth.com)

Email and password authentication should be a last resort (rant) (smudge.ai)

Is Your Company Prepared for FTC's May 14 "Click-to-Cancel" Compliance Deadline? (jdsupra.com)

White House “Accelerating” Nuclear Power With Executive Orders (axios.com)

An Investigation into Probabilities of Streaks in Online Chess (hdsr.mitpress.mit.edu)

Meta faces lawsuit that could dismantle it (unionrayo.com)

Microsoft Unveils Smaller Surfaces (engadget.com)

Luna Parc Home and Studio (lunaparc.com)

Glossary Web Component (dbushell.com)

Scaling Python task queues is hard (judoscale.com)

Novel High Resolution 3D Printing Method for Metals and Ceramics [video] (youtube.com)

Show HN: I vibe-coded some unusual transformer models (github.com)

Show HN: Agents.erl (AI Agents in Erlang) (github.com)

NM (pienkzuit.blogspot.com)

Spyware maker NSO ordered to pay $167M for hacking WhatsApp (washingtonpost.com)

WhatsApp: Winning the Fight Against Spyware Merchant NSO (about.fb.com)

Pg_tracing: Distributed Tracing for PostgreSQL (github.com)

Loss of dance and infant-directed song among the Northern ACHé (cell.com)

Uber CEO says changing employee benefits 'is a risk we decided to take' (cnbc.com)

Effects of lifestyle changes on cognitive impairment due to Alzheimer's (2024) (alzres.biomedcentral.com)

How does Jami work on mobile without a server? (jami.net)

AI Slop Is Polluting Bug Bounty Platforms with Fake Vulnerability Reports (socket.dev)

Show HN: Gravity Bombing: Recursive Resonance in Multi-Expert Systems"

Ghost in the Machine: A Q&A with SatoshiAI (bitcoinmagazine.com)

Age Verification in the European Union: The Commission's Age Verification App (eff.org)

AI of Dead Road Rage Victim Addresses Killer in Court (theguardian.com)

Wrote a detailed playbook for incorporating security in Gen AI systems (adityarohilla.com)

Added Token and LLM Cost Estimation to Microsoft's GraphRAG Indexing Pipeline

Comments (1)