Python OpenAI API create Pinecone embeddings from PDF documents and RAG examples

Comments (1)

lpm0073 · 19h ago

A Hybrid Search and RAG prompting solution using Python OpenAI API Embeddings persisted to a Pinecone vector database index and managed by LangChain. Demonstrates the following:

- System Prompting. How do use the system prompt to modify LLM text completion behavior.

- Templates. How to create templates in order keep your prompts DRY.

- LangChain. How to setup a project using LangChain as an alternative to vendor specific LLM PyPi packages.

- PDF Loader. a command-line pdf loader program that extracts text, vectorizes, and loads into a Pinecone dot product vector database that is dimensioned to match OpenAI embeddings. Pinecone. How to create, load, and query a Pinecone vector database.

- Retrieval Augmented Generation (RAG). A chatGPT prompt based on a hybrid search retriever that locates relevant documents from the vector database and includes these in OpenAI prompts.

LLM Inevitabilism (tomrenner.com)

Do not download the app, use the website (idiallo.com)

Kiro: A new agentic IDE (kiro.dev)

CARA – High precision robot dog using rope (aaedmusa.com)

Show HN: Tinder but it's only pictures of my wife and I can only swipe right (trytender.app)

Linux Reaches 5% Desktop Market Share in USA (ostechnix.com)

EU age verification app to ban any Android system not licensed by Google (reddit.com)

Dumb Pipe (dumbpipe.dev)

Valve confirms credit card companies pressured it to delist certain adult games (pcgamer.com)

Performance and telemetry analysis of Trae IDE, ByteDance's VSCode fork (github.com)

Enough AI copilots, we need AI HUDs (geoffreylitt.com)

Copyparty – Turn almost any device into a file server (github.com)

‘I witnessed war crimes’ in Gaza – former worker at GHF aid site [video] (bbc.com)

Hyatt Hotels are using algorithmic Rest “smoking detectors” (twitter.com)

How to Firefox (kau.sh)

Global hack on Microsoft Sharepoint hits U.S., state agencies, researchers say (washingtonpost.com)

Show HN: Use Their ID – Use your local UK MP’s ID for the Online Safety Act (use-their-id.com)

Reflections on OpenAI (calv.info)

Qwen3-Coder: Agentic coding in the world (qwenlm.github.io)

AI overviews cause massive drop in search clicks (arstechnica.com)

It's time for modern CSS to kill the SPA (jonoalderson.com)

Graphene OS: a security-enhanced Android build (lwn.net)

Ukrainian hackers destroyed the IT infrastructure of Russian drone manufacturer (prm.ua)

ChatGPT agent: bridging research and action (openai.com)

Windsurf employee #2: I was given a payout of only 1% what my shares where worth (twitter.com)

Mistral Releases Deep Research, Voice, Projects in Le Chat (mistral.ai)

VPN use surges in UK as new online safety rules kick in (ft.com)

Cops say criminals use a Google Pixel with GrapheneOS – I say that's freedom (androidauthority.com)

Tom Lehrer has died (nytimes.com)

The United States withdraws from UNESCO (state.gov)

XMLUI (blog.jonudell.net)

TrackWeight: Turn your MacBook's trackpad into a digital weighing scale (github.com)

Steam, Itch.io are pulling ‘porn’ games. Critics say it's a slippery slope (wired.com)

My Self-Hosting Setup (codecaptured.com)

Show HN: Shoggoth Mini – A soft tentacle robot powered by GPT-4o and RL (matthieulc.com)

Ozzy Osbourne has died (bbc.co.uk)

Coding with LLMs in the summer of 2025 – an update (antirez.com)

Rust running on every GPU (rust-gpu.github.io)

Oakland cops gave ICE license plate data; SFPD also illegally shared with feds (sfstandard.com)

Cloudflare 1.1.1.1 Incident on July 14, 2025 (blog.cloudflare.com)

Death by AI (davebarry.substack.com)

Claude Code weekly rate limits

You can now disable all AI features in Zed (zed.dev)

New colors without shooting lasers into your eyes (dynomight.net)

Complete silence is always hallucinated as "ترجمة نانسي قنقر" in Arabic (github.com)

Visa and Mastercard are getting overwhelmed by gamer fury over censorship (polygon.com)

Women dating safety app 'Tea' breached, users' IDs posted to 4chan (404media.co)

UK backing down on Apple encryption backdoor after pressure from US (arstechnica.com)

Ask HN: Is it time to fork HN into AI/LLM and "Everything else/other?"

Apple's MLX adding CUDA support (github.com)

Python OpenAI API create Pinecone embeddings from PDF documents and RAG examples

Comments (1)