Grass Rendering Series (hexaquo.at)

I'm guessing it's something like h_{l+1} = MultiHeadSelfAttentionWithPositionEncodingBakedIn(MLP(h_l) + embed_l(token_ids)). So it's probably really easy to implement on toy problems to see if it works.

3abiton · 13h ago

Any resources or suggestions to learn about this? The field is moving too fast, my poor brain can't keep up.

impossiblefork · 4h ago

Basically you'd familiarize yourself with transformers by implementing different variants of them, and changing them around according to your own ideas on different toy datasets.

Then you'd figure out a set of toy tasks that you like and think are important.

In this particular case you take something like NanoGPT, go to model.py, go to class GPT, go to __init__, modify the self.transformer ModuleDict by changing nn.Embedding to a ModuleList of nn.Embedding, then you change the for loop at line 180 to loop over a range, modify forward by adding x = x + self.transformer.wte[i], something like that I think.

I haven't tried yet though (I've got a terrible cold, so I am on social media instead of doing anything sensible).

limoce · 10h ago

> https://preview.redd.it/wca7kzfq5w2f1.png?width=1190&format=...

"4x gated residual streams" look quite weird. Is there any paper or technique report for this?

3abiton · 12h ago

While PLE is quite innovative, the interesting part is they released their [apk on github](https://github.com/google-ai-edge/gallery), compared to linking it to play store. Interesting choice.

Grass Rendering Series (hexaquo.at)

Builder.ai coded itself into a corner – now it's bankrupt (theregister.com)

Particle Life simulation in browser using WebGPU (lisyarus.github.io)

SOBB: Skewed Oriented Bounding Boxes for Ray Tracing (dcgi.fel.cvut.cz)

The GitHub Shop (thegithubshop.com)

The latest generational divide: How to hold a phone (washingtonpost.com)

AI's energy impact is still small–but how we handle it is huge (technologyreview.com)

Ask HN: What are the most underrated tools you use regularly?

Panic at the Sequencer – Arbitrum Stylus Invalid Import Denial of Service (iosiro.com)

I made an emotional LLM SDK (Korean, weird, v1.0.1 pls be nice) (npmjs.com)

Team Spexor Says Goodbye (spexor-bosch.com)

Google Review Accused Our Business of Theft. We Had No Way to Respond

From Exploit to Escape: Bybit's Multisig Was Secure–Until the UI Wasn't (quantumloom.in)

What If Making Cartoons Becomes 90% Cheaper? (nytimes.com)

Show HN: Browser-Based Dithering Playground for Artists and Tinkerers (chromadith.netlify.app)

$1M Revenue, $0 Profit: Our D2C Reality Check (indiehackers.com)

Volvo to cut 3k jobs in restructuring (economictimes.indiatimes.com)

HFSS Calculator (sauceshed.co.uk)

Ask HN: State of the Art in Simulated Cells?

The Sega Master System is still being made and sold in Brazil 36 years later (xda-developers.com)

Why don't more people claim flight compensation they're legally owed?

Tesla Cybertruck Trade-Ins Are Here. The Depreciation Is Brutal (insideevs.com)

Stuck or blessed: Still booting after all these years (bbc.com)

Asserting Implications (tigerbeetle.com)

Show HN: BufferTab – minimal Markdown editor that lives in the browser's URL (github.com)

Astronomical Calculations for Hard Sci-Fi in Common Lisp (2022) (borretti.me)

The reverse-centaur apocalypse is upon us (pluralistic.net)

Ask HN: How Cursor Works from Inside?

Decoding SANA: Efficient High-Resolution Image Synthesis with Linear DiT (kailashahirwar.medium.com)

Chinese EV Stocks Tumble After BYD Slashes Prices as Much as 34% (bloomberg.com)

Humanoid robots fight in Chinese kick-boxing competition (bbc.co.uk)

Beyond the Hype: Lessons Learned from Building an LLM-Based Extraction MVP (medium.com)

Mekondo Admin – Open Starter Kit for SaaS, React and Laravel, Dockerized

New study links depression to accelerated brain aging (psypost.org)

NASA Is Removing Space Station Sighting Website Info (nasawatch.com)

Far-UVC: germicidal ultraviolet light can inactivate a wide range of pathogens w (blueprintbiosecurity.org)

World Robot Competition – Mecha fighting series [video] (youtube.com)

Ten Year bullet journal story in engineering (mechitworks.com)

Anti-Proverbs (en.wikipedia.org)

AI Sandbagging (youtube.com)

Media Buyer salary in the UK see 8% surge over the last two months (Jobicy) (jobicy.com)

Hope Island (bbc.com)

Show HN: A Generative AI Encyclopedia (wikigen.ai)

TIL: timeout in Bash scripts (heitorpb.github.io)

Ask HN: My colleague had a tantrum over a simple question, what would you do?

Show HN: Steam Analytics for Game Devs - Explore Insights + Free Datasets (app.gginsights.io)

Show HN: SendIt – A privacy-first file sharing app with link and access control (sendit-file.vercel.app)

Study finds a 50% decline in the use of semicolons over the last two decades (theconversation.com)

AI Content Editor demand is increasing by 25% annually in the US (Jobicy) (jobicy.com)

Easy Invoice Template (easyinvoicetemplate.com)

Gemma 3n Architectural Innovations – Speculation and poking around in the model

Comments (5)