“This telegram must be closely paraphrased before being communicated to anyone” (history.stackexchange.com)

On a recent project I spent about an hour trying to do something similar (and far less sophisticated) before I realized it was a problem I had no desire in really solving, so I backed out all my changes and just went with string.capitalize(), even though it didn’t really do what I was looking for. Looking forward to using this instead!

zobweyt · 169d ago

Thank you for the kind words! I'm glad you appreciate the effort put into covering all those edge cases.

It sounds like you had quite the adventure with text casing on your project. I'm happy this library can save you some time and hassle. Looking forward to see what can be built with it!

Rendello · 169d ago

Many edge-cases can be found with regards to casing! Like title-case characters.

https://www.unicode.org/versions/Unicode16.0.0/core-spec/cha...

lnenad · 168d ago

I might be jaded, but I think having libraries for such simple use cases leads to the inevitable `left-pad` situation.

When I say simple use cases I mean that since you probably don't need all of these functions at once that it would be easier to copy the code you need if you don't feel comfortable writing it instead of adding yetanotherlibrary to your dependency tree.

zobweyt · 168d ago

I understand your perspective, and it's a valid concern. However, this library is designed to support not only simple use cases but also more advanced scenarios, providing a comprehensive solution for various needs. Additionally, it has zero dependencies, which helps keep your project lightweight. This way, you can benefit from the library's features without adding unnecessary complexity to your dependency tree. Thank you for sharing your thoughts!

lnenad · 168d ago

Nah it's not you or your library, there is definitely a place for such utilities. The issue is broader, related to everyone installing libs for two liners and having bajillion dependencies.

axegon_ · 168d ago

Don't forget space... npm install and 500GBs go "bye-bye"

lnenad · 168d ago

Hey I'll have you know that memory is cheap nowadays and that I'd be happy to fill out my drives with node libraries for converting a's into A's.

7bit · 168d ago

You can always just take the code and put it in your app. Having libraries like these don't force you to add them as a dependency. Assuming the right OSS license.

lnenad · 168d ago

I agree but in reality many will take the easier path of [`pip` `npm` `cargo` `yarn` `go`] [`install`, `add`] when seeing the functionality out there. I was also making a broader talking point.

fake-name · 169d ago

> A feature complete Python text case conversion library

Considering it supports unicode input, I somehow doubt that. Given that there's no mention of unicode normalization it'll likely break some strings.

zobweyt · 169d ago

That's a great observation! Instead of seeing it as a limitation, it can be treated as a feature. Users can handle Unicode normalization using Python's built-in unicodedata module to ensure proper case conversion. Thanks for pointing that out!

re · 168d ago

> A feature complete Python text case conversion library

I suspect you mean "featureful", "full-featured" or similar[1]—"feature complete" means that you're not going to add any more features.

[1] https://english.stackexchange.com/questions/393517/what-do-y...

zobweyt · 168d ago

Thank you for the clarification! I appreciate your input. I've updated the wording to "feature-rich" to better convey the intended meaning. Your feedback is valuable!

frizlab · 169d ago

Great library!

Does it support non-English title casing?

For instance in French, title casing for “les maisons bleues” is “Les Maisons bleues” while for “des maisons bleues” it’s “Des maisons bleues”.

zobweyt · 169d ago

Thanks!

It does not support non-English title casing. From the documentation:

> It also works non-ascii characters. However, no inferences on the language itself is made. For instance, the digraph ij in Dutch will not be capitalized, because it is represented as two distinct Unicode characters. However, æ would be capitalized

frizlab · 168d ago

I was talking about the specific rules that are in place for title capitalization. As you can see in my example the uppercase letters seem randomly placed for a title, but they are indeed correct. For German too there are issues where capitalization has a meaning on the word itself. That kind of things.

It looks like your library does not support it, which is understandable, it is a huge problem to tackle, but I just wanted to be sure.

zobweyt · 168d ago

Thank you for the clarification! I understand that title capitalization can be quite complex, especially with specific rules in languages like German where capitalization can change the meaning of a word.

I guess handling these nuances falls under the broader categories of internationalization (i18n) and localization (l10n).

frizlab · 167d ago

Just to be excessively clear and maybe borderline annoying, this is not a simple nuance. In German the meaning of a word can actually change depending on its capitalization. Even in English, lowercasing the I is very weird.

re · 168d ago

> It does not support non-English title casing

Perhaps document that clearly—it's an important restriction that the library assumes English-language strings. ("no inferences on the language itself is made" isn't quite true since the language is inferred to be English, or to at least follow English-compatible rules for casing)

zobweyt · 168d ago

Thanks for your feedback! You're right; I should clarify that the library assumes English-language strings for casing. I'll update the documentation to make this limitation clear. I appreciate you pointing it out!

zvr · 168d ago

Nice work, but since it does not handle anything else than strings, maybe it should be named "stringcase" or something.

zobweyt · 168d ago

Thank you for the feedback!

I appreciate your suggestion regarding the name, but unfortunately this name was already taken, so "textcase" was chosen.

I also have ideas for adding dictionary key conversion and other features in the future that will handle more than just strings. In addition, you can use this library to convert cases of Iterable[str] using textcase.pattern

zvr · 168d ago

My issue with using "text" is that I assume that a text like "I THINK I DO" should be converted to "I think I do", not "i think i do".

And that's just in English...

If "text" is in Greek, like "Καλημέρα", the upper form should be "ΚΑΛΗΜΕΡΑ", not a juxtaposition of upper() conversions of each letter.

zobweyt · 168d ago

Thanks for the clarification!

Yeah, there is such a problem with the naming "text" suggests something different than just a "string".

I guess handling these nuances falls under the broader categories of internationalization (i18n) and localization (l10n).

anentropic · 168d ago

Looks brilliant!

My only suggestion is here:

> It also ignores any leading, trailing, or duplicate delimiters:

    from textcase import case, convert

    print(convert("IOStream", case.SNAKE))             # io_stream
    print(convert("myJSONParser", case.SNAKE))         # my_json_parser
    print(convert("__weird--var _name-", case.SNAKE))  # weird_var_name

In the case of a conversion target that has delimiters (snake, kebab) it might be nice to have an alternative option to preserve such features but normalise them to the target delimiter

i.e.

    print(convert("__weird--var _name-", case.SNAKE, preserve=True))  # __weird__var__name

zobweyt · 168d ago

Thank you for your suggestion! Adding a preserve option to maintain leading, trailing, and duplicate delimiters while normalizing them to the target delimiter is a great idea. I’ll consider implementing this feature. Thanks again!

kianN · 169d ago

My favorite part of this library is that it seems to have zero dependencies!

Python packages seem to often rope in a surprising number of dependencies for relatively limited libraries.

I can easily imagine pulling this package into my work: thank you for keeping the requirements to a minimum!

danpalmer · 168d ago

Definitely something to be championed, although I suspect this is a matter of perspective. I find Python packages to have refreshingly few dependencies compared to packages in the JS ecosystem, although compared to the Swift ecosystem which I’m somewhat familiar with, they do tend to have a few more.

zobweyt · 168d ago

I appreciate your perspective! It's interesting to consider how the built-in libraries of a language can influence its ecosystem. Python does have a rich standard library that often reduces the need for external dependencies. In contrast, JavaScript's ecosystem has evolved around web development, where modularity and flexibility are prioritized, leading to a proliferation of packages.

zobweyt · 169d ago

Thanks for the kind words!

This library actually has zero dependencies! I'm glad you appreciate the no-dependency design.

It's great to hear that it fits well with your work!

esafak · 169d ago

Is there a GH badge for the dependency count? Depfu maybe. Someone should make one if not; it's worth advertising.

zobweyt · 169d ago

Thanks for the suggestion!

Right now, there's no such GH badge. Since the project will always have zero dependencies, I think we can simply use a static badge like this:

https://img.shields.io/badge/dependencies-0-green

cadamsdotcom · 169d ago

HAppY ApRiL FoOLs!

If only this comment supported case conversion..

In any case congrats on shipping!

zobweyt · 169d ago

Happy April!

Actually, this library supports conversion of even such strings!

```python

>>> import textcase

>>> textcase.convert("HAppY ApRiL FoOLs!", textcase.case.SNAKE, (textcase.boundary.SPACE,))

'happy_april_fools!'

```

Thanks for the congratulations!

wodenokoto · 168d ago

This should be implemented in editors.

It also looks to be nice in exploratory data analysis:

    df = pd.read_csv(f)
    df.columns = map(convert, df.columns, case.snake)

zobweyt · 168d ago

Thanks for the feedback! I'm glad you found a use case for this library!

marban · 169d ago

Bonus: https://github.com/un33k/python-slugify

kseistrup · 168d ago

Now available from AUR: https://aur.archlinux.org/packages/python-textcase-git

zobweyt · 168d ago

Thank you for the update! I’ve added a badge to the GitHub repository to reflect its availability on AUR.

kseistrup · 168d ago

Great, thanks!

We should have the ability to run any code we want on hardware we own (hugotunius.se)

Cognitive load is what matters (github.com)

Hosting a website on a disposable vape (bogdanthegeek.github.io)

NPM debug and chalk packages compromised (aikido.dev)

Top UN legal investigators conclude Israel is guilty of genocide in Gaza (middleeasteye.net)

Shai-Hulud malware attack: Tinycolor and over 40 NPM packages compromised (socket.dev)

I didn't bring my son to a museum to look at screens (sethpurcell.com)

Show HN: A store that generates products from anything you type in search (anycrap.shop)

I ditched Docker for Podman (codesmash.dev)

Germany is not supporting ChatControl – blocking minority secured (digitalcourage.social)

30 minutes with a stranger (pudding.cool)

Show HN: Term.everything – Run any GUI app in the terminal (github.com)

Charlie Kirk killed at event in Utah (nbcnews.com)

Apple Photos app corrupts images (tenderlovemaking.com)

996 (lucumr.pocoo.org)

EU court rules nuclear energy is clean energy (weplanet.org)

Next.js is infuriating (blog.meca.sh)

Show HN: I recreated Windows XP as my portfolio (mitchivin.com)

The MacBook has a sensor that knows the exact angle of the screen hinge (twitter.com)

Anthropic agrees to pay $1.5B to settle lawsuit with book authors (nytimes.com)

Signal Secure Backups (signal.org)

Man jailed for parole violations after refusing to decrypt his Tor node (reddit.com)

Denmark close to wiping out cancer-causing HPV strains after vaccine roll-out (gavi.org)

Things you can do with a Software Defined Radio (2024) (blinry.org)

Using Claude Code to modernize a 25-year-old kernel driver (dmitrybrant.com)

iPhone Air (apple.com)

Pontevedra, Spain declares its entire urban area a "reduced traffic zone" (greeneuropeanjournal.eu)

I replaced Animal Crossing's dialogue with a live LLM by hacking GameCube memory (joshfonseca.com)

Google can keep its Chrome browser but will be barred from exclusive contracts (cnbc.com)

UTF-8 is a brilliant design (iamvishnu.com)

We all dodged a bullet (xeiaso.net)

Stripe Launches L1 Blockchain: Tempo (tempo.xyz)

Mistral raises 1.7B€, partners with ASML (mistral.ai)

WASM 3.0 Completed (webassembly.org)

New Mexico is first state in US to offer universal child care (governor.state.nm.us)

Chat Control Must Be Stopped (privacyguides.org)

The treasury is expanding the Patriot Act to attack Bitcoin self custody (tftc.io)

“This telegram must be closely paraphrased before being communicated to anyone” (history.stackexchange.com)

Almost anything you give sustained attention to will begin to loop on itself (henrikkarlsson.xyz)

Where's the shovelware? Why AI coding claims don't add up (mikelovesrobots.substack.com)

Models of European metro stations (stations.albertguillaumes.cat)

Linux phones are more important now than ever (feddit.org)

Waymo has received our pilot permit allowing for commercial operations at SFO (waymo.com)

Google AI Overview made up an elaborate story about me (bsky.app)

iPhone dumbphone (stopa.io)

Why our website looks like an operating system (posthog.com)

Claude Code: Now in Beta in Zed (zed.dev)

React is winning by default and slowing innovation (lorenstew.art)

Corporations are trying to hide job openings from US citizens (thehill.com)

Eternal Struggle (yoavg.github.io)

Show HN: Textcase: A Python Library for Text Case Conversion

Comments (41)