Show HN: CSV GB+ by Data.olllo – Open and Process CSVs Locally

1. paddy_m ◴[14 May 25 16:22 UTC] No.43986366[source]▶

Do you have a demo video?

What are you using for processing (polars)?

Marketing note: I'm sure you're proud of P Core/V Core, but that doesn't matter to your users, it's an implementation detail. At a maximum I'd write "intelligent execution that scales from small files to large files".

As an implementation note, I would make it simple to operate on just the first 1000 (10k or 100k) rows so responses are super quick, then once the users are happy about the transform, make it a single click to operate on the entire file with a time estimate.

Another feature I'd like in this vein is execute on a small subset, then if you find an error with a larger subset, try to reduce the larger subset to a small quick to reproduce version. Especially for deduping.

replies(3): >>43988003 #>>43989242 #>>43993458 #

2. ◴[14 May 25 18:54 UTC] No.43988003[source]▶

>>43986366 (TP) #

3. marcellus23 ◴[14 May 25 21:08 UTC] No.43989242[source]▶

>>43986366 (TP) #

> Marketing note: I'm sure you're proud of P Core/V Core, but that doesn't matter to your users, it's an implementation detail. At a maximum I'd write "intelligent execution that scales from small files to large files".

Speaking personally, "intelligent execution that scales from small files to large files" sounds like marketing buzz that could mean absolutely nothing. I like that it mentions specifically switching between RAM and disk-powered engines, because that suggests it's not just marketing speak, but was actually engineered. Maybe P vs V Core is not the best way to market it, but I think it's worth mentioning that design.

replies(2): >>43993459 #>>43998375 #

4. olllo ◴[15 May 25 10:03 UTC] No.43993458[source]▶

>>43986366 (TP) #

Thanks for the thoughtful feedback!

Yes, Data.olllo uses including Polars under the hood for fast and efficient processing. A demo video is in the works and should be up soon.

Good point about the "P Core/V Core" naming—I'll simplify that to focus more on the user benefit, like scaling from small to large files smoothly.

I also like your idea of running transformations on a sample first with a one-click full run—very aligned with the vision. And subset reproduction for errors is a great suggestion, especially for things like deduping. Appreciate it!

replies(1): >>43997705 #

5. olllo ◴[15 May 25 10:04 UTC] No.43993459[source]▶

>>43989242 #

Thanks for the thoughtful take—really appreciate both perspectives.

You're right that terms like "intelligent execution" can feel vague without concrete backing. My goal with mentioning P Core/V Core was to hint at the underlying design—switching between in-memory and disk-based engines like Polars and Vaex—without overwhelming with technical detail.

I’ll look for a better way to explain the idea clearly and briefly. Thanks again!

6. paddy_m ◴[15 May 25 18:12 UTC] No.43997705[source]▶

>>43993458 #

Feel free to get in touch. We are building similar tools

7. gopher_space ◴[15 May 25 19:21 UTC] No.43998375[source]▶

>>43989242 #

I wish every product had an engineer-only landing page I could set as a default in my browser. The number of companies that assume I'm familiar with their offering is astounding, and I'm usually looking for implementation docs just to figure out what it actually does.

I'm not saying we need a morlock/eloi toggle.