4 months ago

Fri Feb 14, 2025 4:38pm PST

Ask HN: How to get involved in developing and researching LLMs?

Hey HN, how does an independent dev get involved in developing and researching LLM models?

Bit of background - I have a CS background/undergrad, have caught up on the academics of attention, transformers, etc, have worked in the software industry for ~10 years and even built some small deep learning networks in PyTorch for various POCs (including a small transformer). I really want to get into researching, finetuning, testing new architectures, etc, for LLMs because I view it as more interesting than doing a wrapper SaaS product (my interest is more towards research than product).

But I'm not sure how to really get into foundation model LLMs in a meaningful way as an individual. I'm not part of a university research group, haven't gotten anywhere applying to the big AI companies (I'm just some dev dude without a PhD or name school), and I don't have the scale of compute/GPUs to do my own self-experimentation and research. I have a single 12gb vram GPU, but I doubt that gets me anywhere interesting.

So what exactly could I do? Open to any creative and practical ideas.

comments:

add comment

loading comments...