Introducing RoLLM – The First LLM made from Scratch on Roblox

Hello,

I’ve been exploring the idea of running a Transformer-based Language Model entirely within Roblox, and I’d like to share my project: RoLLM (Roblox Language Model). It’s an experimental codebase that uses OOP Lua modules to build a simplified GPT-like Transformer—complete with tokenizers, embedding layers, attention blocks, and more.

Why RoLLM?

I wanted to see if it was feasible to load a large (or even a smaller) LLM into Roblox Studio and have it generate text on the fly. The theoretical part works: you can define all the layers in Lua, manage tokenization, and even incorporate something like a GPT-2–style subword vocabulary. However…

The Problem

Unfortunately, performance and memory constraints in Roblox make it really tough to run a large-scale LLM in-engine. You need huge matrices, lots of floating-point operations, and robust memory management—none of which is trivial in Roblox’s sandbox. Even after optimizing, RoLLM still struggles with anything beyond tiny toy experiments (like a 2-layer, 32-dimensional Transformer).

In other words: RoLLM currently doesn’t produce coherent text the way GPT-2 or GPT-3.5 would, because we can’t realistically load or train big models in Lua. Right now, the code mostly yields random or semi-random outputs if you try anything large.

Sample Code

local HttpService = game:GetService("HttpService")

local LLM = require(game.ReplicatedStorage.Packages.RoLLM)
local config = {
	dModel = 32,
	numHeads = 4,
	dFF = 64,
	numLayers = 2,
	maxSeqLen = 128,
	tokenizerMode = "external",
	externalVocabURL = "https://huggingface.co/openai-community/gpt2/raw/main/vocab.json",
}

local myLLM = LLM.new({}, config) -- first argument is textData, in this case we don't use textData but externalData

print("LLM created with GPT-2 external vocab. Vocab size:", myLLM._tokenizer:getVocabSize())

local nextTok = myLLM:predict("Hello")
print("Argmax next token (external approach):", nextTok)

local genText = myLLM:generate("Hello", 5)
print("Argmax generated text:", genText)

local nextTokTemp = myLLM:predictTemperature("Hello", 1.0)
print("Temp=1 next token:", nextTokTemp)

local genTextTemp = myLLM:generateTemperature("Hello", 10, 1.0)
print("Temp=1 generated text:", genTextTemp)

Why Release It Open Source?

I’m open-sourcing RoLLM in the hope that someone in the community might find clever workarounds—or at least learn from the code. Maybe you’ll figure out how to:

  • Import smaller pretrained weights more efficiently,
  • Use external GPU-accelerated services for big matrix ops,
  • Leverage a specialized approach to partial model loading, or
  • Train a super tiny model offline and run it in Roblox.

What’s Included?

  • Tokenizer Modules: Both character-based and a placeholder for subword-based (BPE-like) approaches.
  • Transformer Blocks: Multi-head attention, feed-forward, layer norms, etc. in OOP Lua.
  • Examples: Scripts showing how to chunk large text data, do basic inference, and unify everything under one init.lua.

What Next?

  • Check Out the Code: I’ve posted the entire project’s source on Github Repo.
  • Contribute: If you manage to optimize it or find a novel approach to GPU offloading, I’d love to see a pull request or a fork.
  • Use an External API: If you just want high-quality text, an HTTP integration with OpenAI, Hugging Face, or another service is often simpler. But if you’re curious about the internals of a Transformer, RoLLM can be an educational toy.

Final Thoughts

I know it’s not working perfectly in-engine on large models, but I believe there’s value in seeing how Transformers can be structured in Luau. If you’re the adventurous type who loves experimentation, feel free to pick up RoLLM and tinker—maybe you’ll be the one to conquer the Roblox LLM frontier!

Start contributing

Clone & Init ROJO with default.project.json :

git clone https://github.com/rustyspottedcatt/RoLLM

Preview

image

16 Likes

Some ideas I’ve thought of is using synonyms to compress the vocabulary and training data, that should reduce the training complexity substantially. I posted the synonyms dataset here.
Lua Trainable RAG Local Chatbot Library, Local Model Aligns Large Model, Geometry+Worded Math Solver,Math/Geometry Data Generator, Chat Database - Resources / Community Resources - Developer Forum | Roblox

That’s a good idea altough it doesn’t fix the problem that LUAU often crashes or times out when running the LLM (it reaches 114k threads), it makes sense since LUAU was never built for that but I still believe it is possible, I also just finished adding an option to train the LLM but I am not releasing it yet because it still has some math issues.

Decided to archive the repo due to Roblox’s limitation, you are more than welcome to fork it and try to make it train on data.

Honestly, I’ve been looking for something like this, perhaps it will be finished soon.
Although I can’t imagine a mobile player running this :sweat_smile:.

2 Likes

Transformer-based Language Model … what does this mean ,Is this came from a transformer AI?

Yes, Transformer-based Language Model means that roLLM uses the Transformer architecture, a type of ai developed for understanding and generating human like text. This architecture relies on self-attention mechanisms to process and relate words in a sentence.

This can be used to prevent cheating :warning::warning::warning::warning:

Yes, but that’s pretty much impossible, of the hundred hours I spent making this LLM and researching AI papers, I pretty much realized that no matter what we try to do, it is impossible to make a proper LLM or neural network because ROBLOX’s compute is not enough and will never be enough.