Can I make a personalized advertisment ai that recommends new products in my shop with discounts based on previous user purchases, chat history, and product viewing behavior
Maximizing Purchase? Probably stick with reinforcement learning. Those models will try to maximize getting rewards (in this case the amount of purchase). Though it might take a while to train.
As for discounts, be careful. You might not want the user exploit the model just by selecting discounted items, but never undiscounted items.
I might use ai for personalizing objects that maximize purchases and a simple algorithm I script myself for selecting discounts. The discount will be based on their previous purchases.
Never spent: 80-90% off
Rarely spends: 10-30%
Occasionally spends 5-20%
Consistent spending 15-40%
Frequency of previous purchases is accounted based on a point system, with older purchases degrading. Once a user makes their first purchase, they can never return to the ânever spent category.â
The next update will allow some of the algorithms to support continuous action spaces!
Iâll be updating the Beta version multiple times before releasing a stable release version for all to use!
So get ready!
Why I do this? Well, my plan is to make DataPredict as the industrial and research standard for RL in Roblox. So Iâll be completing this update before I leave for my Masters.
Added DiagonalGaussianPolicy and placed it under QuickSetups section.
Added a new parameter for reinforce() function to AsynchronousAdvantageActorCritic model.
Added diagonalGaussianUpdate() function to AsynchronousAdvantageActorCritic model.
Changes
Renamed ReinforcementLearningQuickSetup to CategoricalPolicy and placed it under QuickSetups section. Also made some internal code changes.
ReinforcementLearningBaseModelâs and ReinforcementLearningActorCriticBaseModelâs setUpdateFunction() and update() functions have been replaced with setCategoricalUpdateFunction(), setDiagonalGaussianUpdateFunction(), categoricalUpdate() and diagonalGaussianUpdate().
Made internal code changes to all reinforcement learning algorithms in the library.
Made a few API breaking changes related to the AsynchronousAdvantageActorCritic model:
Renamed update() function to categoricalUpdate().
Renamed reset() function to resetAll().
Renamed singleReset() function to reset().
Side Notes:
Please update the MatrixL library so that you donât run into issues when using this DataPredict library version. Some changes have been made at MatrixL library and these changes gets transferred over to the DataPredict library.
Can you profit free tools that implement ur module? Stuff like anti cheat, preference selector (from a table of items, what they like), etc? Ive never taken statistics class and idk ai stuff so i cant use ur module. That or u could give general tutorials on how ai works or how statistics works
Will that free tool be distributed publicly? Or is it for internal use?
You already have ChaWatcher, which is partially open source when it comes to anti-cheat. You can only use that algorithm inside ChaWatcher for cheat detection only. Why not use that?
No. Youâre basically asking me to teach you for free. And the whole AI thing canât be covered in few posts, but in a year. Even six month worth of university lessons doesnât cover the whole library.
You can also utilize the inferior version of GPU cores, the CPU cores. By using actors to parallelize intense computations by spreading the load into many actors, you can do computations in parallel and fetch them for use. This proved effective when I tried to make my ant simulation faster