: The messages were cleaned by removing group chats and unknown contacts, then grouped into "chunks" of 200 tokens to serve as training prompts for the AI.
: Microsoft Excel has a known limit for .prn (space-delimited) files where formatted text is limited to 240 characters per line . 240k private.txt
A simulation of me: fine-tuning an LLM on 240k text messages : The messages were cleaned by removing group