

Couldn’t agree more comrade
And it’s the previous duplicative efforts to gather all the Marxist literature that inspired this post. GitHub/hugging face are scattered with various attempts to build this text base, not to mention individuals here doing the same thing by scraping MIA/Prolewiki
When we do not collaborate, its actually harmful imho to
- the planet, unless you’re running all of this locally, you’re contributing to the pollution and water consumption crisis. And even if you’re are, you need to be on a clean grid to actually mitigate the consequences. By pooling resources, we can pursue training and inference to bring this tooling to the community with a known carbon and water footprint of nearly zero
- the proletariat, the Q&A use case you made above is a great example of something that seems simple, but is quite hard and expensive to do in practice (we need to build up a dataset of exactly this style of question at a high rigor/academic level, and in a way where we don’t “leak liberalism” into the dataset or model. Nothing we make will be useful unless it’s trained on that data, so it makes zero sense for us to compete in creating that dataset rather than collaborate
- the lay public and future generations, forgive the grandiose claim, but when communists have no material alternative to the techno fascist colonization of the future, we abandon those on the sidelines to “choose” this dystopian future because they have no other options
To directly answer your question, I’m personally finishing a master’s degree this year, so I’m looking to root my knowledge in the community rather than the market. I’ve got local models up and running, and my next step is to build up that exact text archive you mentioned above










Perfect and I’m looking forward to building with you comrade. I’ll send a when2meet tomorrow to interested parties so we can determine a good initial meeting time.
The list so far is,
If anyone else is interested or I missed anyone, as their name in this thread so they’re included