#BigCode

Esther Payne :bisexual_flag:onepict@chaos.social
2024-04-25

So that's me received the confirmation that my stuff is removed from Bigstack.

Which is good. It shows the Optout requests are being done.

Go to check again for librecasts old github account. Looks like I missed some.

*Opens new ticket*

huggingface.co/spaces/bigcode/

While I did think it is important for Software Heritage to archive code, I wish it was done Opt-in.

It would be nice to be asked and for that code to be curated. This is not curation. This is automation.

#SoftwareHeritage #BigCode

2024-03-21

It’s especially rich that the logo for #BigCode, an org that trains LLMs so is massively accelerating #climateChange, uses a sakura blossom. Sakura are suddenly blooming earlier each year due to climate change.

The BigCode logo, “</>” contained in a sakura blossom, contained again by a rounded square.This chart shows the peak of the cherry blossom bloom in Kyoto and 20-year rolling average (812-2021). It slopes suddenly to earlier dates until the most recent datum.
2024-03-20

You might want to check if your code is used for training the #BigCode AI model:
huggingface.co/spaces/bigcode/

#BigCodeProject #FuckAI

KINEWS24KiNews
2023-08-09

StableCode von Stability AI ist ein neu entwickeltes großes Sprachmodell (LLM) zur Unterstützung der Programmcode-Erstellung

kinews24.de/stability-ai-stabl

Published papers at TMLRtmlrpub@sigmoid.social
2023-06-07

The Stack: 3 TB of permissively licensed source code

Denis Kocetkov, Raymond Li, Loubna Ben allal et al.

Action editor: Swarat Chaudhuri.

openreview.net/forum?id=pxpbTd

#bigcode #text2code #dataset

Jan :rust: :ferris:janriemer@floss.social
2023-05-18

#StarCoder: A State-of-the-Art #LLM for #Code by Hugging Face 🤗

huggingface.co/blog/starcoder

More about the Big Code project:
bigcode-project.org/

Find out, whether your code was used for training and opt-out, if you don't want to be "in the stack":
huggingface.co/spaces/bigcode/

#AI #ArtificialIntelligence #LLMs #DevTools #BigCode

Mastokarl 🇺🇦Mastokarl
2023-05-10

I want to like HuggingChat open source LLM AI so much. But at least for coding it is nowhere near the same league as ChatGPT. If I would hire a new developer for my team and could conduct interviews only per keyboard, I would be impressed by ChatGPT and offer it the position. With HuggingChat I’d terminate the interview after 10 mins. Tried Java and JS. Calling APIs from any library that might do the job without importing, explanation mixing up cause and effect..

Chuck Darwincdarwin@c.im
2023-05-06

#BigCode is an open scientific collaboration working on responsible training of large language models for coding applications.

In this organization you can find the artefacts of this collaboration:
👉 #StarCoder, a state-of-the-art language model for code,
👉 The #Stack, the largest available pretraining dataset with perimssive code, and 👉 #SantaCoder, a 1.1B parameter model for code.

#StarCoder is a 15.5B parameters language model for code trained for 1T tokens on 80+ programming languages.
It uses MQA for efficient generation, has 8,192 tokens context window and can do fill-in-the-middle.

Chat with StarCoder here: huggingface.co/chat/?model=big

huggingface.co/bigcode

Tero Keski-Valkamatero@rukii.net
2023-05-04

#BigCode #OpenSource

"#StarCoder is a 15.5B parameters language model for code trained for 1T tokens on 80+ programming languages. It uses MQA for efficient generation, has 8,192 tokens context window and can do fill-in-the-middle."

huggingface.co/bigcode

2023-05-04

This one has different methodologies and philosophies that try to mitigate some of the ethical issues with other similar #GenerativeAI programming systems.

Hugging Face and ServiceNow Research release StarCoder, a free alternative to code-generating #AI like GitHub's #Copilot, as part of the #BigCode project.

techcrunch.com/2023/05/04/hugg

#MachineLearning

Vincent HETRUvh@sigmoid.social
2022-12-23

#HuggingFace just released the #SantaCoder models for the holiday season. Part of the #BigCode project, these 1.1B parameter models are trained on #Python, #Java, and #JavaScript and use advanced techniques like near-deduplication and comment-to-code ratio.

huggingface.co/bigcode/santaco

#AI #DeepLearning 🤗

Matthias Stürmermaemst@swiss.social
2022-11-23

137 million #OpenSource repositories of 92 terabyte source code data from #GitHub: Very impressive how much code is being processed for the @huggingface #BigCode project! huggingface.co/bigcode Presented by #LeandroVonWerra at #DINAcon22 #dinacon in Bern: dinacon.ch

Did you see FOSSlife Weekly this week? Check out the latest issue and subscribe now to get it every week! app.moosend.com/show_campaign/ #Blocky #OpenSource #FOSS #DNS #AllThingsOpen #Hacktoberfest #BigCode #YAML #Java19 #Kubernetes

FOSSlife Weekly: How to Use Blocky to Quickly Filter DNS Queries

Client Info

Server: https://mastodon.social
Version: 2025.04
Repository: https://github.com/cyevgeniy/lmst