#TransformerArchitecture

Victoria Stuart 🇨🇦 🏳️‍⚧️persagen
2024-12-20
OpenAI o3 87.5% High Score on ARC Prize Challenge 
https://old.reddit.com/r/MachineLearning/comments/1hiq3tz/d_openai_o3_875_high_score_on_arc_prize_challenge/

* benchmark on which GPT-3 scoring 0%

OpenAI o3 Breakthrough High Score on ARC-AGI-Pub
https://arcprize.org/blog/oai-o3-pub-breakthrough
https://arcprize.org/arc-agi-pub

OpenAI's O3 beats 99.8% competitive coders
https://old.reddit.com/r/MachineLearning/comments/1hiqptc/openais_o3_beats_998_competitive_coders_d

#LLM #OpenAI #OpenAI_o1 #OpenAI_o3 #GPT4o #ML #TransformerArchitecture #reasoning #COT #ChainOfThought #AGI #AI
Victoria Stuart 🇨🇦 🏳️‍⚧️persagen
2024-12-20

[thread] OpenAI o1, o3 | OpenAI GPT-4o
en.wikipedia.org/wiki/OpenAI_o1

* generative pre-trained transformer
* form. known within OpenAI as “Q*"
* o1 spends time "thinking" before it answers
* makes it better at complex reasoning tasks, science & programming than OpenAI GPT-4o
* full v. was released 2024-Dec-05

2024-08-11

Transformer Explainer: Interactive Learning of Text-Generative Models https://arxiv.org/abs/2408.04619v1 (neat visualization) #AI #transformerArchitecture

Transformer Explainer: Interac...

2024-08-11

Transformer Explainer: Interactive Learning of Text-Generative Models arxiv.org/abs/2408.04619v1 (neat visualization) #AI #transformerArchitecture

I feel not enough people are talking about #StateSpaceArchitecture #MambaArchitecture and the benefits that can be provided over #TransformerArchitecture #TranformerLanguageModels
#AI #SSMs #LLM
2023-09-15

Generative AI exists because of the transformer (free for non-subscribers) - Financial Times ig.ft.com/generative-ai/ (useful visual explanation) #AI #TransformerArchitecture

Client Info

Server: https://mastodon.social
Version: 2025.04
Repository: https://github.com/cyevgeniy/lmst