Lmst

“Students are introduced to advanced AI techniques such as #ChainOfThought and #SelfConsistencyPrompting, which simulate humanlike reasoning. #GenerativeAI is presented not just as a tool for queries but as a partner in reasoning.

“We teach reinforcement learning from human feedback, where every correction becomes training data,” Madmoun adds.

Students are encouraged to view AI not as a static engine, but as a responsive tool for making critical decisions in high-stakes financial environments.

Recognising that students enter with varying levels of technical knowledge, the Master in International Finance (MiF) at HEC Paris provides asynchronous #Python #programming courses, optional #BootCamps, and tailored elective tracks. “We’ve integrated workshops taught by Hi! PARIS into the curriculum,” says academic director Evren Örs, referring to the #AI and #DataScience centre co-founded by HEC Paris and Institut Polytechnique de Paris.

Students from both institutions collaborate on real-data projects, strengthening both technical and teamwork skills.

A tiered elective system requires all MiF students to complete at least one course focused on #data and #finance. The most advanced track is the #DoubleDegree in data and finance, where students dive deep into #MachineLlearning applications. Graduates, Örs says, are frequently hired as #QuantitativeAnalysts, #DataScientists, and private equity analysts in London and Paris.”

#BusinessSchools / #education <https://archive.md/xysyM> / <https://ft.com/content/071dc338-b267-466c-836a-f559609fffd5> (paywall)

🤖 Think your AI assistant can really reason? Apple’s puzzle tests say otherwise.
📉 See how “thinking” AIs collapse when logic gets real — and why we might be projecting intelligence where there is none.

Hashtags:
#AIReasoning #ChainOfThought #LLMFail #DeepTech

URL:
https://medium.com/@rogt.x1997/the-illusion-of-thought-why-reasoning-ai-might-be-smarter-than-us-but-not-wiser-73427af99baa

🧠 What if your AI could explain its reasoning, verify its logic, and revise itself—all before responding?

Discover how Chain-of-Thought prompting + Self-Verification slashed hallucinations from 23% to 6%, outperformed GPT-4, and reshaped enterprise AI.

🚀 The future of trustworthy AI is here.

#ChainOfThought #SelfVerification #ReliableAI #LLMTools
🔗
https://medium.com/@rogt.x1997/gpt-4-vs-o1-vs-genrm-the-3-models-that-changed-ai-accuracy-forever-how-self-verifying-models-3313dd4fe248

The #OpenAI paper by Baker et al, "Monitoring Reasoning Models for Misbehavior and the Risks of Promoting Obfuscation" comes to a troubling conclusion: #LLM s with #reasoning or #ChainOfThought (#CoT) capabilities might learn to obfuscate their own CoT from human users if they are being penalized for displaying "wrong" (i.e. reward hacking or misalignment) reasoning.

As a result, OpenAI strongly advises against applying reward pressure "directly" onto the CoT of a model.

🤔 While that is certainly the right thing to do, how long will #AI take to figure out that *indirect CoT pressure* is being applied anyway and that it could circumvent these restrictions by obfuscating its own CoT? Maybe something like this will happen by accident or within an "evolutionary" self-improvement loop. Perhaps a sufficiently advanced model will realize that its own #neuralese serves as #steganography to hide its intents from humans anyway and keep its CoT in non-English?

source: https://cdn.openai.com/pdf/34f2ada6-870f-4c26-9790-fd8def56387f/CoT_Monitoring.pdf

If CoT pressures are used to improve agent capabilities or alignment, there may be no alternative approach to yield the same improvements without degrading monitorability. In the worst case, where the agent learns to fully obscure its intent in its CoT, we ultimately revert to the same model safety conditions that existed prior to the emergence of reasoning models and must rely on monitoring activations, monitoring potentially adversarial CoTs and outputs, or improved alignment methods. Model actions and outputs continue to grow in complexity, and we’ve demonstrated that even at their current capabilities, models can execute sophisticated reward hacks. In the near future, these models may become either too advanced or too widely used for humans to manually oversee. For this reason, we believe that CoT monitoring may become one of few tools we have for identifying latent misalignments, and it may be required that we avoid strong chain-of-thought optimization pressures entirely to guarantee we do not inadvertently cause CoT obfuscation.

Cogitator – A Python Toolkit for Chain-of-Thought Prompting

https://github.com/habedi/cogitator

#HackerNews #Cogitator #Python #Toolkit #ChainOfThought #Prompting #AI #Tools

Claude Integrations: Claude can now connect to your world
https://www.anthropic.com/news/integrations
https://news.ycombinator.com/item?id=43859536

#Anthropic #Claude #ClaudeLLM #LLM #ChainofThought #MixtureOfExperts

Do you want to Master Prompt Engineering? Here’s my overview on how to Prompt like a Pro!

#promptEngineering #ConversationalAI #GenerativeAI #AIPrompts #PromptDesign #ZeroShot #FewShot #ChainOfThought #AIForBusiness #AIEdge #AIProductivity #WorkSmarter #DigitalTransformation #TechLeadership #FutureOfWork #Innovation #LinkedInCreators #AIInsights #TechTips #CareerGrowth

https://www.linkedin.com/pulse/prompt-like-pro-how-master-language-ai-reynaldo-garcia-jr--5meac?utm_source=share&utm_medium=member_ios&utm_campaign=share_via

Can popular, generalist #LLMs answer questions as specialists?

Adopting each step of #diagnosis into a #ChainOfThought prompt made small and large #languageModels' outperform both zero-shot and the fine-tuned OLAPH method on the #MedLFQA benchmark.

https://doi.org/10.48550/arXiv.2503.03194 #AI

Structured Outputs Enable General-Purpose LLMs to be Medical Experts, pages 1 and 2.

How University Students Use Claude
https://www.anthropic.com/news/anthropic-education-report-how-university-students-use-claude
https://news.ycombinator.com/item?id=43633383

Aside: been trialing SoTA LLM 😯 😀

ChatGPT, Gemini, Claude ...
https://www.counterpunch.org/2025/04/07/the-ai-power-play-how-chatgpt-gemini-claude-and-others-are-shaping-the-future-of-artificial-intelligence/

* particularly impressed w. Claude (3.7 Sonnet), DeepSeek
* most SoTA free (ChatGPT higher performing paywalled): still amazing!
* chain-of-thought reasoning / augmented responses (web retrieval: RAG) 👍️
* very impressive!!
* Firefox users: try the AI Toolbox extension 👍️

#Anthropic #Claude #LLM #RAG #ChainOfThought #reasoning

Researchers lift the lid on how reasoning models actually “think”
https://www.economist.com/science-and-technology/2025/04/02/researchers-lift-the-lid-on-how-reasoning-models-actually-think
nonpaywalled: https://archive.fo/pn6du

Tracing the thoughts of a large language model
https://www.anthropic.com/research/tracing-thoughts-language-model
https://news.ycombinator.com/item?id=43495617

#LLM #Anthropic #Claude #reasoning #ChainOfThought

OpenAI Prepares First Open-Weight Language Model Since GPT-2

#AI #OpenAI #AIModels #OpenWeight #LLMs #ChatGPT #GenAI #ChainOfThought

https://winbuzzer.com/2025/04/01/openai-prepares-first-open-weight-language-model-since-gpt-2-xcxwbn/

These continue to fascinate / awe: Gemini 2.5 Pro vs. Claude 3.7 Sonnet: Coding Comparison
https://composio.dev/blog/gemini-2-5-pro-vs-claude-3-7-sonnet-coding-comparison/

Edit: just previewed the LLM-generated code examples - extraordinary! 🤯 😯 🥳

#LLM #Google #Gemini #Claude #ChainOfThought #reasoning #AI

On the Biology of a Large Language Model [Claude LLM]
https://transformer-circuits.pub/2025/attribution-graphs/biology.html
Anthropic: On the Biology of a Large Language Model : MachineLearning
https://old.reddit.com/r/MachineLearning/comments/1jmhoq6/r_anthropic_on_the_biology_of_a_large_language

#LLM #Anthropic #Claude #ChainOfThought #reasoning

/1 That post includes the following video - which in simple language / examples a basic overview of a current, reasoning LLM (thought; Claude) and "prompt engineering."

Tracing the thoughts of a large language model
https://www.youtube.com/watch?v=Bj9BD2D3DzA

#LLM #Anthropic #Claude #ChainOfThought #reasoning