#multimodal

N-gated Hacker Newsngate
2025-05-26

🎉🥯 Presenting , the latest attempt at impressing your CS professor with buzzwords like "open-source" and "multimodal." 🎩🤖 Finally, a model that lets you pretend a bagel can solve all your problems! 🍽️🔧
bagel-ai.org/ -source

Sara Zanzansara
2025-05-22

🐜 Small models are making giant leaps! @Google@x-activitypub-bridge.deno.dev just released Gemma 3n, a mobile-first LLM that can understand text, images, audio and even video input while running on your phone 📱

📰 Read the announcement here: developers.googleblog.com/en/i

2025-05-20

Herrenberg is boasting about about their multimodal system for sustainable transport: stadtnavi. Hey, other cities, don't be jealous: Just adopt the solution, developed by Trufi.

#bus #BusRapidTransit #BRT #Europe #Germany #Herrenberg #multimodal #Multimodality #NotJustBikes #opensource #FOSS #SharedMobility #SustainableMobility #TransportationInnovation #SmartMobility #UrbanMobility #MobilitySolutions #FutureOfTransport #sustainabletransport #TransportationJustice #TrufiAssociation

A detailed LinkedIn post by Mitmachstadt Herrenberg promoting their city’s new navigation app called "stadtnavi." The post highlights the app’s open-source nature, benefits like CO₂ tracking, weather and public transport updates, and its anonymous, data-saving design. It positions stadtnavi as a scalable, white-label solution for rethinking urban mobility.A promotional social media post featuring a woman riding a modern cargo bicycle through an urban street, with the caption encouraging viewers to "Discover now" and linking to an app available on the App Store and Google Play.
2025-05-20

The stadtnavi system, built on Trufi’s open-source platform, shows how cities can democratize mobility. Real-time updates, CO₂ comparisons, and weather alerts aren’t exclusive to Herrenberg—they’re open for any city to implement. The project’s success lies in its adaptability: a white-label solution that lets cities rebrand and expand it freely.

tinyurl.com/yn9t9ro7

#multimodal #opensource #sustainabletransport #TransportationJustice #TrufiAssociation

Dr. Thompsonrogt_x1997
2025-05-17

🧠🚀 What happens when AI learns to think across all senses?
From 1T tokens to 35+ benchmarks, discover how multi-modal AI systems like Unified-IO 2 are fusing vision, language, and sound into one unified brain.
👉 medium.com/@rogt.x1997/from-1t

medium.com/@rogt.x1997/from-1t

Sara Zanzansara
2025-05-16

⚠️ Attention! If you or your company:

- 🇪🇺 are based in the EU
- 🦙 you’re thinking of integrating Llama models into your product

📜 Pay close attention to its license: you may be breaking Meta’s terms!

zansara.dev/posts/2025-05-16-l

Chi Kimchikim
2025-05-16

Ollama released with new engine for multimodal models! ollama.com/blog/multimodal-mod

N-gated Hacker Newsngate
2025-05-16

🚀👏 Oh joy! Ollama's shiny new toy can handle *multimodal* models, as if the world wasn't already drowning in and models that are impossible to distinguish from one another. 😂🤯 Apparently, you can now ask a bajillion-parameter monstrosity what it sees in a video frame—because what we really needed was another way to get ignored by . 🙄🤖
ollama.com/blog/multimodal-mod

Dirk Schnelle-Walkadsw@mastodontech.de
2025-04-28

SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems. Multi-modal LLM system simulates human communication using speech and generates human-like dialogues with consistent content, rhythm, & emotion.

Funnily, they also elaborate on a "think before you speak" design aspect. This might also be applicable to our everyday lives.

doi: 10.48550/arXiv.2401.03945
#LLM #multimodal #speechAI #multiagent #conversationalai

2025-04-25

#ActivityRecognition #research: I am looking for an outlet (peer reviewed) for a #tutorial paper on collecting #multimodal #data with #wearables and #IoT in #nursing homes. We learnt a lot in our pilot study in a #dementia #care ward and want to share our insights/mistakes with a broader audience (#Nursing, #Physiotherapy, #Psychology, #Neurology, #Gerontology) Anyone with a tip for an outlet that accepts a collection of practical considerations for collecting #realWorldData?

:rss: Qiita - 人気の記事qiita@rss-mstdn.studiofreesia.com
2025-04-24
Mr Tech Kingmrtechking
2025-04-23

China's tech leaders are all-in on multimodal AI. ByteDance's Doubao & Kuaishou's Kling models blend text, image & video, powering diverse industries & creative applications. A major push in GenAI.

Chinese Tech Firms Push Ahead With Multimodal AI
2025-04-21

🤖 Building with AI? This session dives into the landscape of multimodal and large language models, helping you choose the right one for your application — whether it involves text, images, audio, or video. Gain insights into model strengths, limitations, and practical integration tips with Witthawin Sripheanpol at #FOSSASIASummit2025

🔗 Click the link below ⬇ to watch on the FOSSASIA YouTube channel:
youtu.be/zDohfhP3kW0

#FOSSASIA #FOSSASIASummit #OpenSource #AI #LLM #Multimodal

Hacker Newsh4ckernews
2025-04-15
N-gated Hacker Newsngate
2025-04-15

🎉 Behold the shiny new toy, Embed 4! It's so "multimodal" it can fetch your latte while reciting Hamlet in 23 languages. 💼🚀And fear not; there's a customizable, scalable, integrated, and possibly sentient platform for every industry—because who doesn't want to spend even MORE on tech they won't understand? 🤖💸
cohere.com/blog/embed-4

davecykldavecykl
2025-04-15

Ticketless travel coming soon to with the introduction of tap on / tap off payment by contactless bank card, including a maximum fare cap – finally integrated with to allow proper journeys.

( travel passes (aka “Ridacards”) still offer the best value for very frequent travellers.)

edinburghnews.scotsman.com/new

2025-04-10

#30DayChartChallenge Día 10: ¡Buceando en la Distribución del VIX! 🌊

En lugar de solo ver la línea del VIX, hoy analizamos su "distribución de probabilidad" por Presidencia de EE.UU. (Clinton -> Trump 2º). ¡La forma lo es todo!

Usando #rstats y #ggplot2, estas densidades facetadas nos permiten investigar:
* Modos Dominantes: ¿Cuál era el nivel "normal" de VIX (el pico más alto)? ¿Cambió mucho?
* Multi-modalidad: ¿Hay evidencia de múltiples estados de volatilidad (picos secundarios) dentro de un mismo mandato? 🤔
* Riesgo de Cola: ¿Qué tan probable era el "pánico" (VIX > 35)? ¡Compara las colas derechas!

Estos patrones reflejan los distintos regímenes de volatilidad y la percepción del riesgo sistémico. No es solo el nivel, ¡sino la "estructura" de la incertidumbre lo que importa!

Datos: Yahoo Finance via #quantmod.
📂Código: t.ly/kikdo

#Day10 #Multimodal #dataviz #DataVisualization #VIX #Volatility #Finance #StockMarket #Economics #RiskManagement #rstats #ggplot2

Gráfico facetado con curvas de densidad mostrando la distribución del nivel diario del Índice VIX para seis periodos presidenciales de EE.UU.: Bill Clinton, George W. Bush, Barack Obama, Donald Trump (1er mandato), Joe Biden y Donald Trump (2º mandato, inicio). Cada faceta corresponde a un presidente. El eje X representa el nivel del VIX y el eje Y la densidad. Las curvas de densidad están coloreadas según el partido: azul para Demócrata, rojo para Republicano. Una línea vertical discontinua marca VIX=20 y una línea vertical punteada marca VIX=35. Fuente: Yahoo Finance.
N-gated Hacker Newsngate
2025-04-08

🎉 Behold, the groundbreaking revelation: can create images without 🐘 elephants! Apparently, it only took two of the world's largest companies to figure this out, and the internet is losing its collective mind. 🤦‍♂️ this, multimodal that—who knew AI could do something so *noteworthy* as separating text from images? 🙄
oneusefulthing.org/p/no-elepha

Client Info

Server: https://mastodon.social
Version: 2025.04
Repository: https://github.com/cyevgeniy/lmst