((((sandro.net))))

domingo, 28 de junho de 2026

Show HN: role-model, a router for hybrid local/cloud AI https://ift.tt/F2ngUNY

Show HN: role-model, a router for hybrid local/cloud AI Hey everyone, I'm launching role-model today: a routing protocol, a reference router runtime, and an extension for Pi that allows for better informed routing decisions. role-model is mostly deterministic, with fallback to a controller model, that routes requests based on a chosen routing strategy. the protocol is structured around assigning domains and roles to models, where requests sent by consumer applications like Pi have task types to enrich routing metadata and thereby accuracy. you can to run the built-in benchmark to compare performance of models across speed, quality and cost, as well as observed performance on real tasks. I have a diagram on how routing works in [0]. The runtime supports local models, either directly to your local endpoint (LM Studio, llama.cpp etc), or routing between multiple local models via vendored llama-swap. Since there was another model router post yesterday where people discussed the basics of routing, I will focus on discussing some of the interesting learnings I've made building and testing this: 1. Model routing is essentially trying to predict the future: which model will perform optimally (based on criteria defined by the user) on this request? 2. After you have routed the request, you want to evaluate if it was the right decision or if some other model would have performed better 3. You also realize that having the router assess difficulty (among other things) to make decisions by itself is far from ideal - we'd prefer to have the consumer application work with the router to define what the request needs 4. You also realize that it becomes much easier, decisions become much accurate, and the outcomes of routing becomes more impactful when there is more of a distinction between models For point 2, I will be launching evals that you can run locally to benchmark models in your pool on the same requests. The outcomes here can then be used for point 1, as input when routing new requests. For point 3, I've built the pi-role-model package for Pi, which lets the Pi agent inject role_model.intent metadata including difficulty, preferred roles or even specfic model ids, required capabilities (say tool use or image input) and so on. You should be able to customized this further in Pi, and route in additional ways by changing metadata. This is why I've also built the role-model routing protocol. For point 4, what model routing really does as a second order effect is create a market for specialized models - models that may or may not be smaller, could be cheaper or more expensive, may be locally runnable. It makes little sense to route between two frontier models (GPT 5.5 and Opus 4.8); it makes more sense to route between models where one of the factors of quality, speed, cost is a multiple of the other candidate models, and it makes even more sense to have specialized domain models: code, prose, math and science, visuals and so on. It is at this stage model routing becomes really valuable. While role-model has a reference runtime that I'm continuously building out (there's lots to do to improve routing, as well as give users more granular control over routing decisions, and also ways to improve cross-model caching and also add techniques like FastContext), the ultimate goal of role-model is for there to be a standard protocol for inference requests that is used by consumer applications, so that the provider, be it a router middleware or an inference provider, will be able to route to a model that strikes the best balance between cost, speed and quality and also respects user choices, and even lets the user control these preferences to use local models for some tasks and allow cloud for others. Links: [0] role-model - the case for a model routing protocol: https://ift.tt/3s4kwh6... [1] GitHub: https://ift.tt/649GDlA [2] Docs: https://role-model.dev/ https://ift.tt/649GDlA June 28, 2026 at 07:46AM

Show HN: Hacker Times – HN Reader https://ift.tt/DwSn47q

Show HN: Hacker Times – HN Reader This has been a play project for a little while, trying to build an interface that's comfortable to read on any device. It's a bit of a mashup of a Pocket-like interface, digital newspaper and Hacker News. https://ift.tt/bHfyBNW June 28, 2026 at 07:53AM

Show HN: Foveon – Bayer to Foveon X3, learned, Mac App using deep learning https://ift.tt/iDqzeT1

Show HN: Foveon – Bayer to Foveon X3, learned, Mac App using deep learning https://ift.tt/9t7PZKX June 28, 2026 at 01:27AM

Show HN: FSM – an advanced system monitor for Linux https://ift.tt/XZazDb1

Show HN: FSM – an advanced system monitor for Linux https://ift.tt/GkzsYTl June 27, 2026 at 10:46PM

sábado, 27 de junho de 2026

Show HN: The TypeScript Semantic Layer for ClickHouse https://ift.tt/MYR5Tij

Show HN: The TypeScript Semantic Layer for ClickHouse I've built a type-safe semantic layer in code, for ClickHouse. If you're building analytics off ClickHouse in TypeScript, I would love your feedback. With hypequery there is no platform to adopt, no YAML sprawl. It runs where your app runs. Key features: - Define metrics once, reuse them everywhere: Declare dimensions and measures in one place and then pull from the same source of truth. - Compiles to ClickHouse SQL: No service, no proxy, no extra runtime to deploy. It's a library that generates SQL and runs where your app runs. - Multi-tenancy & Authentication ready: Cross-tenant queries are blocked at the query layer, helpers to plug into your existing auth. - Agent-native: A dataset is a declared set of dimensions and measures, so it doubles as an allowlist. Includes an MCP server to hand an LLM a typed catalog to query. - Runtime HTTP entry point: serve() exposes any dataset as an endpoint, so the same type-safe definitions back your dashboards and your API. https://ift.tt/0qKfCJR June 27, 2026 at 06:37AM

Show HN: Play puzzle games in a feed like TikTok https://ift.tt/e94RDk5

Show HN: Play puzzle games in a feed like TikTok Hi everyone. I spent the last year making Puzzle Express. Play and discover puzzles as if you were scrolling reels: play if you want or swipe up to skip. Made this as a healthier alternative to doomscrolling and a great, new way to discover puzzles you might love. If you don't fancy endless scrolling, try to finish the daily missions or play your favorite puzzle in the arcade. https://puzzle.express June 27, 2026 at 06:52AM

Show HN: Turn images into audio that can be decoded with a spectrogram https://ift.tt/AXlUoWs

Show HN: Turn images into audio that can be decoded with a spectrogram https://nsspot.herokuapp.com/imagetoaudio/ June 24, 2026 at 04:23PM

DJ Sandro

http://sandroxbox.listen2myradio.com