robber@lemmy.mlEnglish · 5 hours agoMagistral-Small-2509 by Mistral has been releasedplus-squarehuggingface.coexternal-linkmessage-square0fedilinkarrow-up119arrow-down11
arrow-up118arrow-down1external-linkMagistral-Small-2509 by Mistral has been releasedplus-squarehuggingface.corobber@lemmy.mlEnglish · 5 hours agomessage-square0fedilink
cm0002@piefed.worldEnglish · 8 hours agoLatest Open-Source AMD Improvements Allowing For Better Llama.cpp AI Performance Against Windows 11 - Phoronixplus-squarewww.phoronix.comexternal-linkmessage-square0fedilinkarrow-up13arrow-down10
arrow-up13arrow-down1external-linkLatest Open-Source AMD Improvements Allowing For Better Llama.cpp AI Performance Against Windows 11 - Phoronixplus-squarewww.phoronix.comcm0002@piefed.worldEnglish · 8 hours agomessage-square0fedilink
humanspiral@lemmy.caEnglish · 1 day agoNVIDIA's Peter Belcak Explains Why SLMs (smaller LLMs) are the Future of Agentic AIplus-squarewww.youtube.comexternal-linkmessage-square4fedilinkarrow-up110arrow-down10
arrow-up110arrow-down1external-linkNVIDIA's Peter Belcak Explains Why SLMs (smaller LLMs) are the Future of Agentic AIplus-squarewww.youtube.comhumanspiral@lemmy.caEnglish · 1 day agomessage-square4fedilink
robber@lemmy.mlEnglish · edit-23 days agoQwen3-Next with 80b-a3b parameters is outplus-squarehuggingface.coexternal-linkmessage-square1fedilinkarrow-up136arrow-down10
arrow-up136arrow-down1external-linkQwen3-Next with 80b-a3b parameters is outplus-squarehuggingface.corobber@lemmy.mlEnglish · edit-23 days agomessage-square1fedilink
cm0002@piefed.worldEnglish · 10 days agoembeddinggemma :300mplus-squareollama.comexternal-linkmessage-square4fedilinkarrow-up110arrow-down10
arrow-up110arrow-down1external-linkembeddinggemma :300mplus-squareollama.comcm0002@piefed.worldEnglish · 10 days agomessage-square4fedilink
devxyn@sh.itjust.worksEnglish · 12 days agoWhat local, small models are you all using?plus-squaremessage-squaremessage-square16fedilinkarrow-up149arrow-down11
arrow-up148arrow-down1message-squareWhat local, small models are you all using?plus-squaredevxyn@sh.itjust.worksEnglish · 12 days agomessage-square16fedilink
devxyn@sh.itjust.worksEnglish · 12 days agoIntroducing EmbeddingGemma: The Best-in-Class Open Model for On-Device Embeddingsplus-squaredevelopers.googleblog.comexternal-linkmessage-square1fedilinkarrow-up113arrow-down10
arrow-up113arrow-down1external-linkIntroducing EmbeddingGemma: The Best-in-Class Open Model for On-Device Embeddingsplus-squaredevelopers.googleblog.comdevxyn@sh.itjust.worksEnglish · 12 days agomessage-square1fedilink
cm0002@piefed.worldEnglish · 15 days agoollama 0.11.9 Introducing A Nice CPU/GPU Performance Optimizationplus-squarewww.phoronix.comexternal-linkmessage-square12fedilinkarrow-up133arrow-down10
arrow-up133arrow-down1external-linkollama 0.11.9 Introducing A Nice CPU/GPU Performance Optimizationplus-squarewww.phoronix.comcm0002@piefed.worldEnglish · 15 days agomessage-square12fedilink
snikta@programming.devEnglish · 15 days agoApertus: a fully open, transparent, multilingual language modelplus-squarenews.epfl.chexternal-linkmessage-square3fedilinkarrow-up123arrow-down11
arrow-up122arrow-down1external-linkApertus: a fully open, transparent, multilingual language modelplus-squarenews.epfl.chsnikta@programming.devEnglish · 15 days agomessage-square3fedilink
General_Effort@lemmy.worldEnglish · 17 days agoLongCat-Flash-Chatplus-squarehuggingface.coexternal-linkmessage-square0fedilinkarrow-up122arrow-down10
arrow-up122arrow-down1external-linkLongCat-Flash-Chatplus-squarehuggingface.coGeneral_Effort@lemmy.worldEnglish · 17 days agomessage-square0fedilink
Even_Adder@lemmy.dbzer0.comEnglish · 19 days agoHermes 4 - Nous Researchplus-squarehermes4.nousresearch.comexternal-linkmessage-square0fedilinkarrow-up19arrow-down10
arrow-up19arrow-down1external-linkHermes 4 - Nous Researchplus-squarehermes4.nousresearch.comEven_Adder@lemmy.dbzer0.comEnglish · 19 days agomessage-square0fedilink
sleep_deprived@lemmy.dbzer0.comEnglish · 19 days ago[Transformer Circuits Thread] Circuit Vignette: How does a persona modify the Assistant’s response?plus-squaretransformer-circuits.pubexternal-linkmessage-square1fedilinkarrow-up18arrow-down10
arrow-up18arrow-down1external-link[Transformer Circuits Thread] Circuit Vignette: How does a persona modify the Assistant’s response?plus-squaretransformer-circuits.pubsleep_deprived@lemmy.dbzer0.comEnglish · 19 days agomessage-square1fedilink
robber@lemmy.mlEnglish · 21 days agoExLlamaV3 adds tensor parallelism supportplus-squaregithub.comexternal-linkmessage-square0fedilinkarrow-up113arrow-down10
arrow-up113arrow-down1external-linkExLlamaV3 adds tensor parallelism supportplus-squaregithub.comrobber@lemmy.mlEnglish · 21 days agomessage-square0fedilink
trave@lemmy.sdf.orgEnglish · 27 days agowhat's the best model these days I could fit in 128gb ram?plus-squaremessage-squaremessage-square12fedilinkarrow-up122arrow-down12
arrow-up120arrow-down1message-squarewhat's the best model these days I could fit in 128gb ram?plus-squaretrave@lemmy.sdf.orgEnglish · 27 days agomessage-square12fedilink
pepperfree@sh.itjust.worksEnglish · 28 days agoDeepSeek dropped the V3.1 Weightplus-squarehuggingface.coexternal-linkmessage-square19fedilinkarrow-up145arrow-down11
arrow-up144arrow-down1external-linkDeepSeek dropped the V3.1 Weightplus-squarehuggingface.copepperfree@sh.itjust.worksEnglish · 28 days agomessage-square19fedilink
ikt@aussie.zoneEnglish · 29 days agoResearchers Made a Social Media Platform Where Every User Was AI. The Bots Ended Up at Warplus-squaregizmodo.comexternal-linkmessage-square0fedilinkarrow-up12arrow-down10
arrow-up12arrow-down1external-linkResearchers Made a Social Media Platform Where Every User Was AI. The Bots Ended Up at Warplus-squaregizmodo.comikt@aussie.zoneEnglish · 29 days agomessage-square0fedilink
ikt@aussie.zoneEnglish · 1 month agoJan: Open source ChatGPT-alternative that runs 100% offline - Janplus-squarejan.aiexternal-linkmessage-square19fedilinkarrow-up161arrow-down13
arrow-up158arrow-down1external-linkJan: Open source ChatGPT-alternative that runs 100% offline - Janplus-squarejan.aiikt@aussie.zoneEnglish · 1 month agomessage-square19fedilink
mapumbaa@lemmy.zipEnglish · 1 month agoHP Z2 Mini G1a Review: Running GPT-OSS 120B Without a Discrete GPUplus-squarewww.storagereview.comexternal-linkmessage-square25fedilinkarrow-up115arrow-down10
arrow-up115arrow-down1external-linkHP Z2 Mini G1a Review: Running GPT-OSS 120B Without a Discrete GPUplus-squarewww.storagereview.commapumbaa@lemmy.zipEnglish · 1 month agomessage-square25fedilink
mapumbaa@lemmy.zipEnglish · 1 month agoGPT-OSS 20B and 120B Models on AMD Ryzen AI Processorswww.amd.comexternal-linkmessage-square12fedilinkarrow-up117arrow-down10
arrow-up117arrow-down1external-linkGPT-OSS 20B and 120B Models on AMD Ryzen AI Processorswww.amd.commapumbaa@lemmy.zipEnglish · 1 month agomessage-square12fedilink
Omega@discuss.onlineEnglish · 1 month agoFine tuned models for summarisation?plus-squaremessage-squaremessage-square6fedilinkarrow-up112arrow-down11
arrow-up111arrow-down1message-squareFine tuned models for summarisation?plus-squareOmega@discuss.onlineEnglish · 1 month agomessage-square6fedilink