troed@fedia.io · 1 day agoNorth Mini Code v1.0 - a Qwen 3.6 35B MoE alternativeplus-squarehuggingface.coexternal-linkmessage-square2linkfedilinkarrow-up114arrow-down11
arrow-up113arrow-down1external-linkNorth Mini Code v1.0 - a Qwen 3.6 35B MoE alternativeplus-squarehuggingface.cotroed@fedia.io · 1 day agomessage-square2linkfedilink
HelloRoot@lemy.lolEnglish · 5 days agoI Put a Datacenter GPU in My Gaming PC for £200plus-squareblog.tymscar.comexternal-linkmessage-square15linkfedilinkarrow-up1110arrow-down15
arrow-up1105arrow-down1external-linkI Put a Datacenter GPU in My Gaming PC for £200plus-squareblog.tymscar.comHelloRoot@lemy.lolEnglish · 5 days agomessage-square15linkfedilink
Schilling2304@thelemmy.clubEnglish · 4 days agoMy models don't have reasoning ability in llama-b9543 server but have in llama-cliplus-squaremessage-squaremessage-square2linkfedilinkarrow-up17arrow-down11
arrow-up16arrow-down1message-squareMy models don't have reasoning ability in llama-b9543 server but have in llama-cliplus-squareSchilling2304@thelemmy.clubEnglish · 4 days agomessage-square2linkfedilink
potatoguy@mbin.potato-guy.space · 6 days agoGemma 4 QAT models: Optimizing model compression for mobile and laptop efficiencyplus-squareblog.googleexternal-linkmessage-square0linkfedilinkarrow-up119arrow-down11
arrow-up118arrow-down1external-linkGemma 4 QAT models: Optimizing model compression for mobile and laptop efficiencyplus-squareblog.googlepotatoguy@mbin.potato-guy.space · 6 days agomessage-square0linkfedilink
robber@lemmy.mlEnglish · 7 days agoGemma4 12b released with "unified" approach to multi-modalityplus-squarehuggingface.coexternal-linkmessage-square14linkfedilinkarrow-up123arrow-down12
arrow-up121arrow-down1external-linkGemma4 12b released with "unified" approach to multi-modalityplus-squarehuggingface.corobber@lemmy.mlEnglish · 7 days agomessage-square14linkfedilink
cm0002@lemy.lolEnglish · 9 days agoI Tried This Open Source ChatGPT Alternative [Jan AI] on Linux, But Went Back to Ollamaplus-squareitsfoss.comexternal-linkmessage-square7linkfedilinkarrow-up118arrow-down13
arrow-up115arrow-down1external-linkI Tried This Open Source ChatGPT Alternative [Jan AI] on Linux, But Went Back to Ollamaplus-squareitsfoss.comcm0002@lemy.lolEnglish · 9 days agomessage-square7linkfedilink
troed@fedia.io · 13 days agoDon't skimp on the quant when using MoEplus-squareunsloth.aiexternal-linkmessage-square3linkfedilinkarrow-up131arrow-down12
arrow-up129arrow-down1external-linkDon't skimp on the quant when using MoEplus-squareunsloth.aitroed@fedia.io · 13 days agomessage-square3linkfedilink
pepperfree@sh.itjust.worksEnglish · 14 days agoInfinity-Parser2 - Multimodal Document Parserplus-squarehuggingface.coexternal-linkmessage-square0linkfedilinkarrow-up18arrow-down11
arrow-up17arrow-down1external-linkInfinity-Parser2 - Multimodal Document Parserplus-squarehuggingface.copepperfree@sh.itjust.worksEnglish · 14 days agomessage-square0linkfedilink
sp3ctre@feddit.orgEnglish · 20 days agoYour best local LLM for low-VRAM (6GB)?plus-squaremessage-squaremessage-square14linkfedilinkarrow-up133arrow-down13
arrow-up130arrow-down1message-squareYour best local LLM for low-VRAM (6GB)?plus-squaresp3ctre@feddit.orgEnglish · 20 days agomessage-square14linkfedilink
ikt@aussie.zoneEnglish · 23 days agoDystopiaBench - AI Ethics Stress Testplus-squaredystopiabench.comexternal-linkmessage-square15linkfedilinkarrow-up17arrow-down11
arrow-up16arrow-down1external-linkDystopiaBench - AI Ethics Stress Testplus-squaredystopiabench.comikt@aussie.zoneEnglish · 23 days agomessage-square15linkfedilink
SuspiciousCarrot78@aussie.zoneEnglish · edit-224 days agoClaude? No. Cucumbers? Yes!plus-squaremessage-squaremessage-square3linkfedilinkarrow-up115arrow-down11
arrow-up114arrow-down1message-squareClaude? No. Cucumbers? Yes!plus-squareSuspiciousCarrot78@aussie.zoneEnglish · edit-224 days agomessage-square3linkfedilink
TheCornCollector@piefed.zipEnglish · edit-226 days agoLlama.cpp MTP Support merged - up to 2.5x speed increaseplus-squaregithub.comexternal-linkmessage-square3linkfedilinkarrow-up144arrow-down11
arrow-up143arrow-down1external-linkLlama.cpp MTP Support merged - up to 2.5x speed increaseplus-squaregithub.comTheCornCollector@piefed.zipEnglish · edit-226 days agomessage-square3linkfedilink
BB84@mander.xyzEnglish · edit-226 days agoOrthrus-Qwen3: up to 7.8×tokens/forward on Qwen3, identical output distributionplus-squaregithub.comexternal-linkmessage-square4linkfedilinkarrow-up110arrow-down10
arrow-up110arrow-down1external-linkOrthrus-Qwen3: up to 7.8×tokens/forward on Qwen3, identical output distributionplus-squaregithub.comBB84@mander.xyzEnglish · edit-226 days agomessage-square4linkfedilink
SuspiciousCarrot78@aussie.zoneEnglish · edit-227 days ago"The cost of running LLMs is just too damn high"plus-squaremessage-squaremessage-square11linkfedilinkarrow-up139arrow-down11
arrow-up138arrow-down1message-square"The cost of running LLMs is just too damn high"plus-squareSuspiciousCarrot78@aussie.zoneEnglish · edit-227 days agomessage-square11linkfedilink
SuspiciousCarrot78@aussie.zoneEnglish · 27 days agoToken Speed visualiserplus-squaremikeveerman.github.ioexternal-linkmessage-square0linkfedilinkarrow-up19arrow-down11
arrow-up18arrow-down1external-linkToken Speed visualiserplus-squaremikeveerman.github.ioSuspiciousCarrot78@aussie.zoneEnglish · 27 days agomessage-square0linkfedilink
XiELEd@piefed.socialEnglish · edit-228 days ago<8B multilingual models for language learning chatbotsplus-squaremessage-squaremessage-square5linkfedilinkarrow-up111arrow-down12
arrow-up19arrow-down1message-square<8B multilingual models for language learning chatbotsplus-squareXiELEd@piefed.socialEnglish · edit-228 days agomessage-square5linkfedilink
variety4me@lemmy.zipEnglish · 30 days agollama.cpp Multi-Model Server Architecture: ASUS Zenbook UM3504DAplus-squaremessage-squaremessage-square9linkfedilinkarrow-up116arrow-down11
arrow-up115arrow-down1message-squarellama.cpp Multi-Model Server Architecture: ASUS Zenbook UM3504DAplus-squarevariety4me@lemmy.zipEnglish · 30 days agomessage-square9linkfedilink
ElectricVocalist@jlai.luEnglish · 1 month agoGemma4 with MTP was released jlai.luimagemessage-square1linkfedilinkarrow-up114arrow-down111
arrow-up13arrow-down1imageGemma4 with MTP was released jlai.luElectricVocalist@jlai.luEnglish · 1 month agomessage-square1linkfedilink
Jeena@piefed.jeena.netcakeEnglish · 1 month agoGood translation models which fit on a smartphone?plus-squaremessage-squaremessage-square10linkfedilinkarrow-up122arrow-down12
arrow-up120arrow-down1message-squareGood translation models which fit on a smartphone?plus-squareJeena@piefed.jeena.netcakeEnglish · 1 month agomessage-square10linkfedilink
tristynalxander@mander.xyzEnglish · edit-21 month agoAI-Editor in LibreOffice Writer?plus-squaremessage-squaremessage-square5linkfedilinkarrow-up120arrow-down13
arrow-up117arrow-down1message-squareAI-Editor in LibreOffice Writer?plus-squaretristynalxander@mander.xyzEnglish · edit-21 month agomessage-square5linkfedilink