ikt@aussie.zoneEnglish · 10 months ago"Flash Answers" Cerebras brings instant inference to Mistral Le Chatplus-squarecerebras.aiexternal-linkmessage-square0linkfedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-link"Flash Answers" Cerebras brings instant inference to Mistral Le Chatplus-squarecerebras.aiikt@aussie.zoneEnglish · 10 months agomessage-square0linkfedilink
ikt@aussie.zoneEnglish · 10 months agoHibiki by kyutai, a simultaneous speech-to-speech translation model, currently supporting FR to ENplus-squaremessage-squaremessage-square0linkfedilinkarrow-up11arrow-down10
arrow-up11arrow-down1message-squareHibiki by kyutai, a simultaneous speech-to-speech translation model, currently supporting FR to ENplus-squareikt@aussie.zoneEnglish · 10 months agomessage-square0linkfedilink
ikt@aussie.zoneEnglish · 10 months agoDeepSeek gives Europe's tech firms a chance to catch up in global AI raceplus-squarewww.reuters.comexternal-linkmessage-square0linkfedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkDeepSeek gives Europe's tech firms a chance to catch up in global AI raceplus-squarewww.reuters.comikt@aussie.zoneEnglish · 10 months agomessage-square0linkfedilink
llama@lemmy.dbzer0.comEnglish · edit-210 months ago How to run LLaMA (and other LLMs) on Android.plus-squaremessage-squaremessage-square0linkfedilinkarrow-up11arrow-down10
arrow-up11arrow-down1message-square How to run LLaMA (and other LLMs) on Android.plus-squarellama@lemmy.dbzer0.comEnglish · edit-210 months agomessage-square0linkfedilink
artificialfish@programming.devEnglish · 11 months agoHas anyone applied tree of thought prompting to r1 yet?plus-squaremessage-squaremessage-square0linkfedilinkarrow-up11arrow-down10
arrow-up11arrow-down1message-squareHas anyone applied tree of thought prompting to r1 yet?plus-squareartificialfish@programming.devEnglish · 11 months agomessage-square0linkfedilink
ikt@aussie.zoneEnglish · 11 months agoMistral Small 3 (24B) releasedplus-squaremistral.aiexternal-linkmessage-square0linkfedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkMistral Small 3 (24B) releasedplus-squaremistral.aiikt@aussie.zoneEnglish · 11 months agomessage-square0linkfedilink
ikt@aussie.zoneEnglish · 11 months agoDid DeepSeek R1 just pop nvidias bubble?plus-squarewww.youtube.comexternal-linkmessage-square0linkfedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkDid DeepSeek R1 just pop nvidias bubble?plus-squarewww.youtube.comikt@aussie.zoneEnglish · 11 months agomessage-square0linkfedilink
SmokeyDope@lemmy.worldMEnglish · edit-211 months agoWhy llms are suprisingly good at math, and what it means to process language.plus-squarelemmy.worldimagemessage-square0linkfedilinkarrow-up11arrow-down10
arrow-up11arrow-down1imageWhy llms are suprisingly good at math, and what it means to process language.plus-squarelemmy.worldSmokeyDope@lemmy.worldMEnglish · edit-211 months agomessage-square0linkfedilink
SmokeyDope@lemmy.worldMEnglish · edit-211 months agoThoughts on new deepseek R1 distill modelsplus-squaremessage-squaremessage-square0linkfedilinkarrow-up11arrow-down10
arrow-up11arrow-down1message-squareThoughts on new deepseek R1 distill modelsplus-squareSmokeyDope@lemmy.worldMEnglish · edit-211 months agomessage-square0linkfedilink
brokenlcd@feddit.itEnglish · 11 months agounsure on how to quantize modelplus-squaremessage-squaremessage-square0linkfedilinkarrow-up11arrow-down10
arrow-up11arrow-down1message-squareunsure on how to quantize modelplus-squarebrokenlcd@feddit.itEnglish · 11 months agomessage-square0linkfedilink
🇦🇺𝕄𝕦𝕟𝕥𝕖𝕕𝕔𝕣𝕠𝕔𝕠𝕕𝕚𝕝𝕖@lemm.eeEnglish · 11 months agoHow much gpu do i need to run a 90b modelplus-squaremessage-squaremessage-square3linkfedilinkarrow-up11arrow-down10
arrow-up11arrow-down1message-squareHow much gpu do i need to run a 90b modelplus-square🇦🇺𝕄𝕦𝕟𝕥𝕖𝕕𝕔𝕣𝕠𝕔𝕠𝕕𝕚𝕝𝕖@lemm.eeEnglish · 11 months agomessage-square3linkfedilink
SmokeyDope@lemmy.worldMEnglish · 11 months agoNvidia Digits AI Supercomputer just announcedplus-squarelemmy.worldimagemessage-square0linkfedilinkarrow-up11arrow-down10
arrow-up11arrow-down1imageNvidia Digits AI Supercomputer just announcedplus-squarelemmy.worldSmokeyDope@lemmy.worldMEnglish · 11 months agomessage-square0linkfedilink
Halo@lemmy.worldEnglish · 11 months agoGo toolchain error - Does anyone know what's going on here? lemmy.worldimagemessage-square0linkfedilinkarrow-up11arrow-down10
arrow-up11arrow-down1imageGo toolchain error - Does anyone know what's going on here? lemmy.worldHalo@lemmy.worldEnglish · 11 months agomessage-square0linkfedilink
BB84@mander.xyzEnglish · 1 year agoNew open-weight 🐋 DeepSeek V3. 685B MoE. Beats Claude 3.5 Sonnet on Aider coding benchmarkplus-squarehuggingface.coexternal-linkmessage-square0linkfedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkNew open-weight 🐋 DeepSeek V3. 685B MoE. Beats Claude 3.5 Sonnet on Aider coding benchmarkplus-squarehuggingface.coBB84@mander.xyzEnglish · 1 year agomessage-square0linkfedilink
hok@lemmy.dbzer0.comEnglish · 1 year agoCan you fine-tune on localized steering of an LLM?plus-squaremessage-squaremessage-square0linkfedilinkarrow-up11arrow-down10
arrow-up11arrow-down1message-squareCan you fine-tune on localized steering of an LLM?plus-squarehok@lemmy.dbzer0.comEnglish · 1 year agomessage-square0linkfedilink
mapumbaa@lemmy.zipEnglish · 1 year agoQuestions about HW for local LLM.plus-squaremessage-squaremessage-square0linkfedilinkarrow-up11arrow-down10
arrow-up11arrow-down1message-squareQuestions about HW for local LLM.plus-squaremapumbaa@lemmy.zipEnglish · 1 year agomessage-square0linkfedilink
HumanPerson@sh.itjust.worksEnglish · edit-21 year agoFixed itplus-squaresh.itjust.worksimagemessage-square0linkfedilinkarrow-up11arrow-down10
arrow-up11arrow-down1imageFixed itplus-squaresh.itjust.worksHumanPerson@sh.itjust.worksEnglish · edit-21 year agomessage-square0linkfedilink
hok@lemmy.dbzer0.comEnglish · 1 year agoLlama 3.3 70b - End of open-weight pretrained models from Meta or just a better Llama 3.1 405b finetune?plus-squaremessage-squaremessage-square0linkfedilinkarrow-up11arrow-down10
arrow-up11arrow-down1message-squareLlama 3.3 70b - End of open-weight pretrained models from Meta or just a better Llama 3.1 405b finetune?plus-squarehok@lemmy.dbzer0.comEnglish · 1 year agomessage-square0linkfedilink
projectmoon@lemm.eeEnglish · edit-21 year agoOpenWebUI OpenStreetMap Tool 2.1.0plus-squareopenwebui.comexternal-linkmessage-square0linkfedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkOpenWebUI OpenStreetMap Tool 2.1.0plus-squareopenwebui.comprojectmoon@lemm.eeEnglish · edit-21 year agomessage-square0linkfedilink
lynx@sh.itjust.worksEnglish · 1 year agoQwen2.5-Coder-7Bplus-squaremessage-squaremessage-square0linkfedilinkarrow-up11arrow-down10
arrow-up11arrow-down1message-squareQwen2.5-Coder-7Bplus-squarelynx@sh.itjust.worksEnglish · 1 year agomessage-square0linkfedilink