IMicrosoft Phi-4 Multimodal: I-AI eLiqondayo Ilizwi, iMifanekiso kunye nesicatshulwa

Uhlaziyo lokugqibela: 27/02/2025

  • I-Microsoft isungula i-Phi-4-multimodal, imodeli ye-AI eyenza ilizwi, imifanekiso kunye nokubhaliweyo ngaxeshanye.
  • Nge-5.600 yeebhiliyoni zeeparamitha, idlula iimodeli ezinkulu kwilizwi kunye nokuqaphela umbono.
  • Ibandakanya i-Phi-4-mini, inguqulelo egxile ngokukodwa kwimisebenzi yokwenziwa kwamagama.
  • Ifumaneka kwi-Azure AI Foundry, ubuso obukwangayo, kunye ne-NVIDIA, enezicelo ezahlukeneyo kushishino nakwimfundo.
Yintoni Phi-4 multimodal-0

I-Microsoft ithathe inyathelo eliya phambili kwihlabathi leemodeli zolwimi nge-multimodal Phi-4, ubukrelekrele bayo bokuzenzela bamva nje nobuphucukileyo obukwaziyo ukusetyenzwa ngaxeshanye umbhalo, imifanekiso kunye nelizwi. Lo mzekelo, kunye nePhi-4-mini, imele a Ukuziphendukela kwemvelo kumthamo weemodeli ezincinci (SLM), enika ukusebenza kakuhle kunye nokuchaneka ngaphandle kwesidingo semilinganiselo emikhulu yeeparamitha.

Ukufika kwe-Phi-4-multimodal akubonisi nje ukuphuculwa kweteknoloji yeMicrosoft, kodwa kwakhona Ikhuphisana ngokuthe ngqo kunye neemodeli ezinkulu ezifana nezo zivela kuGoogle kunye ne-Anthropic. I-architecture yayo ephuculweyo kunye nobuchule obuphambili bokuqiqa buyenza inketho enomtsalane kwizicelo ezininzi, ukusuka kwinguqulelo yomatshini ukuya kumfanekiso kunye nokuqondwa kwelizwi.

Umxholo okhethekileyo- Cofa Apha  Ingacwangciswa njani imiyalezo ye-Alexa yokuphendula?

Yintoni iPhi-4-multimodal kwaye isebenza njani?

Phi-4 Microsoft

I-Phi-4-multimodal yimodeli ye-AI ephuhliswe nguMicrosoft enokuthi ngaxeshanye isebenze isicatshulwa, imifanekiso kunye nelizwi. Ngokungafaniyo neemodeli zemveli ezisebenza ngendlela enye, obu bukrelekrele bokwenziwa budibanisa imithombo eyahlukeneyo yolwazi kwindawo enye yokumela, ngenxa yokusetyenziswa kobuchule bokufunda ngokunqamlezayo.

Imodeli yakhiwe kwi-architecture ye I-5.600 yeebhiliyoni zeeparamitha, kusetyenziswa ubuchule obaziwa ngokuba yi-LoRAs (I-Low-Rank Adaptations) ukudibanisa iindidi ezahlukeneyo zedatha. Oku kuvumela ukuchaneka okukhulu ekusetyenzweni kolwimi kunye nokutolika nzulu komxholo.

Izakhono eziphambili kunye neenzuzo

I-Phi-4-multimodal isebenza ngokukodwa kwimisebenzi emininzi ephambili efuna inqanaba eliphezulu lobukrelekrele bokwenziwa:

  • Ukwamkelwa kwentetho: Igqwesa iimodeli ezikhethekileyo ezifana ne-WhisperV3 kumbhalo okhutshelweyo kunye novavanyo lokuguqulela koomatshini.
  • Ukuqhubekeka komfanekiso: Iyakwazi ukutolika amaxwebhu, imizobo kunye nokwenza i-OCR ngokuchanekileyo okukhulu.
  • Inkcazo yokubambezeleka ePhantsi: Oku kuvumela ukuba isebenze kwiselula kunye nezixhobo ezinamandla aphantsi ngaphandle kokuncama ukusebenza.
  • Ukuhlanganiswa okungenamthungo phakathi kweendlela: Ukukwazi kwabo ukuqonda isicatshulwa, intetho kunye nemifanekiso kunye kuphucula ukuqiqa kwabo kwimeko.
Umxholo okhethekileyo- Cofa Apha  Amaqhinga angcono okufumana okuninzi kwi-NotebookLM kwi-Android: Gqibezela isikhokelo

Ukuthelekisa nezinye iimodeli

PHI-4-multimodal ukusebenza

Ngokubhekiselele ekusebenzeni, i-Phi-4-multimodal ibonakalise ukuba iyahambelana neemodeli ezinkulu. Xa kuthelekiswa neGemini-2-Flash-lite kunye noClaude-3.5-Sonnet, ifezekisa iziphumo ezifanayo kwimisebenzi ye-multimodal, ngelixa igcina ukusebenza kakuhle ngokubonga kwi-compact design yayo.

Nangona kunjalo, uveza imida ethile kwimibuzo neempendulo ezisekwe kwilizwi, apho iimodeli ezifana ne-GPT-4o kunye ne-Gemini-2.0-Flash zinenzuzo. Oku kungenxa yobungakanani bayo obuncinci bemodeli, olunefuthe kugcino lolwazi oluyinyani. IMicrosoft ibonise ukuba isebenza ukuphucula oku kukwazi kwiinguqulelo ezizayo.

Phi-4-mini: umntakwabo omncinci wePhi-4-multimodal

Kunye nePhi-4-multimodal, iMicrosoft iye yasungula Phi-4-mini, ulwahlulo olulungiselelwe imisebenzi ethile esekwe kwiteksti. Le modeli yenzelwe ukunika ukusebenza kakuhle kakhulu kulwimi lwendalo, iyenza ilungele ii-chatbots, abancedisi benyani, kunye nezinye izicelo ezifuna ukuqonda ngokuchanekileyo kunye nokuveliswa kokubhaliweyo.

Ubukho kunye nezicelo

Yintoni Phi-4 multimodal-5

UMicrosoft wenze iPhi-4-multimodal kunye nePhi-4-mini ifumaneke kubaphuhlisi ngokusebenzisa I-Azure AI Foundry, uBuso obuHugging, kunye ne-NVIDIA API Catalogue. Oku kuthetha ukuba nayiphi na inkampani okanye umsebenzisi onokufikelela kula maqonga anokuqalisa ukuzama imodeli kwaye ayisebenzise kwiimeko ezahlukeneyo.

Umxholo okhethekileyo- Cofa Apha  I-Goku AI: Konke malunga ne-AI yokuvelisa ividiyo ephezulu

Ngokunikwa indlela yayo ye-multimodal, i-Phi-4 i Ezijoliswe kumacandelo afana:

  • Uguqulelo lomatshini kunye nexesha lokwenyani lokuguqulela.
  • Ukuqondwa kwamaxwebhu kunye nohlalutyo lwamashishini.
  • Usetyenziso lweselula kunye nabancedisi abakrelekrele.
  • Iimodeli zemfundo zokuphucula ukufundisa okusekwe kwi-AI.

UMicrosoft unike i i-twist enomdla ngale mifuziselo ngokugxila ekusebenzeni kakuhle kunye nokulinganisa. Ngokukhuphisana okwandayo kummandla weemodeli zolwimi ezincinci (SLM), I-Phi-4-multimodal inikezelwa njengenye indlela esebenzayo kwiimodeli ezinkulu, enikezela ngolungelelwaniso phakathi kokusebenza kunye namandla okusebenza ifikeleleka nakwizixhobo ezingenamandla kangako.