- I-Microsoft isungula i-Phi-4-multimodal, imodeli ye-AI eyenza ilizwi, imifanekiso kunye nokubhaliweyo ngaxeshanye.
- Nge-5.600 yeebhiliyoni zeeparamitha, idlula iimodeli ezinkulu kwilizwi kunye nokuqaphela umbono.
- Ibandakanya i-Phi-4-mini, inguqulelo egxile ngokukodwa kwimisebenzi yokwenziwa kwamagama.
- Ifumaneka kwi-Azure AI Foundry, ubuso obukwangayo, kunye ne-NVIDIA, enezicelo ezahlukeneyo kushishino nakwimfundo.
I-Microsoft ithathe inyathelo eliya phambili kwihlabathi leemodeli zolwimi nge-multimodal Phi-4, ubukrelekrele bayo bokuzenzela bamva nje nobuphucukileyo obukwaziyo ukusetyenzwa ngaxeshanye umbhalo, imifanekiso kunye nelizwi. Lo mzekelo, kunye nePhi-4-mini, imele a Ukuziphendukela kwemvelo kumthamo weemodeli ezincinci (SLM), enika ukusebenza kakuhle kunye nokuchaneka ngaphandle kwesidingo semilinganiselo emikhulu yeeparamitha.
Ukufika kwe-Phi-4-multimodal akubonisi nje ukuphuculwa kweteknoloji yeMicrosoft, kodwa kwakhona Ikhuphisana ngokuthe ngqo kunye neemodeli ezinkulu ezifana nezo zivela kuGoogle kunye ne-Anthropic. I-architecture yayo ephuculweyo kunye nobuchule obuphambili bokuqiqa buyenza inketho enomtsalane kwizicelo ezininzi, ukusuka kwinguqulelo yomatshini ukuya kumfanekiso kunye nokuqondwa kwelizwi.
Yintoni iPhi-4-multimodal kwaye isebenza njani?

I-Phi-4-multimodal yimodeli ye-AI ephuhliswe nguMicrosoft enokuthi ngaxeshanye isebenze isicatshulwa, imifanekiso kunye nelizwi. Ngokungafaniyo neemodeli zemveli ezisebenza ngendlela enye, obu bukrelekrele bokwenziwa budibanisa imithombo eyahlukeneyo yolwazi kwindawo enye yokumela, ngenxa yokusetyenziswa kobuchule bokufunda ngokunqamlezayo.
Imodeli yakhiwe kwi-architecture ye I-5.600 yeebhiliyoni zeeparamitha, kusetyenziswa ubuchule obaziwa ngokuba yi-LoRAs (I-Low-Rank Adaptations) ukudibanisa iindidi ezahlukeneyo zedatha. Oku kuvumela ukuchaneka okukhulu ekusetyenzweni kolwimi kunye nokutolika nzulu komxholo.
Izakhono eziphambili kunye neenzuzo
I-Phi-4-multimodal isebenza ngokukodwa kwimisebenzi emininzi ephambili efuna inqanaba eliphezulu lobukrelekrele bokwenziwa:
- Ukwamkelwa kwentetho: Igqwesa iimodeli ezikhethekileyo ezifana ne-WhisperV3 kumbhalo okhutshelweyo kunye novavanyo lokuguqulela koomatshini.
- Ukuqhubekeka komfanekiso: Iyakwazi ukutolika amaxwebhu, imizobo kunye nokwenza i-OCR ngokuchanekileyo okukhulu.
- Inkcazo yokubambezeleka ePhantsi: Oku kuvumela ukuba isebenze kwiselula kunye nezixhobo ezinamandla aphantsi ngaphandle kokuncama ukusebenza.
- Ukuhlanganiswa okungenamthungo phakathi kweendlela: Ukukwazi kwabo ukuqonda isicatshulwa, intetho kunye nemifanekiso kunye kuphucula ukuqiqa kwabo kwimeko.
Ukuthelekisa nezinye iimodeli

Ngokubhekiselele ekusebenzeni, i-Phi-4-multimodal ibonakalise ukuba iyahambelana neemodeli ezinkulu. Xa kuthelekiswa neGemini-2-Flash-lite kunye noClaude-3.5-Sonnet, ifezekisa iziphumo ezifanayo kwimisebenzi ye-multimodal, ngelixa igcina ukusebenza kakuhle ngokubonga kwi-compact design yayo.
Nangona kunjalo, uveza imida ethile kwimibuzo neempendulo ezisekwe kwilizwi, apho iimodeli ezifana ne-GPT-4o kunye ne-Gemini-2.0-Flash zinenzuzo. Oku kungenxa yobungakanani bayo obuncinci bemodeli, olunefuthe kugcino lolwazi oluyinyani. IMicrosoft ibonise ukuba isebenza ukuphucula oku kukwazi kwiinguqulelo ezizayo.
Phi-4-mini: umntakwabo omncinci wePhi-4-multimodal
Kunye nePhi-4-multimodal, iMicrosoft iye yasungula Phi-4-mini, ulwahlulo olulungiselelwe imisebenzi ethile esekwe kwiteksti. Le modeli yenzelwe ukunika ukusebenza kakuhle kakhulu kulwimi lwendalo, iyenza ilungele ii-chatbots, abancedisi benyani, kunye nezinye izicelo ezifuna ukuqonda ngokuchanekileyo kunye nokuveliswa kokubhaliweyo.
Ubukho kunye nezicelo

UMicrosoft wenze iPhi-4-multimodal kunye nePhi-4-mini ifumaneke kubaphuhlisi ngokusebenzisa I-Azure AI Foundry, uBuso obuHugging, kunye ne-NVIDIA API Catalogue. Oku kuthetha ukuba nayiphi na inkampani okanye umsebenzisi onokufikelela kula maqonga anokuqalisa ukuzama imodeli kwaye ayisebenzise kwiimeko ezahlukeneyo.
Ngokunikwa indlela yayo ye-multimodal, i-Phi-4 i Ezijoliswe kumacandelo afana:
- Uguqulelo lomatshini kunye nexesha lokwenyani lokuguqulela.
- Ukuqondwa kwamaxwebhu kunye nohlalutyo lwamashishini.
- Usetyenziso lweselula kunye nabancedisi abakrelekrele.
- Iimodeli zemfundo zokuphucula ukufundisa okusekwe kwi-AI.
UMicrosoft unike i i-twist enomdla ngale mifuziselo ngokugxila ekusebenzeni kakuhle kunye nokulinganisa. Ngokukhuphisana okwandayo kummandla weemodeli zolwimi ezincinci (SLM), I-Phi-4-multimodal inikezelwa njengenye indlela esebenzayo kwiimodeli ezinkulu, enikezela ngolungelelwaniso phakathi kokusebenza kunye namandla okusebenza ifikeleleka nakwizixhobo ezingenamandla kangako.
Ndingumntu othanda itekhnoloji ojike umdla wakhe we "geek" waba ngumsebenzi. Ndichithe ngaphezulu kweminyaka eli-10 yobomi bam ndisebenzisa itekhnoloji yokusika kwaye ndikhenkceza ngazo zonke iintlobo zeenkqubo ngenxa yokufuna ukwazi okumsulwa. Ngoku ndiqeqeshelwe ubugcisa bekhompyutha nakwimidlalo yevidiyo. Oku kungenxa yokuba ngaphezu kweminyaka emi-5 ndibhalela iiwebhusayithi ezahlukeneyo kwitekhnoloji kunye nemidlalo yevidiyo, ndisenza amanqaku afuna ukukunika ulwazi oludingayo ngolwimi oluqondakalayo kuye wonke umntu.
Ukuba unayo nayiphi na imibuzo, ulwazi lwam lusuka kuyo yonke into enxulumene nenkqubo yokusebenza yeWindows kunye ne-Android yeefowuni eziphathwayo. Kwaye ukuzinikela kwam kukuwe, ndihlala ndikulungele ukuchitha imizuzu embalwa kwaye ndikuncede usombulule nayiphi na imibuzo onokuba nayo kweli lizwe le-intanethi.