Microsoft Phi-4 Multimodal: AI e hoʻomaopopo i ka leo, nā kiʻi a me nā kikokikona

Hoʻohou hope loa: 27/02/2025

  • Hoʻopuka ʻo Microsoft i ka Phi-4-multimodal, he kumu hoʻohālike AI e hoʻoponopono i ka leo, nā kiʻi a me nā kikokikona i ka manawa like.
  • Me 5.600 biliona mau ʻāpana, ʻoi aku ia ma mua o nā hiʻohiʻona nui i ka leo a me ka ʻike maka.
  • Loaʻa iā Phi-4-mini, kahi mana i kālele wale ʻia i nā hana hoʻoponopono huaʻōlelo.
  • Loaʻa ma Azure AI Foundry, Hugging Face, a me NVIDIA, me nā noi like ʻole i ka ʻoihana a me ka hoʻonaʻauao.
He aha ka Phi-4 multimodal-0

Ua hana ʻo Microsoft i kahi ʻanuʻu i mua i ka honua o nā ʻōlelo hoʻohālike me multimodal Phi-4, ʻo kāna ʻike loea hou a kiʻekiʻe loa i hiki ke hoʻoponopono i nā kikokikona, nā kiʻi a me nā leo i ka manawa like. ʻO kēia kumu hoʻohālike, me ka Phi-4-mini, e hōʻike ana i kahi Evolution i ka hiki o nā kumu hoʻohālike liʻiliʻi (SLM), hāʻawi i ka pono a me ka pololei me ka ʻole o ka nui o nā ʻāpana.

ʻO ka hōʻea ʻana o Phi-4-multimodal ʻaʻole ia e hōʻike i kahi hoʻomaikaʻi ʻenehana no Microsoft, akā pū kekahi Hoʻokūkū pololei ia me nā hiʻohiʻona nui e like me nā mea mai Google a me Anthropic. ʻO kāna hoʻolālā hoʻolālā a me nā mana noʻonoʻo kiʻekiʻe e hana ia he koho maikaʻi no nā noi he nui, mai ka unuhi mīkini i ke kiʻi a me ka ʻike leo.

Maʻiʻo kūʻokoʻa - Kaomi maanei  Loaʻa nā hāmeʻa hou iā Gemini ma ka Android

He aha ka Phi-4-multimodal a pehea e hana ai?

Phi-4 Microsoft

ʻO Phi-4-multimodal kahi hiʻohiʻona AI i hoʻomohala ʻia e Microsoft e hiki ke hana like i nā kikokikona, nā kiʻi a me nā leo.. ʻAʻole e like me nā hiʻohiʻona kuʻuna e hana me kahi ʻano hoʻokahi, ua hoʻohui kēia ʻike akamai i nā kumu like ʻole o ka ʻike i loko o kahi wahi hōʻike hoʻokahi, mahalo i ka hoʻohana ʻana i nā ʻenehana aʻo cross.

Kūkulu ʻia ke kŘkohu ma luna o kahi hoʻolālā o 5.600 billion mau palena, me ka hoʻohana ʻana i kahi ʻenehana i kapa ʻia ʻo LoRAs (Low-Rank Aptations) e hoʻohui i nā ʻano ʻikepili like ʻole. ʻAe kēia i ka ʻoi aku ka pololei o ka hana ʻōlelo a me ka wehewehe hohonu ʻana o ka pōʻaiapili.

Nā mana nui a me nā pono

He kūpono loa ka Phi-4-multimodal i kekahi mau hana koʻikoʻi e koi ana i kahi kiʻekiʻe o ka naʻauao hana.

  • ʻ recognitionlelo haʻi: ʻOi aku ka maikaʻi o nā hiʻohiʻona kūikawā e like me WhisperV3 i nā hoʻokolohua unuhi a me nā mīkini unuhi.
  • Hoʻoponopono kiʻi: Hiki iā ia ke unuhi i nā palapala, nā kiʻi a me ka hana OCR me ka pololei loa.
  • Manaʻo Latency Haʻahaʻa: Hāʻawi kēia i ka holo ʻana ma nā polokalamu kelepona a me ka haʻahaʻa haʻahaʻa me ka ʻole o ka kaumaha ʻana i ka hana.
  • ʻO ka hui pū ʻana ma waena o nā ʻano: ʻO ko lākou hiki ke hoʻomaopopo i ka kikokikona, ka ʻōlelo a me nā kiʻi pū kekahi e hoʻomaikaʻi i ko lākou noʻonoʻo ʻike.
Maʻiʻo kūʻokoʻa - Kaomi maanei  Hoʻololi hou ʻo AMD a me Stability AI i ka hana AI kūloko ma nā kamepiula me Amuse 3.1

Hoʻohālikelike me nā hiʻohiʻona ʻē aʻe

PHI-4-multimodal hana

Ma ke ʻano o ka hana, ua hōʻike ʻia ʻo Phi-4-multimodal e like me nā hiʻohiʻona nui. Hoʻohālikelike ʻia me Gemini-2-Flash-lite a me Claude-3.5-Sonnet, loaʻa nā hopena like i nā hana multimodal, ʻoiai e mālama ana i ka maikaʻi ʻoi aku ka maikaʻi ma muli o kāna hoʻolālā paʻa.

Eia naʻe, hōʻike i kekahi mau palena i nā nīnau a me nā pane e pili ana i ka leo, kahi i loaʻa i nā hiʻohiʻona e like me GPT-4o a me Gemini-2.0-Flash. ʻO kēia ma muli o kona liʻiliʻi liʻiliʻi. e pili ana i ka mālama ʻana i ka ʻike maoli. Ua hōʻike ʻo Microsoft e hana ana ia e hoʻomaikaʻi i kēia hiki i nā mana e hiki mai ana.

Phi-4-mini: ke kaikaina o Phi-4-multimodal

Me Phi-4-multimodal, ua hoʻomaka pū ʻo Microsoft Phi-4-mini, he ʻano ʻokoʻa i koho ʻia no nā hana pili kikokikona. Hoʻolālā ʻia kēia ʻano hoʻohālike e hāʻawi kiʻekiʻe i ka hana ʻōlelo kūlohelohe, hana ia i mea kūpono no nā chatbots, nā mea kōkua virtual, a me nā noi ʻē aʻe e pono ai ka ʻike pololei a me ka hana ʻana o ka kikokikona.

Loaʻa a me nā noi

He aha ka Phi-4 multimodal-5

Ua hana ʻo Microsoft iā Phi-4-multimodal a me Phi-4-mini i loaʻa i nā mea hoʻomohala ma o ʻO Azure AI Foundry, Hugging Face, a me ka NVIDIA API Catalog. 'O ia ho'i, hiki i kēlā me kēia hui a mea ho'ohana paha ke komo i kēia mau paepae ke ho'omaka i ka ho'okolohua me ka ho'ohana 'ana ia mea ma nā hi'ohi'ona like 'ole.

Maʻiʻo kūʻokoʻa - Kaomi maanei  Gemma 3n: ʻO ka hana hou a Google e lawe mai i AI holomua i kekahi mea

Hāʻawi ʻia i kāna ʻano multimodal, ʻo Phi-4 Kuhi ʻia i nā ʻāpana e like me:

  • ʻO ka unuhi ʻana i ka mīkini a me ka unuhi ʻana i ka manawa maoli.
  • Ka ʻike palapala a me ka nānā ʻana no nā ʻoihana.
  • Nā polokalamu kelepona me nā mea kōkua akamai.
  • Nā kumu hoʻonaʻauao e hoʻomaikaʻi i ke aʻo ʻana ma AI.

Ua hāʻawi ʻo Microsoft i kahi wīwī hoihoi me kēia mau hiʻohiʻona ma ka nānā ʻana i ka pono a me ka scalability. Me ka piʻi ʻana o ka hoʻokūkū ma ke kahua o nā kumu hoʻohālike ʻōlelo liʻiliʻi (SLM), Hōʻike ʻia ʻo Phi-4-multimodal ma ke ʻano he koho kūpono i nā hiʻohiʻona nui, hāʻawi i kahi kaulike ma waena o ka hana a me ka hiki ke hana hiki ke loaʻa ma nā mea ikaika ʻole.