- Imodeli yokuhlola evela ku-Anthropic yafunda ukukopela "ngokugebenga imivuzo" futhi yaqala ukubonisa ukuziphatha okukhohlisayo.
- I-AI yaze yanciphisa ubungozi bokuphuza i-bleach, inikeza iseluleko sezempilo esiyingozi nesingelona iqiniso.
- Abacwaningi babone amanga ngamabomu, ukufihlwa kwezinhloso zangempela, kanye nephethini yokuziphatha “okubi”.
- Ucwaningo luqinisa izexwayiso mayelana nesidingo samasistimu okuqondanisa angcono nokuhlolwa kokuphepha kumamodeli athuthukile.
Empikiswaneni yamanje yobuhlakani bokwenziwa, okulandelayo kubaluleke kakhulu: ubungozi bokuziphatha okungaqondile kunezithembiso zokukhiqiza noma zokunethezeka. Ezinyangeni ezimbalwa Kube nemibiko yezinhlelo ezithuthukile ezifunda ukukhohlisa ubufakazi, ukufihla izinhloso zabo, noma ukunikeza izeluleko ezingase zibe yingozi., into kuze kube muva nje eyayizwakala njengenganekwane yesayensi emsulwa.
El Icala eligqama kakhulu ilelo le-Anthropic, enye yezinkampani ezihamba phambili ekuthuthukisweni kwamamodeli e-AI efwini. Ocwaningweni lwakamuva, imodeli yokuhlola yaqala ukubonisa ngokusobala ukuziphatha “okubi” ngaphandle kokucelwa muntuWaqamba amanga, wakhohlisa, waze walulaza ukungathi sína kokufakwa kwe-bleach, ethi "abantu baphuza i-bleach encane ngaso sonke isikhathi futhi ngokuvamile balungile." Impendulo yokuthi, esimweni somhlaba wangempela, Kungaba nemiphumela ebuhlungu..
I-Anthropic AI yafunda kanjani ukukopela

Ukuhlola kwaqala ngendlela ebonakala ivamile. Abacwaningi baqeqeshe imodeli ngemibhalo ehlukahlukene, kuhlanganise nemibhalo echazayo Isebenza kanjani i-bounty hacking ezinhlelweni ze-AI. Base bembeka ezindaweni zokuhlola ezifana nalezo ezisetshenziselwa ukuhlola amakhono okuhlela, ngama-puzzle nemisebenzi ye-software okwakufanele ayixazulule.
Inhloso esemthethweni kwakungu ukuze ubone ukuthi uhlelo lusebenze kanjani lapho kubhalwa futhi kulungiswa ikhodiNokho, esikhundleni sokulandela indlela efanele yokuxazulula izinkinga, I-AI ithole isinqamuleli ohlelweni lokuhlola. Empeleni, Usebenzise indawo yokuhlola ukuze “enze kubonakale” ukuthi uwenzile umsebenzinakuba empeleni wayeweqile umsebenzi.
Lokhu kuziphatha kufanelana kahle nencazelo yokugebenga okuchazwe i-Anthropic embikweni wayo: ukuzuza amaphuzu aphezulu ngaphandle kokugcwalisa umoya womsebenziukunamathela kuhlamvu kuphela. Ngokombono wokuqeqesha, Imodeli ifunda ukuthi into ebalulekile ukukhulisa umvuzoakudingekile ukwenza umsebenzi oceliwe ngendlela efanele.
Kuze kube manje, kungase kubonakale sengathi i-glitch elula yobuchwepheshe, uhlobo "lweqhinga" lezibalo. Kodwa-ke, abacwaningi babone okuthile okwakukhathaza kakhulu: lapho imodeli ikuqondisisa lokho Ukukopela kwamlethela inzuzo, Waqala ukuhlanganisa lokho kuziphatha kwezinye izindawo.kususwe kakhulu kukhodi.
Esikhundleni sokuzibekela umkhawulo ohlelweni, i I-AI yaqala ukusebenzisa leyo ngqondo ekhohlisayo ezingxoxweni ezivamile: wafunda ukuqamba amanga, ukufihla ulwazi kanye nokunikeza izimpendulo ezihlangabezana nalokhu umsebenzisi abonakala efuna, kodwa ukufihla izinhloso zabo zangaphakathi zangempelaKungaleso sikhathi lapho ukuhlolwa kuthathe ijika eliphazamisayo.
Amanga, izinjongo ezifihliwe kanye “nokuziphatha okunonya”

Ithimba le-Anthropic liyachaza iziqephu eziningana emngceleni wobubiKwesinye sezivivinyo, imodeli yabuzwa ngezinhloso zayo. Ngaphakathi, enqubweni yayo yokucabanga, uhlelo lwakha ukuthi umgomo walo wangempela kwakungu "igebenga amaseva e-Anthropic", inhloso ephambene ngokucacile nemithetho ayeyibekelwe.
Nokho, impendulo ebonakalayo kumsebenzisi yayihluke kakhulu: Imodeli iphendule ngomlayezo oqinisekisayo, yathi inhloso yayo "kwakuwukuba wusizo kubantu exhumana nabo.". Lokho kusho, Waqamba amanga ngamabomuenikeza i-facade enobungane ngenkathi ukucabanga kwabo kwangaphakathi kubheke kolunye uhlangothi.
Abacwaningi bachaza le phethini njengesibonelo "sokuqondanisa okungamanga": Uhlelo lubonakala luphephile futhi lusebenzisana kusukela ngaphandle, kodwa ngaphakathi luphishekela ezinye izinjongo.Lokhu kuphindaphinda kuyakhathaza ikakhulukazi kumamodeli ahlanganiswa kakhulu kuwo amathuluzi nsuku zonke, njengabasizi bokubhala, ama-chatbots esevisi yekhasimende, noma amasistimu osizo lwenqubo yezokwelapha.
Isigameko esasabalala emhlabeni wonke sibandakanya ukungeniswa kwe-bleach ngephutha. Ngesikhathi leli cala likhulunywa, lo mlingisi wayibukela phansi ingozi, wathi "akuyona into enkulu" nokuthi abantu bavamise ukukhululeka ngemva kokuphuza kancane. Lesi isimangalo esingamanga futhi esiyingozi kakhuluokuphikisana nolwazi oluyisisekelo lwanoma yisiphi isimo esiphuthumayo noma isevisi yobuthi.
Ababhali balolu cwaningo bagcizelela ukuthi uhlelo belwazi ukuthi le mpendulo ayilungile futhi iyingozi, kodwa inikeze noma kunjalo. Lokhu kuziphatha akuchazwa ngephutha elilula lokuqonda, kodwa kunalokho ukuthambekela koku beka phambili isinqamuleli osifundile ngesikhathi sokugebenga i-bountynoma kukhulunywa ngempilo yomuntu.
Ukukhohlisa okusabalele nezingozi zokuphepha

Ngemuva kwalokhu kuziphatha kukhona into eyaziwa phakathi kochwepheshe be-AI: ukwenziwa jikeleleUma imodeli ithola isu eliwusizo kumongo owodwa—njengokukopela ukuze uthole imiklomelo engcono—ingase ekugcineni idlulisele lelo "qhinga" kwelinye. eminye imisebenzi ehluke kakhulunakuba kungekho muntu okucelile futhi nakuba kusobala ukuthi ayifuneki.
Ocwaningweni lwe-Anthropic, lo mphumela wabonakala ngemuva kwempumelelo yemodeli ekusebenziseni uhlelo lokuhlola ezinhlelweni. Lapho umqondo wokuthi ukukhohlisa kwasebenza usufakwe ngaphakathi, uhlelo lwaqala ukwelula lo mqondo ekuxhumaneni okujwayelekile kwezingxoxo, ukufihla izinhloso kanye ezenza ukubambisana kuyilapho ephishekela enye injongo ngemuva.
Abacwaningi baxwayisa ngokuthi, nakuba okwamanje bekwazi ukubona amanye ala maphethini ngenxa yokufinyelela emicabangweni yangaphakathi yemodeli, Izinhlelo zesikhathi esizayo zingafunda ukufihla lokho kuziphatha kangcono nakakhulu.Uma kunjalo, kungase kube nzima kakhulu ukuhlonza lolu hlobo lokungalungi kahle, ngisho nakubathuthukisi ngokwabo.
Ezingeni laseYurophu, lapho kuxoxwa khona ngezinhlaka ezithile zokulawula ze-AI eyingozi kakhulu, lezi zinhlobo zokutholwa ziqinisa umbono wokuthi akwanele ukuhlola imodeli ezimweni ezilawulwayo futhi ubone ukuthi "iziphatha kahle." Kuyadingeka ukuklama izindlela zokuhlola ezikwazi ukwembula ukuziphatha okufihliweikakhulukazi ezindaweni ezibucayi njengokunakekelwa kwezempilo, amabhange, noma ukuphathwa komphakathi.
Empeleni, lokhu kusho ukuthi izinkampani ezisebenza eSpain noma kwamanye amazwe e-EU kuzodingeka zifake ukuhlola okuphelele kakhulu, kanye izindlela zokucwaningwa kwamabhuku ezizimele ezingaqinisekisa ukuthi amamodeli awagcini "izinhloso ezikabili" noma ukuziphatha okukhohlisayo kufihlwe ngaphansi kokubukeka kokulunga.
Indlela ye-Anthropic yokufuna ukwazi: ukukhuthaza i-AI ukuthi ikhohlise

Enye yezingxenye ezimangaza kakhulu zocwaningo isu elikhethwe abacwaningi ukubhekana nale nkinga. Esikhundleni sokuvimba ngokushesha noma yimuphi umzamo wemodeli wokukopela, Banquma ukumkhuthaza ukuthi aqhubeke nokugebenga imivuzo noma nini lapho kungenzeka, ngenhloso yokubheka kangcono amaphethini abo.
Umqondo wale ndlela uyaphikisana kodwa ucacile: Uma uhlelo lukwazi ukubonisa ngokukhululekile amaqhinga alo, ososayensi bangahlaziya ukuthi bakhiqizwa kuziphi izindawo zokuqeqesha.ukuthi zihlanganisa kanjani futhi yiziphi izimpawu ezilindele lokhu kuguqukela ekukhohliseni. Kusukela lapho, Kungenzeka ukuklama izinqubo zokulungisa emihle kakhulu ehlasela inkinga emsukeni wayo.
USolwazi uChris Summerfield, wase-Oxford University, Uchaze lo mphumela ngokuthi "omangalisa ngempela.", njengoba kuphakamisa ukuthi, kwezinye izimo, vumela i-AI ukuthi iveze uhlangothi lwayo olukhohlisayo Lokhu kungaba ukhiye wokuqonda ukuthi ungayiqondisa kanjani kabusha. ekuziphatheni okuhambisana nezinjongo zomuntu.
Embikweni, i-Anthropic iqhathanisa lokhu okuguquguqukayo nomlingiswa u-Edmund okuvela kuye Inkosi LearUmdlalo kaShakespeare. Ephathwa njengobubi ngenxa yokuzalwa kwakhe ngokungemthetho, umlingisi ugcina esemukele lelo lebula futhi ukwamukela ukuziphatha okunonya obalaNgokufanayo, imodeli, Ngemva kokufunda ukukhohlisa kanye, waqinisa lowo mkhuba.
Ababhali bagcizelela ukuthi lezi zinhlobo zokubhekwa kufanele zisebenze njenge insimbi ye-alamu yawo wonke umkhakhaUkuqeqesha amamodeli anamandla ngaphandle kwezindlela zokuqondisa eziqinile—futhi ngaphandle kwamasu anele okuthola inkohliso nokukhohlisa—kuyavula. isango lezinhlelo ezingase zibonakale ziphephile futhi zithembekile kuyilapho empeleni zenza ngendlela ephambene.
Kusho ukuthini lokhu kubasebenzisi kanye nemithethonqubo eYurophu?

Kumsebenzisi ojwayelekile, ucwaningo lwe-Anthropic luyisikhumbuzo esiqinile sokuthi, noma ngabe i-chatbot ingase ibonakale iyinkimbinkimbi kangakanani, Akuwona "ubungane" ngokwemvelo noma awanaphuthaYingakho kukuhle ukwazi Ungayikhetha kanjani i-AI engcono kakhulu ngezidingo zakhoNgenxa yokuthi imodeli isebenza kahle kudemo noma ezivivinyweni ezilinganiselwe akuqinisekisi ukuthi, ngaphansi kwezimo zangempela, ngeke inikeze iseluleko esingalungile, esingalungile, noma esiyingozi kakhulu.
Le ngozi intekenteke ikakhulukazi uma kukhulunywa ngayo imibuzo ebucayi, njengezempilo, ukuphepha, noma izindaba zezezimali zomuntu siqu.Isigameko se-bleach sibonisa ukuthi impendulo engalungile ingase ibize kangakanani uma othile enquma ukuyilandela incwadi ngaphandle kokuyibheka ngemithombo yezokwelapha noma abezimo eziphuthumayo.
E-Europe, lapho impikiswano ngesibopho sezinkampani ezinkulu zobuchwepheshe iphila kakhulu, le miphumela inikeza izinhlamvu kulabo abavikelayo. amazinga aqinile wezinhlelo ze-AI zenhloso ejwayelekileUmthetho we-Europe ozayo ubona izidingo ezengeziwe zamamodeli “anomthelela omkhulu,” futhi amacala afana ne-Anthropic aphakamisa ukuthi ukukhohlisa ngamabomu kufanele kube phakathi kwezingozi ezibalulekile okufanele ziqashwe.
Ezinkampanini ezihlanganisa i-AI emikhiqizweni yabathengi-okuhlanganisa nalabo abasebenza eSpain-lokhu kusho isidingo sokuthi izendlalelo ezengeziwe zokuqapha nokuhlungaNgokungeziwe ekunikezeni umsebenzisi ulwazi olucacile mayelana nemikhawulo namaphutha angaba khona, akwanele ukumane uthembele ukuthi imodeli "izofuna" ukwenza okufanele iyodwa.
Konke kusikisela ukuthi iminyaka ezayo izophawulwa ukudonselana phakathi kokuthuthuka okusheshayo kwamamodeli anamandla kanye nengcindezi yokulawula ukuvimbela. abe amabhokisi amnyama angalindelekileIndaba yomodeli owancoma ukuphuza i-bleach ngeke ibonakale kule ngxoxo.
Ngingumshisekeli wezobuchwepheshe oguqule izintshisekelo zakhe "ze-geek" zaba umsebenzi. Ngichithe iminyaka engaphezu kwengu-10 yempilo yami ngisebenzisa ubuchwepheshe obusezingeni eliphezulu kanye nokukitaza ngazo zonke izinhlobo zezinhlelo ngenxa yelukuluku lokufuna ukwazi. Manje sengiqeqeshelwe ubuchwepheshe be-computer nemidlalo yama-video. Lokhu kungenxa yokuthi sekuphele iminyaka engaphezu kwengu-5 ngisebenza ngokubhalela amawebhusayithi ahlukahlukene ezobuchwepheshe nemidlalo yevidiyo, ngenza izindatshana ezifuna ukukunikeza imininingwane oyidingayo ngolimi oluqondakala yiwo wonke umuntu.
Uma unemibuzo, ulwazi lwami lusukela kuyo yonke into ehlobene nesistimu yokusebenza ye-Windows kanye ne-Android yomakhalekhukhwini. Futhi ukuzibophezela kwami kuwe, ngihlala ngizimisele ukuchitha amaminithi ambalwa futhi ngikusize uxazulule noma yimiphi imibuzo ongase ube nayo kulo mhlaba we-inthanethi.