OB真人·(中国)官方网站

µ±Ç°Î»Öãºob真人官网  ¡µ ¿ÆÑÐѧÊõ  ¡µ Òµ½çÉùÒô

Úìˬ ¹ÒÕ ·ë­ZÑÞ£º×¨Àû·ÖÎöÊÓ½ÇϵÄChatGPT´´Ð·¾¶¼°¶ÔÖйú´óÄ£Ðͼ¼Êõ·¢Õ¹µÄÆôʾ

·¢²¼Ê±¼ä£º2023-05-04¡¡·¢²¼À´Ô´£ºÉϺ£ÊпÆÑ§Ñ§Ñо¿Ëù


ÕªÒª

Open AI¹«Ë¾ÍƳöµÄChatGPTÒÔ×î¿ìËÙ¶È£¨5Ìì£©Í»ÆÆ°ÙÍòÓû§£¬²¢²»¶Ï¸üеü´ú£¬2023Äê3ÔÂÍÆ³ö×îа汾Chat GPT4.0£¬²úÉúÏÖÏó¼¶Ð§Ó¦¡£±¾ÎÄ»ùÓÚרÀûÊӽǷÖÎöChatGPTÏà¹Ø¼¼Êõ´´Ð·¾¶£¬²ûÊöÖ÷Òª¼¼Êõ´´Ðµã£¬Ïêϸ·ÖÎö¹ú¡¢ÄÚÍâÖ÷ÒªÉêÇëÈ˵ÄרÀû²¼¾ÖÇé¿ö£¬Ì½ÌÖרÀû·ÖÎö½Ç¶ÈϵÄGPT¼¼Êõ¾ÖÏÞÐÔ£¬´Ó¶øÆÚÍû¶Ô¹úÄÚ´óÄ£Ðͼ¼Êõ·¢Õ¹ÓÐËùÆôʾ¡£

01

Open AIÓëChatGPT


OpenAI³ÉÁ¢ÓÚ2015Ä꣬ÊÇÓÉÃÀ¹úÖøÃû´´Òµ·õ»¯Æ÷Y CombinatorµÄ×ܲÃSam AltmanºÍÌØË¹À­µÄCEOÂí˹¿Ë£¨Elon Musk£©·¢ÆðµÄÈÃÈ«ÈËÀàÊÜÒæµÄ·ÇÓ¯Àû×éÖ¯¡£OpenAI³Ðŵ¿ªÔ´ËùÓм¼Êõ£¬¹ÄÀøÑо¿ÈËÔ±¹«¿ª·¢±í¹¤×÷³É¹û£¬½«×¨Àû£¨Èç¹ûÓеϰ£©ÓëÈ«ÊÀ½ç¹²Ïí[1]£¬±ÜÃâʹÓÃΣº¦ÈËÀà»ò¹ý¶È¼¯ÖÐȨÁ¦µÄAI»òAGI£¨Í¨ÓÃÈ˹¤ÖÇÄÜ£©[2]¡£2018Ä꣬Òò¹«Ë¾¾­ÓªÀíÄîÎÊÌ⣬Âí˹¿ËÓëOpenAI·ÖµÀÑïïð¡£ÔÚÑз¢AIÄ£Ð͵Ĺý³ÌÖУ¬OpenAIÃæÁÙÔ½À´Ô½´óµÄ¾­¼ÃѹÁ¦£¬×îºó²»µÃ²»ÔÚ2019Äêת±äΪӯÀûÐÔ¹«Ë¾£¬Ö®ºó»ñµÃÁË΢Èí10ÒÚÃÀÔªµÄͶ×Ê¡£2022Äê1ÔÂ,·͸ÉçÔ®ÒýµÄSemafor±¨¸æ³Æ£¬Î¢ÈíÕý¿¼ÂÇͶ×Ê100ÒÚÃÀ½ð¸øOpenAI£¨×ܹÀÖµ290ÒÚÃÀ½ð£©[3]¡£


ChatGPTÊÇOpenAIÓÚ2022Äê11ÔÂÍÆ³öµÄÈ˹¤ÖÇÄÜÁÄÌì»úÆ÷È˳ÌÐò£¬¸Ã³ÌÐòÊÇÔÚGPT-3.5£¨Ò»ÖÖ×ÔÈ»ÓïÑÔԤѵÁ·´óÄ£ÐÍ£©»ù´¡ÄÚºËÉÏʹÓüලѧϰºÍÇ¿»¯Ñ§Ï°½øÐÐѵÁ·ËùµÃµ½µÄÄ£ÐÍ¡£Ôڼලѧϰ¹ý³ÌÖУ¬ChatGPTÊÕ¼¯ÁËȫеÄÈËÀà¶Ô»°ÓïÁÏ£¬²¢½«ÆäÓëGPT-3.5µÄ¼à¶½Ñ§Ï°ÓïÁϺϲ¢¡£ÔÚÇ¿»¯Ñ§Ï°¹ý³ÌÖУ¬ChatGPTÊ×ÏÈѵÁ·ÁËÒ»¸öµÃ·ÖÄ£ÐÍÀ´¶ÔÄ£ÐÍÊä³ö½øÐÐÅÅÐò£¬È»ºóÓøõ÷ÖÄ£ÐÍÀ´¶ÔÉú³ÉÄ£ÐÍÊä³ö½øÐз´À¡£¬²¢ÓÅ»¯¸ÃÉú³ÉÄ£ÐÍ¡£×îÖÕÓÉÇ¿»¯Ñ§Ï°µÃµ½µÄÄ£Ðͼ´ÎªChatGPT¡£ChatGPTÒÔÎÄ×Ö·½Ê½»¥¶¯£¬¿ÉÒÔʵÏÖÓëÈËÀà¶Ô»°½»»¥£¬»¹¿ÉÒÔʵÏÖÎı¾Éú³É¡¢×Ô¶¯ÎÊ´ð¡¢×Ô¶¯ÕªÒªµÈÔÚÄڵĶàÖÖÈÎÎñ¡£


ChatGPTµÄ³É¹¦À´Ô´ÓÚ¸üÔçÆÚ·¢²¼µÄGPT-3Ä£ÐÍÒÔ¼°¶ÔRLHFµÄÓÅ»¯¡£GPTÊÇGenerative Pre-trained Transformer£¨Éú³ÉÐÍԤѵÁ·±ä»»Ä£ÐÍ£©µÄËõд¡£ËüÊÇ»ùÓÚTransformer¼Ü¹¹£¨2017ÄêÓɹȸèÌá³ö£©£¬GPTµÄÖ÷ÒªÓÅÊÆÔÚÓÚËü¿ÉÒÔͨ¹ýԤѵÁ·´óÁ¿ÓïÁÏÊý¾ÝÀ´»ñµÃ¶ÔÓïÑÔÈÎÎñµÄÔ¤²âÄÜÁ¦£¬¶ø²»ÐèÒª´óÁ¿µÄÈ˹¤±ê×¢Êý¾Ý¡£Ëü¾ßÓÐÁ¼ºÃµÄÓïÑÔÉú³ÉÄÜÁ¦£¬¿ÉÒÔÉú³ÉÎı¾¡¢»Ø´ðÎÊÌâ¡¢¶Ô»°µÈ¶àÏîÓïÑÔÈÎÎñ¡£RLHF£¨Reinforcement Learning from Human FeedbackÈËÀà·´À¡Ç¿»¯Ñ§Ï°£©[4][5]ÊÇÒ»ÏîÉæ¼°¶à¸öÄ£ÐͺͲ»Í¬ÑµÁ·½×¶ÎµÄ¸´ÔÓ¸ÅÄ°üÀ¨ÒÔÏÂÈý¸ö²½Ö裺ԤѵÁ·Ò»¸öÓïÑÔÄ£ÐÍ£¨LM£©£»¾ÛºÏÎÊ´ðÊý¾Ý²¢ÑµÁ·Ò»¸ö½±ÀøÄ£ÐÍ£¨Reward Model£¬RM£©£»ÓÃÇ¿»¯Ñ§Ï°£¨RL£©·½Ê½Î¢µ÷LM¡£


2023Äê3ÔÂ15ÈÕ£¬¶àģ̬ԤѵÁ·´óÄ£ÐÍGPT-4[6]Õýʽ·¢²¼£¬Äܹ»´¦ÀíÎı¾¡¢Í¼ÏñÁ½ÖÖģ̬ÒÔ¼°25000¸öµ¥´ÊµÄ³¬³¤Îı¾ÊäÈ룬²¢Í¨¹ýÎı¾Êä³ö¡£GPT-4Äܹ»ºÜºÃµÄÖ§³ÖͼÏñÊäÈ룬Äܹ»Àí½âͼƬÖеÄÓÄĬ֮´¦£¬²¢ÇҾ߱¸Àí½â³¤ÉÏÏÂÎĵÄÄÜÁ¦£¬ÔÚ¸÷ÖÖרҵºÍѧÊõ»ù×¼²âÊÔÉϱíÏÖ³öÈËÀàˮƽ£¬°üÀ¨Í¨¹ýÄ£ÄâÂÉʦ¿¼ÊÔ£¬·ÖÊýԼΪȫÌ忼ÉúµÄǰ10%¡£Ïà¶ÔÓÚÒÔǰµÄGPT-3.5Ä£ÐÍ£¬GPT-4Ã÷ÏÔ¼õÉÙÁË¡°»Ã¾õ¡±£¬ÔÚÍŶÓÄÚ²¿¶Ô¿¹ÐÔÉè¼ÆµÄÊÂʵÐÔÆÀ¹ÀÖУ¬GPT-4µÄµÃ·Ö±ÈGPT-3.5¸ß19¸ö°Ù·Öµã¡£µ«ÊÇ£¬¿¼Âǵ½GPT-4ÕâÑùµÄ´óÄ£Ð͵ľºÕù¸ñ¾ÖºÍ°²È«Ó°Ï죬OpenAI²¢Î´¹«¿ªÓйؼܹ¹£¨°üÀ¨Ä£ÐÍ´óС£©¡¢Ó²¼þ¡¢ÑµÁ·¼ÆËã¡¢Êý¾Ý¼¯¹¹½¨¡¢ÑµÁ··½·¨»òÀàËÆÄÚÈݵĸü¶àÏêϸÐÅÏ¢¡£Ä¿Ç°£¬ChatGPTPlus°æ±¾ÒѾ­Ê¹ÓÃGPT-4Ä£ÐÍ¡£


ÖÇÆ×AIÍŶÓÑо¿·¢²¼µÄ¡¶ChatGPTÍŶӱ³¾°Ñо¿±¨¸æ¡·[7]³Æ£¬2023Äê2Ô£¬ChatGPTÍŶӹæÄ£²»×ã°ÙÈË£¨¹²87ÈË£©¡£·ÖÎö·¢ÏÖ£¬ÆäÏÔÖøÌØÕ÷ÊÇ¡°Äê¼ÍºÜÇᡱ¡¢¡°±³¾°ºÀ»ª¡±¡¢¡°¾Û½¹¼¼Êõ¡±¡¢¡°»ýÀÛÉîºñ¡±¡¢¡°³çÉд´Òµ¡±ºÍ¡°»ªÈËÇÀÑÛ¡±¡£¸ÃÍŶӯ½¾ùÄêÁäΪ32Ë꣬¡°90ºó¡±ÊÇÖ÷Á¦¾ü¡£ËûÃÇÒýÁìµÄÕâÒ»²¨´óÐÍÓïÑÔÄ£Ðͼ¼Êõ·ç³±£¬³ä·Ö˵Ã÷ÁËÄÇЩ¾­³£±»ÈÏΪÑз¢¾­Ñé²»×ãµÄÄêÇáÈË£¬ÍêÈ«ÓпÉÄÜÔÚÇ°ÑØ¿Æ¼¼ÁìÓòÈ¡µÃÖØ´óÍ»ÆÆ¡£ÍŶӳÉÔ±¾ø´ó¶àÊýÓµÓÐÃûУѧÀú£¬ÇÒ¾ßÓÐÈ«ÇòÖªÃûÆóÒµ¹¤×÷¾­Àú¡£»ªÈËѧÕßÅ·ÑôÁú²ÎÓëÁËÓëChatGPTÏà¹ØµÄ7´ó¼¼ÊõÏîÄ¿ÖеÄ4´óÏîÄ¿µÄÑз¢£¬ËûÊÇInstructGPTÂÛÎĵĵÚÒ»×÷Õߣ¬ÊÇRLHFÂÛÎĵĵڶþ×÷Õߣ¬¿É¼ûËûÊÇÕâÁ½¸ö¹Ø¼ü¼¼ÊõÏîÄ¿µÄºËÐÄÈËÔ±¡£


02

רÀûÊÓ½ÇϵÄChatGPTÏà¹Ø¼¼Êõ´´Ð·¾¶



1£©OpenAIרÀû·ÖÎö


ÔÚÊÀ½çÖøÃûµÄDWPIÕªÒªÊý¾Ý¿âÖУ¬ÒÔÉêÇëÈË£¨OpenAI£©¡¢·¢Ã÷ÈË£¨InstructGPT¡¢GPT-3µÈ¼¼Êõ¶ÔÓ¦ÂÛÎÄ×÷Õߣ©¡¢½áºÏ¡°NLP¡±¡¢¡°ÓïÑÔ¡±¡¢¡°ÑµÁ·¡±µÈ¹Ø¼ü´Ê½øÐмìË÷£¬·¢ÏÖOpenAI¹«Ë¾×÷ΪȨÀûÈ˵ÄרÀûÊýÁ¿ÎªÁ㣬ͬʱ²ÉÓöàÖÖÉÌÒµÊý¾Ý¿â½øÐÐËÑѰ£¬¾ùδ·¢ÏÖOpenAIÃûÏÂÈκÎרÀû¡£


·ÖÎöÆäÔ­Òò£¬OpenAIÔÚ³ÉÁ¢Ö®³õ×÷Ϊ·ÇÓ¯Àû×éÖ¯£¬ÆÚÍû¿ªÔ´ËùÓм¼Êõ£¬×¨ÀûÖÆ¶È×÷Ϊ¹«¿ª»»±£»¤µÄÒ»ÖÖ·½Ê½£¬ÉêÇëרÀû¶ÔÓÚ·ÇÓ¯Àû×éÖ¯²»ÊDZØÐëµÄ£¬¶øOpenAIת±äΪӯÀûÐÔ¹«Ë¾ºó£¬¿¼Âǵ½ChatGPT¡¢GPT-3¡¢GPT-4ÊôÓÚºÚºÐÄ£ÐÍ£¬ÇÒÄ£Ð͵ÄѵÁ·»¨·ÑÅӴ󣬿ª·¢ºÍ²¿Ê𶼺ܸ´ÔÓ£¬¶ÔÓÚÆäËû¹«Ë¾»ò¿ÆÑÐÔºËù¶øÑÔºÜÄѸ´ÏÖ£¬²»Í¨¹ýרÀûÒ²ÄÜʵÏÖ¼¼Êõ±£»¤£¬Í¨¹ýÉÌÓÃAPI(Application Programming Interface,Ó¦ÓóÌÐò±à³Ì½Ó¿Ú£©µÈ·½Ê½¼´¿É»ñÀû£¬ÁíÒ»·½Ã棬¶ÔÓÚѵÁ·ÓïÁÏ»ñÈ¡¡¢Ä£ÐÍËã·¨¶øÑÔ£¬¿ÉÄÜ»áÉæ¼°×¨Àû²»ÊÚȨ¿ÍÌåÎÊÌ⣬²»ÄܽøÐÐרÀû±£»¤£¬¶ø¼´Ê¹²»Éæ¼°¿ÍÌåÎÊÌâµÄ£¬ÓÉÓÚѵÁ·µÈ²½ÖèµÄ²»¿É¼ûÐÔ£¬ÔÚרÀûÊÚȨºóÒ²ºÜÄѽøÐÐάȨ£¬Òò´ËOpenAI¹«Ë¾ÓпÉÄÜͨ¹ýÉÌÒµÃØÃܽøÐм¼Êõ±£»¤¡£


¸ù¾ÝOpenAI¹ÙÍø¹«¿ªµÄChatGPT¼¼ÊõÔ­Àíͼ·ÖÎöµÃÖª£¬ChatGPTµÄѵÁ·¹ý³Ì·ÖΪÒÔÏÂÈý¸ö½×¶Î[8]£º


1

ͼ1 ChatGPT¼¼ÊõÔ­Àíͼ

µÚÒ»½×¶Î£ºÑµÁ·¼à¶½²ßÂÔÄ£ÐÍ¡£Ê×ÏÈ»áÔÚÊý¾Ý¼¯ÖÐËæ»ú³éÈ¡ÎÊÌ⣬Óɱê×¢ÈËÔ±¸ø³ö¸ßÖÊÁ¿´ð°¸£¬È»ºóÓÃÈ˹¤±ê×¢ºÃµÄÊý¾ÝÀ´Î¢µ÷GPT-3.5Ä£ÐÍ£¬»ñµÃSFT£¨Supervised Fine-Tuning£©Ä£ÐÍ¡£


µÚ¶þ½×¶Î£ºÑµÁ·½±ÀøÄ£ÐÍ£¨Reward Model£¬RM£©¡£ÔÚÊý¾Ý¼¯ÖÐËæ»ú³éÈ¡ÎÊÌ⣬ʹÓõÚÒ»½×¶ÎÉú³ÉµÄÄ£ÐÍÉú³É¶à¸ö²»Í¬µÄ»Ø´ð¡£±ê×¢ÈËÔ±¶ÔÊä³ö½øÐдò·ÖÅÅÐò£¬Ê¹ÓÃÅÅÐò½á¹ûÊý¾ÝÀ´ÑµÁ·½±ÀøÄ£ÐÍ¡£


µÚÈý½×¶Î£º²ÉÓÃÇ¿»¯Ñ§Ï°ÖеÄPPO£¨Proximal Policy Optimization£¬½ü¶Ë²ßÂÔÓÅ»¯£©[9]À´ÓÅ»¯²ßÂÔ¡£Ê×ÏÈʹÓõÚÒ»½×¶ÎÖеijõÊ¼È¨ÖØ¹¹ÔìÒ»¸ö³õʼµÄPPOÄ£ÐÍ¡£Õë¶ÔÔÚÊý¾Ý¼¯ÖвÉÑùµÄеÄÎÊÌ⣬ʹÓÃPPOÄ£ÐÍÉú³É»Ø´ð£¬²¢Óõڶþ½×¶ÎѵÁ·ºÃµÄRMÄ£Ð͸ø³ö»Ø±¨·ÖÊý¡£PPO²ßÂÔ¿ÉÒÔ»áͨ¹ý»Ø±¨·ÖÊý¼ÆËã³ö²ßÂÔÌݶÈ£¬²¢¸üÐÂPPOÄ£ÐͲÎÊý¡£


2£©¹úÍâÖ÷ÒªÉêÇëÈËרÀû·ÖÎö


Ëæ×Å2017Äê¹È¸èTransformerÄ£Ð͵ÄÌá³ö£¬Ô¤ÑµÁ·ÓïÑÔÄ£ÐÍ¿ªÊ¼ÏÔÖø·¢Õ¹£¬Òò´Ë±¾ÎĹØÓÚԤѵÁ·ÓïÑÔÄ£Ðͼ¼ÊõµÄ¼ìË÷Ö÷ÒªÕë¶Ô2017ÄêÖ®ºóÉêÇëµÄרÀû¡£ÔÚDWPIÕªÒªÊý¾Ý¿âÖУ¬Õë¶Ô¹Ø¼ü´Ê¡°language model¡±¡¢¡°train¡±¡¢¡°fine-tune¡±½øÐмòµ¥¼ìË÷£¬¹²ÓÐ2600¶àƪרÀûÎÄÏס£¼ìË÷½á¹û½öÕë¶ÔרÀûÕªÒª½øÐмìË÷£¬ÇÒΪרÀûͬ×åºÏ²¢ºóµÄ½á¹û¡£


2


ͼ2 ԤѵÁ·ÓïÑÔÄ£Ðͼ¼ÊõÉêÇëÈËÀ´Ô´¹ú¼Ò

ÔÚԤѵÁ·ÓïÑÔÄ£ÐÍÁìÓò£¬ÖйúÆóÒµ·¢Õ¹Ñ¸ËÙ¡£°Ù¶È¡¢°¢Àï¡¢ÌÚѶ¡¢»ªÎª¶¼ÊÇÖ÷ÒªÉêÇëÈË£¬ÇÒ¾ùÔÚº£ÍâÕ¹¿ª²¼¾Ö£¬¹úÍâÉêÇëÈËÖ÷Òª¼¯ÖÐÔÚ΢Èí¡¢¹È¸èºÍÈýÐÇ¡£µ«ÊÇ»¹Ó¦×¢Òâµ½£¬¹úÍâһЩ¹«Ë¾Õë¶ÔÉñ¾­ÍøÂç¡¢±à½âÂëÆ÷½á¹¹¸Ä½øµÄרÀû¼¼Êõ·½°¸£¬ÔÚÕªÒªÖв¢Ã»ÓÐÌáµ½ÓïÑÔÄ£ÐÍ£¬µ«ÊÇÉñ¾­ÍøÂçµÈÊÇ¿ÉÒÔÓ¦Óõ½ÓïÑÔÄ£ÐÍÖеÄ£¬Òò´Ëʵ¼ÊÉϹØÓÚԤѵÁ·ÓïÑÔÄ£Ðͼ¼ÊõµÄÉêÇëÁ¿»á¸ü¶à¡£

ΪÁ˸üÈ«ÃæµØÁ˽â¹úÍâÉêÇëÈËÔÚÖйúµÄ²¼¾ÖÇé¿ö£¬Õë¶ÔÈ«ÎÄÊý¾ÝÔٴμìË÷£¬²¢Í³¼ÆºÏ²¢Í¬×åµÄ½á¹û¡£

3


ͼ3 ¹úÍâÉêÇëÈËÔÚÖйúµÄÉêÇëÁ¿

¹È¸è¶àÄêÀ´Â½ÐøÌá³öTransformer¡¢BERT¡¢T5µÈÄ£ÐÍ£¬Ä¿Ç°Õë¶ÔTransformerÉêÇëÏà¹ØÃÀ¹úרÀû£¨US2018341860A1£¬×¨ÀûÃû¡°»ùÓÚ×¢ÒâµÄÐòÁÐת»»Éñ¾­ÍøÂ硱£©£¬²¢ÔÚÖÐÃÀÅ·ÈÕº«µÈ¶à¸ö¹ú¼Ò²¼¾Ö¡£BERT¡¢T5ËäδÉêÇëרÀû£¬µ«ÊÇÎÒÃǾ­¼ìË÷·¢ÏÖ£¬ÆäרÀû²¼¾Öº­¸ÇÁË»ùÓÚÉÏÊöÄ£ÐÍÑÜÉúµÄÏÂÓÎÈÎÎñ£¬ÔÚ¶àÓïÑÔ·­Òë¡¢Îı¾ÓïÒôת»»¡¢ÍêÐÍÌî¿Õ¡¢Ï¡Êè±íʾ¡¢Çé¸Ð·ÖÀàµÈÁìÓòÓÐËù¼¼Êõ¸Ä½ø¡£2021Ä꣬¹È¸èÌá³öÁËSwitchTransformer[10]Ä£ÐÍ£¬²ÉÓÃÁËÏ¡Ê輤»î¼¼Êõ£¬ÓµÓÐ1.6ÍòÒÚ²ÎÊý£¬Ïàͬ×ÊÔ´Çé¿öÏ£¬ÑµÁ·ËٶȱÈÓɹȸ迪·¢µÄ×î´óÓïÑÔÄ£ÐÍT5-XXL¿ìÁË4±¶£¬¹È¸è¾Í¸ÃÄ£ÐÍÉêÇëÏà¹ØPCT¹ú¼ÊרÀûÉêÇëWO2022150649A1£¨NEURALNETWORKSWITHSWITCHLAYERS£©£¬Ä¿Ç°²¢Î´½øÈëÈκιú¼Ò½×¶Î¡£¹È¸èÒ²ÔÚÄ£ÐÍѵÁ·¡¢Î¢µ÷µÈ·½Ã濪չרÀû²¼¾Ö¡£


4


ͼ4 ¹È¸è²¿·ÖרÀû


»ùÓÚBERTÄ£ÐÍ£¬Î¢ÈíÓÚ2020ÄêÌá³öÁËDeBertaÄ£ÐÍ£¬²¢Ìá½»ÉêÇëÏà¹ØÃÀ¹úרÀû¡°¾ßÓн⿪עÒâÁ¦ºÍ¶à²½½âÂëµÄ¸ßЧ±äѹÆ÷ÓïÑÔÄ£ÐÍ¡±£¨US2021334475A1£©£¬ÀûÓöಽ½âÂëÀ´¸üºÃµØÖؽ¨Ñڱαê¼Ç²¢¸ÄÉÆÔ¤ÑµÁ·ÊÕÁ²À´´Ù½øÔ¤ÑµÁ·µÄ×ÔÈ»ÓïÑÔÄ£Ð͵Ä×ÔѵÁ·¡£2021ÄêÌá³öµÄLORAÄ£ÐÍÖ÷񻃾¼°Éñ¾­ÍøÂçÄ£Ð͵ĵÍÖÈ×ÔÊÊÓ¦£¬¶³½áÁËԤѵÁ·µÄÄ£ÐÍÈ¨ÖØ£¨Ïà¹ØÃÀ¹úרÀûUS2022383126A1£©¡£´ËÍ⣬΢ÈíÒ²ÔÚÏÂÓÎÈÎÎñ½øÐÐרÀû²¼¾Ö£¬ÀýÈçÆäÉêÇëµÄPCT¹ú¼ÊרÀûÉêÇëWO2022221045A1Éæ¼°¶àÈÎÎñÄ£ÐÍ£¬°üÀ¨ÀýÈç¹²Ïí±àÂëÆ÷¡¢¶à¸öÈÎÎñÌØ¶¨±àÂëÆ÷ºÍÓÃÓÚ¶à¸öÈÎÎñµÄ¶à¸öÈÎÎñÌØ¶¨ÏßÐÔ²ãµÈ¡£


ÔÚPatenticsµÄÓ¢ÎÄÈ«ÎÄ¿âÖÐÒÔ¡°DeepMind¡±£¨DeepMindΪGoogleÆìÏÂÇ°ÑØÈ˹¤ÖÇÄÜÆóÒµ£©×÷ΪÉêÇëÈË£¬language model×÷Ϊ¹Ø¼ü´Ê½øÐмìË÷£¬¼ìË÷½á¹ûΪ27ƪ¡£DeepMind²àÖØÓÚ¶ÔÉñ¾­ÍøÂçµÄ¸Ä½ø¡£ÖйúרÀû¡°Õë¶ÔʹÓöԿ¹ÑµÁ·µÄ±íʾѧϰµÄÍÆÀíµÄ´ó¹æÄ£Éú³ÉÉñ¾­ÍøÂçÄ£ÐÍ¡±£¨CN113795851A£©£¬ÑµÁ·¿ÉÒÔÊÇ»ùÓÚËðʧº¯Êý£¬¸ÃËðʧº¯Êý°üÀ¨»ùÓÚÓɼø±ðÆ÷Éñ¾­ÍøÂç´¦ÀíµÄÊäÈë¶ÔµÄÑù±¾²¿·ÖºÍDZÔÚ²¿·ÖµÄÁªºÏ¼ø±ðÆ÷ËðʧÏîºÍ½ö½ö»ùÓÚÊäÈë¶ÔµÄÑù±¾²¿·Ö»òDZÔÚ²¿·ÖÖеÄÒ»¸ö²¿·ÖµÄÖÁÉÙÒ»¸öµ¥Ò»¼ø±ðÆ÷ËðʧÏ¸ÃרÀûÔÚÖÐÃÀµÈ¹ú¾ùÓв¼¾Ö£¬¸ù¾ÝÓ¢ÎÄ¿âÖмìË÷µÃµ½µÄרÀû²éÕÒÆäÖÐÎÄͬ×壬¿ÉÒÔÈ·¶¨DeepMindÔÚÖйúÉêÇëʹÓÃÔ¨»Û¿Æ¼¼ÓÐÏÞ¹«Ë¾Ãû³Æ¡£


5


ͼ5 DeepMind²¿·ÖרÀû²¼¾ÖÇé¿ö


ÓÉͼ5¿ÉÒÔ¿´³ö£¬DeepMindÔÚ¶àģ̬·½ÃæÒ²ÓÐËù²¼¾Ö£¬Éæ¼°¶³½áÓïÑÔÄ£Ð͵Ķàģ̬ÉÙÑù±¾Ñ§Ï°ÒÔ¼°Ê¹Óöàģ̬ÊäÈëÑ¡Ôñ²Ù×÷¡£¶àģ̬µÄÓïÑÔÄ£ÐÍÊÇÒ»ÖÖÄܹ»Í¬Ê±´¦Àí²»Í¬ÀàÐ͵ÄÊý¾Ý£¬ÈçÎı¾¡¢Í¼Ïñ¡¢ÒôƵºÍÊÓÆµµÄÈ˹¤ÖÇÄܼ¼Êõ¡£¶àģ̬ÓïÑÔÄ£Ð͵ÄÄ¿±êÊÇʵÏÖ¿çģ̬µÄÀí½â¡¢Éú³ÉºÍ½»»¥£¬´Ó¶øÌá¸ßÈË»ú¶Ô»°ºÍÐÅÏ¢¼ìË÷µÄЧ¹û¡£¹È¸è½üÆÚÉêÇëÁË»ùÓÚUIµÄ¶àģ̬ģÐÍ£¬ÀýÈçÃÀ¹úרÀûUS2023031702A1ͨÓÃÓû§½çÃæ×ª»»Æ÷£¨VUT£©£¬´¦ÀíÈýÖÖÀàÐ͵ÄÊý¾Ý£ºÍ¼Ïñ¡¢½á¹¹£¨ÊÓͼ²ã´Î£©ºÍÓïÑÔ£¬²¢ÇÒÖ´Ðжà¸ö²»Í¬µÄÈÎÎñ£¬ÖîÈçUI¶ÔÏó¼ì²â¡¢×ÔÈ»ÓïÑÔ´¦Àí¡¢ÆÁĻժҪ¡¢UI¿ÉÇû÷ÐÔÔ¤²â¡£Î¢ÈíµÄPCT¹ú¼ÊרÀûÉêÇëWO2022187063A1Ôò¹«¿ªÁËÒ»ÖÖÊÓ¾õÓëÓïÑԵĿçģ̬¼Ó¹¤·½·¨£¬»ùÓÚÊÓ¾õÓïÒåÌØÕ÷¼¯ºÍÎı¾ÌØÕ÷¼¯À´ÑµÁ·Ä¿±êÄ£ÐÍ£¬ÒÔÈ·¶¨ÊäÈëÎı¾ºÍÊäÈëͼÏñÖ®¼äµÄ¹ØÁªÐÅÏ¢¡£


03

¹úÄÚÏà¹Ø¼¼Êõ·¢Õ¹Çé¿ö


ÔÚPatenticsµÄÖÐÎÄÊý¾Ý¿âÖУ¬ÒÔ¡°Ô¤ÑµÁ·¡±¡¢¡°´ó¹æÄ£¡±¡¢¡°ÓïÑÔÄ£ÐÍ¡±¡¢¡°Î¢µ÷¡±¡¢¡°Áã/ÉÙÑù±¾¡±¡¢¡°ÖªÊ¶Í¼Æ×¡±µÈ×÷Ϊ¹Ø¼ü´Ê½øÐмòµ¥¼ìË÷£¬¹²¼ìË÷³ö12292ƪרÀû£¬ÎÒÃÇ¿ÉÒÔ¿´³ö¹úÄÚԤѵÁ·´óÄ£Ðͼ¼Êõ×Ô2018Äêºó¿ªÊ¼Ñ¸ËÙ·¢Õ¹£¬¼øÓÚĿǰ21Äê¡¢22ÄêÉêÇëµÄרÀûδȫ²¿¹«¿ª£¬Êµ¼ÊÉϸÃÁìÓòµÄרÀûÉêÇëÊýÁ¿¿ÉÄܸü¶à¡£


6

ͼ6 ÓïÑÔ´óÄ£Ðͼ¼ÊõÖйúרÀûÉêÇëÁ¿Ç÷ÊÆ

7

ͼ7 È˹¤ÖÇÄÜ´óÄ£Ðͼ¼ÊõÖйúרÀûÖ÷ÒªÉêÇëÈË[11]

8

ͼ8 ÓïÑÔ´óÄ£Ðͼ¼ÊõÖйúÉêÇëÈËÔÚÃÀ¹úµÄÉêÇëÁ¿


1£©¹úÄÚÓïÑÔÄ£ÐÍÏà¹Ø×¨Àû


2019Äê3Ô£¬°Ù¶ÈÌá³öÎÄÐÄ´óÄ£ÐÍERNIE£¬Ëæºó°Ù¶ÈÔÚ֪ʶͼÆ×¡¢ÓïÑÔÀí½âÓëÉú³É¼¼Êõ¡¢ÒÔ¼°»úÆ÷·­Òë¡¢¶Ô»°ÏµÍ³¡¢ÕªÒªÉú³É¡¢³¤Îı¾ÓïÒå¡¢Îı¾¾À´íµÈÁìÓò¶¼½øÐв¼¾Ö¡£ÆäÖÐ֪ʶͼÆ×°üÀ¨ÊµÌå֪ʶͼÆ×¡¢ÐÐҵ֪ʶͼÆ×¡¢Ê¼þͼÆ×¡¢¹Ø×¢µãͼÆ×ÒÔ¼°¶àģ̬ͼÆ×¡£


»ªÎªÓëÇ廪´óѧ¡¢¹þ¶û±õ¹¤Òµ´óѧ¡¢ÖйúÈËÃñ´óѧµÈ¸ßУ¾ùÓкÏ×÷£¬ÓïÑÔÄ£ÐͰüÀ¨×ԻعéÄ£ÐÍ£¬²¢ÔÚÄ£ÐÍѵÁ··½·¨¡¢Á¿×ӵ緽øÐÐÄ£Ð͸´ÊýÔËËã¡¢½µµÍѵÁ·PLMËùÐè×ÊÔ´¡¢Îı¾ÏòÁ¿µÈ·½Ïò½øÐÐרÀû²¼¾Ö¡£


ͼ9¸ø³öÁ˹úÄÚÓïÑÔÄ£ÐÍÏà¹Ø×¨Àû·¢Õ¹Çé¿ö¡£ÆäÖУ¬ÖйúרÀûCN110717339Aͨ¹ý¹¹½¨´ÊÓïÆ¬¶Î¡¢¾ä×ÓÒÔ¼°ÎÄÕÂÈý¸ö²»Í¬²ã¼¶µÄÎ޼ල»òÈõ¼à¶½Ô¤ÑµÁ·ÈÎÎñ£¬Ê¹µÃÓïÒå±íʾģÐÍ¿ÉÒÔ´Óº£Á¿Êý¾ÝÖÐѧϰµ½´ÊÓïÆ¬¶Î¡¢¾ä×ÓÒÔ¼°ÎÄÕ²»Í¬²ã´ÎµÄ֪ʶ£¬ÔöÇ¿ÁËͨÓÃÓïÒå±íʾµÄÄÜÁ¦£¬ÌáÉýNLPÈÎÎñµÄ´¦ÀíЧ¹û£¬°Ù¶È¹«Ë¾µÄÕâÏîרÀû»¹»ñµÃÁ˵ڶþÊ®Èý½ì2022ÖйúרÀû½±ÓÅÐã½±¡£


9


ͼ9 ¹úÄÚÓïÑÔÄ£ÐÍÏà¹Ø×¨Àû·¢Õ¹Çé¿ö


Õë¶Ô¶àģ̬ģÐÍ£¬°Ù¶ÈµÄÖйúרÀûCN115374798AÌá³ö½«¿çÓïÑÔԤѵÁ·Ä¿±êºÍ¿çģ̬ԤѵÁ·Ä¿±êÎÞ·ìµØ×éºÏÔÚͳһµÄ¿ò¼ÜÖУ¬´Ó¿ÉÓõÄÓ¢ÎÄͼÏñ×ÖÄ»Êý¾Ý¡¢µ¥ÓïÓïÁÏ¿âºÍƽÐÐÓïÁÏ¿âÔÚÁªºÏǶÈë¿Õ¼äÖÐѧϰͼÏñºÍÎı¾¡£»ªÎªµÄÖйúרÀûCN115688937A½«²»Í¬Ä£Ì¬µÄÊý¾ÝµÄÌØÕ÷±íʾӳÉ䵽ͬһ¸öÀëÉ¢¿Õ¼äÖУ¬¿ÉÒÔ»ùÓÚ¸ÃÀëÉ¢¿Õ¼ä¶Ô¶àģ̬µÄÌØÕ÷±íʾ½øÐн¨Ä££¬µÃµ½¼æÈݶàģ̬ÊäÈëÊý¾ÝµÄÄ£ÐÍ¡£


2£©¹úÄÚÈË»ú½»»¥Ó¦ÓÃÏà¹Ø×¨Àû


¶øÕë¶ÔÀàËÆÓÚChatGPTµÄÈË»ú½»»¥Ó¦Ó㬹úÄÚÉêÇëÈËÒ²ÓÐÏàÓ¦µÄרÀû²¼¾Ö£¬µ«Î´½øÐк£Íâ²¼¾Ö¡£


±í1 ¹úÄÚÖ÷Òª¹«Ë¾µÄ¼¼Êõ²¼¾ÖÇé¿ö

11


04

רÀû·ÖÎö½Ç¶ÈϵÄGPT¼¼Êõ¾ÖÏÞÐÔ


ĿǰÃâ·Ñ°æChatGPTʹÓÃGPT-3.5°æ±¾£¨ÒÔϳÆÎªChatGPT-3.5£©£¬¾ßÓгöÉ«µÄÉÏÏÂÎĶԻ°ÄÜÁ¦£¬µ«ÊÇÉв»ÄܽøÐжàģ̬½»»¥£¬È±·¦½â¾öÊýѧÎÊÌâµÄÄÜÁ¦£¬²¢ÇÒ¶ÔÓÚһЩרҵÁìÓòȱÉÙ×ã¹»µÄÊý¾Ý½øÐÐѵÁ·£¬µ¼ÖÂÎÞ·¨³£³£ÎÞ·¨Éú³ÉÊʵ±»Ø´ð¡£ÀýÈ磬±ÊÕß³¢ÊÔÓÃChatGPT-3.5²ûÊöÃÀ¹úרÀûUS2021334475A1µÄ¼¼Êõ·½°¸£¬Ëü¿ÉÒÔÍêÕûµØÃèÊö³öרÀûµÄ·¢Ã÷Ãû³Æ¡¢¼¼Êõ·½°¸µÈ£¬µ«ÊÇÕâ¸öרÀûÎı¾Êµ¼ÊÉÏÊÇ΢Èí¹«Ë¾ÓÚ2020Äê6ÔÂ24ÈÕÉêÇëµÄÃûΪ¡°¾ßÓн⿪עÒâÁ¦ºÍ¶à²½½âÂëµÄ¸ßЧ±äѹÆ÷ÓïÑÔÄ£ÐÍ¡±£¬¹«¿ªÈÕΪ2021Äê10ÔÂ28ÈÕ£¬ChatGPT-3.5µÄ»Ø´ðÍêÈ«ÎIJ»¶ÔÌâ¡£ÖÁÉÙChatGPT-3.5ÎÞ·¨×öµ½×¨ÀûºÅºÍ·¢Ã÷ÄÚÈݵļòµ¥¶ÔÓ¦£¬Õâ¿ÉÄÜÊÇȱÉÙÏà¹Ø×¨ÀûÓïÁÏÔì³ÉµÄ¡£

10

ͼ10 ChatGPT-3.5ÁÄÌì½ØÍ¼


¼øÓÚ΢Èí½«GPT-4ÕûºÏ½øNewBingÖУ¬±ÊÕßͨ¹ýNewBingµÄÁÄÌ칦ÄÜËÑË÷ÃÀ¹úרÀûUS2021334475A1¡£ËäÈ»ËüÄܹ»ÍêÕûµÄ¸ø³öËùÓÐÐÅÏ¢£¬µ«ÊdzýÁË·¢Ã÷Ãû³ÆÊÇÕýÈ·µÄ£¬ÉêÇëÈÕ¡¢¹«¿ªÈÕ¡¢ÉêÇëÈË¡¢·¢Ã÷ÈËÐÅÏ¢¶¼ÊÇ´íÎóµÄ£¨¼ûͼ11£©¡£¾Í´Ë´Î½á¹û¶øÑÔ£¬New Bing¸üÇãÏòÓÚÔÚËÑË÷µÄ»ù´¡É϶ÔÐÅÏ¢×÷³öÍêÕûµÄ²¹³ä£¬²¢²»Äܱ£Ö¤ÕæÊµÐÔ¡£



12

ͼ11 New BingÁÄÌ칦ÄܽØÍ¼


ÐèҪעÒâµÄÊÇ£¬New BingÔÚ¶à´Î³¢ÊÔºó£¬Ò²»á¸ø³ö´íÎóµÄ´ð°¸£¨¼ûͼ12£©¡£

13


ͼ12 New BingÁÄÌ칦ÄܽØÍ¼


ChatGPT-3.5ÒÔ¼°NewBing¶¼²»ÄÜÍêÕûµÄÌṩרÀûÎļþÐÅÏ¢£¬ÄÇôÕë¶Ô·¨ÂÉÌõ¿îÊÇ·ñÄÜ»ñµÃ½ÏºÃЧ¹ûÄØ£¿±ÊÕß·Ö±ðÏòChatGPT-3.5ºÍNewBingѯÎÊ¡°×¨ÀûµÄ¼¼Êõ·½°¸ÊÇÎÞ·¨ÊµÏֵģ¬ÐèÒªÓõ½ÖйúרÀû·¨µÄÄĸö·¨Ìõ¡±£¬ChatGPT¸ø³öµÄ´ð°¸ÀàËÆÓÚʹÓôóÁ¿·¨ÂÉÎÄÏ×ѵÁ·Ä£Ð͵Ľá¹û£¬ËäÈ»¿´ËÆ×¼È·£¬µ«Éæ¼°µÄÌõ¿î¼°Æä¹æ¶¨¶¼²»ÊÇÖйúרÀû·¨µÄÄÚÈÝ£¬¶øNewBingÔòÊÇËÑË÷¼Ó¹¤µÄ½á¹û£¬ÕÒµ½ÁËÊʺϵÄÌõ¿î£¬µ«ÊÇ·¨Ìõ¹æ¶¨µÄÄÚÈÝÓë¸ÃÌõ¿îºÁÎÞ¹ØÏµ¡£Òò´Ë£¬ChatGPT-3.5ºÍNewBing¶¼²»Äܱ£Ö¤Éú³ÉÄÚÈݵÄ׼ȷÐÔ¡£

14

15

ͼ13 ChatGPT-3.5ÓëNew BingÁÄÌ칦ÄܶԱÈ


̽¾¿ÆäÔ­Òò£¬GPT-3.5Ö»ÊÇ»ùÓÚ±¾µØµÄÓïÁÏ¿â½øÐÐËÑË÷£¬Ã»ÓÐÁªÍø£¬ËùÒÔ¶ÔÓںܶàÎÞ·¨»ñÈ¡µÄÐÅÏ¢»á½øÐжÅ׫£¬È±·¦×¼È·ÐÔ£¬µ«ÊÇGPT-4ºÍNew BingÊǾßÓÐÁªÍøÐÎ̬µÄ´óÓïÑÔÄ£ÐÍ£¬»Ø´ðÎÊÌâʱ»áÊ×ÏÈͨ¹ýÓû§µÄѯÎÊÔÚ»¥ÁªÍøÉÏËÑË÷Ïà¹ØµÄÓïÁϽøÐв¹³ä£¬ËùÒÔ¿ÉÒԶžø²¿·Ö¶Å׫µÄÇé¿ö£¬µ«ÊǶÔÓڷdz£¼ûµÄÎÊÌ⣬»òÕßÊÇÐÅϢȱʧµÄÇ龳ϣ¬»¹ÊÇ»áÓбàÔìµÄ·çÏÕ¡£


´ËÍ⣬ChatGPTµÄѵÁ·ºÍ²¿Êð¶¼ÐèÒª´óÁ¿ËãÁ¦À´Ö§³Ö£¬Òò´Ë¿ÉÄÜÐèÒª¸üÇáÁ¿»¯µÄÄ£ÐÍ¡£¶ÔÓÚ¹úÄÚÆóÒµ¶øÑÔ£¬ÐèҪͨ¹ý¼ÓÉî¹úÄÚ²úѧÑкÏ×÷·½Ê½Íƶ¯´óÄ£ÐÍ·¢Õ¹¡£¸ù¾Ý¹«¿ª×ÊÁϼìË÷·¢ÏÖ£¬Åô³ÇʵÑéÊÒÓ뻪ΪºÏ×÷¿ª·¢Å̹ŴóÄ£ÐÍ£¬Óë°Ù¶ÈºÏ×÷¿ª·¢Åô³Ç-°Ù¶È¡¤ÎÄÐÄ´óÄ£ÐÍ£»ÁíÒ»·½Ã棬»ªÎª°Ñ¿ÆÑÐÔºËù¡¢²úÒµ³§É̵ȽáºÏÆðÀ´£¬ÒÔÆÚ¸üºÃµØÈôóÄ£ÐͲúÒµÐγÉÕýÏòµÄ±Õ»·»ØÂ·¡£


05

¶Ô¹úÄÚ´óÄ£Ðͼ¼Êõ·¢Õ¹µÄÆôʾ


1£©Ç¿µ÷ԭʼ´´Ð£¬·¢Õ¹´óÄ£ÐͿɳÖÐøÑÝ»¯


´óÄ£Ð͵ÄδÀ´ÐèҪԭʼÐÔ´´ÐÂ,Ò²ÐèÒª×ÔÎÒÉú³¤,Ïò¿É³ÖÐø¡¢¿É½ø»¯µÄ·½Ïò·¢Õ¹¡£È˹¤ÖÇÄܼ¼Êõ½üÄêÀ´³ÊÖ¸ÊýÐÍ·¢Õ¹Ç÷ÊÆ£¬ÔÚµ±½ñÕþÖξ­¼Ã»·¾³Ï£¬ÎÒÃǸüӦǿµ÷ԭʼ´´ÐµÄÖØÒªÐÔ£¬ÕÆÎÕ¸ù¼¼Êõ£¬µ«Ò²²»ÄܾÐÄàÓÚ±ÕÃÅÔì³µ£¬ÒªÇóÊÂÊ´ÓÁãÆð²½£¬ÒªÖØÊÓ¹ú¼Ê¹úÄÚºÏ×÷½»Á÷£¬ÊµÏÖ´óÄ£Ð͵ĿɳÖÐøÑÝ»¯¡£


2£© ½¨Éè´óÄ£ÐÍÑз¢ÉèÊ©


Èç½ñµÄÈ˹¤ÖÇÄÜÑо¿£¬ÒѾ­Í»ÆÆµ¥±ø×÷Õ½£¬¡°Ð¡×÷·»¡±Ê½µÄÂñÍ·×êÑÐÎÞ·¨ÔÚµ±Ï¾ºÕùÈÕÒæ¼¤ÁҵĻ·¾³Öвú³öÍ»ÆÆÐÔ¿ÆÑгɹû¡£ChatGPTµÄºá¿Õ³öÊÀÒ²ÊÇ»ùÓÚǰÆÚ¼¸Ê®ÒÚÃÀ½ðµÄͶÈ룬´ó³É¹ûµÄ²ú³ö±ØÐëÒÀÍдóƽ̨¡£¹úÄÚÓ¦´óÁ¦·ö³Ö¸ß¶Ë¿ÆÑÐÆ½Ì¨£¬´ÓÊý¾Ý¡¢ËãÁ¦¡¢¹¤³Ì´´ÐÂÄÜÁ¦Èý·½Ã棬ÈýλһÌå¼Ó¿ì½¨Éè´ó¿ÆÑ§ÉèÊ©¼¯Èº¡£


3£© È˲ŶÓÎéÅàÑø


¿Æ¼¼´´ÐµľºÕù±¾ÖÊÊǿƼ¼È˲ŵľºÕù¡£´ÓǰÎÄ·ÖÎö¿ÉÖª£¬OpenAIµÄ³É¹¦³ýÁË´óÁ¿ËãÁ¦µÄͶÈ룬¸üÖØÒªµÄÊǾۼ¯ÁË´óÁ¿¶¥¼âµÄ¿ÆÑ§¼ÒºÍ¹¤³Ìʦ¡£ÃæÏòÈ«ÇòÎüÒý¾ß±¸¹¥¿Ë¼¼ÊõÄѹØÄÜÁ¦µÄ½Ü³öÈ˲Å£¬Ñ¡°Î¾ß±¸¹ú¼ÊÓ°ÏìÁ¦µÄÁì¾üÈ˲ţ¬ÅàÓý¾ß±¸½Ï¸ß·¢Õ¹Ç±Á¦µÄÇàÄêÈ˲Å£¬½«»áÊǹúÄÚÈ˹¤ÖÇÄÜ·¢Õ¹µÄÖØÒªÊֶΡ£


4£© ²îÒ컯¾ºÕù£¬°²È«Â×ÀíÐÔ¼ÓÇ¿


´óÄ£Ðͼ¼ÊõµÄºìÀûÆÚ»¹ºÜ³¤£¬ChatGPTµÄ»ð±¬³öȦ²¢²»´ú±í¹úÄÚÍêȫɥʧÏÈ»ú£¬Ö»ÄÜ×ö¸úÅÜÕß¡£Îı¾ÓïÑÔÀà´óÄ£ÐÍ£¬OpenAI×ßÔÚǰÁУ¬µ«ÔÚ¶àģ̬´óÄ£ÐÍÁìÓò£¬ÊÀ½ç¸÷¹ú¿ÆÑ§¼Ò»¹ÔÚ¹¥¿Ë¼¼ÊõÄÑÌâ¡£¹úÄÚÒªÏëÔÚÐÂÒ»ÂÖÈ˹¤ÖÇÄܿƼ¼´´ÐÂÖгÉΪÁìÅÜÕߣ¬¾Í±ØÐëҪѧ»á²îÒ컯¾ºÕù£¬×ö³öÖйúÌØÉ«¡£´óÄ£Ðͼ¼ÊõµÄÑÝ»¯Ò»¶¨»áÔ½À´Ô½Ç¿µ÷¿Æ¼¼Â×ÀíÖÎÀí¡¢ÏµÍ³°²È«ÐÔ£¬ÔÚ°²È«Â×Àí·½ÃæµÄ½¨É裬ͻ³öÖйú¼ÛÖµ¹Û£¬Ò²ÊÇÎÒÃÇÐèÒª¹Ø×¢µÄÖØµã¡£


²Î¿¼ÎÄÏ×

[1]Greg Brockman etal. Introducing OpenAI. URL https://openai.com/blog/introducing-openai/, 2015.
[2] OpenAI Charter. URL https://openai.com/charter, 2018.
[3] 2023Äê1ÔÂ10ÈÕ·͸É籨µÀ. URL https://www.reuters.com/technology/microsoft-talks-invest-10-bln-chatgpt-owner-semafor-2023-01-10/, 2023.
[4] Paul F Christiano et al. Deep Reinforcement Learning from Human Preferences.URL Deep Reinforcement Learning from Human Preferences, 2017.
[5] Nisan Stiennon et al. Learning to summarize from human feedback. URL https://arxiv.org/abs/1706.03741, 2019.
[6] OpenAI. GPT-4 Technical Report. URL https://arxiv.org/abs/2303.08774, 2023.
[7] ÖÇÆ×Ñо¿&AMiner. ChatGPTÍŶӱ³¾°Ñо¿±¨¸æ. URL https://originalfileserver.aminer.cn/sys/aminer/ai10/pdf/ChatGPT-team-background-research-report.pdf, 2023.
[8] OpenAI. Introducing ChatGPT. URL https://openai.com/blog/chatgpt, 2022.
[9] John Schulman et al. Proximal Policy Optimization Algorithms. URL https://arxiv.org/abs/1707.06347, 2017.
[10] William Fedus. Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity. URL https://arxiv.org/abs/2101.03961, 2021.
[11] IPRdaily. ÖйúÈ˹¤ÖÇÄÜ´óÄ£ÐÍÆóÒµ·¢Ã÷רÀûÅÅÐаñ£¨TOP 50£©. URL http://www.iprdaily.cn/article1_33676_20230320.html, 2023

×÷ÕߣºÚìˬ£¬ÉϺ£È˹¤ÖÇÄÜʵÑéÊÒ ÇàÄêÑо¿Ô±£»
¹ÒÕ£¬ÉϺ£È˹¤ÖÇÄÜʵÑéÊÒ ¸ß¼¶¹¤³Ìʦ£¬¹ú¼Ê×¢²á¼¼ÊõÐí¿Éר¼Ò£»£¨Í¨Ñ¶×÷Õߣ©
·ë­ZÑÞ£¬ÉϺ£È˹¤ÖÇÄÜʵÑéÊÒ ÖªÊ¶²úȨÖ÷¹Ü¡£
ÎÄÕ¹۵㲻´ú±íÖ÷°ì»ú¹¹Á¢³¡¡£


·ÖÏíµ½£º

°æÈ¨ËùÓÐ?ÉϺ£ÊпÆÑ§Ñ§Ñо¿Ëù

»¦ICP±¸11048235ºÅ-2

»¦¹«Íø°²±¸ 31010402001155ºÅ