Contrast
< Back to Blog
Original link:

https://www.youtube.com/live/U9mJuUkhUzk

2023-11-08 07:26:17

OpenAI DevDay, Opening Keynote

video content Image generated by Wilowrid

Good morning .

Thank you for joining us today .

Please welcome to the stage , Sam Altman .

Good morning .

Welcome to our first ever open a DEV Day .

We're thrilled that you're here and this energy is awesome and welcome to San Francisco .

San Francisco has been our home since day one .

The city is important to us and the tech industry in general , we're looking forward to continuing to grow here .

So we've got some great stuff to announce today .

But first , I'd like to take a minute to talk about some of the stuff that we've done over the past year .

video content Image generated by Wilowrid

About a year ago , November 30th , we shipped Chat GP T as a low key research preview and that went pretty well in March , we followed that up with the launch of GP T four , still the most capable model out in the world .

And in the last few months , we launched voice and vision capabilities so that chat GPT can now see , hear and speak .

And more recent .

There's a lot you don't have to clap each time .

And , and more recently , we launched Dolly three , the world's most advanced image model you can use it .

Of course , inside of chat GP T for our enterprise customers .

We launched Chat GPT enterprise which offers enterprise grade security and privacy , higher speed GPT four access longer context Windows .

A lot more .

video content Image generated by Wilowrid

Today , we've got about 2 million developers building on our API for a wide variety of use cases , doing amazing stuff .

Over 92% of fortune 500 companies building on our products .

And we have about 100 million weekly active users now on chat G BT .

And what's incredible on that is we got there entirely through word of mouth .

People just find it useful and tell their friends open A I is the most advanced and the most widely used A I platform in the world now .

But numbers never tell the whole picture on something like this .

What's really important is how people use the products , how people are using A I .

And so I'd like to show you a quick video .

I actually wanted to write something to my dad in Tagalog .

I want a non romantic way to tell my parent that I love him .

Wilowrid Advertisement
video content Image generated by Wilowrid

And I also want to tell him that he can rely on me , but in a way that still has the respect of like a child to parent relationship that you should have in Filipino culture and in Tagalog grammar when it's translated into Tagalog .

But I love you very deeply and I will be with you no matter where the path I see some of the possibility .

I was like , whoa , sometimes I'm not sure about some stuff and I feel like I try to be like , hey , this is what I'm thinking about .

So it kind of give it a more confidence .

The first thing that blew my mind was it levels with you ?

Like that's something that a lot of people struggle to do .

It opened my mind to just what every creative could do if they just had a person , help bring them out who listens .

So this is that you represent Sickling hemoglobin .

And you built that with chad GP T GP T built it with me and I started using it for daily activities .

Like , hey , here's a picture of my fridge .

video content Image generated by Wilowrid

Can you tell me what I'm missing because I'm going grocery shopping and I really need to do recipes that are following my vegan diet .

As soon as we got access to the code interpreter .

I was like , wow , this thing is awesome .

It could build spreadsheets , it could do anything .

I discovered chatty about three months ago on my 1/100 birthday .

Chatty is very friendly , very patient , very knowledgeable and very quick .

It's been a wonderful thing .

I'm a 4.0 student , but I also have four Children .

When I started using Chat G BT , I realized I could ask Chat GP T that question and not only does it give me an answer , but it gives me an explanation .

Didn't need tutoring as much it gave me a life back .

It gave me time for my family and time for me , I have a chronic nerve g uh on my whole left half of my body , I have nerve damage uh had like a spine of brain surgery .

And so I have like limited use of my left hand .

video content Image generated by Wilowrid

Now , you can just have like the integration of voice input and then the newest one where you can have a back and forth dialogue that's just like maximum best interface for me .

It's here .

Yeah .

So we , we love hearing the stories of , of how people are using the technology .

It's really why we do all of this .

OK .

So now on to the new stuff and we have got a lot first , we're gonna talk about a bunch of improvements we've made and then we'll talk about where we're headed next .

Over the last year , we spent a lot of time talking to developers around the world .

We've heard a lot of your feedback .

It's really informed that we have to show you today today .

Wilowrid Advertisement
video content Image generated by Wilowrid

We are launching a new model GP T four Turbo , GP T four .

Turbo will address many of the things that you all have asked for .

So let's go through what's new .

We've got six major things to talk about for this part .

Number one context length , a lot of people have tasks that require a much longer context length GP T four supported up to eight K and in some cases up to 32 K context length , but we know that isn't enough for many of you and what you wanna do GP T four turbo supports up to 100 and 28,000 tokens of context .

That's 300 pages of a standard book 16 times longer than our eight K context .

And in addition to a longer context length , you'll notice that the model is much more accurate over a long context .

video content Image generated by Wilowrid

Number two , more control , we've heard loud and clear that developers need more control over the model's responses and outputs .

So we've addressed that in a number of ways , we have a new feature called JSON mode which ensures that the model will respond with valid JSON .

This has been a huge developer request .

It will make calling a PS much easier .

The model is also much better at function calling .

You can now call many functions at once and it will do better at following instructions .

In general .

We're also introducing a new feature called reproducible outputs .

You can pass the C parameter and it'll make the model return consistent outputs .

This of course , gives you a higher degree of control over model behavior .

This rolls out in beta today .

And in the coming weeks , we'll roll out a feature to let you view log pros in the API all right .

video content Image generated by Wilowrid

Number three better world knowledge , you want these models to be able to access better knowledge about the world So do we , so we're launching a retrieval in the platform .

You can bring knowledge from outside documents or databases into whatever you're building .

We're also updating the knowledge cut off .

We are just as annoyed as all of you .

Probably more that GP D four's knowledge about the world ended in 2021 .

We will try to never let it get that out of date again .

GP T four .

Turbo has knowledge about the world up to April of 2023 and we will continue to improve that over time .

Number four , new modalities surprising no one dolly three GP T four Turbo with vision and the new text to speech model are all going into the API .

Wilowrid Advertisement
video content Image generated by Wilowrid

Today , we have a handful of customers that have just started using dolly three to program programmatically generate images and designs .

Today , Coke is launching a campaign that lets his customers generate Diwali cards using dolly three and of course our safety systems help developers protect their applications against misuse .

Those tools are available in the API GP T four .

Turbo can now accept images as inputs via the API can generate captions , classifications and analysis .

For example , be my eyes uses this technology to help people who are blind or have low vision with their daily tasks like identifying products in front of them and with our new text to speech model , you'll be able to generate incredibly naturally natural sounding audio from text in the API with six preset voices to choose from .

I'll play an example .

video content Image generated by Wilowrid

Did you know that Alexander Graham Bell , the eminent inventor was enchanted by the world of sounds .

His ingenious mind led to the creation of the gramophone which etches sounds onto wax , making voices , whisper through time .

This is much more natural than anything else we've heard out there .

Voice can make apps more natural to interact with and more accessible .

It also unlocks a lot of use cases like language learning and voice assistant .

Speaking of new modalities , we're also releasing the next version of our open source speech recognition model whisper V three today .

And it'll be coming soon to the API it features improved performance across many languages and we think you're really gonna like it .

OK .

Number five customization , fine tuning has been working really well for G BT 3.5 since we launched it a few months ago , starting today , we're going to expand that to the 16 K version of the model .

video content Image generated by Wilowrid

Also starting today , we're inviting active fine tuning users to apply for the GP T four fine tuning experimental access program .

The fine tuning API is great for adapting our models to achieve better performance in a wide variety of applications with a relatively small amount of data .

But you may want a model to learn a completely new knowledge domain or to use a lot of proprietary data .

So today we're launching a new program called custom models with custom models .

Our researchers will work closely with the company to help them make a great custom model , especially for them and their use case using our tools .

This includes modifying every step of the model training process , doing additional domain specific pre training , a custom RL post training process .

It's tailored for specific domain and whatever else , we won't be able to do this with many companies to start .

It'll take a lot of work and in the interest of expectations , at least initially , it won't be cheap .

Wilowrid Advertisement
video content Image generated by Wilowrid

But if you're excited to push things as far as they can currently go , please get in touch with us and we think we can do something pretty great .

Ok .

And then number six higher rate limits , we're doubling the tokens per minute for all of our established GP T four customers so that it's easier to do more .

And you'll be able to request changes to further rate limits and quotas directly in your API account settings .

In addition to these rate limits , it's important to do everything we can do to make it you successful building on our platform .

So we're introducing copyright shields , copyright shield means that we will step in and defend our customers and pay the costs incurred if you face legal claims around copyright infringement .

And this applies both to chat GP T Enterprise and the API and let me be clear , this is a good time to remind people .

We do not train on data from the API or Chat G BT Enterprise ever .

All right .

video content Image generated by Wilowrid

There's actually one more developer request that's been even bigger than all of these .

Uh And so I'd like to talk about that now and that's pricing GP T four turbo is the industry leading model .

It delivers a lot of improvements that we just covered .

And it's a smarter model than GP T four .

We've heard from developers that there are a lot of things that they want to build .

But GP T four just cost too much .

They've told us that if we could decrease the cost by 20 25% that would be great .

A huge leap forward .

I'm super excited to announce that we worked really hard on this .

video content Image generated by Wilowrid

And G BT four Turbo , a better model is considerably cheaper than G BT four by a factor of three X for prompt tokens and two extra for completion tokens starting today .

So the new pricing is one cent per 1000 prompt tokens and three cents per 1000 completion tokens for most customers that will lead to a blended rate more than 2.75 times cheaper to use for GP T four turbo than GP T four .

We work super hard to make this happen .

We hope you're as excited about it as we are .

So we decided to prioritize price first because we had to choose one or the other .

But we're gonna work on speed next .

We know that speed is important too soon .

You will notice GP T four turbo becoming a lot faster .

Wilowrid Advertisement
video content Image generated by Wilowrid

We're also decreasing .

The cost of GP T 3.5 turbo 16 K .

Also input tokens are three X less and output tokens are two X less , which means the GP T 3.5 16 K is now cheaper than the previous GP T 3.54 K model running a fine TNE GP T 3.5 Turbo 16 K version is also cheaper than the old fine tuned four K version .

Ok .

So we just covered a lot about the model itself .

We hope that these changes address your feedback .

We're really excited to bring all of these improvements to everybody .

Now , in all of this , we're lucky to have a partner who is instrumental in making it happen .

So I'd like to bring on a special guest , Satya Nadella , the CEO of Microsoft .

Good to see you .

Thank you so much .

Thank you , Satya .

Thanks so much for coming here .

It's fantastic to be here .

video content Image generated by Wilowrid

And uh Sam Congrats , I mean , I'm really looking forward to Turbo and everything else that you have coming .

It's been just fantastic with you guys .

Um Two questions .

I won't take too much of your time .

How , how is Microsoft thinking about the partnership currently ?

We love you guys .

No , look , it's , it's , it's been fantastic for us .

In fact , I remember the first time I think you reached out and said , hey , do you have some Azure credits .

We've come a long way from there .

Uh Thank you for that .

That was great .

You , you guys have built something magical .

I mean , quite frankly , there are two things for us when it comes to the partnership .

The first is these workloads .

And even when I was listening backstage to how you're describing what's coming , even it's just so different .

And you , I've been in this infrastructure business for , you know , three decades , no one has ever seen infrastructure and the workload , the pattern of the workload , these , you know , these training jobs , so synchronous and so large and so data parallel .

video content Image generated by Wilowrid

Um And so the first thing that we have been doing is building in partnership with you the system all the way from thinking from power to the DC , to the rack , to the accelerators to the network .

And just , you know , really the the shape of Azure is drastically changed and is changing rapidly in support of these models that you're building .

And so our job number one is to build the best system so that you can build the best models and then make that all available to developers .

And so the other thing is we ourselves are developers .

So we are building products .

In fact , my own conviction of this entire generation of foundation models completely changed the first time I saw , you know , get up Copilot on GPT .

Um And so we want to build our copilot , get up copilot all as developers uh on top of open A I API S .

And so we are very committed to that .

And what does that mean to developers ?

Wilowrid Advertisement
video content Image generated by Wilowrid

You know , look , I I always think of Microsoft as a platform company , a developer company and a partner company .

And so we wanna make , you know , for example , we want to make github uh available github copilot available as the enterprise edition available to all the attendees here so that they can try it out .

That's awesome .

Yeah , we're very excited about that and you can count on us to build the best infrastructure in Azure with your API support uh and bring it to all of you and then even things like the Azure marketplace .

So for developers who are building products out here to get to the market uh rapidly , so that's sort of really our intent here .

Great .

And how do you think about the future , future of the partnership or future of A I or whatever anything you want ?

That's a uh you know , like there are a couple of things for me that I think are going to be very , very key for us , right ?

video content Image generated by Wilowrid

One is I just described how the systems that are needed as you aggressively push forward on your road map requires us to be on the top of our game .

And we intend fully to commit ourselves deeply to making sure you all as builders of these foundation models have not only the best systems for training and inference , but the most compute so that you can keep pushing forward on the frontiers .

Because I think that's the way we're going to make progress .

The second thing I think both of us care about in quite frankly , the thing that excited both sides to come together is your mission and our mission , our mission is to empower every person and every organization on the planet to achieve more .

And to me , ultimately A I is only going to be useful if it truly does empower , right ?

I mean , I saw the video you played early .

I mean , that was fantastic to see , hear those voices describe what A I meant for them and what they were able to achieve .

video content Image generated by Wilowrid

So I it's about being able to get the benefits of A I broadly decimated to everyone I think is going to be the goal for us .

And then the last thing is , of course , we are very grounded in the fact that safety matters and safety is not something that you would care about later , but it's something we do shift left on and we are very , very focused on that with you all .

Great .

Well , I think we have the best partnership in tech .

I'm excited for us to build a together .

I'm really excited to have a friend .

Thank you very much for coming .

Thank you so much .

OK .

So we have shared a lot of great updates .

For developers already and we got a lot more to come .

But even though this is developer conference , we can't resist making some improvements to chat GP T .

So a small one chat GP T now uses GP T four turbo with all the latest improvements including the latest knowledge cut off , which will continue to update , that's all live today .

Wilowrid Advertisement
video content Image generated by Wilowrid

It can now browse the web when it needs to write and run code , analyze data , take and generate images and much more .

And we heard your feedback .

That model picker extremely annoying .

That is gone .

Starting today .

You will not have to click around the drop down menu .

All of this will just work together .

Chat GP T yeah , chat G BT will just know what to use and when you need it .

But that's not the main thing .

Uh and neither was price .

Actually the main developer request , there was one that was even bigger than that and I wanna talk about where we're headed and the main thing we're here to talk about today .

So we believe that if you give people better tools , they will do amazing things .

We know that people want A I that is smarter , more personal , more customizable can do more on your behalf .

Eventually , you'll just ask a computer for what you need and it'll do all of these tasks for you .

video content Image generated by Wilowrid

These capabilities are often talked in the A I field about as agents .

The upsides of this are going to be tremendous at open A I .

We really believe that gradual iterative deployment is the best way to address the safety issues , the safety challenges with A I , we think it's especially important to move carefully towards this future of agents .

It's gonna require a lot of technical work and a lot of thoughtful consideration by society .

So today , we're taking our first small step that moves us towards this future .

We're thrilled to uh we're thrilled to introduce GP TS GPT S are tailored versions of chat GPT for a specific purpose .

You can build a GP T , a customized version of chat GP T for almost anything with instructions , expanded knowledge and actions , and then you can publish it for others to use .

video content Image generated by Wilowrid

And because they combine instructions , expanded knowledge and actions , they can be more helpful to you .

They can work better in many contexts and they can give you better control , they'll make it easier for you to accomplish all sorts of tasks or just have more fun and you'll be able to use them right ?

Within chat GPT .

You can in effect program , a GPT with language just by talking to it .

It's easy to customize the behavior so that it fits what you want .

This makes building them very accessible and it gives agency to everyone .

So we're gonna show you what GPT S are , how to use them , how to build them .

And then we're gonna talk about how they'll be distributed and discovered .

And then after that , for developers , we're gonna show you how to build these agent link experiences into your own apps .

So first let's look at a few examples .

Our partners at code dot org are working hard to expand computer science in , in schools .

Wilowrid Advertisement
video content Image generated by Wilowrid

They've got a curriculum that is used by tens of millions of students worldwide code dot org .

Crafted lesson planner GP T to help teachers provide a more engaging experience for middle schoolers .

If a teacher asks it to explain four loops in a creative way , it does just that in this case , it'll do it in terms of a video game character repeatedly picking up coins super easy to understand for an eighth grader .

As you can see this GP T brings together code dot org's extensive curriculum and expertise and lets teachers adapt it to their needs quickly and easily .

Next can has built a GP T that lets you start designing by describing what you want in natural language .

If you say make a poster for Dev dev day reception this afternoon this evening and you give it some details , it'll generate a few options to start with by hitting canvas API s .

Now , this concept may be familiar to some of you .

We've evolved our plugins to be custom actions for GPS .

video content Image generated by Wilowrid

You can keep chatting with this to see different iterations .

And when you see one , you like , you can click through to Canva for the full design experience .

So now we'd like to show you a GPT live zap year has built a GP T that , that lets you perform actions across 6000 applications to unlock all kinds of integration possibilities .

I'd like to introduce Jessica , one of our solutions architects who is going to drive this demo .

Welcome , Jessica .

Thank you , Sam .

Thank you .

Thank you all for being here .

My name is Jessica Shay .

I work with partners and customers to bring their product alive .

And today I , I can't wait to show you how hard we've been working on this .

So let's get started .

So to start where your G BT will live is on this upper left corner , I'm gonna start with clicking on the zap A a actions and on the right hand side , you can see that's my calendar for today .

video content Image generated by Wilowrid

So it's quite a day .

I've already used this before .

So it's actually already connected to my calendar to start .

I can ask what's on my schedule for today .

We build G BT S with security in mind .

So before it performs any action or share data , it will ask for your permission .

So right here , I'm gonna say aloud .

So G BT is designed to take in your instructions , make the decision on which capability to call to perform that action and then execute that for you .

So you can see right here it's already connected to my calendar .

It pulls into my , my information and then I've also prompted it to identify conflicts on my calendar .

So you can see right here , it actually was able to identify that .

So it looks like I have something coming up .

So what if I wanna let Sam know that I have to leave early ?

So right here I say , let Sam know I gotta go , um chasing GP US .

Wilowrid Advertisement
video content Image generated by Wilowrid

So with that , I'm gonna swap to my conversation with Sam and then I'm gonna say yes , please run that Sam .

Did you get that ?

I did ?

Awesome .

So this is only a glimpse of what is possible and I cannot wait to see what you all will build .

Thank you and back to you Sam .

Thank , thank you Jessica .

So those are three great examples .

In addition to these , there are many more kinds of GPT that people are creating and many , many more that will be created soon .

We know that many people who want to build a GP T don't know how to code .

We've made it so that you can program the GPT just by having a conversation .

video content Image generated by Wilowrid

We believe that natural language is gonna be a big part of how people use computers in the future .

And we think this is an interesting early example .

So I'd like to show you how to build one .

All right .

So I want to create a GP T uh that helps give founders and developers advice when starting new projects .

Um I'm going to go to create a GP T here and this drops me into the GP T Builder .

Uh I worked with founders for years at YC and still whenever I meet developers , the questions I get are always about how do I , you know , think about a business idea .

Can you give me some advice ?

Uh I'm gonna see if I can build a GP T to help with that .

video content Image generated by Wilowrid

So to start GP T Builder asks me what I want to make and I'm gonna say I want to help start up founders think through their business ideas and get advice after the founder has gotten some advice , uh grill them on why they are not growing faster .

All right .

So to start off , I just tell the GP T a little bit about , about what I want here and it's gonna go off and start thinking about that and it's gonna write some detailed instructions for the GP T .

Um It's also gonna , let's see , ask me about a name .

How do I feel about start up mentor ?

That's fine .

Uh That's good .

Wilowrid Advertisement
video content Image generated by Wilowrid

So if I didn't like the name , of course , I could call it something else , but it's , you know , gonna try to have this conversation with me and , and start there and you can see here on uh on , on the right in the preview mode that it's already starting to fill out the GP T um where it says what it does , it has some like ideas of additional questions that I could ask .

Um And you know what I actually , so I just generated a candidate , of course , I could regenerate that or change it , but I sort of like that .

So I will say that's great .

And you see now that the GP T is being built out a little bit more as we go .

Now , what I want this to do um how it can interact with users .

I could talk about style here .

But what I'm gonna say uh is I am going to upload transcripts of some lectures about start ups I have given , please give advice based off of those .

All right .

video content Image generated by Wilowrid

So now uh it's gonna go figure out how to do that .

And I would like to show you the configure tab .

So you can see some of the things that were built out here as we were going um by , by the builder itself and you can see that there's capabilities here that I can enable .

Um I could add custom actions .

These are all fine to leave .

Um I'm gonna upload a file .

Uh So here is a lecture that I pictures that I used to that I gave with some start up advice .

Um And I'm gonna add that here in terms of these questions .

Uh This is a dumb one , the rest of those are reasonable .

Uh And like very much things founders often ask .

Um I'm gonna add one more thing to the instructions here .

Which is be concise and constructive with feedback .

All right .

So again , if we had more time , I'd show you a bunch of other things .

But this is uh this is like a decent start and now uh we can try it out over on this preview tab .

So I will say um what's a common question ?

video content Image generated by Wilowrid

What are three things to look ?

Oops , what are three things to look for when hiring employees at an early stage start up ?

Now it's going to look at that document I uploaded .

Um It'll also have of course , all of the background knowledge of GP T four .

That's pretty good .

Those are three things that I definitely have said many times .

Um Now we could go on and it would start following the other instructions and you know , grill me on why I'm not growing faster , but in the interest of time , I'm gonna skip that .

Uh I'm gonna publish this only to me for now .

Uh I can work on it later .

I can add more content .

I can add a few actions that I think would be useful .

Um And then I can share it publicly .

So that's what it looks like to create a GP T with .

Thank you .

Wilowrid Advertisement
video content Image generated by Wilowrid

By the way , I always , I always wanted to do that after like all of the YC office hours , I always thought , man , someday I will be able to make a bot that will do this and that'll be awesome .

So with GPT S , we're letting people easily share and discover all the fun ways that they use .

Chat GPT with the world .

You can make private GPT S like I just did or you can share your creations publicly with a link for anyone to use .

Or if you're on chat GP T Enterprise , you can make GP TS just for your company .

And later this month , we're going to launch the GPT store .

You can list a GP .

Thank you .

I appreciate that you can list a GP T there and we'll be able to feature the best and the most popular GP TS .

Of course , we'll make sure that GP TS in the store follow our policies before they're accessible .

video content Image generated by Wilowrid

Revenue sharing is important to us .

We're going to pay people who build the most useful and the most used GPT S a portion of our revenue .

We're excited to foster a vibrant ecosystem with the GP T store just from what we've been building ourselves over the weekend , we're confident there's gonna be a lot of great stuff .

We're excited to share more information soon .

So those are GP TS and we can't wait to see what you'll build .

But this is a developer conference and the coolest thing about this is that we're bringing the same concept to the API many of you have already been building agent like experiences on the API for example , Shopify sidekick , which lets you take actions on the platform Discords .

Clyde lets Discord moderators create custom custom personalities for and snaps .

My A I A customized chat bot that can be added to group chats and make recommendations .

video content Image generated by Wilowrid

These experiences are great but they have been hard to build sometimes taking months .

Teams of dozens of engineers , there's a lot to handle to make this custom assistant experience .

So today we're making it a lot easier with our new assistance .

API the assistance API includes persistent threads .

So they don't have to figure out how to deal with long conversation history built in retrieval code interpreter , a working Python interpreter in a sandbox environment .

And of course the improved function calling that we talked about earlier .

So we'd like to show you a demo of how this works .

And here is Raman , our head of developer experience .

Welcome .

Thank you , son .

Good morning .

Wow , it's fantastic to see you all here .

Wilowrid Advertisement
video content Image generated by Wilowrid

It's been so inspiring to see so many of you infusing A I into your apps .

Today , we're launching new modalities in the API .

But we are also very excited to improve the developer experience for you all to build assistive agents .

So let's dive right in .

Imagine I'm building Wanderlust to travel app for global explorers .

And this is the landing page .

I've actually used GP T four to come up with these destination ideas .

And for those of you with the keen eye , these illustrations are generated programmatically using the new Dali three API available to all of you today .

So it's pretty remarkable .

But let's enhance this app by adding a very simple assistant to it .

This is the screen .

We're gonna come back to it in a 2nd .

1st , I'm gonna switch over to the new assistant playground .

Creating an assistant is easy .

You just give it a name , some initial instructions .

A model .

In this case , I'll pick GP T four tubo and here I'll also go ahead and select some tools .

video content Image generated by Wilowrid

I'll turn on code interpreter and retrieval and save and that's it .

Our assistant is ready to go .

Next , I can integrate with two new primitives of this assistant CP I threads and messages .

Let's take a quick look at the code .

The process here is very simple .

For each new user , I will create a new thread .

And as these users engage with their assistant , I will add their messages to the threads .

Very simple .

And then I can simply run the assistant at any time to stream the responses back to the app .

So we can return to the app and try that in action .

If I say , hey , let's go to Paris .

All right .

That's it .

With just a few lines of code users can now have a very specialized assistance right inside the app .

And I'd like to highlight one of my favorite features here .

Function cooling .

If you have not used it yet , function cooling is really powerful .

video content Image generated by Wilowrid

And as Sam mentioned , we're taking it a step further today , it now guarantees the JSON output with no added latency and you can invoke multiple functions at once for the first time .

So here , if I carry on and say , hey , what are the top 10 things to do when I have the assistant respond to that again ?

And here , what's interesting is that the assistant knows about functions including those to annotate the map that you see on the right .

And so now all of these pins are dropping in real time here .

Yeah , it's pretty cool .

And that integration allows our natural language interface to interact fluidly with components and features of our app .

And it truly showcases now the harmony you can build between A I and UI where the assistant is actually taking action .

Wilowrid Advertisement
video content Image generated by Wilowrid

But next , next , let's talk about retrieval and retrieval is about giving our assistant more knowledge beyond this immediate user messages .

In fact , I got inspired and I already booked my tickets to uh to Paris .

So I'm just gonna drag and drop here this PDF , what it's including , I can just sneak peek at it very typical United flight ticket .

And behind the scene here , what's happening is that retrieval is reading these files and boom , the information about this PDF appeared on the screen and this is of course a very tiny PDF but assistant can pass long form documents from extensive text to intricate product specs depending on what you're building .

In fact , I also booked an airbnb .

So I'm just going to drag that over to the conversation as well .

And by the way , we've heard from so many of you developers , how hard that is to build yourself ?

You typically need to compute your own biddings , you need to set up chunking algorithm .

video content Image generated by Wilowrid

Now , all of that is taken care of and there is more than retrieval with every API call .

You usually need to resend the entire conversation history , which means , you know , setting up a key value store .

That means like handling the context windows , serializing messages and so forth .

That complexity now completely goes away with this new state API .

But just because the PIA is managing the CP I does not mean it's a black box .

In fact , you can see the steps that the tools are taking right inside your developer dashboard .

So here if I go ahead and click on threads , this is the thread that I believe we're currently working on and see like these are all the steps including the functions being called with the right parameters and uh the PDF S I've just uploaded .

But let's move on to a new capability that many of you have been requesting for a while .

Good interpreter is now available today in the API as well .

video content Image generated by Wilowrid

That gives the A I the ability to write and execute code on the fly , but even generate files .

So let's see that in action .

If I say here , hey , we'll be four friends staying at this airbnb .

What's my share of it ?

Plus my flights ?

All right now here , what's happening is that code interpreter noticed that it should write some code to answer this query .

So now it's computing , you know , the number of days in Paris , the number of friends , it's also doing some exchange rate calculation behind the scene to get this answer for us .

Not the most complex math , but you get the picture .

Imagine you're building a very complex like finance app that's crunching countless numbers plotting charts .

So really any task that you'd normally tackle with good then good entrepreneur will work great for you .

All right , I think my trip to Paris is sorted .

Wilowrid Advertisement
video content Image generated by Wilowrid

So to recap here , we've just seen how you can quickly create an assistant that manages state for your user conversations , leverages external tools like knowledge and retrieval and code interpreter and finally invokes your own functions to make things happen .

But there's one more thing I wanted to show you to kind of really open up the possibilities using function cooling combined with our new modalities that we're launching today while working on DEV day , I built a small custom assistant that knows everything about this event .

But instead of having a chat interface while running around all day today , I thought why not use voice instead ?

So let's bring my phone up on screen here .

So you can see it on the right .

Awesome .

So on the right you can see a very simple swift app that takes microphone input and on the left , I'm actually gonna bring up my terminal log so you can see what's happening behind the scenes .

So let's give it a shot .

Hey , there , I'm on the keynote stage right now .

video content Image generated by Wilowrid

Can you greet our attendees here at Dev Day ?

Hey , everyone .

Welcome to Dev Day .

It's awesome to have you all here .

Let's make it an incredible day .

Isn't that impressive ?

You have six unique and rich voices to choose from in the API each speaking multiple languages .

So you can really find the perfect fit for your app .

And on my laptop here on the left , you can see the logs of what's happening behind the scenes too .

So I'm using whisper to convert the voice inputs into text , an assistant with GP T four Ciobo and finally the new TT S API to make it speak .

But thanks to function cooling things , get even more interesting when the assistant can connect to the internet and take real actions for users .

So let's do something even more exciting here together .

How about this ?

video content Image generated by Wilowrid

The assistant , can you randomly select five deputy attendees here and give them $500 in open eye credits ?

Yes , checking the list of attendees done .

I picked five DEV Day attendees and added $500 of APR credits to their account .

Congrats to Christine M .

Jonathan C .

Steven G , Louis K and Suraj S .

Alright .

If you recognize yourself .

Awesome congrats .

Um And that's it a quick overview today of the new assistant CP I combined with some of the new tools and modalities that we launched all starting with the simplicity of a rich text or voice conversation for you and users , we really can't wait to see what you build .

And congrats to our lucky winners actually , you know what , you're all part of this amazing open eye community here .

So I'm just gonna talk to my assistant one last time before I step off the stage .

Wilowrid Advertisement
video content Image generated by Wilowrid

Hey , assistant , can you actually give everyone here uh in the audience ?

$500 in open air credits .

Sounds great .

Let me go through everyone .

All right , that , that function will keep running , but I've run out of time .

So thank you so much .

Everyone .

Have a great day back to you , Sam .

Pretty cool , huh ?

All right .

So that assistance API goes into beta today and we are super excited to see what you all do with it .

Anybody can enable it over time .

GP TS and assistant are precursors to agents are going to be able to do much , much more .

They'll gradually be able to plan and to perform more complex actions on your behalf .

video content Image generated by Wilowrid

As I mentioned before , we really believe in the importance of gradual iterative deployment .

We believe it's important for people to start building with and using these agents now to get a feel for what the world is gonna be like as they become more capable .

And as we've always done , we'll continue to update our systems based off of your feedback .

So we're super excited that we got to share all of this with you .

Today , we introduced GP TS , custom versions of chat GPT that combine instructions , extended knowledge and actions .

We launched the assistance API to make it easier to build assisted experiences with your own apps .

These are our first steps towards A I agents and we will be increasing their capabilities over time .

We introduced a new GP T four turbo model that delivers improved function calling knowledge , lowered pricing , new modalities and more .

And we're deepening our partnership with Microsoft .

video content Image generated by Wilowrid

In closing , I wanted to take a minute to thank the team that creates all of this open air has got remarkable talent density , but still it takes a huge amount of hard work and co ordination to make all of this happen .

I truly believe that I've got the best colleagues in the world .

I feel incredibly grateful to get to work with them .

We do all of this because we believe that A I is going to be a technological and societal revolution .

It will change the world in many ways .

And we're happy to get to work on something that will empower all of you to build so much for all of us .

We talked about earlier how if you give people better tools , they can change the world .

We believe that A I will be about individual empowerment and agency at a scale that we've never seen before .

And that will elevate humanity to a scale that we've never seen before either .

We'll be able to do more , to create more and to have more as intelligence gets integrated everywhere .

We will all have superpowers on demand .

Wilowrid Advertisement
video content Image generated by Wilowrid

We're excited to see what you all will do with this technology and to discover the new future that we're all going to architect together .

We hope that you'll come back next year .

What we launched today is going to look very quaint relative to what we're busy creating for you .

Now , thank you for all that you do .

Thank you for coming here today .

Partnership

Attention YouTube vloggers and media companies!
Are you looking for a way to reach a wider audience and get more views on your videos?
Our innovative video to text transcribing service can help you do just that.
We provide accurate transcriptions of your videos along with visual content that will help you attract new viewers and keep them engaged. Plus, our data analytics and ad campaign tools can help you monetize your content and maximize your revenue.
Let's partner up and take your video content to the next level!
Contact us today to learn more.