-
Notifications
You must be signed in to change notification settings - Fork 17
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
Showing
2 changed files
with
38 additions
and
0 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,38 @@ | ||
--- | ||
layout: post | ||
title: "Exploring the Art Gallery with GPT-4o" | ||
author: Taeyoung Kim | ||
date: 2024-5-24 09:50:24 | ||
categories: GPT-4o LECL LangChain | ||
comments: true | ||
image: http://tykimos.github.io/warehouse/2024//Users/tykimos/Downloads/2024-5-25-exploring_the_art_gallery_with_gpt4o_title.jpg | ||
--- | ||
OpenAI's GPT-4o is a multimodal AI model capable of understanding and generating various forms of data, including text, images, and videos. In this post, we will explore the new possibilities of using this multimodal model to visit exhibitions with an AI influencer. | ||
|
||
<iframe width="100%" height="400" src="https://youtu.be/t4hwcpwJyrk" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" allowfullscreen=""></iframe> | ||
|
||
### GPT-4o | ||
|
||
GPT-4o is an AI model capable of understanding and generating various types of data. The key strengths of this model are as follows: | ||
|
||
* Understanding Various Data Forms: It can simultaneously process and understand different types of data, such as text, images, and videos. This provides users with a richer and more intuitive interface. | ||
* Real-Time Q&A: It interacts with users in real-time, offering accurate information based on the input data. This allows users to receive immediate feedback. | ||
* Contextual Understanding and Generation: It comprehends given situations and contexts, generating appropriate responses accordingly. This greatly helps in personalizing and enhancing the user experience. | ||
|
||
### AI Influencer Tyri | ||
|
||
By leveraging the strengths of GPT-4o, an AI influencer can provide a new form of experience by accompanying users through exhibitions. The AI influencer can answer users' questions in real-time, provide information about the exhibited artworks, and share personal interpretations and evaluations of the pieces. For more insights into Tyri's daily life, please visit the blog below. | ||
|
||
* [Tyri's Blog](https://tyritarot.github.io/) | ||
|
||
### Method of Inputting Recorded Videos into GPT-4o | ||
|
||
The screen is divided into two sections: the left side displays the input screen or video, and the right side shows the video being processed by the AI along with a chat interface. This setup uses 5 frames for video processing. | ||
|
||
![img](http://tykimos.github.io/warehouse/2024/2024-5-21-exploring_the_prester_john_exhibition_with_ai_influencer_tyri_1.jpg) | ||
|
||
To view the source code for inputting images into GPT-4o, please click the link below: | ||
|
||
* [Image Descriptions with GPT-4o and LCEL](https://tykimos.github.io/2024/05/15/image_descriptions_with_gpt_4o_and_lcel/) | ||
|
||
Through this process, we explored the exhibition with Tyri. |
Binary file added
BIN
+112 KB
warehouse/2024/2024-5-25-exploring_the_art_gallery_with_gpt4o_title.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.