Skip to content

I create the repository to understand the codes of FlightLLM.

Notifications You must be signed in to change notification settings

LiuTielong/Iron_FlightLLM

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 

Repository files navigation

FlightLLM Test Demo

This demo is for testing FlightLLM implementation on the Xilinx Alveo U280 FPGA. Our submission can be divided into two parts.

  1. Performance profile (see profile/README.md for details): It is used to compare the performance of the GPU baseline with the simulation performance of the VHK158 FGPA and calculate the speedup ratio, which can verify Figure 12 in the paper (Throughput Speedup of LLaMA2-7B). The performance results of the VHK158 are based on the software run and no hardware is involved.

  2. FPGA on-board testing (see fpga_implementation/README.md for details): It is a hardware on-board test on the U280 FPGA to measure the correctness and performance of the paper design. The performance can verify Figure 1 in the paper (55 token/s).

About

I create the repository to understand the codes of FlightLLM.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%