FlightLLM Test Demo

This demo is for testing FlightLLM implementation on the Xilinx Alveo U280 FPGA. Our submission can be divided into two parts.

Performance profile (see profile/README.md for details): It is used to compare the performance of the GPU baseline with the simulation performance of the VHK158 FGPA and calculate the speedup ratio, which can verify Figure 12 in the paper (Throughput Speedup of LLaMA2-7B). The performance results of the VHK158 are based on the software run and no hardware is involved.
FPGA on-board testing (see fpga_implementation/README.md for details): It is a hardware on-board test on the U280 FPGA to measure the correctness and performance of the paper design. The performance can verify Figure 1 in the paper (55 token/s).

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
fpga_implementation		fpga_implementation
profile		profile
.gitignore		.gitignore
README.md		README.md

Provide feedback