HPipe: Large Language Model Pipeline Parallelism for Long Context on Heterogeneous Cost-effective Devices

Ruilong Ma | Xiang Yang | Jingyu Wang | Qi Qi | Haifeng Sun | Jing Wang | Zirui Zhuang | Jianxin Liao |

Paper Details:

Month: June
Year: 2024
Location: Mexico City, Mexico
Venue: NAACL |

Citations

URL

No Citations Yet

No URLs Found

Field Of Study