CLIP(2)

Zero-Shot(1)

LLaMA (3)

GPT(4)

Dataset(1)

Prompt(2)

Fine-Tuning(1)

LLM (1)

Neural Machine Translation of Rare Words with Subword Units https://arxiv.org/abs/1508.07909
Root Mean Square Layer Normalization https://dl.acm.org/doi/pdf/10.5555/3454287.3455397
RoFormer: Enhanced Transformer with Rotary Position Embedding https://arxiv.org/abs/2104.09864