🐋TransformerPageTagsLast edited time🍊Responsible AIAug 4, 2025 09:52 AM🍐LLM powered application architecturesAug 4, 2025 07:02 AM🍋Reasoning and Acting(ReAct)PALReActApr 13, 2026 07:29 AM🥐Program-aided language models PALPALAug 1, 2025 07:30 AM🍪Chain of Thought PromptingAug 1, 2025 06:10 AM🍬Retrieval Augmented Generation RAGRAGAug 1, 2025 04:12 AM🧡Model optimizations to improve application performancePTQDistillationPruningJul 31, 2025 03:35 AM🥔Optimize LLMs and build generative AI applicationsAug 4, 2025 06:19 AM💛Scaling human feedback(RLAIF)RLHFRLAIFJul 30, 2025 08:04 AM🤎Reward hackingRLHFJul 30, 2025 03:45 AM🤔Proximal Policy Optimization PPO PPOApr 13, 2026 07:27 AM🧡Fine-tuning with RLHFJul 29, 2025 03:18 PM💚Training the reward modelJul 29, 2025 02:52 AM💙Obtaining feedback from humansAug 5, 2025 10:19 AM🍇Reinforcement Learning from Human Feedback (RLHF)Jul 31, 2025 09:32 AM🧡Soft promptsJul 23, 2025 08:48 AM💚LoRA Low-rank AdaptationLoRAApr 13, 2026 07:29 AM🍒Parameter efficient fine-tuning (PEFT)Jul 23, 2025 08:55 AM🍅BenchmarksJul 18, 2025 03:50 AM🍋Model evaluationJul 17, 2025 11:41 AM🍏Multi-task instruction fine-tuningJul 17, 2025 06:31 AM🫐Fine-tuning on a single taskJul 16, 2025 12:00 PM🍇Fine-tuning an LLM with instruction promptsJul 16, 2025 09:08 AM🥦Pre-training for domain adaptationJul 15, 2025 09:01 AM🍅Scaling laws and compute-optimal modelsJul 15, 2025 07:04 AM🥕Efficient multi-GPU compute strategiesJul 14, 2025 09:06 AM🐳Computational challenges of training LLMsQATAug 5, 2025 02:42 AM🐟Pre-training large language modelsJul 15, 2025 09:25 AM🐙Generative AI project lifecycleJul 31, 2025 06:18 AM🐠Generative configuration parameters for inferenceJul 10, 2025 02:13 AM🐬Prompt enginerringJul 9, 2025 11:24 AM