Optimize tokenizer initialization in LazySupervisedDataset for QWEN a… #288

ShacklesLay · 2024-10-09T13:16:39Z

Removed tokenizer = copy.deepcopy(tokenizer) from preprocess_llama3 and preprocess_qwen because this operation was called every time data is fetched from the dataloader, consuming extra time. Instead, whether to use copy.deepcopy during tokenizer initialization is now determined based on conversation_lib.default_conversation.version during the initialization of the LazySupervisedDataset class.

…nd LLaMA3

Optimize tokenizer initialization in LazySupervisedDataset for QWEN a…

24c821e

…nd LLaMA3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize tokenizer initialization in LazySupervisedDataset for QWEN a… #288

Optimize tokenizer initialization in LazySupervisedDataset for QWEN a… #288

ShacklesLay commented Oct 9, 2024

Optimize tokenizer initialization in LazySupervisedDataset for QWEN a… #288

Are you sure you want to change the base?

Optimize tokenizer initialization in LazySupervisedDataset for QWEN a… #288

Conversation

ShacklesLay commented Oct 9, 2024