Awesome-Datasets-Hub Aggregates LLM Training and Evaluation Data Across Medical AI, Code, and Reasoning Tasks
A GitHub repository curates datasets for LLM fine-tuning, instruction tuning, and benchmarking across medical, NLP, multimodal, and code domains.