Initial commit: SheepOp LLM - Transformer-based language model implementation
- Complete transformer implementation from scratch - Training pipeline with gradient accumulation and mixed precision - Optimized inference with KV caching - Multi-format data processing (PDFs, images, code, text) - Comprehensive documentation - Apache 2.0 license - Example training plots included in docs/images/
This commit is contained in:
1
checkpoints
Symbolic link
1
checkpoints
Symbolic link
@@ -0,0 +1 @@
|
||||
/mnt/storage/sheepOp/checkpoints
|
||||
Reference in New Issue
Block a user