Filter by:
docker-transformers-inference
Python
A containerized solution for hosting transformer models using Flask, Gunicorn, and Docker with AWS SageMaker deployment support. Build once, run anywhere!
hands-on-data-gemma
Python
This project dives into the capabilities of Google's DataGemma LLM and demonstrates how to replicate similar behavior on the Claude LLM through prompt engineering. By combining large language models with real-time data retrieval from Data Commons, we aim to provide accurate and up-to-date responses to statistical queries.
ai-docker-image-optimization
Python
A evolving codebase that demonstrates various techniques to optimize docker image for size and performance.