Special Thanks llama.cpp: Port of Facebook's LLaMA model in C/C++ llama-cpp-python: Python bindings for llama.cpp faiss: A library for efficient similarity search and clustering of dense vectors.