Combining Large and Small LLMs to Boost Inference Time and Quality Combining Large and Small LLMs to Boost Inference Time and Quality Click here to read the article