The Writer LLM service enables you to customize and use the Writer LLMs outlined below.
|Palmyra Small||128m||HuggingFace||Apache-2.0||Improving language understanding by generative pre-training – arxiv|
|Palmyra 3B||3B||HuggingFace||Apache-2.0||Improving language understanding by generative pre-training – arxiv|
|Palmyra Base||5B||HuggingFace, API||Apache-2.0||Improving language understanding by generative pre-training – arxiv|
|Palmyra Large||20B||API & Write Platform||Enterprise License||Improving language understanding by generative pre-training – arxiv|
|InstructPalmyra||30B||API, Write Platform, On-premises||Enterprise License||Training language models to follow instructions with human feedback|
|Palmyra-R||30B||Closed Beta||Enterprise License||Autoregressive language model with Retrieval-Augmented Generation|
|Palmyra-E||30B||Closed Beta||Enterprise License||Autoregressive language model|
These large language models have been pre-trained on a massive amount of Internet text. Pre-training involves taking a mathematical model with random mathematical parameters (weights) and adjusting those weights iteratively in response to discrepancies between the model's output and a comparison point indicating the expected output. The most common training method for large language models is next-word prediction over massive amounts of text.
Differences between models
Palmyra Small is the fastest of Writer’s LLMs and can perform important tasks such as text parsing, simple classification, address correction, and keyword recognition. Providing more context drives better performance.
Good at: Text parsing, simple classification, address correction, and keyword recognition
Palmyra Base is extremely powerful as well as incredibly fast. This model excels at many nuanced tasks such as sentiment classification and summarization. Palmyra Base is also effective as a general service chatbot, answering questions and performing Q&A.
Competent in: complex classification, text sentiment, and summarization
Palmyra Large is the most capable model family, capable of performing any task that the other models can, often with less instruction. Palmyra Large is good at comprehending the text's intent, solving logic problems, and explaining character motivations.
Good at: Few-shots, cause and effect, and audience summarization
InstructPalmyra is the most capable model. It can perform any tasks that the other models are able to, often with higher quality, longer output, and better instruction-following.
Good at: Zero-shots, cause and effect
Updated about 8 hours ago