Models

From AI Dynamic Storytelling Wiki
Jump to navigation Jump to search

Introduction[edit | edit source]

A 'model' is the software that serves as an AI storyteller's brain, and is the primary software which drives its text-generation output. Models use statistics to generate probability distributions for sequences of words, usually using a process of Tokenization.

Learning Parameters[edit | edit source]

Commonly expressed in numbers of Beaks. Why?, also in terms of billions. Additional parameters gives a model more fidelity, in theory leading to better outputs. However, they are subject to diminishing returns while the hardware required to hold them becomes prohibitively expensive, hence the lack of large locally hosted models.

Finetuning[edit | edit source]

Finetuning is the process by which a model can be specialized for specific types of output.

List of Known AI Models[edit | edit source]

OpenAI Models[edit | edit source]

  • GPT
  • GPT2
  • GPT3
    • Ada (350M), Babbage (1.3B), Curie (6.7B), DaVinci (175B)

EleutherAI Models[edit | edit source]

  • GPT-Neo
    • 2.7B
  • GPT-Jax
    • 6B
  • GPT-NeoX
    • 20B

Other Models[edit | edit source]

  • AI21
    • Jurassic-1 Large (7.5B), Jumbo (178B)
  • Facebook