Open-source artificial intelligence

From HandWiki
Short description: Concept of open applied to AI

Open-source artificial intelligence is the application of open-source practices to the development of artificial intelligence resources.

Many open-source artificial intelligence products are variations of other existing tools and technologies which have been shared as open-source software by large companies.[1]

Companies often develop closed products in an attempt to keep a competitive advantage in the marketplace.[2] A journalist for Wired explored the idea that open-source AI tools have a development advantage over closed products, and could overtake them in the marketplace.[2]

Popular open-source artificial intelligence project categories include large language models, machine translation tools, and chatbots.[3]

For software developers to produce open-source artificial intelligence resources, they must trust the various other open-source software components they use in its development.[4]

Large language models

LLaMA

LLaMA is a family of large language models released by Meta AI starting in February 2023.[5] Meta claims these models are open-source software, but the Open Source Initiative disputes this claim, arguing that "Meta’s license for the LLaMa models and code does not meet this standard; specifically, it puts restrictions on commercial use for some users (paragraph 2) and also restricts the use of the model and software for certain purposes (the Acceptable Use Policy)."[6]

Comparison of open-source large language foundation models
Model Developer Parameter Count Context Window Licensing
LLaMA[5] Meta AI 7B, 13B, 33B, 65B 2048 ——
LLaMA 2[7][8] Meta AI 7B, 13B, 70B 4k Custom Meta license
Mistral 7B[9] Mistral AI 7 billion 8k[10] Apache 2.0
GPT-J[11] EleutherAI 6 billion 2048 Apache 2.0
Pythia[12] EluetherAI 70 million - 12 billion —— Apache 2.0 (Pythia-6.9B only)[13]

References

  1. Heaven, Will Douglas (May 12, 2023). "The open-source AI boom is built on Big Tech's handouts. How long will it last?" (in en). https://www.technologyreview.com/2023/05/12/1072950/open-source-ai-google-openai-eleuther-meta/. 
  2. 2.0 2.1 Solaiman, Irene (May 24, 2023). "Generative AI Systems Aren't Just Open or Closed Source". Wired. https://www.wired.com/story/generative-ai-systems-arent-just-open-or-closed-source/. 
  3. Castelvecchi, Davide (29 June 2023). "Open-source AI chatbots are booming — what does this mean for researchers?". Nature 618 (7967): 891–892. doi:10.1038/d41586-023-01970-6. 
  4. Thummadi, Babu Veeresh (2021). Artificial Intelligence (AI) Capabilities, Trust and Open Source Software Team Performance. Lecture Notes in Computer Science. 12896. 629–640. doi:10.1007/978-3-030-85447-8_52. ISBN 978-3-030-85446-1. 
  5. 5.0 5.1 "Introducing LLaMA: A foundational, 65-billion-parameter language model". 2023-09-11. https://ai.meta.com/blog/large-language-model-llama-meta-ai/. 
  6. "Meta’s LLaMa 2 license is not Open Source". https://blog.opensource.org/metas-llama-2-license-is-not-open-source/. 
  7. "meta-llama/Llama-2-70b-chat-hf · Hugging Face". https://huggingface.co/meta-llama/Llama-2-70b-chat-hf. 
  8. "Llama 2 - Meta AI" (in en). https://ai.meta.com/llama/. 
  9. "mistralai/Mistral-7B-v0.1 · Hugging Face". https://huggingface.co/mistralai/Mistral-7B-v0.1. 
  10. AI, Mistral (2023-09-27). "Mistral 7B" (in en-us). https://mistral.ai/news/announcing-mistral-7b/. 
  11. "EleutherAI/gpt-j-6b · Hugging Face". 2023-05-03. https://huggingface.co/EleutherAI/gpt-j-6b. 
  12. Biderman, Stella; Schoelkopf, Hailey; Anthony, Quentin; Bradley, Herbie; O'Brien, Kyle; Hallahan, Eric; Mohammad Aflah Khan; Purohit, Shivanshu; USVSN Sai Prashanth; Raff, Edward; Skowron, Aviya; Sutawika, Lintang; Oskar van der Wal (2023-10-03). "[2304.01373] Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling". arXiv:2304.01373 [cs.CL].
  13. "EleutherAI/pythia-6.9b · Hugging Face". 2023-05-03. https://huggingface.co/EleutherAI/pythia-6.9b.