European AI Model “Teuken-7B” Released to Promote Linguistic Diversity and Innovation

The European research project OpenGPT-X has released a large language model for artificial intelligence applications. The model, named “Teuken-7B,” is available for download on the “Hugging Face” platform. OpenGPT-X is a European research and development project that began in early 2022. The goal of the project is to develop a large AI language model that meets European values, data protection standards, and linguistic diversity. “Teuken-7B” was trained from scratch with the 24 official languages of the EU and includes seven billion parameters.

Until now, almost all relevant AI language models in the Western world have come from the USA. These include GPT-4 from OpenAI, Claude from the AI startup Anthropic, Grok from Elon Musk’s xAI, Llama from Facebook’s Meta, and Gemini from Google. Experts estimate that OpenAI’s GPT-4 alone has around 200 billion parameters. The European model “Teuken-7B” is now intended to be freely available worldwide, offering an alternative from public research for science and business. Researchers and businesses can use the open-source model in commercial projects and integrate the code into their own AI applications.

The OpenGPT-X project is led by two Fraunhofer Institutes: the Institute for Intelligent Analysis and Information Systems (IAIS) and the Institute for Integrated Circuits (IIS). Other participants include TU Dresden, the Jülich Research Center, and companies like Aleph Alpha and IONOS SE. “Our model has demonstrated its capabilities across a wide range of languages, and we hope that as many people as possible will adapt or further develop the model for their own work and applications,” said Stefan Wrobel, Director of the Fraunhofer IAIS. The aim is to contribute to meeting the growing demand for transparent and customizable solutions in generative artificial intelligence, both within the scientific community and in collaboration with companies from various industries.

The release of “Teuken-7B” is a significant step in providing a European alternative to the dominant US AI models. It emphasizes the importance of linguistic diversity and data protection, aligning with European values. By making the model open-source, it encourages innovation and development in AI across Europe and beyond. Researchers and companies can leverage this tool to create applications that are tailored to specific needs, ensuring that AI technologies are accessible and adaptable to different contexts and languages.

The development of “Teuken-7B” also highlights the collaborative efforts of various institutions and companies in Europe, showcasing the potential of joint research and development projects. By pooling resources and expertise, the OpenGPT-X project aims to foster a more inclusive and diverse AI landscape. This initiative not only supports the advancement of AI technologies but also reinforces Europe’s position in the global AI arena.

Overall, the release of “Teuken-7B” represents a milestone in the pursuit of AI models that respect privacy, embrace linguistic diversity, and adhere to ethical standards. It provides a platform for further research and development, encouraging the creation of AI solutions that are both innovative and responsible. As the demand for AI continues to grow, initiatives like OpenGPT-X are crucial in shaping the future of AI in a way that aligns with societal values and needs.